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Kinesthetic Aftereffect (KAE), once a promising personality index, has been 
abandoned by many investigators because of poor retest reliability and inter- 
mittent validity. In challenging this current consensus, We argue that (a) first- 
session KAE is valid; (b) poor retest reliability simply reflects Jater-session 
bias; (c) hence, multisession studies should not be used to assess validity with- 


out taking this bias into account, 


KAE validity were each multisession in 


Those recent studies which failed to support 
design. If our bias contention is correct, 


these studies should be ignored, and the claim of intermittent validity is thus 
rebutted. Reanalysis of the most recent major multisession, nonsupportive va- 


lidity study indicates (a) Session 1 v 
multisession scores are combined to avoid bias. 


Thus, KAE validly measures personality. 


Jater-session validity when 


In 1970 the Kinesthetic Aftereffect (KAE) 
task was regarded as measuring an important 
personality variable—individual differences in 
| reactivity to stimulus intensity—and was 
~ believed to predict a wide range of behaviors 

(e.g., Petrie, 1967). Today, this task has been 

abandoned by many researchers because of 

abysmally poor test-retest reliability and 
; marginal and/or intermittent validity findings 
M*(cf. below). At this crucial stage in the his- 
| tory of this variable, the present paper chal- 


lenges the current critical consensus and pre- 
sents an alternative conceptualization which 
reinstates KAE as a measure of a promising 
personality construct as follows: 


1. We review the critique of the KAE 
task based on its poor test-retest reliability 


We thank Daniel Weintraub, who went far 
beyond the call of duty in making his data and 
details of his analyses available to us. Thanks are 
also extended to Barbara Duncan, Henrietta Gal- 
lagher, Janet Schmidt, Laraine Schwartz, and Scot 

Tempel for their help in this investigation. A brief 

` report of the Session 1 validity findings was pre- 
sented at the meetings of the Eastern Psychological 
Association in April 1974 at Philadelphia. 
Requests for reprints should be sent to A. Harvey 
| Baker, who is now at the Department of Psychol- 
ogy, Queens College, City University of New York, 
Flushing, New York 11367. 


alidity, 


(b) later-session bias, and (c) 


and then challenge this critique on logical- 
empirical grounds. Low test-retest reliability 
is here reinterpreted to indicate not that KAE 
scores are invalid but rather that second- (and 
later-) session scores are biased. Therefore, only 
the first-session scores provide unbiased mea- 
sures of the dimension tapped by this task. 
It is thus concluded that multisession studies 
should not be used to assess validity without 
explicitly taking this bias into account. 

2. We reexamine the KAE validity litera- 
ture in light of this conclusion, Here we 
demonstrate that those recent studies which 
failed to show positive research findings each 
involved KAE scores from two ur more SeS- 
sions. If our bias formulation is correct, such 
KAE scores are biased and, 


multisession 
ignored, 


therefore, these studies should be 
The claim of intermittent validity is thus 
rebutted. 

3, We provide a test of the conclusions 
reached in this reexamination of KAE reli- 
ability and validity by reanalyzing data from 
the most recent major nonsupportive validity 
study (Weintraub, Green & Herzog, 1973) in 
done in the original study but 
clearly required by the guidelines emerging 
from our reexamination. Evidence supporting 


first-session KAE validity and second- (and 


a manner not 
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Jater-) session KAE bias emerged from these 
reanalyses, 


The KAE Task and the Modulation 
Hypothesis 


Tn brief, KAE involves judgments of the 
width of a standard wooden block before 
(pretest trials) and after (test trials) inter- 
vening tactile stimulation, which consists of 
rubbing a block of wood differing in size from 
the standard. The KAE score is defined as the 
difference between these two sets of judg- 
ments (test minus pretest). KAE validity 
studies have involved from one to seven 
sessions of KAE testing, 

In the Petrie (1967) procedure, the sub- 
ject rubs a stimulus block with one hand 
while resting the other. In another variant of 
the KAE task, the subject simultaneously 
rubs two stimulus blocks—a larger one with 
one hand, a smaller one with the other (e.g., 
Spilker & Callaway, 1969). This paper is de- 
limited to reliability and validity issues which 
arise when the one-hand procedure is used 
and does not consider issues arising with 
employment of the two-hand variant, 

Petrie (1967) hypothesized that KAE 
assesses individual differences in a postulated 
mechanism which modulates the intensity of 
incoming stimulation (modulation hypothesis), 
At one extreme, people who judged the stan- 
dard block as relatively smaller after (test) 
stimulation as compared with before (pretest) 
stimulation are termed reducers because they 
presumably reduce the subjective intensity 
of incoming stimulation, Under normal cir- 
cumstances, they are therefore subjectively 
stimulus deprived and seek more stimulation 
to compensate. At the other extreme, people 
who judged the standard block as relatively 
larger after as compared with before stimula- 
tion are known as augmenters because they 
presumably magnify the subjective intensity of 
incoming stimulation. Augmenters are there- 
fore overloaded with stimulation and seek to 
avoid stimulation, (Subjects who fall between 
the two extremes are called moderates.) 

Early validity studies Supported this view. 
For example, under the high-intensity stimu- 
lation of pain, reducers showed greater toler- 


—* 


ance than augmenters (Petri 


Collins, 
Solomon, 1958; Poser, 1960; Sy 


ney, 1966 


However, when the stimulus ¿ituation il 
volved the very low-intensity s* ulation ( 
sensory deprivation, reducers y+ found 1 
be less tolerant (Petrie et al., 1958) ar 
physically more active (Sales, | 71) 

augmenters. Ryan and Foster (1° 7) furthe 
found that male students who pa: cipated i 
body contact sports such as foot! 1l reduce 
more on KAE and showed higher »ain tole 
ance than did males who did not articipat 
in such sports. In keeping with th. view th 


reducers seek higher levels of 
Petrie (1967) reported that re 
more friends and showed mor 
behavior and a higher activity 
(1971, 1972) extended this line 
in four additional interrelated 
found support for the view that 
disposed to seek out and react ; 
complex and intense stimulus situ 
augmenters tend to seek out and 
and simple stimulus situations. 
Recently, the literature on KA 
a sharp turn. The current cons 


mulation 
cers hal 
smoki 
el. Sal 
resear 
dies ani 
icers a 
ively t 
ns, whi 


(a) KAE lacks test-retest reliabi McDon 
ald, 1974; Morgan, 1972: Morg Hilgard, 
1972; Morgan, Lezard, Prytulal:, & Hilgard, 
1970; Platt, Holtzman, & Le son, 1971}, 
Sales & Throop, 1972: Weintr & Herzog, 
1973; Weintraub et al., 1: Lykken; 
Note 1); (b) KAE shows only marginal, if 
any, validity (McDonald, 197}; Morgan & 


Hilgard, 1972; Sales, Guydosh, & Iacona, 
1974; Weintraub et al., 1973): and (c) use 
of these KAE scores as an index of stable 
individual differences should be discontinued 
(Morgan, 1972: Morgan & Hilgard, 1972} 
Platt et al, 1971; Weintraub & ‘ierzogy 
1973; Weintraub et al., 1973). 

We contend that first-sessi E scores 
Possess true test reliability because they have 
demonstrated concurrent and construct valid: 
ity relationships with personality and cogni- 
tive variables. Our view is based on logical 
and empirical grounds.? 


1 These issues, 
dealt with at 
Mishara, Parker, 


treated somewhat briefly here, 4 
greater length elsewhere (Baket 
& Kostin, Note 2). 
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rhe Reliability Critique 

As described above, a number of critics 
nave declared that KAE is invalid as a mea- 
sure of individual differences because of its 
voor test-retest reliability. They have appar- 
ently reasoned syllogistically as follows: (a) 
Reliability is essential for a construct to be 
meaningful: “Reliability is the sine qua non, 
however, for any measure of individual dif- 
fences” (Weintraub et al., 1973, p. 829). 
(5) The poor correlations between two iden- 
tical administrations of KAE demonstrate 
poor reliability. (c) Therefore, the validity 
findings are chimerical and the KAE variable 
should be abandoned: 


In any case, the . . + aftereffect difference score - - - 
js contraindicated as a measure of individual differ- 
ences because test-retest reliability is poor. (Weintraub 
& Herzog, 1973, p. 421) 


The KFAE [sic] test, using the apparatus employed 
in this experiment, does not tap an enduring charac- 
teristic of S.... It, therefore, seems highly un- 
likely that KFAE [sic] measures a stable character- 
istic of the person such as a cognitive control or a 
cognitive style. (Platt et al., 1971, p. 794) 


The demonstrated reliability in these data was not 
sufficiently high (yr =.18) to justify analysis as a 
cognitive style. (Morgan, 1972, p. 505) 


Low Test-Retest Reliability as Bias 


The above argument is specious. Reliability 
in the first proposition is true reliability, an 
Jideal concept which cannot be measured 
directly but only inferred (Guilford, 1965, 
p. 439). Reliability in the second proposition 
Jis test-retest reliability, one means of esti- 
mating true reliability. Low test-retest reli- 
ability does not necessarily indicate low true 
Î reliability; alternative explanations are pos- 
sible. Practice effects, which can bias scores 
on a later administration of a test, may 
account for low test-retest reliability. 

Is there evidence for such bias effects with 
the KAE task? Yes. First, the group mean of 
second- and later-session KAE pretests gen- 
erally differs from the mean of the first- 
session pretest (Bakan & Thompson, 1962; 
Hilgard, Morgan, & Prytulak, 1968; Wein- 
traub et al., 1973), which suggests in itself 
| the possibility of differential carry-over effects. 
However, if a carry-over effect existed which 
| was essentially the same for all subjects, this 


would not affect test-retest reliability; only 
individual differences in such carry-over ef- 
fects would bias the test-retest reliability 
coefficient. Direct evidence of differential 
carry-over effects has been reported (Bakan 
& Thompson, 1962; Baker, Mishara, Parker, 
& Kostin, Note 2). This evidence indicates 
that bias exists and that the locus of this 
bias is in the later-session pretest trials. 

Tf later-session bias is the proper interpreta- 
tion of low test-retest reliability, then the 
KAE-modulator hypothesis is not disproved 
by poor reliability data, because this indicates 
nothing about the relationship between first- 
session KAE and validity measures. Rather, 
it simply shows that KAE is not a stable 
measure over successive administrations and 
that only first-session KAE scores will relate 
in the predicted direction to external validity 
variables. If one wishes to assess validity evi- 
dence for the KAE-modulator hypothesis from 
published reports, only one-session studies 
should be considered. (In the results section 
below, we describe some procedures for ana- 
lyzing data from a multisession study which 
explicitly takes the bias into account. These 
procedures would allow one to assess validity 
even when u:.og multisession KAE data.) 


Intermittent Validity in Light of 
Later-Session Bias 


Let us look at the validity evidence for the 
KAE-modulator formulation in light of our 
conclusions from the test-retest reliability 
literature. The findings for one-hand KAE 
studies are presented in Table 1. Findings are 
categorized as follows: (a) supportive-non- 
supportive, (b) one session-two or more SeS- 
sions, and (c) before 1969-after 1969. The 
early validity evidence (before 1969) is highly 
favorable (18 supportive, 5 nonsupportive 
studies). It is the recent literature (after 
1969) which is divided (4 supportive, 3 non- 
supportive) and which presumably provides 
the empirical basis for the critics’ charge of 
questionable and/or intermittent validity. 
Such a view is appropriate. As Gardner 
(1966) argues, it is probable that the more 
critical and skeptical researchers enter a new 
research domain later. Therefore, the KAE 


BAKER, MISHARA, KOSTIN, AND PARKER | 
TABLE 1 | 
VALIDITY STUDIES or THE KAE-MODULATOR FORMULATION i | 
Reference of study aa { 
Procedure Supportive Nonsupportive | 
Pre-1969 i 

ji 1. Eysenck (1955) 1. Becker (1960) 
Po 2, Petrie, Collins, & Solomon (1958) : 2. Broadbent (1961) 
Pain tolerance experiment 3. Norcross, Lipman, & Spit- 1961) | 

3. Petrie, Collins, & Solomon (1958) : 4. Howarth (1963) 
Sensory deprivation experiment 5. Brown (1965) 


4. Poser (1960) 
5. Sweeney (1966) 
6. Ryan & Foster (1967) 


7. Petrie (1967): Deafness (p. 68) 
8. Petrie (1967): Alcoholism (p. 92) 


‘Two or more sessions 1. Petrie (1960) 


2, Petrie, McCulloch, & Kazdin (1962) 


3. Petrie, Holland, & Wolk (1963) 


4. Compton (1967) 


5. Petrie (1967): Aspirin (pp. 46-48) 
moking (pp. 94-97) 

: Number of friends, nail 
biting and greater activity level 


6. Petrie (1967) 
7. Petrie (196 


(p. 100) 


8. Petrie (1967) : Hours of sleep preferred . 
00, 


p. 1 
9. Petrie (1967) : Schizophrenia (pp. 62-67) 


Post-1969 


One session 1. Sales (1971) 
Mishara, Baker, 

3. Sales (1972) 

4, Sales & T) hroop (1972) 


‘Two or more sessions 


& Kostin (1972) 


1. Morgan, Lezard lak, 

& Hilgard (1970 
2. Weintraub, Gre 
3. McDonald (197 


x Herzog (1973) 


Note, Total number of supportive studies = ber 
; studies = 21; total numi 
* Because we are dubious of all multiple-session find 
i ings (su 


Ben aiB Parker, 1974) reassessed 
studies of the last few 
most crucial, 

However, it will be noted 
the recent nonsupportive yal 
volve two or more 


years should be the 


that all three of 


charge of 


i 
from the appears 


Pportive, 0 
if one con- 


[14 Supportive—includin, 
(a) the delinquency study reanalyzed E 
Baker, Mishara, Kostin, & Parker (1974) 
which is cited in the footnote to Table is and 


for which sepa- 
-session KAE 
and 5 non- 
of bias sug-! 
to reanalyze 


(b) the schizophrenia 
rate supportive findings ‘~ fir 
were published (Petrie, i) 
Supportive]. Further, the iovic 
ests that it should be pos 
the honsupportive two-or-moi 
and find positive evidence of + 
evidence of the presence of bias. 


A Reanalysis of a Multisession Si 


In the most recent major nonsupportive 
Study, Weintraub et al. (1973) conducted 4 
“Ven-session KAE study and found that 
the first-session KAE scores correlated most 
Poorly with the scores from other sessions: 

vey discarded the first-session scores, de- 
vised a composite score based on scores from 
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> six sessions (these latter did 
ely high test-retest reliability), 
«i this score against a 25-item va- 
‘opnaire. Finding no relationship, 

} that their study offered no 

\E validity. From our perspec- 
tive, 1 KAE scores (the very ones 
they disc 1) are the ones most likely to 
relate to a dity scale. Therefore, we under- 
is of their data since this pro- 
comparison of these two op- 
ts regarding KAE reliability 


took a rea 
vided a dir 
posing vie\ 
and validity 

In under 
by Weintrat 
three questio 
hand KAE scx 
ality? (b) Do 
(and possibly 
later-session K 
sonality wher 
computing thi 
bias into accou 


» a reanalysis of the study 
al. (1973), we focused on 
(a) Are first-session, one- 
a valid measure of person- 
ias effects exist in second- 
r-) KAE sessions? (c) Do 
scores validly measure per- 
e procedures followed in 
cores explicitly take the 


METHOD 
Subjects 
1973) employed three groups 
in their original study. They 
on only one group, namely, 
re stimulated by a block wider 
block. Almost all one-session, 
s have used this large block 
his procedure was quite reason- 
orts a reanalysis of data from 
lata were available for 19 male 
vraduates and 2 subjects whose 
cermined from the records. The 
olunteers from local colleges. 


Weintraub et 
of 37 subjects c 
„reported validity 
those subjects wh 
than the standarc 
one-hand KAE 
type of stimulus, 
able, This paper 
that group. Valid 
and 14 female u 
sex could not b 
subjects were p 


Apparatus 


The apparatus c< 
test block, a 24 


nsisted of a 2-in. (5.08-cm) wide 
(6.35-cm) wide inducing block. 
and a 30-in. (76.20-cm) long tapered comparison 
wedge. The blocks were always to the left of the 
Pojet and the wedge was always to the right, with 
en wider end in front of the subject. During test- 
ng, a subject wore a large cardboard collar which 
Prevented him from ever seeing any of the apparatus. 


Procedure 


ae subjects were tested in seven different sessions 
wae 1, 2, 3, 4, 5, 10, and 15). During the first 
ites ae ae the subject began by feeling the entire 
ie o the wedge and making two practice judg- 
the fi S the Width of the test block. Otherwise, 

ollowing identical procedure was used in each 
of the seven sessions. 


Each subject made four preinduction ny =e 
judgments of the width of the test block A bk rd g? 
it between the thumb and forefinger of his left hand 
and simultaneously indicating the apparent width on 
the tapered wedge with the thumb and forefinger of 
his right hand. On half of the trials a subject began 
the judgments at the narrow end of the wedge sii 
(ascending judgments), and on half of the trials a “i 
subject began judgments at the wide end of the 
wedge (descending judgments), By mounting the ry] 
wedge in a track that allowed its point of objective i 
equality to be moved 24-in. (6.35-em) forward and — 
24-in. (6.35-cm) back with respect to the subjects 7$ 
body, wedge position was also alternated. The same 
fixed order of trials was used for each subject. (For 
further details, see Weintraub et al., 1973.) i 
The pretest was followed by a 60-sec induction 
period during which each subject rubbed the 2h-in. 
(6.35-cm) wide induction block with the thumb and — 
forefinger of his left hand and rested the right hand 
at his side. $ 
The test block was then replaced in the subject's s 
left hand and he made four postinduction (test) 
judgments of its width, The KAE score for each x45 
subject was obtained by subtracting the mean of the 
four pretest judgments from the mean of the four 
test judgments: A à 


KAE score = M test—M pretest. we 


The 2$-in. (6.35-cm) induction block induces a 
contrast effect, With the induction block greater than +% 
the test block, subjects generally tend to under- 
estimate the width of the test block after rubbing 
the induction block. However, reducers (the lowest 
third of the KAE distribution) underestimate the 
width maximally, and augmenters (the highest third) 
underestimate the width minimally or even slightly 
overestimate it. Thus, in general, large negative KAE 
scores represent reducing, and small positive and E 
small negative scores represent augmenting, Based on p 
a random assignment of the two borderline subjects, 
there were 11 subjects in the moderate group and 
12 subjects each in the augmenter and reducer groups: i 
After the KAE sessions, & 25-item questionnaire 


was mailed to each subject. The apo rr 
i h issues as pain reactivity, preferred am 
ee 5 partying, fingernail 


of sleep, drinking: smoking, i 
biting, studying, number of friends, birth De an J 


i i t. These items are 
childhood environmen! (described ear- 


ulated by Petrie and/or by 
1972). 


(e.g 
In many instances, 
similar in content) oF 
items have already rE x 
to KAE performance, xa ‘ 3 
Con has reported significant aa pete i 
ships between KAE and esas ional (ene 
tapped by 12 of the esas a e AURA 
a 3, 4, , 9, 10b, 10c, i 
A i 2 Tes POT has found ae a 
ER for responses to or behavior tapped by 
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TABLE 2 
SCORING SYSTEM FOR THE 18-ITEM INDEX 


: 1. How often do you drink coffee? 


Never SD 
Once in awhile (—.33) 
About 1 cup/day (.33) 


More than 1 cup/day (1) 


2. Sleep—the score for this item is the mean of the 
Scores for 2a and 2b below: 
a. Approximately how many hours of sleep do you 
usually get during each 24-hour period? 


9 or more (—1) 
Between 6 and9 (0) 
6 or less (1) 


b. Would you like to get more than 9 hours of 
sleep each night if you could? 
Yes (—1) 
No (1) 


3. How much do your eyes usually bother you when 
you come out of a movie theater? 
Not at all (1) 
A little (.33) 
Somewhat uncomfortable (—.33) 
Very uncomfortable (—1) 


4. About how many accidental injuries have you had 
in the past three years which required professional 
medical treatment? 


0 (—1) 
Any number (1) 
5. How much do you fear getting an injection? 
Not at all (1) 
A bit (.33) 
A fair amount (—.33) 
Greatly (—1) 


6. Do you enjoy parties that have lots oj 


Joy parti Í noise, action, 
and varied lighting effects? 


Yes (1) 
No (=1) 
7. What is your overall grade point average? 
3.5 to 4.0 =1) 
3 to 3.49 (=.33) 
2.5 to 2.99 (.33) 


Less than 2.49 (1) 


8. Doyou get edgy when you are alone for a long time? 


No (—1) 
Yes (1) 
9. Do you tend to bite your nails? 
No (—1) 
Yes (1) 
Note. A nega 


10. Friends—the score for this item is the 
scores for 10a, 10b, 10c, and 10d belo 

a. With how many friends at a time d 

share recreational activities (n 


1 usually | 


icluding 
dating)? 
lor2 —1) 
3or more (1) 
b. Do you have ten or more close mak nds? 
Yes (1) 
No (—1) 
c. Do you have ten or more close fema' friends? 
Yes (1) 
No (1) 
d. Do you find it easy to meet people } make 
friends? 
Yes 2 
No (—1) 
11. Do you often find yourself worrying L your 
health? 
Yes (—1) 
No (1) 
12. How often do you participate in sports 
Almost never (—1) 
Occasionally (—.33) 
Fairly often (.33) 
Very regularly (1) 
13. About how often do you drink alcohol? 
Rarely 
Occasionally 
Once a month or more (.33) 
Once a week or more (1) 
14. On school nights, about how any hours per night < 
do you study on the average? 
4or more (—1) 
3 to 3.99 (—.33) 
2 to 2.99 -33) 
Less than 2 (1) 
15. a. I would prefer a job which would require a lot 


of traveling. W 
b. I would prefer a job in onc 


16. a. I enjoy the thrills of watching ci d) 
b. I find car races unpleasant. Ta 


17a. I Prefer friends who are exciting and un- 
predictable. (1) 


b. I prefer friends who are reliable and 
predictable. (-1) 
18. Smoking 
a. About how many cigarettes per day do you 


smoke on the average? 


b. If you have £ver smoked cigarettes, at about 
what age did you start?a (if never, write 0) 


„negative score indicates an augmentin, v3 
a and tribution of “aye started smmaneet was divided a2 
smola 18b was combined into a single se; 


H 16), still smokes; 1; 16, still 
+33} started younger than 16, 5 S: 1; started at 16, 
older than 16, stopped smoking: —.66; never se AR 1 16, stopped smo 


sitive score indicates a reducing tendency, 

Started wo wmately equal groups, and informati for Items 
nieces b: a ormation for Ite: 
moes: 66; started Giger than 16, still smokes: 33; started CLONS: Started early (under 

a arte 


king: 0; started at 16, stopped smoking: 


Tire eee 


| 


| 
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additional items (see Table 2, Items 6 and 14). Sales 


and Throop (Note 3) found evidence for a relation- 


ship between KAE and one further item (see 
Table 2, Item 1), while Ryan and Foster (1967) 


| reported a finding which provides the basis for yet 


another item (see Table 2, Item 12). On both theo- 
retical and empirical grounds, Petrie (1967) has con- 
cluded that KAE predicts individual differences in 
tendencies to seek stimulation, a conclusion strongly 
supported by some more recent findings by Sales 
(1971, 1972). Seven of the questionnaire items (see 
Table 2, Items 2a, 10a, 10d, 13, 15, 16, 17), though 
not hitherto studied in terms of their relation to 


| KAE, tap such stimulation-seeking tendencies, The 


remaining two items in the questionnaire, which 
concern birth order and urban versus rural child- 
hood environment, were developed to assess the en- 
vironmental origins of the dimension tapped by KAE. 


Scale Scores 


When the responses of two small groups of sub- 
jects (12 augmenters, 12 reducers) were to be com- 
pared on individual items, it seemed improbable a 
priori to expect that significant differences would 
emerge more frequently than one would expect by 
chance (and, in fact, they did not). Therefore, a 
scale was created by reducing the original 25 items 
assessed by Weintraub et al. (1973) to 18 behavioral 
and psychological domains (see Table 2). Two items, 
birth order and size of community in which the 
subject grew up, were dropped because they do not 
measure psychological or behavioral characteristics. 
Ttems which tapped the same domain were averaged 
together (the four friendship items, the two smoking 
items, and the two sleep items) to obtain a scale in 
which each domain was equally weighted. The scores 
were designed to range from 1 to —1. Positive scores 
were arbitrarily assigned to responses hypothesized 
to reflect reduction, and a scale score was obtained 
by summing across domains. The scale scoring was 
derived prior to examination of the KAE scores, and 
all scoring was done by two independent raters also 
blind to KAE scores. (As a check on the decisions 
involved in devising the 18-item index in which each 
domain was equally weighted, scale scores were also 
derived from the original 25 items. All analyses re- 
ported below were also carried out with this 25-item 
index. The results overall closely parallel those for 
the 18-item index and therefore will not be reported.) 


RESULTS 
First-Session KAE Validity 


The 18-item scale scores of the two extreme 
groups (augmenters and reducers) were com- 
pared by ¢ tests to answer the first question, 
“Are first-session KAE scores valid?” This 
procedure was recommended by Mishara, 
Baker, Parker, and Kostin (1973) because of 
the possibility of nonlinear relationships be- 
tween KAE and validity indices. Augmenters 


(M = —.73,SD = 4.59) differed significantly, 
(23) = 2.56, p < .01, from reducers (M and 
SD = 3.59). (Since all hypotheses in this 
paper are directional, one-tailed tests of 
significance are used throughout.) It was 
concluded that first-session KAE scores are a 
valid predictor of the index. 

A correlation between KAE and the 18-item 
index was also significant, r(33) = —.36, p 
< .02. It was concluded that the KAE/18- 
item index relationship was linear, and only 
correlations were used in the later analyses. 


Bias Effects 


Two analyses were performed to answer 
the second question, “Do bias effects exist in 
later sessions of KAE?” Here, we define bias 
as some effect which changes the relationship 
between a particular score from the KAE task 
and scores from the validity variable. These 
analyses were guided by some earlier find- 
ings: Baker et al. (Note 2) presented an 
internal scaling analysis of a large (N = 65) 
two-session KAE study. In terms of the pat- 
terning of the correlations between pretest 
and test trials across the two sessions, this 
analysis showed that second-session pretest 
scores resembled first- and second-session 
test scores much more than they resembled 
first-session pretest scores. It was concluded 
that (a) there is a bias; (b) its locus is the 
second-session pretest; and (c) its form is 
such that second-session pretest scores are 
functionally similar to first- and second- 
session test scores and functionally dissimilar 
to first-session pretest scores. 

Based on the preceding findings, our first 
bias hypothesis was that later-session KAE 
scores should show less of a relationship with 
the validity scale than first-session KAE scores. 
If pretest scores come to resemble test scores 
in later sessions, then the difference between 
pretest and test scores (i.e., the usual KAE 
score) should increasingly reflect only error 
or noise. They thus should show a lower rela- 
tionship to the criterion variable. Table 3 pre- 
sents the correlation between the KAE score 
for each of the seven sessions and the 18-item 
index. First-session correlation is —.36; the 
median for Sessions 2-7 is —.035. The valid- 
ity correlation does show the predicted decre- 


TABLE 3 


CORRELATIONS BETWEEN SINGLE-Session KAE 
APTEREFFECT SCORES FOR A SPECIFIED SESSION 
AND THE 18-ITem Vauiprry SCALE 


Session 18-item index 


30" 
28 
2h 
—.04 
.08 
—.03 
22 


MOU we 


Mean KAE score for 


all sessions —.14 


Rp <.02, 


ment in later sessions. It may further be 
noted that the variability of these KAE scores 
decreases 27% from Session 1 (SD = 1.44 in, 
or 3.66 cm) to the later sessions (median SD 
for Sessions 2-7 = 1.05 in. or 2.67 cm), a 
finding consistent with the view that pretest 
Scores increasingly resemble test scores, 

The second bias analysis focused on the 
two components of the KAE score and at- 
tempted to pinpoint the locus of the bias. 
Table 4 shows the correlations between the 
18-item validity index and the pretest and 
test scores for each session Separately, 

It should first be noted that the table vali- 
dates the use of a test-minus-pretest change 
Score for the first-session KAE. The logic of 


minus pretest) (index) = —.36, p 
3]. This indicates that ea 
pendently contributes to m 
(aftereffect) score valid. 

If our formulation that later-session pre- 
tests resemble tests is correct, this initial dif- 
ference between pretest and test scores should 

diminish, Specifically, T pretest) (index) should 
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shift from positive to negative 
should always remain negative, a 
nitude of the difference betw: 
(index) and 7; test) (index) Should dec! 
initial (Session 1) value of .29 ( 
Each of these events occurred. A 
Positive to negative in Fipretest) 
after Session 1. We calculated 
T pretesty(index) for Sessions 2-7 
which clearly approximates the va 
first session T(test)index) Of —.16, W 
culated the median value for Sessic 
"ctest)cindex), Which is —.18. No 
change over sessions OCcUrs IN Fiteni 
is also noteworthy that 7, Protest) 
Tctest)(indexy approximate each ot) 
closely during the last four sessions 
Tt is further possible to Specify 
bias first manifests itself in the secor 
Pretest. Even though Pretest 1 
test 2 share 60% common variance () 
the difference between HeProtest 1) cin 
(Protest 2)(index) iS significant, ż(32 


P< .005. No parallel change occurs i: 


lationship between the test and the in 
the first two sessions (£ 
between forest t)cimaexy 
(KPA .89, ns), 


and Forests 


test for the di! 


Taken together, this set of findings is 
strikingly consistent with expectation: Bias 
has been shown, the locus of bias has been 
identified (later-session pretests), and the 
form of the bias Specified (pretest trials cuu 


. TABLE 4 
CORRELATIONS BETWEEN PRETEST AND TE 
(SEPARATELY ComPUTED) 
AND THE 18-ITEM_INDEX 


FOR A SPECIFIED $ 


Correlation Correlation 
between between 
pretest scores test scores 
z and 18-item and 18-item F'pretost 
Session index index = 
1 13 298 
2 —.18 =at 
3 —.09 13 
4 —.15 02 
5 =.10 — 05 
z —.19 “03 
Lf =.20 .06 
Mean score for 
all sessions Af DA 
5132) = 2.18, a 
ep op SP <.08. 
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to function like test trials). Although Baker, 
Mishara, Parker, and Kostin (Note 2) reached 
similar conclusions based upon internal analy- 
ses of the KAE task (i.e., no external validity 
criterion was present), here the same conclu- 
sions are supported on the basis of significant 
differences in the relationship to an external 
validity criterion. This constitutes a much 
more rigorous and stringent test of the bias 
formulation. Bias here does not mean random- 
ness but rather a change in relationship be- 
tween pretest scores and the validity variable. 


Later-Session Validity 


The third question asks, “Do later-session 
KAE scores validly measure personality when 
the procedures followed in computing these 
scores explicitly take the bias into account?” 
Weintraub et al. (1973) speculated that KAE 
as an individual differences measure should 
stabilize and even improve over time. If this 
view is correct, then appropriately defined 
scores using information from later sessions 
should validly tap whatever it is that KAE 
measures. 

Table 5 presents one kind of between- 
session KAE score which takes pretest bias 
into account and which may be compared 
with the more usual within-session KAE 
scores in Table 3. In this between-session 
KAE score, the size of the aftereffect is de- 
fined as the amount of change in size judg- 
ment from the Session 1 pretest to the Session 
n> 1 Test. |This score has been used occa- 
sionally by other researchers (e.g, Bakan & 
Thompson, 1962), although not to assess the 
validity of KAE.] 

If later-session pretests are systematically 
influenced by the first KAE administration, 
but the tests are not so influenced, we would 
then expect KAE scores computed by taking 
the difference between later-session test scores 
and the first-session pretest to be noticeably 
better than the usual within-session KAE 
score based on the corresponding testing ses- 
sion. In fact, this occurred. On the average, 
the validity correlations remain high over the 
course of the seven sessions for these between- 
session KAE scores, in which the Session 1 
pretest is used throughout (median 7 = —.415, 
p< .01, cf. Table 5). For the usual within- 


TABLE 5 
LATER-SESSION VALIDITY FINDINGS: RELATIONSHIP 
or BETWEEN-Session KAE Scores TO 
THE 18-ITEM INDEX 


KAE score 18-item index 


Test 2 — Pretest 1 — 24" 
Test 3 — Pretest 1 Att 
Test 4 — Pretest 1 — 43r 
Test 5 — Pretest 1 =i 
Test 6 — Pretest 1 — 40" 
Test 7 — Pretest 1 ane 
Median r — Ais 


Nole. Between-session KAE score = Testn>1 — Pretesti, 
*p <.10, 

Dp <01. 

*** p < 005. 


session KAE scores (as reported in Table 3), 
the relationship with validity disappears over 
the course of the later sessions (median r for 
Sessions 2-7 is —.035, ns). This analysis 
thus provides evidence that later-session test 
scores contribute at least equally well as 
first-session test scores in the KAE-validity 
relationships. (The amount of explained vari- 
ance associated with the median correlation 
for these between-session KAE scores is 
17%; the amount associated with Session 1 
within-session KAE scores is 13%.) 

Finally, variability associated with these 
between-session KAE scores increases 43% 
from the first of these scores (SD for be- 
tween-session KAE for Test 2 minus Pretest nt 
= 1,14 in. or 2.89 cm) to the later sessions 
(median SD for the remaining five between- 
session KAE scores = 1.63 in. or 4.14 cm), 


Some Further Later-Session Findings 


If later-session pretest scores are function- 
ally similar to test scores, then the difference 
between one of them and Pretest 1 (residual 
KAE score) should correlate with the validity 
index in a manner similar to the between- 
session KAE scores just reported. Table 6 
presents these analyses. As a set, these scores 
were found to predict validly the criterion 
variable. The median correlation between 
these residual KAE scores and the 18-item 
index (r = 385, $ < -02, cf. Table 6) is ap- 
proximately equal to the correlation found 
between the within-session KAE scores for 
Session 1 and the 18-item index (.36, see 


Table 3). 


10 
TABLE 6 


FURTHER Later-Sxssion Bias FINDINGS: 
RELATIONSHIP or REsipuAL KAE SCORES 
TO THE 18-IteM INDEX 


KAE score 18-item index 
Pretest 2 — Pretest 1 e 
Pretest 3 — Pretest 1 —.35* 
Pretest 4 — Pretest 1 mort 
Pretest 5 — Pretest 1 — 29 
Pretest 6 — Pretest 1 pares 
Pretest 7 — Pretest 1 — 41 

Median r a. i 
— Pretesti. 


Note. Residual KAE score = Pretesta>: 
~p <05. 


For these KAE scores reported in Table 6, 
variability increases just as it did for the 
between-session KAE scores reported in Table 
5: The median SD for the five later scores, 
1.40 in. (3.56 cm), is 24% larger than the 
1.13-in. (2.87-cm) SD observed from the very 
first of these scores (Pretest 2 — Pretest 1); 
The finding that variability for both types of 
intersession scores (Tables 5 and 6), which 

_ avoid bias, increases whereas variability for 
the usual within-session KAE score decreases 
is consistent with the view that bias makes 
later-session pretest Scores more like test scores, 

The Pretest, 5 ;— 


sented has Occasionally been used by other 


_ Both types 
of KAE score 
with the 18-it 


(residual and between-session) 
s, then, correlate significantly 
em index, However, since both 
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sets of scores share Pretest 1 as a c ponent, 1 
they are not independent analyses. -till the | 
data support the conclusion of later-: ion va- 
lidity with appropriately defined K scores, 1 
Discussion i 
Session 1 KAE is valid. These KA scores | 
predicted self-reports of stimulation ceking | 
(e.g., coffee drinking, alcohol consi ption, 
enjoyment of parties), pain toleran (eg, 
fear of injection, eyes bothering on: when 
coming out of a theater), and tolera: e for 
stimulus deprivation (e.g., edgy wher lone, 
number of hours spent studying). F` «ther, 
later-session KAE scores are biased. The 
locus and form of the bias have been sp: ifed. 
It is in second- (and later-) session p test. 
When this bias is ignored, by using hin- 
session (Sessions 2-7) KAE scores, the | AE- 
validity relationship disappears, but whe: bias 
is taken into account in devising res 
(between-session and residual KAE). the ` 


KAE-validity relationship persists ove: ‘he 
Seven sessions. All this was found in the e 
data in which Weintraub et al. (1973) id 
no support for KAE validity. 

These findings converge to support our cen- 
tral contention that one must focus 
interrelationship between test-retest reliabil- 
ity, true test reliability, and validity to com- 
prehend apparently discrepant and paradox- 
ical KAE findings, Baker, Mishara, Parke 
and Kostin (Note 2) developed thi 
ment at length, and it is summarized in 
introduction to this study. Loy 
reliability does not necessarily i 
measure. Bias due to practice effects is ; 
alternative explanation. Based on an internal 
Scaling analysis of a two-session KAE study, 
Baker, Mishara, Parker, and Kostin (Note 2) 
concluded that second-session pretest scores 
resemble first- and Second-session test scores 


much more than they resemble first-session 
Pretest scores, 


on tne 


lidat 


evidence for the 
KAE 


onstrates that KAE 
a self-report scale 
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variety of behaviors and preferences—pro- 
vided, of course, that one takes bias into 
account by restricting analyses to the first 
session or by using multiple session scores 
that recognize the bias. Overall, these findings 
show striking consistency both with our a 
priori formulations of the interrelationships 
between KAE validity, test-retest reliability 
and bias, and also with the earlier findings 
from the internal scaling analysis. 

Weintraub et al. (1973) did not do this. 
Some of their observations about mean dif- 
ferences (general effects), however, are con- 
sistent with our own observations about indi- 
vidual differences. For example, they noted 
that the KAE induction procedure causes 
semipermanent or permanent changes in later- 
session pretest scores. This finding parallels 
our contention that there is an enduring 
change in the pretest-validity relationship 
after Session 1. Weintraub et al. further ob- 
served that as the pretest values decrease 
across days, the KAE approaches zero. This 
general effect parallels both the KAE-validity 
relationships across sessions (it disappears 
using the within-session KAE score) as well 
as our argument regarding why this should 
þe the case (i.e., pretest scores become func- 
tionally equivalent to test scores in terms of 
their relationship to the external validity 
criterion). 

We eschew rejecting KAE as a measure of 
individual differences as Weintraub et al. 
(1973) and others have done. Rather, we feel 
that we have correctly identified the phe- 
nomenon of later-session pretest scores being 
similar to test scores as reflecting bias and 
that we have found the proper methodo- 
logical approach to predicting from multi- 
session KAE scores. Although in other con- 
texts one might choose to reject a biased 
score, such action is not indicated here. The 
bias does not reflect error, nor is it unrelated 
to the validity criterion. Rather it takes this 
form: Once the initial aftereffect procedure 
has been conducted, one component (pretest) 
of the KAE score (test minus pretest) be- 
comes functionally equivalent, in terms of its 
relationship to validity variables, to the other 
component (test) of the KAE score. That 
is, the Session 1 aftereffect procedure induces 


a long-lasting carry-over effect which differs 
from person to person. Under these condi- 
tions, there is no need, and it makes no sense, 
to reject these scores; by recognizing what 
has occurred, the proper remedy follows, and 
all scores can be used appropriately to pre- 
dict validity criteria (cf. Tables 5 and 6), 

The test-minus-pretest KAE scores used in 
this and most other KAE studies constitute 
what are psychometrically known as “change 
scores” or “gain scores.” Since there is wide- 
spread controversy both pro (e.g, Benjamin, 
1973) and con (e.g, Cronbach & Furby, 
1970; Etaugh & Etaugh, 1972; O'Connor, 
1972) among psychologists and psychometri- 
cians regarding the appropriateness of their 
use in individual differences studies, some 
brief comment seems indicated. One common 
objection to the use of gain scores is that they 
are not “reliable.” Expressed psychometri- 
cally, this statement means that change scores 
primarily reflect errors of measurement. The 
finding here that the KAE gain scores sig- 
nificantly account for variance on a personal- 
ity scale—an instrument which is clearly 
quite different in form and content from 
the KAE procedure—demonstrates that these 
scores do not simply reflect error variance; 
indeed, the amount of explained variance 
compares favorably with successful attempts 
at prediction in personality and cognitive 
style research generally, Here, the usual cri- 
tique of gain scores is not applicable: Though 
gain scores from tasks other than KAE may 
predominantly reflect error variance, it is 
clearly not the case for KAE. 

A second criticism of the use of gain 
scores as a predictor variable is that they are 
misleading and unnecessary: The same valid- 
ity findings would emerge if either one of the 
constituent scores (in this case, pretest or 
test) were used as the predictor. In } the 
present study, the correlation of the 18-item 
index with the (a) Session 1 pretest scores was 
13 and with the (b) Session 1 test scores 
was —.16. Clearly the validity findings for 
the Session 1 KAE gain score, the most 
widely used in the literature, does not here 
reflect variance associated with the pretest 
or test trials alone, but rather variance 
uniquely associated with the two scores com- 
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bined. Nor can the validity findings from 
the between-session KAE scores be explained 
as reflecting variance associated only with 
the Session 1 Pretest score (which was com- 
mon to all KAE scores used in Table 5); 
these findings too require both components, 
This is all that a legitimate use of a gain 
Score as a predictor variable would seem 

to require, 
The major import of this study is not in 
the particular findings of validity and bias 
but rather in terms of its implications for the 
area of KAE individual differences research: 
Tt destroys the logical and empirical basis for 
the current critical consensus which has con- 
signed the KAE measure to oblivion as a 
measure of individual differences. We feel 
Most strongly that KAE demands an intensive 
research effort, rather than abandonment, 
and that future research should be guided by 
the conceptual framework and methodological 
formulation offered here. Our own research 
Program continues with (a) additional valid- 
ity studies (Baker, Mishara, Parker, & Kostin, 
Note 4; Mishara & Baker, Note 5) and (b) 
new models of the processes underlying the 
KAE task. This latter research, of course, 

is in the most tentative Stage 

Mishara, Schwartz, & Kostin, Note 6) 
In conclusion, the original excitement and 


interest which many psychologists showed in 
the KAE task and the 


ation, April 1971. 
» Mishara, B. Ty Pa 


manuscript, C; 

Mellon University, 1971, es 

4. Baker, A. H. Mishara, B. L, Parker, L, & 
Kostin, I. W, Menstrual cyle affects a bercep 


« Mishara, B, L., & Baker, A, H, Stimulus intensity 
TA berceptual-cognitive approach to 


life style in the elderly (Research Bu 74-35) 
Princeton, N.J.: Educational Tes Service, 
1974. 

6. Baker, A, H., Mishara, B., y i ae 
Kostin, I. W. Kinesthetic aftere fect ne effect 
or two effects? Paper presented at annual 
meeting of the Psychonomic Soci: Boston 


Massachusetts, November 21, 1974. 
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are independent, High 
behaviors do not in ly 
. For example, Birchler et al.'s 
ding of higher rates of positive be- 
nondistressed compared with dis- 
ples does not imply that reciprocity 
for the two groups of couples, Al 
distressed couples may seem to be 
reciprocating Positive behavior more fre- 
quently than distressed 
be an artifact of the 
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ted is that a par- 


). For example, 


bability (P) of a conse- 
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quent positive wife code (W+) given an ante- 

cedent positive husband code (H+) must be 

significantly greater than the nonconditional 
bility of occurrence of W+. This means 

t a knowledge of the antecedent code, H+, 

dds significantly to the ability to predict the 

occurrence of a W+ code, over and above 
prediction, based simply on knowing the rela- 
ive frequency of occurrence of W+. Symboli- 
all at must be demonstrated is that p 

i+) >p (W+) (Raush, Barry, 
|, & Swain, 1974). In the present investi- 

hese information analyses were per- 
| to assess the ability of reciprocity 
ables to discriminate distressed from non- 
ed marriages. 

listinction between high rates of posi- 

des and reciprocity has been ignored in 

inical literature on family interaction. 
umple, Alexander (1973) found that 
ion across families between parent- 
pportive behavior and child-to-par- 
ive behavior was significantly dif- 

ı zero (father-son r = .69, p < .05; 
n r=.59, p<.05). Contrary to 

nder’s conclusions, these correlations do 

nply reciprocity, A family with high 
f supportiveness could be distributing 

-haviors noncontingently throughout a 

on, In this case the correlations ob- 

would be high due to different base 
ross families, but there would still be 

e of reciprocity. 

e base rate-reciprocity issue is also im- 
for distinguishing between behavior 
and social learning theories of mari- 

mflict resolution. Birchler et al. (1975) 

a mix of language from both theories. 

They demonstrated that distressed couples 

eimi: lower rates of positive codes and higher 

rates of negative codes than do nondistressed 
couples. This does not justify their view of 
the positive codes as “social reinforcements.” 

It would be necessary to show that a particu- 

lar code on the part of one spouse affected the 

probability of occurrence of a particular code 
of the other spouse. 

Reciprocity hypotheses, therefore, have yet 
to be adequately tested. Behavior exchange 
hypotheses of marital conflict resolution have 
also been ambiguous in making specific pre- 
dictions. There has, in fact, been little re- 


search on marital interaction from a behavior 
exchange standpoint. For example, standard 
game theory experiments, which are commonly 
associated with behavior exchange theory, 
were not applied to the study of the marital 
dyad until Speer (1972). Speer studied 60 dis- 
tressed couples seeking counseling in a clinic 
and 60 nondistressed couples in an extension 
of the Prisoner’s Dilemma Game, None of the 
four response variables in Speer’s study dis- 
criminated the two groups on four forms of 
the game. 

Perhaps two reasons for the lack of success 
of Speer’s measures are that (a) behavior ex- 
change experiments reduce the behavioral 
repertoire of the couple and (b) they ex- 
ternally control the payoffs of specific behav- 
iors that they exhibit (Gergen, 1969), Another 
potential problem with game theory experi- 
ments applied to the marital dyad has to do 
with how the payoff matrix is interpreted. 
There are many alternative ways of interpret- 
ing a payoff matrix. Behavior exchange theory 
proposes that a dyad engaged in a mutually 
satisfying relationship will exchange behaviors 
that have low cost and high reward to both 
members (Thibaut & Kelley, 1959). One in- 
terpretation of the reward-cost notion is that 
producing a specific behavior has a specific 
reward and a specific cost to the person pro- 
ducing the behavior (Gergen, 1969, p. 36). An 
alternative interpretation of the payoff matrix 
is that in a satisfying dyadic relationship, a 
person will receive behaviors that have high 
reward for himself or herself. This implies 
that a person will perceive a relationship as 
satisfying to the extent that he or she codes 
the behaviors received as a positive. 

Another issue regarding the way payoffs are 
assigned is the comparison level for alterna- 
tive relationships. Presumably, a behavior that 
will be coded as positive in one relationship 
could be coded as negative in another rela- 
tionship in which the receiver’s comparison 
level is much higher. The fact that we do not 
know a spouse’s comparison level for REE 
tives suggests a phenomenological metho! i ie 
measuring payoff. To test this interpreta 3 
of the payoff matrix, then, it Is penne p 
design a measurement procedure in whi B 

ositivity of the behavior received wou è 
E directly by the receiver. In this OE 
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TABLE 1 


UNIVARIATE F Ratios AND MEANS FOR RELATIVE 
FREQUENCY OF Spouse’s CODING OF PARTNER’S 
BEHAVIOR, STUDY 1 


Nondis- 

Distressed tressed 

Variables Fe average average 
Super negative 2.06 -06 -01 
NANE 10.66** 24 -11 
Positive 7.47* .34 50 
Super positive 7.36* 02 7 


a Neutral is not included in the analysis of multivariate F; 
all five variables form a linearly dependent set, since they sum 
to 1,0, 

b degrees of freedom = 1, 28. 

*p <.05, 


** p < 01, 


sents means and univariate F ratios for dis- 
tressed and nondistressed couples. 
These results are consistent with the data 
obtained by Birchler et al. (1975) for ob- 
servers’ coding of the couple’s behavior, Birch- 
ler et al. found that distressed couples were 
more likely to emit negative and less likely to 
emit positive codes than nondistressed couples. 
Furthermore, the results support a communi- 
cation deficit model of marital distress. By 
looking at Table 1 and Table 2 in combina- 
tion, it can be seen that the distributions of 
intent and impact are similar for nondis- 
tressed couples. This was not the case for dis- 
tressed couples. Both groups of couples were 
nearly identical in the way they intended their 
behavior to be coded. Therefore, for dis- 
tressed couples, their behavior was coded as 
far less positive than was their intention. 
Conflict effect. To check the low-high con- 
flict manipulation, the videotapes of four 
couples, two distressed and two nondistressed, 
were randomly selected and coded for agree- 
ment and disagreement. The first 2 minutes of 
each task were coded, with coders tallying 
each occurrence of an agreement or disagree- 
ment, There were four pairs of coders; each 
pair coded only one couple. Coders were blind 
to the task hypothesis and as to whether the 
couple whose tape they were coding was dis- 
tressed or nondistressed, The total number of 
agreements and disagreements were calculated 
separately for each coder, summing over high- 
and low-conflict tasks. Coders were first 
trained using an interaction category system 
developed in our laboratory (Gottman 
Notarius, & Markman, Note 1) and reached 
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criterion on an achievement test, which ` sted 
their knowledge of agreement and dis 
ment codes. One coder was designate: 
coder and the other the reliability che. ker, 


Correlations between coder and relia lity 
checker over the four couples were .9.' for 
agreement and .93 for disagreement. D the 


low-conflict tasks, there were averages of 0.343 
agreements and 6.92 disagreements; in ‘he 
high-conflict tasks, there were averages oi 38 
agreements and 8.00 disagreements, y*(\\ = 
4.23, p < .05. In research on family int 
tion, agreement and disagreement have bem 
shown to be the best consistent discrimina! «ts 
between distressed and nondistressed fami es 
(Riskin & Faunce, 1972). Agreement— is- 
agreement ratios less than one are charac! er- 
istic of distressed families, and ratios gres ter 
than one are characteristic of nondistres ed 
families. For the low-conflict tasks, the agr +e- 
ment—disagreement ratio was 1.49, whereas ‘or 
the high-conflict tasks it was .67. 

To assess the effect of high- or low-conf ct 
tasks, the five received impact codes were con- 
sidered on a 5-point Likert scale ranging from 
1 (super negative) to 5 (super positive), The 
same Likert scale was constructed for the i1- 
tent codes, Scale scores were separately ave <- 
aged for the three low-conflict tasks and tie 
two high-conflict tasks. An analysis of vari- 
ance was performed on the impact data and 
on the intent data, with two levels of distress 
(distressed—nondistressed), two levels of sex 
(husband-wife) , and two levels of conflict 
(high-low). As expected, there was a signifi- 
cant F ratio for the impact data on the 
distress factor, F(1,56) = 15.37, p< .00i. 
Distressed couples averaged 3.04 and nondis- 
tressed couples averaged 3.52 over all tasks. 


TABLE 2 


Univariate F Ratios AND MEANS ror RELATIVE 
FREQUENCY or Spousts’ Intent, 
Stupy 1 


ont x pende: 

s istres se 

C Dienmet Fad 
Super negative 18 4 

3 é 00 i 
Negative 1.72 09 0S 
Positive 33 49 55 
Super positive 12 05 ‘03 
aD 8 = igni xf 
ii is mo ee = 1, 28. All Fs are nonsignificant. Neu- 


in the analysis of multivariate F; all five 


variables i i 
iables form a line: et, since they sum to 1.0. 


arly dependent s 
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There was not a significant conflict effect, 
F(1, 56) = 3.86, p > .05, although the means 
were in the predicted direction (high-conflict 
M = 3.17; low-conflict M = 3.39), There was 
no significant sex main effect or any signifi- 
cant interactions. On the intent variable, there 
were no significant effects for any factor or for 
the interactions. 

” It was thus possible to discriminate dis- 
tressed from nondistressed husbands and wives 
on how positively they coded their spouse’s 
bei wvior regardless of the level of conflict the 
tast induced. These results support the Jacob 
an Davis (1973) findings and extend these 
findings to high-conflict tasks. 

Reciprocity, Four analyses of covariance 
were performed to compare $ (W+/H+) 
with p (W+), p (H+/W+) with p (H+), 
p (W—/H—) with p (W—), and p (H—/ 
W-—) with p (H—) for distressed and nondis- 
tressed couples. Probabilities were separately 
computed for each couple. In each case the 
unconditional probability was the covariate. 
Because of the relatively low frequencies of 
the super-positive and super-negative codes, 
these were combined with positive and nega- 
tive codes, respectively, for the sequential 
analysis, The test of the reciprocity hypothe- 
sis of distressed versus nondistressed marriage 
is the significance of the group F ratio on that 
portion of the conditional probability that 
cannot be predicted by the unconditional 
probability. For example, the residual in re- 
gressing p (W+/H+) on $ (W+) must be 
greater in nondistressed couples than in dis- 
tressed couples. This means that knowing that 
the husband has just coded the wife’s behavior 
as positive increases the prediction over the 
base rate of W+ that the wife will subse- 
quently code the husband’s behavior as posi- 
tive. The direction of the prediction is ob- 
viously reversed for negative reciprocity ; that 
is, there should be more negative reciprocity 
in distressed than in nondistressed couples. 

Table 3 presents the results of the analyses 
of covariance. There is some evidence that 
there is more positive reciprocity in nondis- 
tressed couples than in distressed couples. 
Though the degree of gain in prediction from 
conditional to unconditional probabilities is 
small for nondistressed couples, the analysis 
of covariance controls for the possibility that 
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TABLE 3 


Variable 


; Nondis- Reciprocit 
Distressed treased X Groups K» 


Positive reciproci 


p (WH) 
? W +/H+) 
H 


p (H+) 
p (H+/WH) 


Negative reciprocity 


(w— 


b 
p (H—/W-) 


ity 

340 M1 4.68" 
430 561 

387 635 192 
350 687 

298 161 26 
-450 167 

291 063 3.04 
398 075 


a p (W+) = the probability of a positive wife code, p (W =) 


= the probability of a 
ability of a positive hu 


negative wife code; p (H+) = the prob- 
teat ode, (H =) = the probabillty of 


a negative husband code. 
b Calculated by covariance analysis, with unconditional base 


rate as the covariate, 
*p <.05, 


when unconditioned base rates are initially 
low, greater gain in prediction may be more 


likely. 


Method 


Srupy 2 


Subjects and selection. An independent sample of 


14 clinic and 16 n 


onclinic couples was recruited for 


participation in the research project. All clinic couples 
were seeking therapeutic assistance for marital prob- 
Jems at either of two community mental health 


centers; nonclinic 
lease that briefly 
and nonclinic coup! 
Precisely the same 
MRI as in Study 


couples responded to a press re- 
described the project. Both clinic 
les were paid $10 for participation. 
cutoff scores on the Locke-Wallace 
1 were used to select 12 distressed 


and 8 nondistressed couples for analysis; all 30 


couples completed 
bands and wives i 
cantly differ in a 


the experimental procedures, Hus- 
n the two groups did not signifi- 
ge, educational level, number of 


years married, or salary. Couples in the present study 
were on the average 32.50 years old, had been mar- 


ried an average of 
age of 13.95 years 
age salary of 11 


9.44 years, had completed an aver- 
of schooling, and earned an aver- 
44 thousand dollars. Thus, the 


couples in Study 2 had been married approximately 


6 years longer than the coup! 


approximately 2 y 
difference between 


Jes in Study 1 and had 
ears Jess of schooling. The major 
the samples in the studies seems 


to be the length of time the couples were married. 
Procedure. Based upon the results of Study 1, the 


procedure was 
were selected: 


(IMC) as a high-conflict task ani 


modified as follows: First, two tasks 
the Inventory of Marital Conflict 


d the food task as 


a low-conflict task. The IMC was selected because 


it is a standardize 


high degree of con 


(nearly all couples 
and it could be 
vignettes to ensure 


d task, it appeared to induce a 
flict in Study 1 in both groups 
fought considerably on this task), 
expanded to include additional 
a sufficient sample of interactions 
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TABLE 4 


UNIVARIATE F RATIOS AND Means FOR RELATIVE 
FREQUENCY OF SPoUsE’S CODING OF PARTNER'S 
BEHAVIOR, STUDY 2 


Nondis- 
Distressed tressed 
Variables pS average average 
Super negative 2.98 .05 D 
Negative 3.82 AS : 
Positive 19.87* 30 rh 
Super positive pas 12 AS 


a Neutral is not included in the analysis of multivariate F; 
all five variables form a linearly dependent set, since they sum 
to 1.0. 

v degrees of freedom = 1, 36. 
+p < 001. 


for stable estimates of conditional probabilities. The 
food task was selected because it induced little con- 
flict in either group in Study 1. The IMC was ex- 
panded to include three additional vignettes. 

All sessions were conducted at one of two com- 
munity mental health centers nearest the couple’s 
residence, It may be the case that the lack of differ- 
ences on intent variables in Study 1 may have re- 
sulted from the couples not understanding that they 
were to code how they intended their messages to be 
taken by their spouses. More explanation and a 
specific example was provided for the intended 
positivity codes of the talk table than had been 
provided in Study 1. 


Results and Discussion 


Multivariate analysis of variance of impact 
data, with distressed-nondistressed as a be- 

tween-subjects factor and husband-wife as a 

within-subjects factor resulted in a significant 

effect for the distress factor, Wilks-Lambda 

F(4,33) = 7.01, p< 001, and no significant 

sex or Sex X Distress interaction, Table 4 pre- 

sents the univariate F ratios that contributed 

to the distress multivariate F ratio, Nondis- 
tressed spouses were more likely to code their 
partner’s behavior as positive than their dis- 
tressed counterparts, These results replicate 
those of Study 1, A comparison of the multi- 
variate F ratios Suggests that the results are 
even stronger here, This may be a function of 
the differences in the length of marriage in the 
two studies, since marital happiness is known 
to decrease with length of time married within 
the age ranges of the current two studies 
(Burgess et al., 1971), 

Multivariate analysis of variance of spouses’ 
intent resulted in no significant distress, sex 
or Sex X Distress F ratios. There were no 
significant differences in intention for either 
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husbands or wives on any category. This pot- 
tern of no differences is similar to that cb- 
tained in Study 1. For comparison with Stu ly 
1, Table 5 presents the means and univari: te 
F ratios for distressed and nondistres: èd 
couples. 

These results lend strong support to a co: 1- 
munication deficit explanation of marital dis- 
tress: In the two studies, although distress: d 
and nondistressed couples do not significanti y 
differ in the way they intend their messages to 
be received by their spouses, they do signif - 
cantly differ in how the messages are actually 
received. 

Conflict effect. Conflict effects were assessed 
in the same manner as in Study 1. An analys.s 
of variance on the impact data found no si; - 
nificant main effect for conflict, F(1, 72) 
2.29, p > .05, although, again, results were i1 
the predicted direction (high-conflict M 
3.48, low-conflict M = 3.57). This result js 
consistent with that of Study 1, As expecte: 
there was a significant F ratio for the impac 
data on the distress factor, F(1, 72) = 14.57 
p < .001. Distressed couples averaged 3.32 
and nondistressed couples averaged 3.72 ove 
all tasks. 

There was a significant Distress X Conflict 
interaction for the impact data, F(1,72) = 
8.44, p < .01. It was easier to discriminate 
distressed from nondistressed couples on the 
high-conflict task than on the low-conflict 
task. Distressed couples averaged .70 lower 
on impact scores than nondistressed couples 
on the high-conflict task; distressed couples 
were .13 higher on impact scores than nondis- 
tressed couples on the low-conflict task. There 
were no significant main effects or interaction 
effects on the intent variable. 


TABLE 5 
Univariate F Ratios AND MEAN 


3 S FOR RELATIVE 
FREQUENCY OF Spo 


Intent, Stupy 2 


Variables Fe : arenes A 
Super negative 66 EE 
; 00 ) 
Negative “80 04 v 
Positive 5.43* 41 “60 
Super positive 323 M4 10 


* Neutral is 
all five variabl 
to 1.0, 

Degrees of freedom = 
z Degrees lom = 1, 36, 


not included in the an; 
es form a linearly depen 


alysis of multive 
ndent set, since they s 
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The presence of Group X Conflict interac- 
ion effects in the second study and their ab- 
ence in the first study is interesting. It may 
a that the food task is a different kind of 
isk for the couples of Study 2, since the hus- 
nds in this study are working and hence 
re less likely to have a role in food prepara- 
ion than husbands in Study 1, who are more 
ikely to be in school and share in the house- 
ork, The pilot study with both tasks, men- 
oned in the Procedure section of Study 1, 
yund no significant sex differences in knowl- 
lge on either of the food or NASA tasks; 
vever, a subsequent pilot study with college 
lents found significant sex differences in 
ratings of confidence (on a 9-point scale) 
solution for the NASA task, £(68) = 
< ,04, with females less confident than 
it may be that working husbands in 
2 have as little confidence in their 
ledge of food as the female undergradu- 
had of their knowledge of space. Both 
ips of husbands may have yielded more on 
task and consequently been perceived as 
re positive by their wives. It may also be 
e case that the length of marriage factor 
4s related to the obtained interaction effects. 
’erhaps couples who have been married longer 
(Study 2) have learned to exchange more 
positive messages in low-conflict situations 
and to avoid situations of high conflict. This 
s consistent with the clinical picture of the 
stable-unsatisfactory” marriage (Lederer & 
jackson, 1968) or the “united-front” couple 
Kramer, 1968), who can present a satisfac- 
ory image of themselves to the public in so- 
cial situations but are unable to hide behind 
this image when discussing real marital issues. 

Overall, there seems to be evidence that the 

conflict level of the task influences how 
spouses code each other’s behavior. In the 
first study it was equally easy to discriminate 
distressed from nondistressed couples whether 
they were discussing an unresolved personal 
issue or rank ordering a list of dog breeds. 
In the second study couples could be dis- 
criminated only on the high-conflict_ tasks. 
These results may account, in part, for the 
difficulty of obtaining consistent differences 
across studies between distressed and nondis- 
tressed families (Jacob, 1975). Tt would be 
wise to sample systematically from low- and 


TABLE 6 
RECIPROCITY ANALYSES, STUDY 2 


o oa Reciprocity 
Variables N a L 
Positive reciprocity 
p (W+) 448 700 222 
b (W+/H+) 536 ‘787 
p (A+) 405 “701 58 
p (H+/W+) 1512 756 
Negative reciprocity 
(w—) 223 AL 61 
p (W—/H-) 354 “195 
(H—) 235 079 AS 
p (H—/W-) 331 „121 


ap (W+) = probability of a positive wife code, p (W =) 
= probability of a negative wife code, p (H +) = probability of 
a positive husband code, p (H—) = probability of a negative 
husband code. 

b Calculated by covariance analysis, with unconditional base 
rate as the covariate. 


high-conflict tasks in future investigations of 
couples and family interaction. The results of 
the present investigation would suggest that 
high-conflict tasks are more reliable for dis- 
crimination between distressed and nondis- 
tressed couples. 

Reciprocity, Table 6 presents the results of 
the reciprocity analyses. Reciprocity variables 
did not discriminate between distressed and 
nondistressed couples. In addition, the one 
significant positive reciprocity effect of Study 
1 was not replicated, 

The results of these two studies must bring 
into question the current marriage counseling 
image of positive reciprocity as characteristic 
of nondistressed marriages and not character- 
istic of distressed marriages. The high base 
rate of behavior that is positively coded by 
observers or by spouses may seem to be 
equivalent to reciprocity, but it is not. The 
data from the present investigation support 
a “bank account” model of nondistressed mar- 
riage rather than a reciprocity model. In a 
bank account model, a nondistressed marriage 
differs from a distressed marriage in that there 
are more positive “deposits” than negative 
“withdrawals.” In a nondistressed marriage 
the consequent positive impact codes are not 
contingent upon the spouse’s antecedent cod- 
ing. Perhaps it is precisely this lack of re- 
ciprocity in a context of high positive ex- 
change that characterizes stable positive 1n- 
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The results of the present investigation have 
demonstrated the usefulness of the talk table 
as a means for operationalizing behavior ex- 
change theory. Results obtained are generally 
consistent with more expensive coding studies. 
Furthermore, the power of variables obtained 
by the couple coding their own behavior to 
discriminate between distressed and nondis- 
tressed couples is somewhat greater than that 
obtained by observer-coding studies. This re- 
sult may have some theoretical import. Per- 
haps the differences between observer and 
spouse coding could account for some of 
the unreliability inherent in present coding 
systems, If one observer focuses on the 
facial expressions of the sender of a message, 
a positive code could result from a read- 
ing of intention; if the other coder focuses 
on the message’s impact on the receiver, a 
negative code could result from a reading of 
the impact of the message. It would be fruit- 
ful to control the cues that observers use and 
to study those instances when observers and 
spouses are discrepant. This would be one way 
to study the development of a private message 
system in close relationships in which many 
messages do not mean to the stranger what 
they do to the person for whom they were in- 
tended, Work along these lines is in progress 
in our laboratory. 

One limitation of the present investigation 
deserves mention. Because of the arrange- 
ments with referring agencies and the nature 
of the advertisement, both couples and in- 
terviewers were aware of which couples were 
seeking or starting marriage counseling. The 
MRI scores were, however, not available to 
either the interviewers or the couples, This 
aspect of the design of the Present investiga- 
tion introduces an experimenter expectancy 
criticism and the possibility that couples may 
be presenting themselves in order to be con- 

sistent with their classification. Subsequent 
investigations could improve on the current 
procedures by advertising for couples without 
recruiting couples from therapeutic agencies, 
Couples could be split on MRI scores, and the 
interviewers could remain blind to a couple’s 
classification. Such a procedure, however, may 
draw different samples of couples than the 
present investigation, which used a conver- 
gence of two definitions of marital distress. 


REFERENCE NOTE 


1. Gottman, J., Notarius, C., & Markman, H. Th 
couples interaction scoring system (CISS). Un 
published manuscript, 1976. (Available from Johr 
Gottman, Department of Psychology, Universit 
of Illinois, Children’s Research Center, 51 I 
Gerty Drive, Champaign, Illinois 61820.) 


REFERENCES 


Alexander, J. F. Defensive and supportive communi- 
cations in family systems, Journal of Marriage and 
the Family, 1973, 35, 613-617. 4 

Azrin, N. H., Naster, B. J., & Jones, R. Reciprocity 
counseling: A rapid learning-based procedure for 
marital counseling. Behavior Research and Therapy, 
1973, 11, 365-382. 

Birchler, G. R., Weiss, R. L., & Vincent, J. P. Multi- 
method analysis of social reinforcement exchange 
between maritally distressed and nondistressed 
spouse and stranger dyads, Journal of Personality 
and Social Psychology, 1975, 31, 349-360. 

Burgess, E. W., Locke, J. J., & Thomas, M. M. The 
family. New York: Van Nostrand Reinhold, 1971, 

Gergen, K. The psychology of behavior exchange. 
Reading, Mass.: Addison-Wesley, 1969. 

Haley, J. Research on family patterns: An instru- 
ment measurement. Family Process, 1964, 3, 41-65. 

Haley, J. Speech sequences of normal and abnormal 
families with two children present. Family Process, 
1967, 6, 81-97. 

Hall, J. Decisions, decisions, decisions. Psychology 
Today, November, 1971, 51-88. 

Jacob, T. Family interaction in disturbed and normal 
families: A methodological and substantive review. 
Psychological Bulletin, 1975, 82, 33-65. 

Jacob, T., & Davis, J. Family interaction as a func- 
tion of experimental task. Family Process, 1973, 12, 
415-427. 

Kramer, C. H. The theoretical position: diagnostic 
and therapeutic implications, In Charles Kramer, 
Bernard Liebowitz, Robert Phillips, Sylvia Schmidt, 
& James Gibson (Eds.), Beginning phase of family 
treatment, Chicago: Family Institute of Chicago 
1968. $ 

Lederer, W. J., & Jackson, D. D. The mirages of 
marriage. New York: Norton, 1968. 

Locke, H. J. Prediction adjustment in marriage: A 
comparison of a divorced and happily married 
group. New York: Holt, 1961. 

Navran, L. Communication and adjustment in mar- 

Oe Process, 1967, 6, 173-180, 
son, D. H., & R; i 
conflicts (IMC) re G. Inventory of marital 


‘xperimental interaction pro- 

cedure. Journal of Marria; il 
oes ge and the Family, 1970, 
Patterson, G. R. A basis for identifying stimuli which 


control behaviors in natural settings. il 
velopment, 1974, 45, 900-911 ai) adel 
Ses es AF, & Harrell, J. A behavioral-exchange 
odel for marital counseling, Famil i 
1072, 21, man. ng. Family Coordinator, 


BEHAVIOR EXCHANGE 


Raush, H. L., Barry, W. A., Hertel, R. K., & Swain, 
M. A. Communication, conflict and marriage. San 
Francisco: Josey-Bass, 1974, 
Riskin, J., & Faunce, E. E. An evaluative review of 
family interaction research. Family Process, 1972, 
11, 365-455. 
Speer, D. C, Marital dysfunctionality and two-person 
non-zero-sum game behavior. Journal of Person- 
ality and Social Psychology, 1972, 21, 18-24. 
tuart, R. B. Operant interpersonal treatment for 
marital discord. Journal of Consulting and Clini- 
cal Psychology, 1969, 33, 675-682. 

rman, L. M., & Wallin, P. The validity of marriage 


23 


prediction and marital adjustment test. American 
Sociological Review, 1949, 14, 497-504. 

Thibaut, J. W., & Kelley, H. H. The social psychology 
of groups. New York; Wiley, 1959. 

Weiss, R, L., Hops, H., & Patterson, G, R. A frame- 
work for conceptualizing marital conflict: A tech- 
nology for altering it, some data for evaluating it. 
In Leo A. Hamerlynck, Lee C. Handy, & Eric J. 
Mash (Eds.), Behavior Change: The Fourth Banff 
Conference on Behavior Modification. Champaign, 
TII.: Research Press, 1973. 


(Received April 27, 1975) 


Journal of Personality and 


Social Psychology 
1976, Vol. 34, No, 1, 24-29 


Deindividuation as a Function of Density 
and Group Membership 


Dale O. Jorgenson and Fred O. Dukes 

California State University, Long Beach 
Deindividuation theory predicts that conditions of high objective anonymity 
produce greater deindividuation than conditions of low anonymi Based on 
this prediction, it was hypothesized that subjects in high-densit ettings or 
who are part of large social aggregates would report feeling more deindividu- 
ated and act in a less inhibited manner than subjects in lower-density settings 
or who are part of a smaller social aggregate, Based on a distinction between 


a social aggregate and a social group, it was further hypothesized that members 
hat would be predicted from de- 
lividuated and act more inhibited 
were tested by observing the per- 
situational norm (the measure of 


of interacting social groups, 


individuation theory, would feel less deind 
than solitary individuals, These hypotheses 
centage of individuals who conformed to a 


contrary to wi 


behavioral inhibition) under both high- and low-density conditions in a field 
setting, The combination of density level or aggregate size, sex of subject, and 


group size yielded a 2 X2 X4 design. Separate 
urveyed for self-rep 
d. In addition, 


density conditions were si 


hypotheses were supporte 
conformed also reported less dei 
These results are interpreted as 
deindividuation, 


Zimbardo (1970) has attempted to sys- 
tematically identify the theoretical anteced- 
ents to and consequences of deindividuation, 
an internal psychological state characterized 
by (a) a feeling of indistinguishability from 
one’s environment, (b) a lowered self-aware- 
ness and self-evaluation, and (c) a decrease in 
concern about the evaluation of other people. 
Zimbardo proposed that this state in turn 
leads to an increase in uninhibited behavior, 
Among the antecedent variables which Zim- 
bardo identified were anonymity and group 
size. As predicted by the model, several 
studies have shown that increased anonymity 
or lowered identifiability does lead to an in- 
crease in uninhibited behaviors (Baron, 1971; 
Cannavale, Scarr, & Pepitone, 1970; Diener, 
1974; Diener, Westford, Diener, & Beaman, 
1973; Festinger, Pepitone, & Newcomb, 1952: 
Gergen, Gergen, & Barton, 1973; Singer, 
Brush, & Lublin, 1965: Zimbardo, 1970; 
Kiernan & Kaplan, Note 1; Marwell & 
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samples of 30 subjects in both 
orts of deindividuation. Both 
subjects in the survey sample who 
led to do so. 
ed theory of 


Schmitt, Note 2). Most import: 
also some evidence that the level 
is related to the degree of 
(Arbuthnot & Andrasik, 1973 
Singer, Brush, & Lublin, 
Dineen, & Westford, Note 3). 
Evidence of a simple relationship between 
group size or what some investigators refer to 
as “group presence”? (e.g., Festinger et al, 
1952) or “group mind” (Lebon, 1925) and 
nsistent, Several in- 


ant, there is 
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tinguish the concept of a social aggregate 
from the concept of a social group. A social 
aggregate can be defined as a collection of 
individuals who are not interacting with one 
another, are not organized in any particular 
way, and are independent in most respects. 
\ social group, on the other hand, is a collec- 
tion of individuals who may be interacting, 
who perceive the existence of a group and 
their membership in it, and who are interde- 
pendent in critical respects. Another signifi- 
cant difference is that the members of a group 
may share some explicit goal, whereas mem- 
vers of an aggregate share no explicit goal. 
‘f course, an aggregate such as a lynch mob 
vy temporarily become a group when they 
the goal of lynching someone. This dis- 
n is important because some of the 
es which have compared the feelings and 
savior of people in groups with that of 
tary individuals may have been observing 
embers of aggregates instead, In almost all 
ises, participants in these studies had no con- 
‘act prior to their participation, In such cases, 
nereased size may actually mean increased 
anonymity from both the experimenter and 
other members of the aggregate. In contrast, 
if participants were already members of a 
social group or if experimental procedures 
created one or more of the conditions for be- 
coming one (e.g., forcing them to interact in 
pursuit of a common goal), increased size may 
have a very different effect. In light of this 
distinction, the purpose of this investigation 
was to compare the effects of the size of non- 
interacting aggregates with the effects of the 
size of interacting groups in a field setting. 
The central hypothesis of this study was 
that individuals act less inhibited and feel 
more deindividuated when in a high-density 
setting or a large aggregate than when in a 
low-density setting or small aggregate. In 
terms of a model of the deindividuation pro- 
cess, such a difference would occur because 
the presence of a large crowd or aggregate 1n 
a specific setting may increase anonymity, 
which in turn may produce an increased feel- 
ing of deindividuation and/or a decrease in 
behavioral inhibition. 5 
Regardless of the density of a setting or the 
size of an aggregate, any individual present 
in that setting may either be solitary or a 


member of a face-to-face interacting group. 
For example, it is possible for an aggregate 
to consist of solitary individuals in combina- 
tion with a number of independent social 
groups. If, as both Zimbardo (1970) and 
Festinger et al. (1952) suggest, membership 
in a group produces greater deindividuation, 
solitary individuals should act more inhibited 
and feel less deindividuated than members of 
a face-to-face group. However, when both 
solitary individuals and members of face-to- 
face groups are part of a larger noninteracting 
social aggregate, it seems possible that a mem- 
ber of a group may feel more identifiable, at 
least to other members of that group, than a 
solitary individual and may thus experience 
greater individuation and/or behavioral in- 
hibition, In the present study, it was hypothe- 
sized that when both solitary individuals and 
members of small interacting groups are part 
of a larger noninteracting aggregate, solitary 
individuals act in a less inhibited manner and 
feel more deindividuated than members of 
face-to-face groups, independent of the size 
of the aggregate. 


METHOD 

Subjects 

Subjects were 274 males and 270 females who were 
present at a table in one of the dining areas of the 
student union at a large western university sometime 
between 8:30 am, and 12:30 pm., on Monday 
through Friday, over a 6-week period during Febru- 
ary, March, and April 1975. During these time pe- 
riods, only those individuals who had food trays and 
left the dining area permanently during any period 
were observed as subjects. Because of the large num- 
ber of observation periods, it is possible that a very 
few subjects were observed more than once, despite 
efforts to avoid this. This would be problematic only 
if the proportions of multiple observations in the 
various experimental conditions were substantially 


different. 


Procedure and Design 


On each of the 30 observation days, a 23-year-old 
male observer located himself at a table somewhere 
around the perimeter of the cafeteria, such that none 
of the other people present were behind him, and be- 
haved as if he were studying. At no time did it ap- 
pear obvious that anyone present was aware of the 
observations being made, perhaps because the ob- 
servation forms were kept out of view. 

As an indication of behavioral inhibition or con- 
formity to a situational norm, the observer noted 
whether any person with a food tray who left the 


TABLE 1 


SAMPLE SIZE AND PERCENTAGE OF SUBJECTS RETURNING 
THE TRAY AS A FUNCTION OF Density, GROUP 
SIZE, AND SUBJECT SEX 


High density Low density 

Condition n % n % 
ne 74 50.0 78 68.9 
Are 39: $1.2 46 73.9 
Female 35 48.6 32 62.5 
Dyad BATT 73 87.5 
Male 69 771 34 94.1 
Female 68 73.1 39 79.4 
Tryad 49 734 21 90.4 
Male 28 82.1 © 80.0 
Female 21 61.9 16 93.7 
Quad 45 71.1 7 85.7 
Male 16 75.0 4 100.0 
Female 29 68.9 3 66.6 
Overall 305 68.8 179 80.1 


area during an observation period either returned the 
tray at one of the locations designed for this purpose 
near the exits or left it at the table. Since there are 
signs which ask cafeteria users to return them, re- 
turning the tray was regarded as conformity to this 
fairly explicit norm, As a person with a tray rose to 
leave, the observer recorded the subject’s sex, the 
number and sex of persons at the table, and whether 
or not the tray was returned, All Persons with trays 
at a table who left during the observation period 
were included as subjects, A total of 244 males and 
240 females were observed in this fashion. 
Differences in Aggregate size or density were cre- 
ated naturally by observing only during periods when 
the approximately 90 chairs in the cafeteria were 
either 75% occupied, which was defined as a high- 
density period, or when 25% of them were Occupied, 
which was defined as a low-density period. If the 
occupancy rate changed to a level between these cut- 
off points during an observation period, observations 
were halted until the level on 
of the two Tanges, However, 


+» While low-density peri 
occurred primarily between te a 


Measurement of Deindividuation 


To determine if dens 
level of deindividuation, 
female subjects with tra 
Seated at a table in the ol 
during the first 15 days 
respond to a survey of 
life situations,” 
table, a male and a femal 
in both high- and low-den: 
15 days, If the table chosen contained neither a per- 
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To obtain a measure of the felt level of deindivid 
ation among subjects in each of the two density co 
ditions, each of the subjects approached was ask« 
to indicate how they felt at the moment by anon 
mously filling out a series of 7-point bipolar adjectiy 
scales. The following four scales were used as me 
sures of deindividuation: concerned-carefre €, cor 
spicuous-anonymous, free-restrained, and inhibitec 
uninhibited. Three scales were used as fillers 
reduce possible reactivity: active-passive, friendl 
unfriendly, and sleepy-wide awake. 

After subjects had agreed to participate, they wer 
asked to complete the questionnaire and place it ir 
a large manila envelope without writing their name 
on it. It was hoped that this procedure would in 
crease response honesty, After each questionnaire was 
returned, the observer returned to his table and re 
corded the sex of the subjects, the number of people 
at the table, the number with trays, and whether or 
not the subjects returned their trays, 


RESULTS 


Based on Lunney’s (1970) finding that 
analysis of variance procedures can be used 
on dichotomous data with very little error, 
even when various assumptions are violated, 
the tray-return behavior of the subjects who 
were given no questionnaire was analyzed by 
means of a three-way analysis of variance for 
unequal cell frequencies, with two levels of 
density, two levels of subject sex and four 
levels of group size. Since the number of sub- 
jects observed in groups with five or more 
members was so small (n= 12), it was de- 
cided to collapse these group size categories 
into one which represented those with four or 
more members only, The actual percentage of 
subjects returning their trays in each of the 
resulting conditions is contained in Table 1, 
As predicted, subjects observed during high- 
density periods returned their trays less fre- 
ess conformity or be- 
i than those observed 
during low-density periods, F(1, 468) = 5.42, 
while group size was 


mana 


s ewer solitary sub- 
Jects (60.1%) returned their trays tae those 


ee eee a 
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who were members of a group (78.0%), re- 
gardless of size, F (1,468) = 15.44, p < 01. 
Thus, group presence resulted in greater be- 
havioral inhibition; the size of the group had 
no effect on return rate. Furthermore, as evi- 
denced by the lack of a significant interaction, 
F (1,468) = .07, ns, density level and group 
presence had an additive effect on the likeli- 
hood of a tray being returned. Finally, no sig- 
nificant sex difference in tray return rate was 
‘ound here, F(1, 468) = 2.61, ms, nor did sex 
i subject interact significantly with either 
nsity, (1,468) = .07, ns, or group size, 
3,468) = .34, ns. 
The possibility that the tray return rate 
s lower during high-density than low-den- 
periods because of the effects of deindi- 
tion was partially supported by the re- 
‘es to the questionnaire. Because of the 
| number of subjects in a few of the 
ltant cells, the questionnaire responses of 
jects in groups of all sizes were collapsed 
o a single category, and a 2 X 2 X 2 un- 
eighted means analysis of variance was com- 
eted. Thus, subjects who responded during 
igh-density periods reported feeling signifi- 
intly more anonymous, F (1, 52) = 4.31, p < 
5, less inhibited, F(1, 52) = 3.97, p < .05, 
ut no more free, F(1, 52) = 1.41, ns, and no 
iore carefree, F(1, 52) = .05, ms, than sub- 
cts who responded during low-density pe- 
iods. Furthermore, consistent with what would 
e expected from tray-return rate difference, 
litary subjects reported feeling more care- 
free, F(1, 52) = 5.56, p< .05, more anony- 
mous, F(1,52) = 7.95, p< 01, and more 
free, F(1, 52) = 8.45, p < .01, than members 
f a group. However, they felt no more in- 
hibited, F(1,52) = 1.89, ns, than subjects 
who were members of a group. There was no 
significant sex difference, nor were there any 
significant interactions for any of the adjec- 
tives. Finally, there were no significant effects 
for any of the control adjectives. The means 
for the four adjective pairs used as measures 
of deindividuation as a function of density, 
sex, and group presence are contained in 
Table 2, However, because of the small num- 
ber of cases in some of the low-density cells, 
considerable caution should be used in inter- 


preting these results. 


TABLE 2 


MEAN RESPONSE AS A FUNCTION OF ADJECTIVE Pam 
AND SUBJECT CONDITION 


Adjective pair 


Conspic- Inhib- 
Con- uous-  ited-Un- 
Free-Re- cerned- Anony- inhib- 


Condition strained Carefree mous ited 


High Density 5.4 
Alone 
Male 
Female 
Group 
Male 
Female 
Low Density 
Alone 
Male 
Female 
Group 
Male 
Female 


48 5.5 


PRROQVOREMUUAD 
BowOhRwWUNACHKODAN 
LONE PRON a O n E 
SroryUDeQeUUNows 
LER RNR RRR DOOD 
SSSSHSu NaS 
Ssropnsannnno g 
—weeounsaoeucw 


_Nole. Higher means are assumed to represent greater dein- 
dividuation. 

A comparison of the questionnaire responses 
of the 48 subjects in the questionnaire sample 
who returned their trays and the 12 who left 
them at their table produced additional sup- 
port for the deindividuation theory. Subjects. 
who returned their tray felt significantly less 
carefree, t(58) = 2.02, p < .05, less anony- 
mous, (58) = 2.78, p < .01, less free, ¢(58) 
= 2.47, p < .05, but no more inhibited, +(58) 
= 1.60, ns, than those who left them at their 
table. The means of these four adjective pairs 
for subjects who returned their trays were 
3.83, 4.25, 4.77, and 4.92, respectively, 
whereas those for subjects who left their trays 
at the table were 4.92, 5.50, 5.92, and 5.58, 
respectively. There were no significant differ- 
ences in the responses of these two groups to 
the control adjectives. Since the questionnaire 
was completed by these 60 subjects prior to 
leaving the table, these results might be in- 
terpreted to mean that feelings of deindividua- 
tion preceded nonconforming behaviors. This 
would distinguish this phenomenon from 
Wheeler’s (1966) model of behavioral conta- 
gion, which might predict that deindividuation 
follows rather than precedes unrestrained be- 


havior. 
DISCUSSION 


Evidence obtained here appears to support 
a modified version of deindividuation theory. 
Thus, individuals who were part of a large 
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aggregate (i.e, who were observed during a 
high-density period) behaved in a less in- 
hibited, nonconforming manner and reported 
feeling more deindividuated, at least as mea- 
suted here, than those who were part of 
smaller aggregates, This is consistent with the 
prediction which would be made on the basis 
of Zimbardo’s (1970) model. Nevertheless, 
contrary to what might be predicted by this 
model, members of face-to-face interacting 
groups behaved in a more inhibited or con- 
forming manner and felt more individuated 
than solitary individuals, Thus, simply being 
a member of a face-to-face interacting group 
may in itself produce no deindividuation; the 
occurrence of deindividuation in such groups 
may depend on the presence of several condi- 
tions (e.g., the consumption of alcohol) or 
may vary as a function of the type of group, 
the group’s goals, and their activities, It may 
also be that only increases beyond a certain 
size have a deindividuating effect and then 
only when other conditions are present (e.g., 
level of arousal, diffusion of responsibility). 
For this reason, it is important to consider 
here that only 12 of the 332 people observed 
were in groups which had as many as 5 mem- 
bers and only 52 were in groups which had 4, 
Thus, the comparison made here only crudely 
Suggests what the effects of variations in group 
size might be. 

Tn evaluating these results it is important to 
consider that the majority of individuals ob- 
Served returned their trays, regardless of situ- 
ation, Thus, the increased anonymity pro- 
duced by the Presence of large crowds may 
not always produce intensely unrestrained be- 
havior or widespread nonconformity, Rather, 
it may occasionally result in more subtle 
changes in the way people feel and act. Per- 
haps because the Output behaviors in Zim- 
bardo’s (1970) model of the deindividuation 

Process connote a high behavioral intensity, 
the majority of target behaviors used in dein- 
dividuation research have typically been of 
that type. Even so, it seems that the very same 
antecedent conditions could produce behavior 
which is subject to deindividuation and yet 
has few of the characteristics which Zimbardo 
identified. While aggression and other highly 
visible forms of uninhibited behavior are per- 
haps more significant, it seems appropriate to 


give greater consideration to more subtle ı 
fects in future research, i 

Although the presence of the predicted di 
ferences in deindividuation and tray-retu 
behavior would seem to support a deindivici 
ation interpretation, there are some altern 
tives which should be considered. One obvior 
possibility is that it was simply more difficu 
to return a tray during a high-density perio: 
However, at no time were there any lines : 
the tray-return locations (which were locate 
near all exits). A second possibility is tha‘ 
people may have modeled nonreturn behavior 
more frequently during high-density than low 
density periods. Modeling probably did have 
some influence, but such an interpretation 
fails to explain the differences in the responses 
to the questionnaire, 

Another alternative explanation of the dif- 
ference in high- and low-density return rates 
is that people who were present during these 
two periods differ systematically on some 
stable traits which are potentially related to 
the likelihood of conformity to this particular 
norm, Again, however, the questionnaire re- 
Sponses seem to support a deindividuation in- 
terpretation. In addition, the fact that sub- 
jects in the questionnaire sample who returned 
their trays felt less deindividuated than those 
who left them behind is difficult to attribute 
to the influence of stable personality traits, 

Based on the findings reported here, it 
seems desirable to conduct more field studies 
of the predictions of deindividuation theory. 
For example, it would be worthwhile to find 
out if people really do feel deindividuated or, 
conversely, individuated, in real-life situations 
which many observers expect to produce these 
effects. Do people living in large urban areas 
generally feel more deindividuated than those 


living in rural villages? If not, what conditions 
in urban areas led 


tion process in the more controlled environ- 
ment of the laboratory (all demand character- 
istics and experimenter effects aside), it also 
Seems important to know the distribution of 
the phenomenon in nature as well, Although a 
study such as this has important defects, 
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perhaps what we lack is practice at minimiz- 
ing their influence. 

One problem that would have to be solved 
before either laboratory or field research can 
proceed in an orderly fashion is the clarifica- 
tion of the concept of deindividuation. Zim- 
bardo (1970) describes it as a reduction in 

o»ncern for the evaluation of self and/or 

hers and lists some behavioral manifesta- 
ons; however, a reasonably valid measure of 
1e feeling state itself has yet to appear. Un- 
vrtunately, the few cases in which attempts 
we been made to measure it independent of 

; behavioral manifestations have used differ- 

it measurement strategies. Thus, it is un- 

ar from this study whether a person who 

‘ports feeling highly anonymous is neces- 

wily deindividuated or whether anonymity, 

jective or subjective, is only antecedent to 

This fact, combined with the limited num- 

r of subjects completing the questionnaire, 

rticularly during the low-density periods, 

ikes the questionnaire data only suggestive. 
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The Experience of Crowding: A Two-Factor Theory 
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It was proposed that the experience of crowding occurs in a two-step process: 
First, the individual becomes aroused by violations of his personal space, and 
then he attributes the cause of this arousal to other people in his environment. 
Based on this model it was predicted that violations of personal space rather 
than simple density is the spatial variable associated with crowding, Further, 
it was predicted that the experience of crowding can be alleviated if the indi- 
vidual is distracted from making the attribution that his arousal is caused by 
other people, In order to test these predictions, interaction distance (close and 
far) and density (high and low) were varied separately. In addition, for half 
of the conditions, pictures (attribution inhibitors) were placed on the walls of 
the experimental room, and in the other half, the walls were bare, Subjects 
worked on two tasks and were then questioned about their experience of 
crowding. The results showed that interaction distance was more closely re- 
lated to crowding than was density and that the addition of pictures reduced 
‘Yat experience of crowding only in the close interaction conditions. The results 


Were interpreted as supporting the attribution analysis of crowding. 


The last few years have seen a rapidly in- 
creasing interest among social psychologists in 
the effects of crowding on human behavior. 
‘The early animal studies (e.g., Calhoun, 1962; 
Christian, Flyger, & Davis, 1960) found that 
crowding was stressful, led to a deterioration 
of social order, and was responsible for in- 
creased disease and death. To some extent, 
these findings were substantiated in humans 
by the early demographic work (e.g., Schmitt, 

1957, 1966), which indicated a positive rela- 
tionship between density and such indices as 
infant mortality, incidence of tuberculosis, 
admissions to mental institutions, and juvenile 
delinquency. 2 

More recently, however, laboratory studies 
on the effects of crowding on human behavior 
have yielded mixed results. On one hand 
Grifftt and Veitch (1971) found that high 
density led to increased intragroup hostility. 
This finding was replicated with male subjects 
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by Stokols, Rall, Pinner, and Schopler (197 
On the other hand, Stokols et al, found th t 
high density did not increase intragroup ho - 
tility with female subjects, and Freedma. 
Klevansky, and Ehrlich (1971) found th 


high density did not produce a decrement i 
task performance, 


arousal is supposedly related to some spatial 
his spatial 
he measure 
nost closely 
ensity is a 
pecifies the 


1972; 1973), there is little evidence or theo- 


gesting that high densit 
should be arousing (Freedman, 1978). 7 
x While there is little to Suggest a relationship 
etween simple Measures of space available 
and arousal, there is some theory and data to 


—_—__ 


a 


THE EXPERIENCE OF CROWDING 31 


suggest that certain arrangements of indi- 
viduals within a space may lead to arousal. 
This view focuses not on the space available 
to the individual (density) but on the manner 
in which the space is actually used. In his 
study of proxemics, Hall (1959, 1966) sug- 
gested that man is characterized by interac- 
ion distances or personal space. There are 
ery definite interpersonal distances at which 
n individual is comfortable when interacting 
ith other individuals. The specific distance 
; determined by the type of interaction and 
1e relationship that exists between the indi- 
iduals. According to Hall, stress results when 
1e appropriate distance is violated. The indi- 
idual whose personal space is violated will 
{tempt to rearrange the interaction in order 
o reestablish his or her space. Sommer and 
is associates (Sommer, 1969; Felipe & Som- 
ner, 1966) found that violations of personal 
race cause individuals to become stressed and 
ncomfortable. This suggests that it is not 

e amount of space available to the indi- 
idual per se but the distance between indi- 
duals that determines the degree of stress 
rising from a particular situation. 

Using a personal space approach, it be- 
omes possible to predict that on one hand, an 
ndividual may experience stress in a rather 
urge room with a low density and a small 
umber of other persons; on the other hand, 
n individual may feel relatively uncrowded in 
onditions of relatively high density. The 
‘rmer condition should occur when the indi- 

viduals are interacting at a rather close dis- 
tance and using only a small portion of the 
room, The latter condition could result when 
the individuals are “spread out” in a rather 
small room. The interaction distance approach 
could also account for results such as those 
found by Epstein and Karlin (1975), who re- 
ported that with density measures held con- 
stant, persons in the center of a group felt 
more crowded than individuals at the fringe 
of the group. Persons in the center of the 
group are more likely to have their personal 
space invaded than are persons at the group's 
fringes. 

Most previous research (e.g. Stokols et al., 
1973) has failed to take into account the dis- 
tinction between interaction distance and den- 
sity, and consequently the two measures are 


often confounded: When room size is varied, 
interaction distance is also varied, One aim 
of the present study was to unconfound den- 
sity and interaction distance and to investi- 
gate the relationship of each to the experience 
of stress and crowding. In order to achieve 
this, subjects were divided into two groups: 
In half of the conditions, subjects interacted 
at such a close distance that their personal 
space was violated; in other conditions, the 
interaction distance was greater, and there 
was no violation of personal space. This ma- 
nipulation was crossed with room size (large 
or small). Since the number of persons in a 
group was held constant, this latter manipula- 
tion affected density. It was predicted that 
the interaction distance would more drasti- 
cally affect stress and arousal than would 
simple density variations. 

Even though it may be possible to identify 
the spatial factors that lead to stress and 
arousal, it does not necessarily follow that 
these factors also lead to the experience of 
crowding or to the negative behavioral effects 
associated with crowding, Stokols (1972) has 
suggested that crowding is a psychological 
state that is mediated by spatial, social, and 
personal factors. This position has been 
echoed by numerous other investigators in the 
field (Desor, 1972; Esser, 1973; Choldin, 
Note 1). The question that remains unan- 
swered is how the individual translates these 
spatial and social factors into the experience 
of crowding. Stokols argued that spatial factors 
result in crowding when the people in an indi- 
vidual’s environment begin to interfere with 
his or her activities. This view may be correct, 
put it raises additional questions. First, why 
does an individual experience crowding when 
he (she) is surrounded by many other people 
but is not engaged in any activity? In this 
case, there could be no interference, but the 
experience of crowding could still result, Sec- 
ond, why does crowding rather than some 
other experience such as frustration result 
from having activities interfered with? 

Much of the present conceptualization of 
crowding suggests that it carries the psycho- 
logical status similar to that of an emotion. As 
such, it seems plausible tol Hypothese that 
the process in which one “feels crowded is 


similar to that in which one feels other emo- 


tions, Previous research (e.g., Schachter & 
Singer, 1962) has repeatedly demonstrated 
that the experience of an emotion is dependent 
on a state of arousal or stress and an attribu- 
tion about the cause of that arousal state; 
that is, an arousal is not experienced as a par- 
ticular state until an attribution about the 
cause of that arousal has been made. 
Schachter and Singer were able to demonstrate 
that within limits, the same state of arousal 
could be experienced either as a positive state 
(euphoria) or a negative state (anger). 
It can also be hypothesized that crowding 
is a function of arousal and the attribution 
that the arousal is caused by other people in 
an individual’s environment. The arousal state 
results from some spatial factor. In this case, 
it would be a violation of personal space 
rather than some simple density condition, 
However, the personal space violation should 
not result in the experience of crowding unless 
the attribution is made that the arousal is 
caused by other people. Thus, crowding is a 
two-stage process beginning with a state of 
arousal created by violations of personal space 
and followed by the attribution that other 
people are the cause of this arousal. 

This model has implications for the means 
that can be employed to reduce the experience 
of crowding, It suggests that crowding (as a 
Psychological state) can be reduced eithe 
eliminating arousal or by 
attribution that the aro 
people. Based on the at 
means of interfering wi 
cess would be attentio: 
people surrounding 
be done by placing 
room, showing a m 
tracting event occ 
hibitors” should 


r by 
interfering with the 
usal is due to other 
tribution model, one 
th the attribution pro- 
n diverted from other 
an individual, This could 
interesting Pictures in the 
ovie, or having some dis- 
ur, These “attribution in- 
r reduce the experience of 
crowding and reduce the effects of crowding 
on behavior, In most previous crowding re- 
search (Freedman et al., 1971; Stokols et al 

1973), care has been exercised to remove any 
possible distractions from the experimental 


room and make the room as bare as possible. 
The necessary effect of t 


the individual’s attention on the other people 


around him; there is little else on which he 
can focus. Because his attention would þe 


concentrated on other people, his attribution 


his should be to focus ~ 
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about feeling aroused should also cenie on 
these other people. This attribution ild 
result in the experience of crowding. 

In the present experiment, the de ty 
and interaction distance manipulations re 
crossed with a distractor variable, In > ne 
conditions, posters and wind chimes re 
placed in the experimental room; in th: c- 
maining conditions, the experimental r ns 
were bare with the exception of the neces y 
chairs. The addition of pictures should i: 
fere with the crowding attribution, Thu. t 


was predicted that the addition of pict: s 
should reduce the experience of crowding « d 
decremental behaviors associated with cro 
ing significantly more in the close interacti 
distance condition than in the far interact 
distance condition. This prediction follows | 
cause subjects in the close condition should 
more stressed than subjects in the far con 
tion, and the attribution of causality for t 
Stress should be occurring in the former b 
not in the latter condition. If the hypothe 
that interaction distance and not density 
responsible for the experience of crowdit 
were correct, the picture manipulation shou 
not interact with the density manipulatior 
That is, there should be no attribution proces 
motivated by the density manipulation, anc 
thus the pictures should not affect experience 


METHOD 
Subjects 


Subjects were 315 male intri 
students who participated in a 
performance” in partial fulfill 
quirement. Of these, 35 group; 
groups of 7 subjects were r 
having more than one 7-ma 
unacquainted subjects were 


oductory psychology 
study entitled “group 
ment of a course re- 
S of 8 subjects and 5 
un, with no condition 
n group. Only previously 
included in any group. 


Method 


The subjects met 
experimental room and were 


„er EDE 


a 
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cording everyone’s voice, A “dummy” microphone 

ing from the ceiling. The recording cover story 

as used so that subjects would not move their 
hairs, thereby affecting the interaction distance. 

The three independent manipulations involved 

om characteristics. First, density was manipulated 

rough room size. Half of the groups were run in 

vigh-density room measuring 12 feet 6 inches X 9 

t (3.6 m 13.24 cm X 2.7 m), while the remaining 

ups were run in a low-density room measuring 
eet 6 inches X 11 feet 6 inches (6 m 13.24 cm X 
m 13.24 em). Second, interaction distance was 
ipulated by means of the placement of chairs 
he room. In all cases, the chairs were arranged 
circle. In the close distance condition, the front 
of each chair touched the front legs of the chairs 
sither side of it, whereas in the far distance con- 
n, there was 19.5 inches (49.53 cm) between 
front leg of one chair and the front leg of the 
r next to it. The third manipulation involved the 
ence or absence of pictures and posters. In the 
ures present condition, the room had six prints 
nged on the walls and two wind chimes hung 
n the light fixtures. Three of the prints were from 
Studio 1 Signature Series (Flamenco Guitarist, 
ale Nude, Rock Musician), two were Portal 
ication posters (Burges? Boy and Girl by 
ım and Don Quixote), and one poster was from 
Sierra Club Format Series (Kauai and the Park 
try of Hawaii). In the no-picture condition, the 
. did not contain any pictures or wind chimes. 
ept for the experimental manipulations, the rooms 
e identical with regard to color, ceiling tiles, num- 
of doors (one) and windows (none), and tem- 
ature and ventilation systems. 
\fter subjects had been seated, the experimenter 
e each subject a name badge (lettered A-H) to 

ı onto their shirts, Throughout the study, the ex- 
perimenter’s interaction with the subjects was mini- 
wal, His role was simply to explain the tasks, collect 
the data, and leave the room. In fact, the experi- 
menter was only present in the room with the sub- 
jects for a very short time, Subjects were then given 

ew minutes to get acquainted with each other as 
the experimenter left the room, supposedly to get 
me additional materials. No instructions were given 
to the subjects at this time except that the experi- 
menter would be “right back.” Just prior to giving 
the experimental instructions, the experimenter col- 
lected the subjects’ watches. He explained that the 
experiment involved subjects working on a number 
oi tasks and that subjects often became concerned 

A ; i trate on the tasks. 
with the time and did not concentra 5 
He was thus collecting the watches so that time 
would not be of concern to the subjects. Actually, 
one of the dependent measures was time distortion, 
and the watches were collected so that subjects 
would not know the amount of time that had elapsed 
in the study. 

The experimenter explain 
study were to investigate how well groups perform 
on various tasks and to examine the group processes. 
The subjects were then told that the first task in- 


ed that the aims of the 


volved forming as many words as possible from the 
master word industriously, The group members were 
to come up with one group list and were to work 
until the experimenter told them to stop. After 
ascertaining that everyone understood the task, the 
experimenter left the room and allowed 10 minutes 
for the first task, After this period, he returned to 
the room and collected the list of words. This task 
was employed because the number of words formed 
by the group could serve as a clear index of per- 
formance. 

The experimenter then stated that the second task 
was a human relations problem. He gave the subjects 
a case history of a juvenile delinquent, Johnny 
Rocco. Rocco was described as a young boy from a 
broken home who had committed numerous minor 
offenses. Rocco had just been caught perpetrating a 
more serious crime. The task of the subjects was to 
read the case history and indicate on the accompany- 
ing love-punishment scale which of seven alternative 
courses of action should be taken, The seven al- 
ternatives varied along a punitiveness dimension, 
The subjects were told that they were first to make 
an individual decision as to what course of action to 
take with Rocco. After recording their individual 
recommendation, they were to discuss the Rocco 
case as a group and arrive at a group recommenda- 
tion. The experimenter allowed subjects 10 minutes 
to arrive at the individual decisions:and then gave 
them an additional 10 minutes to arrive at the 
group decision. The use of the Johnny Rocco case 
allowed the opportunity to investigate how crowding 
related to the punitiveness of group decisions. 

After subjects had completed the Johnny Rocco 
case, the experimenter collected their responses. He. 
then asked subjects to complete a 20-item question- 
naire pertaining to their reactions to the experiment 
and their attraction toward fellow group members. 
The subjects were given 17 minutes in which to 
complete the questionnaire. Following this, the ex- 
perimenter returned the watches to the subjects and 
carefully debriefed them. a 


RESULTS 


Measures of Crowding 

Four questions were included to measure 
subjects’ experience of crowding. Subjects 
were asked how comfortable, confined, and ill 
at ease they felt during the experiment and 
how crowded they found the experimental 
room. Since the four questions yielded similar 
results, the ratings from the four were com- 
bined to form one “crowding” index. As can 
be seen from Table 1, the three main effects 
indicate that subjects felt more crowded in 
the small as opposed to large room, in the 
close as opposed to far interaction distances, 
and in a room without pictures as opposed to 
a room with pictures. There are, however, two 


TABLE 1 
MEANS FOR CROWDING Inpices 
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TABLE 3 
TIn ESTIMATES 


Density /interaction Pictures No pictures Density /interaction Pictures s 
Low (large room) Ka, : 
Far 4.07 4.22 “Task 1 9.21 9 
Close 5.92 6.31 Task 2 16.76 16 
i lose 
High (small room) 3 3 c Task i 10.20 1 : 
Far 2 i Task 2 18.70 
Close 97 z High 
mea = il. formed by a composite Far 
aol nda Ombre Gates es ‘crowed Task 1 9.48 10.40 
subjects reported being; 1. = least crowded, 10 = most crowded. Task 2 17.51 18-44 
Close = 
interactions that qualify these effects. The R ee 4 aa 
Density X Interaction Distance interaction 


indicates that density affected the experience 
of crowding much more under far interaction 
distance than under close interaction distance 
(see Table 2). For example, subjects in the 
far interaction condition reported feeling sig- 
nificantly more crowded in the high-density 
than in the low-density condition, F(1, 32) = 
99.00, p < .001, However, there was no sig- 
nificant difference in reports of crowding in 
the high- versus low-density conditions under 
close interaction distance, F(1,32) = 1,96, 
ns. Thus, while interaction distance affected 
the experience of crowding regardless of den- 
sity conditions, variations in density affected 


the experience of crowding only under far 
interaction distance. 

The significant Interaction Distance x Pic- 
ture interaction resulted from the fact that in 
the close interaction condition, the addition 
of pictures to the room significantly reduced 
the experience of crowding, F(1, 32) = 23.04, 


Note. For Task 1, subjects were asked to indicate how uch 
time (in minutes) the experimental (word) task took; fc isk 
2, subjects were asked to indicate how much time (in min ies) 
the experimental (Johnny Rocco) task took. 


the far interaction condition had no ef 1 
Feil. 


Additional Stress Indices 


Subjects were asked to estimate how much 
time Task 1 (word game) and Task 2 
(Johnny Rocco) took. It was predicted t) it 
the more stressed subjects felt, the more thy 
would overestimate the time they spent on 
the experiment. In actuality, Task 1 lasted 10 
minutes, while the second task was 20 minutes 
in duration. As can be seen from the resulis 
presented in Table 3, overestimation of time 
in comparison with actual time occurred only 
in the close interaction conditions. There 
were, however, significant main effects indi- 
cating that estimations of time were greater 


P < .001, However, the addition of Pictures in 1 bigh- as opposed to low-density conditions, 
TABLE 2 ESPERA 
F $ ANALYS p 7 
3 ANALYSIS or vaic FOR CROWDING INDICES Ce ae sh i 
Source gy i =e 
ae A X Source df TaskiF Task27 
A (Density) 1 * a 
B ffateraction distance) 1 aie $ (Density) 2 1 6.06* 12.22** 
C (Pictures) i 10388 3 (Interaction distance) 1 go 4y+* 106.13"* 
AXB 1 36.736 A (Pictes) 1 48.43 33.40" 
XG 1 2 ee 11 
BXG 1 12'63%* A ae rare St 2.06 
AXBXC 1 14 1 26.67** 2334+ 
2 ` AXBXC 1 
Error 32 05 E 0 08 
K Tror 32 1.09 1.23 
SP < 0i. 
*) <.001, SP <.05. 
**p <.001 
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c! se as opposed to far interaction conditions, 
a «| picture as compared with no-picture con- 
« ions, The largest effect was for the interac- 
| on distance variable (see Table 4), The sig- 
ant Interaction Distance X Pictures in- 
ctions occurred because the addition of 
| ures to the experimental room significantly 
r uced the estimation of time in the close 
i) -raction conditions, for Task 1, F(1,32) = 
1 48, p<.001; for Task 2, F(1,32) = 
12, p < .001, but did not affect time esti- 
r tions in the far interaction conditions, for 
k 1, F <1; for Task 2, F <1. 

‘subjects were also asked: “Was there a 
) der or leaders in the group?” Korten 
( 162) has found that leadership is more 
1 ely to emerge in groups working under 
ssful conditions than in groups working 
v der less stress-producing situations. Sub- 
s in the present study reported signifi- 
tly more often that there was a leader in 
close interaction conditions than in the far 
raction conditions, F(1, 32) = 35.85, $ < 

l. 
On an additional measure of stress, subjects 
re asked to recall the names of individuals 
their group. Subjects began the experiment 
* introducing themselves to one another, but 
wing the experiment the identification tags 
orn by subjects contained only a letter. The 
number of names recalled was taken as a mea- 
ire of deindividuation (Zimbardo, 1967). It 
„as felt that the more stressful subjects found 
the experimental situation to be, the more 
they would deindividuate the group by failing 
to recall the names of other group members. 
(This measure has been related to stress and 
crowding by Stokols et al., 1973). Only two 
effects attained an acceptable level of signjfi- 
cance, Subjects recalled significantly fe'e } 
names in the close than in the far interactic 1 
condition, F(1, 32) = 35.97, p < 001. Sub- 
jects also recalled fewer names in the no-pic- 
ture than in the picture condition, F(1, 32) = 


10.78, p < .01. 


Ratings of Experimental Room 


Subjects were asked to rate how pleasant 
al room (1 = not 


they found the experiment 

at all pleasant, 10 = very pleasant) and oe 
distracted they found themselves during he 
experiment (1 = not at all distracted, 10= 


TABLE 5 


PERFORMANCE ON Worp Task: Numper or Worps 
Formep py GROUP 


Density /interaction Pictures No pictures 
Low 
Far 54.40 58.40 
Close 54.60 44.00 
High 
Far 56.80 57.00 
Close 52.80 39.00 


very distracted). The pleasant question 
yielded only a significant main effect for the 
picture variable, F (1,32) = 197.07, p < .001, 
indicating that the room was more pleasant 
with the pictures than without them. Subjects 
also reported being significantly, F(1, 32) = 
139.49, p < .001, more distracted in the pic- 
ture condition (Æ = 4.09) than in the no- 
picture condition (X = 2.44). The only other 
room effect that approached significance was 
that subjects reported being more distracted, 
F(1, 32) = 7.69, p < .05, in the close (X = 
3.45) than far interaction condition (£ = 
3.07). 


Performance 


The number of words each group was able 
to form from the master word (industriously) 
was taken to be a measure of group perform- 
ance, It was predicted that the experience of 
crowding should inhibit performance and re- 
duce the number of words formed by the 
group. As can be seen from the results pre- 
sented in Tables 5 and 6 interaction distance 
significantly affected performance as groups 
performed better in the far than close condi- 
mn 


i TABLE 6 


ANALYSIS OF VARIANCE FOR Worp TASK PERFORMANCE 
Source df R. 
OSE ee 
i 5 
A (Density) 1 AS 
B Mi distance) 1 Usen 
C (Pictures) 1 F 
AXB 1 : 
AXC 1 66), 
BXC 1 10.93 
AXBXC 1 05 
Error 32 46.79 
05. 
sth Si. 


TABLE 7 
INDIVIDUAL AND GROUP PUNITIVENESS SCORES 


Density /interaction Pictures No pictures 
Low 
Far 
ecto 2.20 2.60 
Individual average 2.62 2.80 
Close 
Grou 3.00 3.60 
Individual average 3.00 3.44 
High 
Far 
Group 2.70 2.60 
Individual average 2.78 2.76 
Close 
Group 3.20 4.00 
Individual average 3.03 3.56 


Nole, 1 = most 
punishment, 


benign or all love, 7 = most punitive or all 
tion. Density, however, did not significantly 
affect performance on this task. There was a 
significant effect for pictures due to the fact 
that groups formed more words with pictures 
than without them, The Interaction Distance 
X Pictures interaction 
main effect, since it indicates that while the 
addition of pictures to 
improved performance į 
condition, F(1, 32) = 7.95, p< 01, it did not 
affect performance in th 
dition, F < 1 (see Table 6), 


Punitiveness 


Subjects were asked to indicate which of 
seven alternative treatment programs (1 = 
most benign or all love, 7 = most Punitive or 
all punishment) they would Tecommend for 
Johnny Rocco, Subjects first made their own 
individual recommendations and then the 


Case and make 
ndividual aver- 
ing the average 
group consensus 
g agreed upon by 


age score was computed by taki 
of the individual ratings. The 
Score is the one ratin 
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effect on punitiveness, The significant 
action Distance X Pictures interaction 
group rating resulted because the addit 
pictures significantly reduced punitiven 
the close interaction condition, F(1, 
4.79, p < .05, but it did not in the far 
action condition, F < 1, (see Table 8) 
also interesting to note that the grou 
cision was less punitive than the indi 
average scores in all the far interaction «< 
tions. However, the reverse was true ii 
close interaction conditions, Thus, grou; 
cussion reduced punitiveness in the fo: 
conditions but increased punitiveness in 
latter set of conditions, 


Ratings of Other Group Members 


Subjects were asked to rate the other gr 
members on a number of dimensions, A 
measure of attraction subjects were 
“How much did you like 
group?” and they were requested to rate e 
member on a 10-point scale (1 = not frien: 
10 = friendly), An average attraction meas 


as} 


on the “like” 


TABLE 8 
ANALYSIS oF VARIANCE FOR Ixpivipvar 
NITIVENESS SCORES 


AND Group 


the people in yi ı 


ter- 
is 
de- 
1al 
li- 
he 
is- 
ier 
he 


Individua 
Source if averages Group F 
A (Density) 
y) ety 
B {Interaction distance) 1 ar 35ee Azogeee 
Ç (Pictures) 1 5.78" 14. 19s+* 
Kite 1 02 02 
Bic 1 18 Sri 
A 1 2.49 7.44%" 
XBXC 1 98 94 
Error 32 31 .06 
*? <.05. 
dD <.01. 
* D <001. 
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raction only in the close interaction condi- 
ons (see Table 10). 
Subjects rated each group member as to 
»w nervous (1=calm, 10 = nervous) he 
it that member was. Average “nervous” 
ores suggested that subjects felt other group 
»mbers were significantly more nervous in 
1e close as opposed to far interaction condi- 
ons and in the no-pictures as compared with 
\e picture conditions. Density did not signifi- 
intly affect the attributions of nervousness. 
he significant interaction resulted from the 
ictures reducing the amount of nervousness 
ı the close interaction conditions, F(1, 32) = 
5.83, p < .001, but having no significant ef- 
‘ct in the far interaction conditions, F < 1. 
Subjects rated each group member accord- 
ng to how “aggressive” he felt that member 
vas (1 = passive, 10 = aggressive). Again, 
here were significant interaction distance and 
sictures effects but no density effect on this 
ariable. And, again, the pictures significantly 
educed the attribution of aggressiveness in 
he close, F(1, 32) = 13.16, p < .001, but not 
n the far interaction conditions, F < 1, There 
was a Density X Interaction Distance inter- 
action indicating that interaction distance 


TABLE 9 
RATING or OTHER GROUP MEMBERS 


Density /interaction Picture No picture 
Low 
Far 
Attraction 8.00 7.69 
Nervous 2.59 2.60 
Aggressive 4.60 4.59 
Close 
Attraction 7.18 6.17 
Nervous 2.78 4.06 
Aggressive 4.84 6.22 
High 
Far 
Attraction 7.60 7.15 
Nervous 2.38 2,94 
Aggressive 4.77 5.34 
Close 
Attraction pE re 
Nervous 730 576 


Aggressive 
ects were asked to indi- 


Note. For the attraction variable, subjects i 
cate how much they liked the people in their gronn ee a (2 
friendly—not friendly ratings; 1 =not attract! O obee 
Z very attracted to others. For the nervous and aggressi ni 
e told to rate each member of their group 
or aggressiveness (scores are mean ratings 
r the nervous dimension, 1 = calm, 
essive dimension, 1 = passive, 10 


ables, subjects weri 
to their nervousness 
for each member); for 
= nervous; for the ager 
= aggressive. 
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TABLE 10 val 
ANALYSIS OF VARIANCE FOR RATING OF OTHER Group à 
MEMBERS 


Attraction Nervous Aggressive 
F F F: 4 


Source 
Density (A) 120.10" 5 
emean a k 
distance (B) 1 125.51" 42,35%** 25,324 
Pictures( C) 1 52.94"** 45,07%** poo 
AX B 1 02 19 9.69"" 
AXC 1 -12 3.85 AS 
BXC 1 12.4991% 21:12*** n 15.06%% 
AXBXC» 4 05 00 4,87* 
Error 32 ne AS AS 
*p < 05. 
ep < 0l, 
rep < 001. 


made a significantly greater effect on attribu- 
tion of aggressiveness in the low- than in the 
high-density condition. 


DISCUSSION Gn 


The present results are pertinent to the 
question of what spatial factors are most 
likely to lead to stress and arousal, On each 
measure, the interaction distance manipulation — 
yielded strong effects. Violations of personal — 
space led subjects to report feeling crowded — 
and uncomfortable, Subjects whose personal — 
space was violated also reported feeling that 
they spent longer in the experiment than did 
subjects whose personal space was not vio- 
lated. In addition, the violated subjects per- — 
formed more poorly on the word task and 
were more punitive toward Johnny Rocco and 
toward other members of their group. They 
also attributed more nervousness and aggres- 
siveness to other group members than did sub- 
jects in the far interaction distance condition. 
These effects were found regardless of room 
size (high or low density). Í 

On the other hand, the manipulation of 
simple density or space per subject did not 
yield clear effects. While there were reports 
that subjects felt more crowded in the high- 
as opposed to low-density conditions, this ef- 
fect was qualified by the interaction with in- 
teraction distance; that is, these results were 
found under the close interaction condition 
but not under the far interaction condition. A 
similar pattern of results was found on the 
time estimates and on the attraction for other 
group members. The density manipulation 


_ under the close int 


had no effect on the task performance, on the 
punitiveness, or on ratings of group members’ 
nervousness and aggressiveness. 

Taken together, these results offer strong 
support for the prediction that personal space 


violations are arousing and stressful, while 


conditions of high density are not necessarily 
arousing. Further, the results show that vio- 
lations of personal space can lead to a decre- 
ment in group performance and an increase in 
punitiveness, while density manipulations may 
not have these effects. The implication of 
these findings is that interaction distance and 
not density per se is the spatial variable that 
is related to crowding. 

Given that interaction distance and not 

density is the important spatial variable in 
crowding, we can also make predictions about 
what “social and personal variables” (Stokols, 
1972) will affect crowding. It is often reported 
(see Evans & Howard, 1973) that females 
tend to have smaller personal space zones 
than males, Based on this, we could predict 
that females would report feeling less crowded 
and would show fewer disruptive behaviors 
under high-density conditions than would 
males. This follows because the violation of 
their personal space would not be as severe as 
that for males, Numerous investigations have 
shown that high density increases aggressive- 
ness and inhibits performance in males but 
not in females (Freedman et al., 1972; Stokols 
et al., 1973; Epstein, Note 2), Thus, it may 
be that the personal space of males but not 
that of females was violated in these condi- 
tions. The nature of the task should also af- 
fect reports of crowding, as Hall (1966) re- 
ported that when intimate issues are discussed, 
the size of personal space decreases as com- 
pared with the case when less personal issues 
are discussed. Cultural background should also 
affect crowding, since the size of personal 
space is determined, in part, by the culture of 
the individual (Hall, 1966). 

One more point dealing with spatial factors 
is of interest: Subjects were more punitive 
toward Johnny Rocco under close as compared 
with far interaction distances. Further the 
group discussion heightened the punitiveness 
eraction distance condition. 
S result has relevance for juries where in- 
duals must decide on the fate and punish- 


Thi 
divi 


: 38 STEPHEN WORCHEL AND CHARLES TEDDLIE 


ment of a defendant. It suggests that 
closer jurors are forced to interact, the m 
punitive will be their final decision. Freed n 
et al. (1972) reported a similar finding. T} 
studied the effects of density on the decisi 
meted out by a mock jury and found t} 
harsher sentences resulted from high- as « 
posed to low-density conditions, The pres 
study, however, did not find a density eff 
for punitiveness but did find that interact 
distance determined the decision, It is p 
sible that Freedman et al, simultaneou 
manipulated interaction distance and densit 
and that their results were due to the intera: 
tion distance manipulation rather than simp 
density. 

The present results are also relevant to t} 
attribution theory of crowding presente 
earlier. It was hypothesized that crowding r 
quired both an arousal created by spatial fac 
tors and an attribution that the arousal wa 
caused by other people in the environment 
The critical spatial factor was hypothesize: 
to be violations of personal space rather thar 
a simple density measure, Accordingly, the 
spatial factors that would normally lead to 
crowding should fail to do so if the attribu 
tion process could be interfered with. In the 
present study, pictures were used as “attribu 
tion inhibitors,” and it was predicted that the 
use of pictures would decrease the experience 
of crowding and negative effects associated 
with crowding in close interaction conditions. 
The pictures should have little effect in the 
other conditions because there would be no 
arousal in those conditions, 

In the present study, the pictures interacted 
with interaction distance so that the addition 
of pictures reduced the experience of crowding 
in the close but not in the far interaction dis- 
tance conditions, Pictures also increased per- 
formance and decreased aggressiveness in the 
close but not in the far interaction distance 
conditions, Although the results do not ex- 
plicitly demonstrate the attribution process, 
these are exactly the effects that would be ex- 
pected if the pictures were inhibiting the at- 
tribution that other people in the environment 
were responsible for creating arousal, 

Stokols (1972) has argued that the experi- 
ence of crowding may be alleviated by de- 
creasing the salience of crowding cues around 
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te individual. This view is similar to the 
tribution theory of crowding if we view the 
crease of salience as inhibiting the attribu- 
n process. According to Stokols, distracting 
e€ individual should result in his paying less 
tention to his environment., This explanation 
uld suggest that distractions should inhibit 
rformance on tasks, since the individual 
uld not be paying attention to his environ- 
nt. This, however, was not the case in the 
esent study, as the addition of pictures in- 
eased performance in the close interaction 
nditions. Thus, a simple distraction ex- 
anation would not completely explain the 
sults of the present study. 
The results of the present study are sup- 
itive of the personal space-attribution view 
crowding, There are, however, some limita- 
ons to the design. First, only male subjects 
ere run because this was the only population 
at was available in sufficient numbers to 
plete the study. However, according to 
je theory, the same results should be ob- 
ined for females, although a closer interac- 
ion distance would have to be employed. 
econd, the high-density (small) room was 
arge enough to allow some movement by sub- 
jects. This size room was chosen so that in- 
teraction distance could be varied separately 
from density, Had a smaller room been used, 
a density effect should have resulted, since 
density and interaction distance would have 
been simultaneously varied. A third limitation 
involves the number of pictures used. This 
number was chosen so as not to make the 
room look cluttered. It is possible that had 
many more pictures been employed, the sub- 
ject’s environment would have become suffi- 
ciently complex that an increment rather than 
the decrement in crowding would have re- 
sulted (eg., Milgram, 1970; Ittelson, Pro- 
shansky, Rivlin, & Winkel, 1974). Obviously, 
additional research is needed to test the va- 
lidity of model of crowding. However, the 
focus on personal space variables and attribu- 
tion does account for the present data and 
allows for specific predictions in other situa- 
tions. 
REFERENCE NOTES 


1. Choldin, H. Population density and social relations. 
Paper presented at the meeting of the Population 
Association of America, Toronto, Canada, April 


1972, 


2. Epstein, Y. M. Effects of crowding on cognitive 
performance and social behavior. Unpublished 
manuscript, Rutgers University, 1973. 


REFERENCES 


Altman, I. The environment and social behavior, 
Monterey, Calif.: Brooks/Cole, 1975. 

Calhoun, B. J. A behavioral sink. In E. Bliss (Ed,), 
PRA of behavior, New York: Harper & Row, 

Christian, J. J., Flyger, V., & Davis, D. E. Factors in 
mass mortality of a herd of sika deer (Cervus 
nippon). Chesapeake Science, 1960, 1, 79-95. 

Desor, J. A. Toward a psychological theory of 
crowding. Journal of Personality and Social Psy- 
chology, 1972, 21, 79-83. 

Epstein, Y. M., & Karlin, R. A. Effects of acute ex- 
perimental crowding. Journal of Applied Social 
Psychology, 1975, 5, 34-53. 

Esser, A. H. Experiences of crowding: Tilustration 
of a paradigm for man-environment relations. 
Representative Research in Social Psychology, 1973, 
4, 207-218. 

Evans, G. W., & Howard, R. B. Personal space, Psy- 
chological Bulletin, 1973, 80, 334-344. 

Felipe, N. J., & Sommer, R. Invasions of personal 
space. Social Problems, 1966, 14, 206-214. 

Freedman, J. L, Crowding and behavior. San Fran- 
cisco, Calif.: W. H. Freeman, 1975. 

Freedman, J. L., Klevansky, S., & Ehrlich, P. R. The 
effect of crowding on human task performance. 
Journal of Applied Social Psychology, 1971, 1, 7 
25. 

Freedman, J. L., Levy, A., Buchanan, R. & Price, J. 
Crowding and human aggressiveness. Journal of 
Experimental Social Psychology, 1972, 8, 528-548, 

Griffitt, W., & Veitch, R. Hot and crowded: Influence 
of population density and temperature on inter- 
personal affective behavior. Journal of Personality 
and Social Psychology, 1971, 17, 92-98. 

Hall, E. T. The silent language. Garden City, N.Y.: 
Doubleday, 1959. 

Hall, E. T. The hidden dimension. Garden City, N.Y.: 
Doubleday, 1966. 

Ittelson, W. H., Proshansky, H. M, Rivlin, L. G, & 
Winkel, G. An introduction to environmental psy- 
chology. New York: Holt, Rinehart & Winston, 
1974. 

Korten, D. C. Situational determinants of leadership 
structure, Journal of Conflict Resolution, 1962, 6, 
222-235. 

Loo, C. The effects of spatial density on the social 
behavior of children. Journal of Applied Social 
Psychology, 1972, 2, 372-381. 

Loo, C. Important issues in researching the effects 
of crowding on humans. Representative Research 
in Social Psychology, 1973, 4, 219-226. 

Milgram, S. The experience of living in cities. Science, 
1970, 167, 1461-1468. 

Schachter, S., & Singer, J. E. Cognitive, social and 
physiological determinants of emotional state. 
Psychological Review, 1962, 69, 379-399. 


40 STEPHEN WORCHEL AND CHARLES TEDDLIE 


Schmitt, R. C. Density, delinquency, and crime in Physical, social, and personal determinants 
ti K U. 3 


y ri i wdi: Envi t and Beh 
i ii 957, 41 erception of crowding. Environmen 
pee Sociology and Social Research, 1957, 41, s TR j eg 

mi R lensity, heal and si i- i A. Residential group siz 
Schmitt, R. C. Density, health d social disorgani- Valins, S., & Baum, A. e; dı | 

I Jo rnal oF "the Aani Institute of interaction, and crowding. Environment an 

Planners, 1966 32, 38-40. havior, 1973, 5, 421-440. 

A Je) N . 


i ice: Individuatior 
Sommer, R. Personal space: The behavioral basis of Pee poet iaioe: ene at 
een Englewood Cliffs, N.J.: Prentice-Hall, 1969, ae Bae sas ald aa Tavie 
Stokols, D. On the distinction between density and a t 


i Motivation, 196 
crowding: Some implications for future research. aeoea Symposium on 
Psychological Review, 1972, 79, 275-277. 5 ; 
Stokols, D., Rall, M., Pinner, B., & Schopler, J. (Received August 2, 1975) 


Manuscripts Accepted for Publication in the 
Journal of Personality and Social Psychology 


Boys Meet Girls: Patterns of Interaction and Attribution in Heterosexual Attraction. John 
D. Cunningham (Department of Psychology, Macquarie University, North Ryde, New 
South Wales 2113, Australia). 

Duration of the Effect of Good Mood on Helping: “Footprints on the Sands of Time.” Alice 
M. Isen (Department of Psychology, University of Maryland, Baltimore County, 5401 
Wilkens Avenue, Baltimore, Maryland 21228), Margaret Clark, and Mark F, Schwartz, 

The Power of Suggestion: Another Examination of Misattribution and Insomnia. Richard R 
Bootzin (Department of Psychology, Northwestern University, Evanston, Illinois 
60201), C. Peter Herman, and Perry Nicassio. 

Elastic Shifts of Opinion: Determinants of Direction and Durability. Robert B. Cialdini 
(Department of Psychology, Arizona State University, Tempe, Arizona 85281), Alan 
Levey, C. Peter Herman, Lynn T. Kozlowski, and Richard E, Petty. 

Sex and Social Class as Determinants of Future Orientation (Time Perspective) in Adoles- 
cents. Helmut Lamm (Sozial 


wissenschaften, Universitat, 68 Mannheim, West Germany), 
Rolf W. Schmidt, and Gisela Trommsdorff, 


Some Effects of Congruency of Member Characteristics and Group Structure upon Group 
Behavior, Marvin E, Shaw (Department of Psychology, University of Florida, Gaines- 
ville, Florida 32611) and Blaze Harkey, 

Comparative Effects of the Reward Stimulus and Its Cognitive Representation in Voluntary 
Delay. Bert Moore, Walter Mischel (Departmen y. 


Stanford, California 94305), and Antonette Zeiss, 
Distinctiveness and Consensus: 


Observers’ Attributions, Ranald D. Hansen (Department of Psychology Oakl: i- 
versity, Rochester, Michigan 48063) an ect or 
What the Shadow Knows: Person Percepti 


John C. Barefoot, and Lloyd H. Strickland (Department of Psych Jni 
versity, Ottawa, Canada K1S 5B6). Š S O Eon Uni- 


(Continued on page 91) 


Several authors (Kohlberg, 1958, 1969; 1969, p. 108) 


Journal of Personalit: i 
1976, Vol. 34, No. I, pers} 


Role-Taking Standpoint and Adolescent Usage of 


Kohlberg’s Conventional Stages of Moral Reasoning 


Charles Levine 
Department of Sociology, University of Western Ontario 
London, Can À 


By taking i 7 i i 
as a aie a EBEE role-taking standpoint, this study pro- 
judgm« at Kohlberg’s conventional stages can be 
expected to vary depending upon (a) the identity of the protagonist implicated 
in the moral dilemma and (b) the nature of the issues raised by the moral 
dilemmas used. Different groups of respondents were asked to answer three 
versions of Kohlberg’s moral judgment questionnaire. One version implicated 
strangers in the dilemmas used (fictitious-other questionnaire), while the other 
two versions implicated respondents’ best friends or mothers (primary-other 
questionnaires). A comparison of these questionnaire treatments indicates that 
Stage 3 response rate significantly increases and Stage 4 response rate signifi- 
cantly decreases when respondents answer primary-other questionnaires. It is 
suggested that this finding casts doubt on the adequacy of the displacement 
view of moral stage acquisition. The displacement perspective argues that as 
more cognitively complex structures are acquired, earlier acquired structures 
are displaced. In contrast, the present study proposes that a “nondisplace- 
ment” view of stage acquisition is required in order to explain variation ob- 
served at Stage 3 when it may be assumed that a Stage 4 reasoning capability 


has been acquired. 


. However, there may be reason 


Rest, 1973; Rest, Turiel, & Kohlberg, 1969; 
Turiel, 1966) have asserted that Kohlberg’s 
stage typology (see Table 1)? represents an 
invariant developmental sequence. This im- 
plies that Kohlberg’s stages are hierarchically 
related (i.e., as children mature, they move 
through these stages displacing earlier ones 
as later, more cognitively complex stages are 
acquired). By late adolescence or early adult- 
hood, stage stabilization is thought to occur 
with North American males and females ac- 
quiring modal dispositions to judge at Stages 
4 and 3, respectively (Kohlberg & Kramer, 


This paper is based, in part, on a dissertation sub- 
mitted by the author in partial fulfillment of the 
doctoral requirements at the University of Alberta. 
The author would like to express his thanks to John 
Fox, Orrin E. Klapp, R. J. Turner, W. R. Avison, 
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Requests for reprints should be sent to Charles 
Levine, Department of Sociology, University of 
Western Ontario, London, Ontario, Canada. 

1For reasons of space, the description of Kohl- 
gy is brief. The reader can find 


berg’s stage typolo: 
the stages described in more detail elsewhere (Kohl- 


berg, 1969; Turiel, 1974). 


to argue that Kohlberg and Kramer’s Stage 4 
male and Stage 3 female finding is an artifact 
of their methodology. The questionnaire these 
authors use may impart an “artificial” sta- 
bility to the moral reasoning characteristics 
they observe. Insofar as their respondents 
were asked to respond to dilemmas implicat- 
ing fictitious persons only (e.g, Heinz), they 
were therefore denied the opportunity to dis- 
play other moral reasoning characteristics 
that may be produced by different circum- 
stances (eg, dilemmas implicating best 
friends). 

This study attempts to demonstrate that 
one can predict the relative response rates of 
Stages 4 and 3, regardless of the sex of re- 
spondents, by varying the identity of the 
protagonist implicated in the moral dilemmas 
used, Depending on this identity, one can 
expect the respondent to construct a role- 
taking standpoint or point of view that will 
encourage the use of one stage of moral rea- 
soning over another. This expectation is based 
on the assumption that several moral struc- 
tures are acquired by late adolescence and 
that the one used depends on the respondent's 
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TABLE 1 
LEVELS AND STAGES or MORAL REASONING 


Level Stage 
L L Premoral level 1. Punishment and obedience 
e orientation 


2. Naive instrumental hedon- 
ism 

3, Good-boy morality of main- 
taining good relations, 
approval of others 

4, Authority-maintaining 

morality 


II, Morality of conven- _ 
tional role conformity 


IIL, Morality of self-ac- 5. Morality of contract and of 
j cepted moral princi- democratically accepted 
ples 


law ies < 
. Morality of individual prin- 
ciples of conscience 


2 


Nole, Reprinted from “The Development of Moral Thought; 
Review and Evaluation of Kohlberg’s Approach” by W, Kur- 
tines and E, B, Grief, Psychological Bulletin, 1974, 81, 453-470, 
(Copyright 1974 by the American Psychological Association, 
_ Reprinted by permission.) 


naive definition of what he or she considers to 
be a meaningful response to any specific 
dilemma. 

Because moral dilemmas raise various is- 
sues and implicate several actors, one may 
hypothesize that these dilemmas will stimulate 
different role-taking standpoints and thus 
elicit qualitative and quantitative variation in 
moral reasoning. The direction such variation 
in reasoning is likely to take cannot be pre- 
dicted for the moment, However, hypotheses 
Specifying direction will be offered for the 
variation in reasoning produced by the two 
Standpoints with which this study is con- 
cerned: the primary-other and fictitious-other 
standpoints, 

The primary other has been operationalized 
48 respondents? best friends and mothers, per- 
sons from whom one derives a sense of iden- 
tity (Mead, 1956/1965) and with whom one 
has a stable relationship characterized by a 
fusion of personalities (Cooley, 1909/1962, p. 
23). Therefore, it may be assumed that pri 
mary-other relationships are likely to elicit 
intense empathetic and sympathetic responses 
(Cooley, p. 33), Moreover, the strong norms 
of reciprocity that are activated by these Tre- 
lationships (Gouldner, 1969, p. 171 
that they also elicit “ 
gation that is binding under all cir 
(Shibutani, 19°s p. 68). 


we 
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“general positive relationships among peop! 
(Rest, 1973, p. 94), a motive consistent wi 
empathy, sympathy, and obligation, it is p 
dicted that respondents will probably empl: 
this reasoning when judging their prima 
others. 

On the other hand, because of the assum 
greater social distance created by fictitioy 
other questionnaires (Cooley, 1909/1962, 
42), it is unlikely that they will evoke a 1 
sponse similar to the one predicted for pri 
mary-other questionnaires, It may be becaus 
of their avoidance of the primary-other stand 
point that these fictitious-other questionnaire 
(e.g., Heinz, a man, etc.) elicit the Stage <4 


orientation “toward authority, fixed rules, and 
y, 


the maintenance of social order” (Turiel 
1972, p. 14), which has been reported in the 
literature as the modal characteristic of male 
adolescent moral orientations (Haan, Smith, & 
Block, 1968; Kohlberg & Kramer, 1969). 
The following hypotheses can now be drawn 
from the preceding discussion: (a) Stage 4 
response rate observed under the primary- 
other questionnaire treatments will be signifi- 
cantly lower in comparison with the fictitious- 
other questionnaire treatment; (b) Stage 4 
response rate will vary significantly from one 
moral dilemma to another; (c) Stage 3 re- 
sponse rate observed under the primary-other 
questionnaire treatments will be significantly 
higher in comparison with the fictitious-other 
questionnaire treatment; and (d) Stage 3 re- 


sponse rate will vary significantly from one 
moral dilemma to another, 


Metuop 
Subjects 


significantly 
3 and 4. 


Questionnaire Treatments 


Rotihers's (1958) questionnaire was modified in 
order to present dilemmas in which (a) a stranger 
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of the same sex was implicated, (b) a best friend of 
the same sex was implicated, and (c) the respondent’s 
mother was implicated. The stranger treatment opera- 
tionalizes the fictitious-other standpoint, whereas the 
best friend and mother treatments operationalize the 
primary-other standpoint, Four of Kohlberg’s (1958) 
nine dilemmas were selected (Dilemmas 1, 3, 4, and 
7), and each was modified in the three ways just 
mentioned. Dilemma 1 describes a father breaking a 
promise to his son, Dilemma 3 is concerned with the 
theft of a drug, Dilemma 4 deals with the euthanasia 
problem, and Dilemma 7 concerns the deception of a 
neighbor. Partial examples of the three modified 
questionnaire versions for Dilemma 3 follow: 


1. [Stranger treatment] A woman was near death 
from a special kind of cancer, There was one drug 
that the doctors thought might save her. The drug 
was expensive to make, and the druggist was charg- 
ing $2000 for a small dose of it. The sick woman’s 
husband went to everyone he could to borrow the 
money, but he could only get together about $1000 
which was only half of what the drug cost. [The 
preceding is a male example. For female respon- 
dents, the protagonist was identified as “the sick 
man’s wife.”] 


2. [Best friend treatment] Imagine that a good 
friend of %5 was near death from a special 
kind of cancer. There was one drug that the doctors 
thought might save %s friend. The drug was 
expensive to make and the druggist was charging 
$2000 for a small dose of the drug, —— went to 
everyone he could to borrow the money, but he 
could only get together about $1000 which was 
only half of what the drug cost. [Respondents an- 
swering this treatment filled in the name of their 
best same-sex friend in all the blanks of the ques- 
tionnaire before writing their answers.] 


3. [Mother treatment] Imagine that your mother’s 
best friend was near death from a special kind of 
cancer. There was one drug that the doctors 
thought might save her, The drug was expensive 
to make, and the druggist was charging $2000 for 
a small dose of the drug. Though your mother 
went to everyone she could to borrow the money, 
she could only get together about $1000 which was 
half of what the drug cost. 


After each dilemma, respondents were asked, “Should 
(the man /___/ your mother) have done that? Was 
it right or wrong? Why?” Probe questions were not 
used, and the presentation of the four dilemmas was 
varied so that any contaminating effect due to 
dilemma order would be avoided. 


Data Collection Procedures 


A total of 680 first-year sociology students par- 
ticipated in the present study during one of their 
regular class periods. Instructors of several sociology 
classes were contacted a week in advance and were 
asked to announce the time of the intended question- 
naire administration. They told their students that 
the research being conducted was concerned with 


people’s opinions about moral dilemmas. Student co- s 
operation was solicited on a volunteer basis with no 
incentives, monetary or otherwise, offered for par- — 
ticipation. 2 

One week later the researcher presented himself to 
the various classes. Using a structured introduction, 
he informed the students that the questionnaire in 
no way represented a test but was simply an op- 
portunity for them to voice their opinions concerning 
several moral dilemmas. After assuring respondent — 
anonymity, the researcher encouraged the students — 
to write down the reasons for their answers and — 
asked them to avoid giving simple yes or no replies, 
Finally, the students were thanked for their coopera- í 
tion and were promised that they would be informed si) 
of the results of the study before their semester had 
ended. b- 

In each sociology class, male respondents were di- in 
rected to a male questionnaire stack and were in- — 
structed to take the questionnaire on the top of the 
pile. Females were instructed similarly, The three 
questionnaire treatments were randomly distributed 
in each of the two stacks, From the 680 question- 
naires collected, three questionnaire treatment sub- 
samples were randomly drawn, each consisting of 50 
males and 50 females. Thus, 100 completed question- 
naires for each of the three treatments were ran- 
domly drawn, yielding a total sample size of 300 for 
this analysis. 


x 
Scoring Procedures and Reliability à 
Percent-response scores for Stages 3 and 4 were 


$ 
3 
§ 
3 
% 


computed for the total questionnaire as well as for 
each of the four dilemmas, These scores were de- 
rived by using the sentence-coding scoring technique, 
which requires the scoring of each thought content 
unit that is given in a response (Kramer, 1968; 
Kurtines & Grief, 1974, p. 455). Kramer (1968, Pe 
25) reports that this scoring technique correlates in 
the high 90s with the original global rating tech- J 
niques used by Kohlberg (1958). 

The proportion of agreements between the twò 
coders employed was computed (Robinson, 1987). 
Of the 3,497 ratings made at the conventional stages, 
the coders were in agreement 3,118 times, yielding a 


coefficient of agreement of .89. 
RESULTS 


For each dependent variable 
and Stage 3 response rate) a 2x 3 X 4 (Sex 
X Treatment X Dilemma) analysis of vari- 
ance was computed. The first hypothesis pre- 
dicted that Stage 4 response rate would be 
significantly lower for the best friend and 
mother treatments. The results of the analysis 
indicated a statistically significant main effect 
produced by variation in questionnaire treat- 
ment, F(2, 294) = 13.10, p < .001, with the 


(i.e. Stage 4 


2 Complete versions of the questionnaire can be 


obtained from the author. 


TABLE 2 


MEAN Percent RESPONSE AT STAGES 4 AND 3 BY 
QUESTIONNAIRE TREATMENT 


Treatment means Stage 4 Stage 3 
Stranger (ST) 36.14 43.72 
Best friend (BFT) 23.76 58.65 
Mother (MT) 28.44 56.73 

Comparison by ¢ test® 

ST-BFT 3.58* 4.04* 
ST-MT 2.224% 3.51* 
BFT-MT 1.36 .52 


* Figures listed in this section are ¢ values, 
* p < .001, one-tailed test. 
k p < .025, one-tailed test, 


= mean Tesponse rates for each treatment ex- 
hibiting the expected pattern (see Table 2), 
The main effect produced by variation in 
moral dilemma (Hypothesis b) is also sta- 
tistically significant, F(3, 882) = 3.78, p< 
O18 

The third hypothesis predicted that Stage 3 

response rate would be significantly higher for 

mother treatments, As 

questionnaire treatment 

tically significant main 


y Variation in moral dilemma (Hy- 


) is also Statistically significant for 


Sta 
ial 3 response Tate, F(3, 882) = 6.79, p< 


for Stage 4 res 
ex, X Dilemma g ae 


= 3.09, $ <.05. An examination of 


rate, a § 


higher rate for Dilemma 
Dilemma 4 (p < 05). A Sex 


teraction was also observed for Stage 3 re- 


sponse rate, F (3, 882) = 411, $< 01, with 
females responding at a igher rate for Di- 


lemma 1 (p< .001) and Dil 
:05). Males responded at a 
Dilemma 3 (p < .05). 

For Stage 3 response rate, a Significant Sex 
X Treatment interaction was noted, F(2, 294) 
= 7.04, p < .001. The signi 


emma 7 (p< 
higher rate for 


ficant comparison 
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was observed within the stranger treatmen 
with females responding at a higher rate tha 
males (p < .05). This latter finding is consi 
tent with the performance of females in othe 
samples that have been studied (Haan et al 
1968; Kohlberg & Kramer, 1969). 


x Discussion 
The results of this study lend support t 
the proposition that Stage 3 moral reasonin 
is more likely to be employed and Stage 4 rea 
soning is less likely to be employed when re 
spondents judge dilemmas implicating their 


primary others. The present research also 


demonstrates that different moral dilemmas 
function to increase or decrease response rate 
coded at Kohlberg’s conventional stages. 
Previous research provides one with little 
help in explaining the Sex x Moral Dilemma 
interactions observed for Stage 4 and Stage 3 
response rate. To my knowledge, no investiga 
tions using the Kohlberg instrument report a 
Statistically significant effect due either to 
variation in moral dilemma or to a Sex x 
Moral Dilemma interaction.’ On the contrary, 
Kohlberg (1958, 1968) and Kramer (1968) 
Teport that the quality of moral judgments 
does not vary in significant fashion across the 
dilemmas used in their research. One perspec- 
tive which may help to explain both of the 
observed Sex X Dilemma and Sex X Treat- 
ment interactions is the social involvement 
thesis (Kohlberg & Kramer, 1969, p. 108), 
This perspective views differential involve- 
ment in different roles as the basis for ob- 
Served sex differences and Suggests that ado- 
lescent sex differences in moral reasoning need 
not be considered particularly stable, Conse- 
quently, one would Predict that as the modal 
role involvements of males and females 
change, so do their respective moral reason- 


ing processes, What may be required, then, to 


3 Of the six Possible dilem 
4 response rate, three 
comparisons between 
to be significant, 


: © comparisons between Dilemmas 1 and 4 and 
Dilemmas 3 and 7 fail to be significant. The other 
sons are significant (p < 01). 


ma comparisons for Stage 
are significant (p < 05). The 
Ds, Di-D;, and De-D; fail 


tionnaire in their Tesearch, 
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explain both the Sex x Dilemma and Sex x 
Treatment interactions is further research that 
specifies the relationship between (a) modal 
role-playing categories, (b) stages of moral 
reasoning, and (c) specific moral dilemmas.° 

In conclusion, the present research has been 
generated by a theoretical orientation that 
conceptualizes the moral reasoning process as 
the active construction of a meaningful re- 
sponse from the respondent’s point of view. 
Therefore, it was assumed that the specific 
nature of the social context within which 
moral judgments are made plays an important 
role in the moral reasoning process. By vary- 
ing the identity of the protagonist implicated 
in the dilemma and by varying the content 
of the dilemma itself, it was hypothesized that 
variation in response rate coded at Kohlberg’s 
conventional stages would be observed. The 
analyses conducted provided support for this 
orientation. 

When judgmental variation has been re- 
ported elsewhere, it has typically been ex- 
plained by reliance on the theory of stage 
mixture (Turiel, 1969). The stage-mixture 
perspective, based on the stage-displacement 
model (Kohlberg, 1958, 1969; Rest, 1973; 
Turiel, 1974), asserts that the observed varia- 
tion in response rate is a manifestation of a 
respondent’s current effort to acquire a more 
differentiated moral structure. However, can 
such an explanation be deemed acceptable 
when attempting to explain judgmental vari- 
ation, at Stage 3 for example, on the part of 
respondents who have already acquired a 
Stage 4 ability to reason? It is this problem 
which the present research seems to have un- 
covered, given that the mean age of the sam- 
ple studied was 18.6 years. 

For the above reason, the present study has 
not relied on the stage-displacement model to 
explain the observed variation in conventional 
stage response rate. Rather, it has been sug- 
gested that the acquisition of more, differ- 
entiated stages need not imply the displace- 
ment of less differentiated ones. It may be 


that less differentiated structures, conceptual- 
d entities or as momen- 


ized either as isolate 

tarily stimulated components of more complex 
structures, can always be used; their usage 
depends on the respondent’s naive definition 
of what is an appropriate response to any par- 
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ticular moral dilemma, This “nondisplace- 
ment” thesis is the perspective that I have 
adopted. 

It may be true that the stage-displacement 
thesis is consistent with the cognitive-develop- 
mental paradigm when it asserts that more 
differentiated structures are formed through 
the synthesis of less differentiated structures. 
However, the stage-displacement thesis is 
problematic if it also asserts that the develop- 
ment and existence of higher order structures 
dictates, by definition, their usage. If this is 
the case, the stage-displacement thesis will 
have become incapable of explaining judgmen- 
tal variation coded at lower order stages under 
situations where it may be assumed that more 
advanced reasoning processes could have been 
used, If cognitive-developmental theory is to 
explain such variation, it may have to accept 
a distinction between the process by which 
moral stages are acquired and the processes 
dictating how and when moral stages are used. 
The nondisplacement thesis may be a neces- 
sary component in explaining these processes. 


6 Rest (Note 1) has offered a similar suggestion, 


REFERENCE NOTE 


1, Rest, J. New options in assessing moral judgment 
and criteria for evaluating validity. Paper pre- 
sented at the Society for Research in Child De- 
velopment Convention, Denver, April 10, 1975. 
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Effects of Group Size and Proximity Under Cooperative 
and Competitive Conditions 


John J. Seta, Paul B. Paulus, and Janette K. Schkade 
University of Texas at Arlington 


Research on coaction has concentrated on the arousal-inducing properties of 
others while practically ignoring the possibility that others may be a source 
of arousal reduction. The results of the present study support the notion that 
other coactors can be a source of arousal reduction as well as induction, as 
reflected in their performance on novel tasks. One experiment varied group 
size (two or four) and instructional set (competition or cooperation), while 
a second experiment varied proximity (close or far) and instructional set (com- 


petition or cooperation). 


In both experiments, a significant interaction was 


obtained. In the first experiment, subjects in the cooperative condition per- 
formed better in groups of four than in groups of two, while the reverse was 
obtained in the competitive condition. In the second experiment, subjects in 
the cooperative condition performed better when seated close as opposed to 
far apart, while opposite results were obtained in the competitive condition. 


Past research has demonstrated that an 
individual’s task performance can be affected 
by the mere presence of others (see Cottrell, 
1972, for a review). This research has in- 
volved the use of two paradigms, audience 
and coaction. In the audience paradigm, 
individuals perform a task in the presence 
of observers, while in the coaction para- 
digm a group of individuals concurrently and 
individually perform an identical task. 

Zajonc (1965) has argued that the mere 
presence of others increases an individual's 
general level of arousal which in turn increases 
the tendency to emit dominant responses. If 
the dominant responses are correct (as is pre- 
sumably the case for well-learned tasks), 
heightened arousal results in improved per- 
formance by an individual, while if the domi- 
nant responses are incorrect (as is presumably 
the case for tasks that are not well-learned), 
arousal results in impaired performance. 


This research was partially supported by a grant 
from the Organized Research Fund of the University 
of Texas at Arlington. 3 

Robert W. Matthews’s and B. B. Bruno’s assist- 
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at the meeting of the Southwestern Psychological 
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Requests for reprints should be sent to John J. 
Seta, Department of Psychology, University of Texas 
at Arlington, Arlington, Texas 76019. 
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Cottrell (1972) had modified Zajonc’s con- 
ceptualization by arguing that mere presence 
is not a sufficient condition for increased 
arousal. He proposes that the presence of 
others has nondirective energizing effects only 
if the presence of others creates anticipations 
of positive or negative outcomes. He assumes 
that at birth others are motivationally neu- 
tral; that neither increase nor decrease gen- 
eralized drive or arousal. However, various 
rewarding and punishing events can become 
associated with the presence of other individu- 
als so that others gradually lose their neutral 
qualities and become a sufficient stimulus for 
increasing the general level of arousal, Cottrell 
(1972) states that, 

Through experience the individual learns to anticipate 
subsequent positive or negative outcomes whenever 


others are merely present and not overtly doing any- 
thing that has motivational significance for him. 


(p. 227) 


It is these anticipations elicited by the pres- 
ence of others that supposedly increase the 
individual’s drive level. This hypothesis was 
supported in studies by Paulus and Murdoch 
(1971) and Henchy and Glass (1968). 
Cottrell’s learned drive hypothesis suggests 
that the presence of others can be a learned 
source of increased arousal but ignores the 
possibility that others may be a source of 
arousal reduction. In certain situations indi- 
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viduals may have associated the presence of 
others with a reduction in vulnerability to an 
aversive situation or outcome and conse- 
quently a source of arousal reduction. This 
Suggestion is supported by a series of studies 
on affiliation (Amoroso & Walters, 1969; 

Schachter, 1959; Wrightsman, 1960) which 
have demonstrated that individuals exposed to 
aversive stimuli (e.g., threat of shock) are 
motivated to be with others who are also ex- 
posed to these same aversive stimuli, Further- 
more, if individuals are given an Opportunity 
to wait with other persons expecting the same 
potential aversive experience, they demon- 
Strate a reduction in their level of fear or 
arousal compared to other individuals not 
given such an opportunity (Amoroso & 
Walters, 1969; Wrightsman, 1960). 

Since no one has demonstrated that others 
can be a potential source of arousal reduction 
in a coactive setting (see Epley, 1974, for a 
review), the first experiment was designed to 
investigate the effects of group size under 
conditions in which others were either a poten- 
tial source of arousal induction or reduction. 

In one condition, subjects were told that 
the experimenter was interested in their per- 
formance relative to each other (competitive 
condition), This competitive condition ap- 
proximates the type of situation that is either 
implicitly or explicitly employed in coaction 
research (eg., Carment, 1970; Martens & 
Landers, 1972; Wankel, 1972). In this con- 
dition, competitors should be potentially aver- 
sive stimuli in that they signal potential nega- 
tive outcomes (e.g., embarrassment, loss of 
self-esteem) and should produce increased 
arousal. Therefore, increasing the number of 
competitors in a group should result in 
increasing levels of arousal, 

Tn a second condition, individuals were 
8iven a cooperative instructional set. They 
were told that the experimenter was not inter- 
ested in their individual Scores, but only in 
the collective score of the group relative to 
three other groups. Such a condition should 
be Stressful, but in this case other individuals 
in the group should be perceived as a source 
of Potentially positive outcomes and hence 
their Presence should be arousal reducing. 
That is, in this type of condition, others may 


diffuse the potential responsibility of failure. 
The larger the number of individuals in such 
a group, the greater the diffusion of responsi- 
bility and the greater the reduction in arousal, 

The first experiment reported employed a 
2 X 2 factorial design manipulating instruc- 
tional set (competition or cooperation) and 
group size (two or four members). Since the 
task was a novel learning task, it was ex- 
pected that in the competitive condition the 
large group of coactors would perform worse 
than the small group due to their relatively 
higher state of arousal. On the other hand, in 
the cooperative condition the large group of 
coactors should presumably perform better 
than the small group due to their relatively 
lower state of arousal. 

In sum, an instructional set by group 
size interaction is predicted. The interaction 
should negate any main effects due to group 
size, while the general level of arousal pro- 
duced by the two instructional sets cannot be 
foreseen since both competitive and coopera- 
tive situations should involve some degree 
of stress. 


EXPERIMENT 1 
Method 


Subjects. Sixty-three introductory psychology stu- 
dents were employed as subjects, Subjects arrived 
at the experiment in same-sex groups of 2 or 4, 
with a total of 24 groups participating in the experi- 
ment. Upon their arrival, the subjects were randomly 
assigned to one of the four experimental conditions. 
With one exception, all conditions contained 16 sub- 
jects, 8 males and 8 females. The competitive large 
group condition consisted of 15 subjects because 1 
group consisted of only 3 female subjects and 1 fe- 
male confederate because of 1 subject who failed 
to appear. 

Procedure. The task employed was a list consisting 
of 20 words which represented cards in a normal 
deck of playing cards. Across the top of the list 
were the names of the four suits of cards (hearts, 
diamonds, spades, and clubs) while beneath each of 
the suits were words representing cards within that 
suit. The subjects were given 1 minute to memorize 
the list of words and suit names followed by 2 
minutes to recall as many of the words as they could 
remember. During recall the subjects were told to 
write the four suits of cards across the top of a 
blank piece of paper and to write down beneath 
each suit as many of the 20 words as they could 
temember. They could recall the words in any order 
as long as they put the correct words under the 
Correct ‘suit. Since there were only 20 words, not 
counting the four suits of cards, the subjects were 


COOPERATION AND COMPETITION 


TABLE 1 


Mean NUMBER or Worps RECALLED AS A 
Function OF INSTRUCTIONAL SET 
AND GROUP SIZE 


Instructional set 


Group size Cooperation Competition 
2 8.06 10.56 
9.13 


4 10.00 


told not to write more than 20 words. Geen (1971) 
has successfully used a similar type of memory task 
in an audience paradigm. 

In the competitive condition, subjects were told 
that the experimenter was interested in how each of 
them did relative to each other. In the cooperative 
condition, subjects were told that the experimenter 
was concerned with how the group (either two or 
four subjects) did collectively. They were told that 
their group score was to be compared to three other 
groups. Although the experimenter explained that he 
would see the subject’s individual scores, the experi- 
menter stressed that he was only interested in their 
collective score as a group. 


Results and Discussion 


The means of the four experimental condi- 
tions are shown in Table 1. An analysis of 
variance revealed a significant Instructional 
Set X Group Size interaction, F(1, 59) = 
4.08, p < .05, while neither of the main ef- 
fects reached statistical significance (F < 1). 
Although neither of the two specific compari- 
sons within the cooperative and competitive 
conditions reached significance, one can see 
‘from Table 1 that the form of the interaction 
indicates that within the cooperative condi- 
tion the large groups subjects recalled more 
words than those in the small group, while in 
the competitive condition the small groups 
subjects recalled more words than those in the 
large group. The results suggest that on a 
novel learning task, subjects can perform 
better in large groups than in small groups 
when cooperating but better in small as op- 
posed to large groups when competing. These 
results support the hypothesis that conspe- 
cifics can be a stimulus for positive as well as 
negative outcomes and subsequently can either 
increase or decrease arousal.* 

Beside the implications of the previous ex- 
periment for the coaction and affiliation lit- 
erature, it can also be considered a study of 
crowding. Crowding has been assumed to pro- 
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duce negative effects and reactions in humans 
similar to those found in animal studies (e.g., 
Calhoun, 1962; Christian, 1970). Several lab- 
oratory and field studies have also provided 
evidence that crowding (i.e., increase in group 
size or density) can produce negative psycho- 
logical effects on humans (e.g., Dabbs, 1971; 
Freedman, Levy, Buchanan, & Price, 1972; 
Griffitt & Veitch, 1971; Paulus, Cox, McCain, 
& Chandler, 1975; Smith & Haythorn, 1972; 
Stokols, Rall, Pinner, & Schopler, 1973). 
However, some theorists (Jacobs, 1961; 
Le Corbusier, 1933; Soleri, 1969; Tange, 
1970), without specifying the underlying psy- 
chological processes, have speculated that 
crowding may have positive effects. Indeed 
there is evidence that crowding may have a 
positive effect (MacDonald & Oden, 1973; 
Valins & Baum, 1973) or no effect (Freed- 
man, Klevansky, & Ehrlich, 1971). Calhoun 
(1962) reported differences in the effects of 
conspecifics due to different contexts. He 
found no indication for “deviant behaviors” 
within groups having stable dominance hier- 


1 An alternative explanation of the above results 
could be that the interaction is due to differential 
effort between the conditions. That is, the individu- 
als in the competitive small group may have written 
more words than those in the competitive large 
group, while the cooperative large group subjects 
may have written more words than those in the 
cooperative small group. An analysis of vari- 
ance was performed on the total number of words 
written in the four experimental conditions, This 
analysis indicated that there was no significant 
difference in the total number of words written 
in the four conditions (F <1), A second alterna- 
tive explanation could be that the interaction is 
due to differential guessing between the conditions. 
This possibility was tested by an analysis of variance 
on error scores, with an error defined as an incorrect 
response. The only result approaching significance 
was the interaction between instructional set and 
group size, F(1, 59) = 3.70, p < .07. Within the com- 
petitive condition, there were more errors produced 
by the competitive large group than the competitive 
small group condition, while within the cooperative 
condition there were more errors produced by the 
cooperative small group as compared to the coopera- 
tive large group condition. This result negates dif- 
ferential guessing as a viable explanation since the 
interaction indicates that the subjects who recalled 
the most words also made the fewest errors, In addi- 
tion, this interaction indicates that a more suitable 
task for future research would be one in which 
guessing is minimized. 
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archies while finding strong evidence for 
“deviant behaviors” within groups not having 
stable hierarchies. Valins and Baum (1973) 
found that students who had lived in a 
crowded dorm subsequently performed signifi- 
cantly worse on cooperative tasks and sig- 
nificantly better on competitive tasks than 
those students who had lived in less crowded 
conditions. In other words, nonexcessive 
crowding had both positive and negative ef- 
fects, depending on the task. 

MacDonald and Oden (1973) reported that 
crowding in a dormitory setting increased at- 
traction between marital partners. There was 
a considerable amount of stress in this situa- 
tion since five couples were assigned to one 
room and all activities had to be performed in 
the potential view of all others. Since there 
was stress in the situation, the partners pos- 
sibly cooperated with their spouse to reduce 
the stress and their cooperation may have 
resulted in increased attraction. In this con- 
text, one could assume that the other spouse 
was both a source of positive outcomes and a 
source of arousal reduction. The results of 
the above studies indicate that the effects of 
crowding may depend upon the context of the 
group. 

The second study attempted to determine 
whether the results of the first study general- 
ized to a different task and to another experi- 
mental manipulation generally employed by 
researchers in crowding studies. A 2 X 2 fac- 
torial design was employed varying instruc- 
tional set (competition or cooperation) and 
proximity (close or far). In a coaction set- 
ting, arousal should increase as the distance 
between competing conspecifics decreases. As 
previously mentioned, competitors may be 
perceived as potential sources of aversive out- 
comes (i.e., aversive stimuli). As the distance 
between competitors decreases, the salience 
and intensity of these aversive stimuli should 
increase with a resulting increase in arousal. 
Hence, since the task which is employed was 
not well-learned, competitors seated far from 
other competitors (far condition) may per- 
form better than those more closely spaced 
(close condition). 


In contrast, arousal may decrease as the 
distance between cooperating conspecifics de- 


creases. For example, one often observes that +7 


in some uncertain or stressful situations (e.g., 
war, natural catastrophies) individuals desire 
to be close to their friends or loved ones (co- 
operative others). In these situations, indi- 
viduals may desire close physical distances, 
possibly for the arousal-reducing effects of 
such closeness. Therefore, in a coaction set- 
ting in which there is within-group coopera- 
tion and between-group competition, indi- 
viduals seated close to cooperative others may 
be less aroused and this low level of arousal 
should result in superior learning perform- 
ance. In sum, an Instructional Set X Proxim- 
ity interaction is predicted. The interaction 
should negate any effects due to proximity 
while the general level of arousal produced 
by the two instructional sets cannot be fore- 
seen since both competitive and cooperative 
conditions should involve some degree of 
stress. 


EXPERIMENT 2 
Method 


Subjects. One hundred thirty-two introductory 
psychology students were employed as subjects. Sub- 
jects arrived at the experiment in mixed-sex groups 
of 4. Upon their arrival, the subjects were randomly 
assigned to one of the four experimental conditions. 

Experimental design. A 2 X 2 factorial design vary- 
ing instructional set (competition or cooperation) 
and proximity (close or far) was employed. The sub- 
jects in the far group condition were spaced 5 feet 
(1.5 m) from the nearest conspecific, while the sub- 
jects in the close group condition were spaced 2 feet 
(.6 m) from the nearest conspecific. 

Task and procedure. The task used was a “multi- 
ple-U” maze machined in Plexiglas similar to that 
employed by Hunt and Hillery (1973). On this task, 
the starting point is located near the bottom edge of 
the maze and the goal near the top. The maze has 
12 levels with two alternatives at each level and is 
geometrically balanced, requiring as many left turns 
at the decision points as right turns in order to 
trace it correctly. Located at each cul-de-sac and at 
the goal is an electrical contact. If the subject touches 
a cul-de-sac contact with his stylus, the red light 
signals in front of him to indicate he has made an 
error and a cumulative counter in the experimenter’s 
room is activated. When the contact at the goal is 
touched, white lights on the subject’s apparatus and 
on the experimenter’s desk in the adjacent room 
signal that the subject has completed a trial, After 
completing each trial, the subject immediately starts 
on the next trial. A raised block is located at the 
starting point, enabling the subject to begin each 
trial without looking at the maze and without re- 
ceiving assistance simply by finding the block. 
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The mazes were housed in individual plywood 
boxes with an opening at the bottom of the front 
panel, which allowed access of the subject's hand 
and arm for tracing and prevented subjects from 
viewing the maze during their performance. The 
maze was affixed to the back panel of the box and 
was perpendicular to the top of the table. The signal 
lights were mounted on top of the plywood boxes, 2 
inches (5.1 cm) from the front edge. The lights 
were covered by a box with an open side facing the 
subject and were not visible to other subjects. 

In the experiment, each group of subjects was 
taken en masse from the waiting room to the experi- 
mental room. The groups of four subjects (two 
male and two female) were seated at the diagonal 
points of an imaginary square, facing the center of 
the square. Once they were seated the experimenter 
read the task and the task-set instructions. The task- 
set instructions employed were identical to that of 
Experiment 1. When all questions were answered, 
the experimenter left the room to monitor the equip- 
ment in the adjacent room and to record the data 
with the aid of several assistants. 

The subjects were instructed to continue until told 
to stop. When the slowest person in the group 
finished 10 trials, the maze performance was termi- 
nated, Each time the subject reached the goal, a 
trial was completed. The first trial of each subject 
was not included in the data analysis, since without 
prior experience performance on this trial is based 
on chance. The number of errors made by each 
subject on each of the next nine trials was recorded 
and the total number of errors made by each subject 
on these nine trials was used as the dependent mea- 
sure. Consistent with Hunt and Hillery (1973), the 
maximum number of errors allowed on any given 
trial was 12. This criterion is reasonable since there 
was a total number of 12 correct alleys, and unless 
the subject made the same error twice, only 12 
errors could be made.? 


Results 


The mean number of errors for the nine 
trials for each of the experimental conditions 
are contained in Table 2. As can be seen in 
Table 2, the predicted interaction of instruc- 
tional set and proximity was obtained, F(1, 
128) = 4.30, p < .05. Although neither of the 
two specific comparisons within the coopera- 
tive and competitive conditions reached sig- 
nificance, one can see from Table 2 that the 
form of the interaction indicates that in the 
cooperative condition the widely spaced sub- 
jects performed more poorly than the closely 
spaced subjects, while in the competitive con- 
dition the far subjects performed better than 
the closely spaced subjects. The main effects 
did not reach statistical significance (F < 1). 


TABLE 2 


MEAN NUMBER or Errors AS A FUNCTION oF 
INSTRUCTIONAL SET AND PROXIMITY 


Instructional set 


Proximity Cooperation Competition 
Close 34.67 38.03 
Far 39.85 


34,67 


GENERAL Discussion 


Experiments 1 and 2 have demonstrated 
that during coaction, group size and proxim- 
ity interact with instructional set to determine 
the quality of task performance, The first 
experiment found a significant interaction of 
instructional set and groups size, while in the 
second experiment a significant Instructional 
Set X Proximity interaction was obtained. 
The present results partially support Cottrell’s 
(1972) learned drive position of social facili- 
tation in that the degree of arousal produced 
by other conspecifics appears to depend on the 
potential positive or negative consequences 
that an individual has learned can be derived 
from those conspecifics. In addition, the pres- 
ent results extend the implications of Cot- 
trell’s position. According to Cottrell, conspe- 
cifics are seen as a potential source of arousal 
induction since they can be stimuli signaling 
potential positive and/or negative conse- 
quences. 

However, we propose that conspecifics can 
be a potential source of arousal reduction as 
well as induction. Others can potentially be a 
source of arousal reduction when their pres- 
ence reduces the intensity or the probability 
of an aversive outcome or increases the inten- 
sity or probability of a rewarding outcome, 
This conceptualization also predicts that un- 
der certain conditions, large groups of com- 
petitive coactors might perform better on a 
novel learning task than a smaller group of 
competitive coactors. For example, let Us as- 
sume that coactors were told that the indi- 
vidual who obtained the group’s worst score 


2 Although Hunt and Hillery (1973) in using this 
particular task found that coaction subjects per- 
formed better than alone subjects, several studies in 
our laboratory as well as a subsequent study in the 
Akron laboratory (Byers, Note 1) have found that 
under competitive conditions, small groups perform 
better than large groups in this particular task. 
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would be penalized and that none of the other 
individual scores would be either revealed or 
penalized. In this case, coactors performing in 
‘a large group should be less aroused than 
those performing in a smaller group since they 
have a lower probability of being penalized 
and a higher probability of escaping the 
aversive outcome than those coactors perform- 
ing in the smaller group. Since the large 
group of coactors would be less aroused than 
the small group, they should perform better 
on a novel learning task. 

The intent of most researchers in crowding 
has been to document the long-suspected no- 
_ tion that crowding can be aversive. The re- 
sults of the present study suggest that it is 
important for researchers not only to specify 
the aversive effects of crowding but also to 
identify those variables that lessen these 
aversive effects or even identify those varia- 
bles that could make nonexcessive crowding 
pleasant. By being cognizant of the fact that 
others can be both rewarding and punishing — 
a source of arousal induction or reduction— 
one could arrange variables in such a way as 
to make an otherwise aversive situation plea- 
sant. For example, in some situations it may 
be impossible to reduce stress by reducing the 
number of people living in a particular dormi- 
tory, jail cell, or barracks, but it may be 
possible to reduce the stress by inducing co- 
operation among the individuals living in that 
particular area. However, cooperation is but 
one potential way of reducing the stress of 
crowding. Since it has been shown that under 
certain conditions similar others tend to like 
each other more than nonsimilar others (i.e, 
similar others may be a more positive stimu- 
lus; cf. Byrne, 1971), individuals that are 
confined with similar others may feel less 
threatened and hence less aroused than if 
they were confined with others of dissimilar 
attitudes, beliefs, or abilities, 


REFERENCE NOTE 


1. Byers, P. J, Sex differences and evaluation poten- 

- tial as determinants of social facilitation effects in 
@ cognitive task. Unpublished master’s thesis, Uni- 
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Fear Appeals and Attitude Change: Effects of a Threat’s 
Noxiousness, Probability of Occurrence, and the 
Efficacy of Coping Responses 


Ronald W. Rogers and C. Ronald Mewborn 
University of South Carolina 


Three factorial experiments examined the persuasive effects of the noxiousness 
of a threatened event, its probability of occurrence, and the efficacy of recom- 
mended protective measures. A total of 176 students participated in separate 
studies on the topics of cigarette smoking, driving safety, and venereal disease. 
The results disclosed that increments in the efficacy variable increased inten- 
tions to adopt the recommended practices. Interaction effects revealed that 
when the preventive practices were effective, increments in the noxiousness and 
probability variables facilitated attitude change; however, when the coping 
responses were ineffective, increments in noxiousness and probability either had 
no effect or a deleterious effect, respectively. These interaction effects were 
discussed in terms of a defensive avoidance hypothesis, the crucial component 
of which was an inability to ward off the danger. Furthermore, the effect of 
the emotion of fear upon intentions was found to be mediated by the cognitive 
appraisal of severity of the threat. Finally, similarities with and extensions of 


previous studies were reviewed. 


In one of the earliest theoretical analyses of 
fear arousal and persuasion (Hovland, Janis, & 
Kelley, 1953), fear appeals were characterized 
as communications describing the unfavorable 
consequences that might result from failure to 
adopt the communicator’s recommendations. 
This definition was sufficiently sweeping to 
allow fear-arousing communications to be op- 
erationalized in a variety of ways. For ex- 
ample, fear has been manipulated in some 
studies by presenting information on the 
amount of bodily injury and the likelihood of 
exposure at each level of the manipulation 
(e.g., Chu, 1966), by omitting the latter in- 
formation at one level of the manipulation 
(e.g., Janis & Feshbach, 1953), and by omit- 
ting the latter information entirely (e.g., 
Rogers & Thistlethwaite, 1970). Investigators 
have been aware of different communication 
factors but have pursued other issues, not yet 
examining these factors systematically. If 
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fear-arousing communications are multifac- 
eted stimuli, then their persuasive impact may 
be due to any one or more of the components. 
Thus, the effective content stimuli that pro- 
duce attitude change may not have been 
firmly established. 

If studies have varied different types of 
communication content, then it would be dif- 
ficult to compare experiments and to de- 
termine the communication variables produc- 
ing the theoretically relevant changes in at- 
titudes. We believe that conceptualizations of 
fear appeals have been too global and that 
they now must be refined if more precise and 
unequivocal relations are to be generated. 
An important conceptual and empirical task 
is to identify the effective content variables 
and their associated mediational processes. 
Therefore, the major purpose of the present 
series of three experiments was to investigate 
some of the more important components of a 
fear appeal. 

Rogers (1975) has proposed that an ex- 
pectancy model, which includes all of the fac- 
tors of concern to investigators on fear com- 
munication and to workers with the health 
belief model (e.g., Hochbaum, 1958; Rosen- 
stock, 1974), be applied to the fear com- 
munication problem in a more systematic 


manner. ‘The three most crucial variables in a 
fear appeal are (a) the magnitude of noxious- 
ness of a depicted event, (b) the conditional 
probability that the event will occur provided 

| no adaptive activity is performed, and (c) the 

_ effectiveness of a coping response that might 

© avert the noxious event. Several investigators 
«have recognized similar variables, among a 
g „host of others, but the relationships among 
the components and the mediational processes 
differ from those to be described subsequently. 
Most importantly, these communication vari- 
ables have not been simultaneously varied. 

The noxiousness and probability content 
variables may be derived from Hovland et al.’s 
(1953) discussion of “potential dangers,” and 
the efficacy variable is very similar to their 
concept of “reassurance.” Limiting compo- 
nents of fear appeals to these three variables 
is attractive not only because they seem to be 
pivotal concepts in Hoyland et al.’s analysis 
but also because this differentiation yields con- 
cts similar to those in the more general 
gory of expectancy—value theories. Ac- 
ng to this class of theory, the tendency 
to perform a particular act is a function of the 
expectancy that the act will be followed by 
certain consequences and the value of those 
consequences, Feather (1959) identified psy- 
chologists in diverse areas who have used this 
type of theory (e.g., Lewin, 1938; Tolman, 
1932). Within the field of social psychology, 
expectancy—value formulations have been ap- 
plied to the structure of attitudes (Rosenberg, 
1956), prediction of behavior from self-report 
measures (Fishbein, 1967), and persuasion in 
the health education field (Hochbaum, 1958). 
Hopefully, progress in understanding the fear 
communication problem will be facilitated by 
a theoretically based classification schema that 
is linked to more general psychological the- 
ories. 

Since the theory to be investigated, termed 
protection motivation, has been presented in 
detail elsewhere (see Rogers, 1975), only the 
most salient features are reviewed here. It is 
| assumed that each of the three components of 
a fear appeal initiates a corresponding cogni- 
_ tive mediating process. Each of these processes 
__ appraises communication information about 
(a) noxiousness, (b) probability, or (c) effi- 
cacy by placing each stimulus on dimensions 
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of (a) appraised severity of the depicted 
event, (b) expectancy of exposure to the 
event, or (c) belief in the efficacy of the rec- 
ommended coping response, respectively. It 
will be taken as a working hypothesis that 
these cognitive processes are independent, 
Each of these appraisal processes will be 
roughly proportional to the strength of the 
associated message variable. The representa- 
tion will not be exact, since individuals have 
different styles of appraising threatening 
events (cf. Lazarus, 1968). 

It should be noted that appraised severity 
is a cognitive mediator distinct from fear. This 
position is similar to Leventhal’s (1970) im- 
portant distinction between danger and fear 
control processes, There are several adyan- 
tages of protection motivation theory’s em- 
phasis upon cognitive mediational processes 
rather than an emotional state of fear that 
functions as an acquired drive, First, with 
respect to instrumental avoidance learning, 
Rescorla and Solomon (1967) suggested that 
peripheral physiological activity is merely an 
index of central states that mediate avoidance 
behavior. Second, protection from aversive 
events frequently requires long-sustained pro- 
cesses, like cognitive representations, rather 
than reflexive responsivity to visceral events. 
Third, the traditional association of fear with 
peripheral, visceral activity readily lends it- 
self to an emphasis upon reduction of the 
emotional state rather than avoidance of the 
environmental danger. Most importantly, as 
Leventhal noted, the fear appeal literature 
strongly suggests that cognitive processes are 
more important than emotional ones in medi- 
ating attitude change. Thus, protection mo- 
tivation theory might be viewed as differenti- 
ating Leventhal’s “danger control” process 
into three cognitive mediational constructs 
specifically linked to antecedent communica- 
tion stimuli in a fear appeal. 

Protection motivation theory postulates at- 
titude change to be a multiplicative function 
of the three mediational processes, since this 
relationship is postulated by most expectancy- 
value theorists (see Feather, 1959). Thus, 
this formulation predicts not only positive 
main effects for each variable, but also second- 

and third-order interaction effects. Only the 
latter is described, since it incorporates the 
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other hypothesized effects. For each level of 
a magnitude of noxiousness variable, high 
probability of occurrence and high efficacy of 
coping response should each produce greater 
acceptance of the. communicator’s recom- 
mendation than low levels of these variables. 
Within a low-noxiousness condition, the su- 
periority of a high- over a low-efficacy condi- 
tion should be more pronounced in a high- 
than a low-probability condition, Within a 
high-noxiousness condition, the same order- 
ing of means should be found, but the dif- 
ferences should be more pronounced. There- 
fore, when acceptance of the communicator’s 
recommendation is plotted against any one of 
the independent variables, the other two vari- 
ables should form a fan of diverging curves. 
No experiments have been reported that 
attempted to investigate this triple-order in- 
teraction effect. The scant data that do bear 
upon other effects predicted by protection 
motivation theory provide moderate support 
(see Rogers, 1975). The purpose of the pres- 
ent series of experiments was to test the utility 
of protection motivation theory by factorially 
manipulating the three communication vari- 
ables and investigating their effects upon at- 
titudes toward health dangers that produce 
incalculable, yet preventable, human suffering: 
lung cancer, automobile injuries, and venereal 
disease. 


METHOD 


Design and Subjects 


Separate experiments were performed on each of 
three health topics: cigarette smoking, safe driving, 
and venereal diseases (gonorrhea and syphilis). Each 
of the three experiments employed a 2 X 2 X 2 fac- 
torial design with three between-subjects manipula- 
tions: high versus low magnitude of noxiousness of 
the depicted event, high versus low probability of 
that event’s occurrence, and high versus low ef- 
ficacy of recommended coping responses. 

Subjects were 176 students enrolled in an ele- 
mentary psychology course who participated to 
satisfy a course requirement. Of those, 64 served in 
the venereal disease study, 72 in the driving study, 
and 40 in the smoking study, Students could only 
participate in one study. Requirements for the three 
studies were as follows: For the smoking study, sub- 
jects had to have been smoking an average of at 
least 10 cigarettes per day for the past year. For the 
driving safety study, only subjects who had obtained 
a valid driver’s license could participate. For the 
venereal disease study, participation was restricted to 
subjects who were not allergic to penicillin. 
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Procedure 


Subjects were run in groups ranging from four to 
eight members, and each group was randomly as- 
signed to one of the magnitude of noxiousness con- 
ditions. Within each of these conditions, each stu- 
dent was randomly assigned to one of the probability 
of occurrence and efficacy of response conditions. 
Each experiment was presented as part of a project 
seeking college students’ evaluations of public health 
programs. All subjects were cautioned that they 
might see films of bodily injury (or surgery) and 
were allowed to leave if they so desired. Only nine 
students left, and they were subsequently replaced. 

In each experiment, the film manipulating the 
noxiousness variable was shown first. Each student 
then underlined the key phrase in each paragraph of 
the experimentally varied probability of occurrence 
and efficacy of response written communications, 
The request to underline key phrases tended to in- 
sure that the essays were read and supported the 
cover story that the materials were to be evaluated. 
Finally, evaluation questionnaires, which contained 
the dependent variables to be described, were ad- 
ministered. After each session, students were thanked 
for their help and fully debriefed. They were cau- 
tioned that the essays were written for persuasive 
purposes and then informed of the current state of 
knowledge regarding the independent variables. Most 
importantly, all students were informed that the 
recommended preventive practices (eg, stopping 
smoking) were extremely effective and were urged 
to adopt them, 


Stimulus Materials 


The high- and low-noxiousness movie sequence for 
the smoking study was entitled “One in 20,000.” The 
low-noxiousness condition consisted of the first 18 
minutes of the film. This portion depicts the case 
history of a man with lung cancer, portraying his 
discovery of his condition, an interview with his 
physician and a surgeon, and the surgical prepara- 
tions for removing his lung. The high-noxiousness 
film condition consisted of the same film plus a 5- 
minate presentation of the operation for removing his 
lung. 

In the study on driving, the low-noxious: 
was entitled “Broken Glass,” This film daniel ae 
trolled collisions using remotely operated cars with 
anthropometric dummies seated in them. The high- 
noxiousness film, which was entitled “Wheels of 
Tragedy,” portrays the gory aftermath of fatal col- 
lisions. For this film, a cameraman had traveled with 
state highway patrolmen; thus, cars and victims we 
shown before the arrival of ambulances and pie ig 

The film on venereal disease selected for the low- 
noxiousness condition was entitled “VDRL [Vener 1 
Disease Research Laboratory] Slide Qualitativ: Tes 
with Serum” and was provided by the Naval M dal 
Training Institute, Bethesda, Maryland. This a 
minute film demonstrates the laborato; f re 
for performing a serum test fi ae 
3 j a ‘or venereal disease, Ni 
diseased tissue is shown. The high-noxi 4 

-noxiousness film 
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was entitled “Radical Retroperitoneal Node Dissec- 
tion in the Treatment of Testicular Tumors” and 
was loaned by the Armed Forces Institute of Pathol- 
ogy, Washington, D.C. A 6-minute clip of this film 
depicts the surgical procedures used to remove dis- 
eased tissue from reproductive organs that had been 
destroyed by a venereal disease. 

In each of the three experiments, the probability 
of occurrence and efficacy of response variables were 
manipulated by using written messages of approxi- 
mately equal length.1 Thus, subjects read a com- 
munication arguing that the probability of being ex- 
posed to the threatened event was either high or low. 
For example, in the smoking study, the high-proba- 
bility essay would present a persuasive case, supported 
by logical arguments and descriptive statistics, that 
if one smokes cigarettes, there is a very high chance 
of contracting lung cancer. The low-probability essay 
would argue that although smoking can cause lung 
cafcer, the chances of any given smoker actually 

eloping cancer are very small, The high-efficacy 
ay would urge that the recommended preventive 

‘actices were extremely effective methods of avoid- 
ing the threatened event. The low-efficacy message 
provided little reassurance that the available coping 
response was efficacious. Within each experiment, 
both efficacy messages would mention the same 
recommendations. The probability of occurrence es- 
say was always read before the efficacy of response 
essay. 


Dependent Variables 


A dependent measure of fear arousal, which was 
completed immediately after viewing the film and 
before reading the communications, consisted of six 
mood adjectives: fright, tension, nervousness, anx- 
iety, discomfort, and nausea. In each experiment, as 
manipulation checks, there were three items assessing 
the perceived severity of the depicted event (e.g. 
“Lung cancer is an extremely frightening and dan- 
gerous type of disease”), three items measuring ex- 
pectancy of exposure (e.g., “I think it is likely that I 
will get lung cancer sometime in the years ahead”), 
and two items measuring the perceived efficacy of the 
recommended coping responses (e.g., “For a smoker, 
giving up cigarettes is extremely effective in reducing 
the chances of developing lung cancer”). Four items 
were used in each study to assess intentions to 
comply with the recommended practices (see Foot- 
note 1). Examples of the intention items in the three 
experiments are as follows; “At the present time, I 
intend to stop smoking completely”; “When driving 
in the future, I shall always think ahead and plan 
ways I can avoid dangerous traffic situations”; 
and “I firmly intend to get penicillin treatments if I 
ever get a venereal disease.” All items were rated 
on 10-point graphic rating scales. Items were varied 
in the questionnaire so that on some of the state- 
ments a score of 10 represented complete agreement, 
while on others it represented complete disagreement. 
The latter items were converted in scoring responses 
so that higher numbers represented stronger agree- 
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ment, Scores on all dependent measures are based 
upon the mean ratings for the items in each cluster, 


Resutts 


The three experiments were included as a 
replications factor in the analysis of variance 
design, Thus, all dependent variables were 
analyzed ina 2X 2 X 2 X3 design. 


Manipulation Checks 


Fear arousal. The analysis of the six mood 
adjectives indicated that the high-noxiousness 
films produced higher levels of fear arousal 
than did the low-noxiousness films (Xs = 5,4 
and 3.7, respectively), F(1, 152) = 64.62, p < 
.0001, In addition, there was a main effect as- 
sociated with the replications factor, F (2, 152) 
= 43.66, p < 0001, The mean fear score for 
the driving experiment (5.6) was higher than 
that for the smoking study (4.7), which, in 
turn, was higher than that for the venereal 
disease experiment (3.3). 

Appraised severity. The films did not have 
a significant effect on the measure of appraised 
severity, F(1, 152) = 2.18, p < .15, However, 
the high-probability essay (X = 7.7) pro- 
duced higher scores than the low-probability 
message (Æ = 7.0), F(1, 152) = 11.32, p< 
.001. There were no other significant main or 
interaction effects. 

Expectancy of exposure, The probability of 
occurrence communication was the only inde- 
pendent variable to affect expectancies of 
being exposed to the threatening events, 
F(1, 152) = 37.14, p < 0001. The high-prob- 
ability essay produced higher mean scores 
(5.8) than did the low-probability message 
(4.5). 

Belief in efficacy of coping responses, The 
groups exposed to the high-efficacy communi- 
cation believed the recommended practices to 
be more highly effective preventives than the 
groups who read the low-efficacy messages 
(Xs = 8.3 and 4.5, respectively), F (1, 152) = 
209.96, p < .0001. An Efficacy X Probability 
interaction effect, F(1, 152) = 6.96, p < .01, 
indicated that the efficacy variable’s impact 
was more pronounced in the high-probability- 
of-occurrence condition than in the low-prob- 


1 The experimental communications and the post- 
test questionnaires are available upon request from 


the first author. 
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TABLE 1 


MEAN SCORES ON THE MEASURE OF INTENTIONS TO 
ADOPT THE RECOMMENDED RESPONSES 


Probability 


Experimental condition Low High 
Low noxiousness 
Low efficacy 5.9 DO. 
High efficacy 6.1 6.8 
High noxiousness 
Low efficacy 5.6 5.8 
High efficacy 6.7 7A 


ability-of-occurrence condition. Finally, there 
was a main efféct associated with the noxious- 
ness manipulation, F(1, 152) = 4.69, p < .05, 
with the high-noxiousness mean of 6.7 higher 
than the low-noxiousness mean of 6.1. 


Intent to Adopt Recommended Response 


The predicted triple-order interaction effect 
was not obtained, F(1, 152)= 1.37, p < .25. 
The obtained means are reported in Table 1. 
Compared with the low-efficacy condition (X 
= 5.7), the high-efficacy condition (£ = 6.7) 
produced stronger intentions to accept the 
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Fıcure 1. Interaction effect of probability of oc- 
currence and efficacy of coping response upon intent 
to adopt the recommended response in the cigarette 
smoking experiment. 
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Ficure 2. Interaction effect of magnitude of nox- 
iousness and efficacy of coping response upon intent 
to adopt the recommended response in the venereal, 
disease experiment. 


preventive measures, F(1, 152) = 13.23, p < 
.001). This main effect must be interpreted in 
light of the Efficacy X Probability x Replica- 
tions interaction effect, (2,152) = 4.47, p < 
.02. The Efficacy X Probability interaction ef- 
fect was tested at each level of the Replica- 
tions factor. These analyses revealed that the 
second-order interaction attained significance 
only in the smoking experiment, F(1, 32) = 
5.87, p < .02. Figure 1 shows a plot of the 
obtained mean scores, Examination of this in- 
teraction revealed that it arose from the in- 
crease in the high-efficacy condition (+1.1) 
and the decrease in the low-efficacy condition 
(—1.5). There were no differences among the 
means in the other two studies. 

Theré was also a Noxiousness X Efficacy X 
Replications interaction effect, F(2, 152) =` 
3.08, p < .05. The Noxiousness X Efficacy in- 
teraction was tested at each level of the repli- 
cations factor. These analyses indicated that 
the second-order interaction effect was signifi- 
cant only in the venereal disease experiment 
F(1, 56) = 5.03, p < .03. These results are 
Eman graphically in Figure 2. There 
were no erences amon; i 
other two experiments. E De means ae 
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Regression and Path Analyses 


In many fear appeal studies (e.g., Leven- 
thal & Niles, 1964), supplementary correla- 
tional analyses, which ignore the classification 
variables, are performed to take advantage of 
the large individual differences in responses to 
the manipulations. Since protection motiva- 
tion theory assumes that the mediational pro- 
cesses are independent and directly affect in- 
tentions, an ordinary multiple linear regression 
equation was computed that entered the ma- 
nipulation check items as predictors of the 
criterion variable, intentions, Standardized co- 
efficients in such a multiple regression equa- 
tion can be interpreted as estimates of the 
contribution of predictor variables in “caus- 
ing” the criterion variable. Hence, this analy- 
sis might implicate a specific mediator(s) as 
being causally involved in determining inten- 
tions. The obtained standardized partial re- 
gression coefficients were as follows: for fear- 
arousal, .13; for severity, .19 (p < .05); for 
expectancy of exposure, .03; and for belief in 
efficacy, .32 (p < .01), Thus, belief in the ef- 
ficacy of coping responses explained more of 
the variance than did any other predictor, and 
the appraised severity of the threatened event 
was the only other significant predictor. 

The regression equation examined the sim- 
ple, direct effect of each mediator upon in- 
tentions, A path analysis was performed to 
examine a combination of simultaneous chain 
regressions (see Turner & Stevens, 1959). 
Specifically, this analysis sought to determine 
whether the mediators operated upon inten- 
tions, not directly, but through one or more 
of the other mediators. Hence, in addition to 
the direct relationships indicated by the re- 
gression analysis, the path analysis might re- 
veal some causal cycles or paths. The paths 
that were significant beyond the .05 level are 
diagrammatically presented in Figure 3. 


Discussion 


The hypothesized multiplicative relation- 
ship among the independent variables was re- 
jected. Nevertheless, the results of the present 
series of experiments demonstrated that incre- 
ments in the efficacy of recommended coping 
responses increased intentions to comply with 
the recommended practices. Regardless of 
what the threatened event was, or how noxious 
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Ficure 3. Significant path coefficients among the 
dependent variables. 


it was, or how likely it was to occur, the — 
stronger the belief that a coping response 
could avert a danger, the more strongly people 
intended to adopt the communicator’s recom- 
mendations. The main effect of the efficacy 
factor corroborates previous studies finding 
that this variable enhanced acceptance of 
medication (Chu, 1966), increased intentions 
to stop smoking (Rogers & Thistlethwaite, 
1970), and reduced cigarette consumption 
(Rogers and Deckner, 1975). One practical 
implication is that mass media appeals should 
emphasize that their recommendations can ef- 
fectively control an aversive event. The salu- 
tary effect of the efficacy of the recommenda- 
tions extends the findings of Leventhal and his 
colleagues (see Leventhal, 1970) that specific — 
action instructions facilitate behavioral com- 
pliance. The data indicated that the efficacy 
component acted upon intentions, not by mak- 
ing the threat seem less severe or less likely to 
occur but by directly strengthening beliefs in 
the ability to cope with danger when it is con- 
fronted. - 
The manipulation of noxiousness affected 
fear arousal but not appraised severity and ~ 
did not have a main effect upon intentions. 
The transitory nature of the emotion of fear — 
may account for its failure to affect intentions, 
Leventhal and Niles (1965) found that self- 
reports of emotional arousal drop rapidly after 
exposure to a fear appeal. Mewborn (1975) 
found that although subjects exposed to the 
high-noxiousness_ venereal disease film were 
more aroused (on measures of heart rate, gal- 
yanic skin responsivity, and self-report of fear) 
than those exposed to the low-noxiousness film, 
this difference dissipated in 10 minutes. Thus, 
there were no differences in fear arousal be- 
tween these two groups at posttest time when 
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they indicated the degree of their acceptance 
of the communicator’s recommendation. Mew- 
born also found that the film manipulation 
did not affect intentions. Although the present 
studies did not employ this continuous physio- 
logical monitoring, it seems reasonable to as- 
sume that the emotional arousal had dis- 
sipated by posttest time; thus, we would not 
expect emotional arousal to have affected in- 
tention scores, Furthermore, these studies and 
Leventhal’s (1970) parallel response model 
suggest that the appraised severity of a threat 
should be distinguished from the transient 
emotional reaction to the threat. The path 
analysis of the present data demonstrated that 
fear arousal affected intentions, not directly, 
but by increasing the perceived severity of 
the threat, which, in turn, affected intentions. 
Protection motivation theory requires long- 
sustained cognitive representations (i.e., ap- 
praised severity) to mediate attitude change. 

Increments in the magnitude of noxiousness 
in the veneral disease study facilitated atti- 
tude change only if the recommended coping 
response was highly effective (see Figure 2). 
If the response was relatively ineffective, it 
did not matter how severe the danger was. 
This interaction effect is consistent with Janis 
and Feshbach’s (1953) defensive avoidance 
hypothesis. As elaborated by Rogers and 
Thistlethwaite (1970), this hypothesis sug- 
gests that “increments in fear-arousal should 
produce greater increments in acceptance of 
the recommended action when the audience is 
given high, rather than little or no, reassur- 
ance of the efficacy of the actions” (pp. 227— 
228). However, Rogers and Thistlethwaite 
found support for this interaction only on a 
subsidiary measure, the belief that cigarette 
smoking causes lung cancer. The present data 
provide stronger support, since they involve 
the more crucial measure of acceptance of the 
recommended preventive practices. 

The interaction in the smoking experiment 
showed that increments in the probability 
-variable tended to increase acceptance of the 
recommendation only if the recommendation 
was a highly effective preventive practice. If 
the recommendation was ineffective, incre- 
ments in probability of occurrence tended to 
have a boomerang effect. That is, if smokers 
were told that it was highly likely they could 


contract lung cancer and that stopping smok- 
ing would not improve their lot, they planned 
to increase their cigarette consumption. Leven- 
thal (1970) reviewed evidence indicating that 
if an individual feels vulnerable to a threat, 
then resistances are aroused, and a boomerang 
effect is observed, The present findings sug- 
gest that this may occur only when individuals 
feel incapable of coping with the threat; if 
they can cope, increasing the probability of 
occurrence diminishes resistances and facili- 
tates attitude change. 

The interaction between the efficacy and 
probability variables suggests an extension of 
the defensive avoidance hypothesis: It need 
not be limited to an interaction involving fear 
or noxiousness. The key variable seems to be 
the inability to ward off a danger. Hence, re- 
gardless of the magnitude of noxiousness of a 
threat, when expectancy of exposure to a 
threat is strongly aroused but not fully re- 
lieved by belief in the efficacy of the recom- 
mended coping response, individuals may re- 
sist the communicator’s recommendation. 

However, it might be misleading to label 
the findings of the present experiments as “de- 
fensive,” for not accepting an ineffective cop- 
ing response is quite reasonable. Additionally, 
there was no evidence of denial tendencies 
(Janis & Feshbach, 1953) or defensive reap- 
praisal (Lazarus & Averill, 1972). Consistent 
with Leventhal and Niles’s (1964) study, the 
data revealed no evidence of unrealistic tend- 
encies to minimize the magnitude of the dan- 
ger, minimize the probability it would ma- 
terialize, or maximize the ability to cope. 
However, the present research was not de- 
signed to explore potential intrapsychic ad- 
justments created by the inability to cope with 
a threat. The state wherein one cannot con- 
trol one’s environment and has no adequate 
means of adjustment has been cited as a cru- 
cial component of anxiety (e.g., Lazarus & 
Averill, 1972), stress (e.g., McGrath, 1970), 
and helplessness (Maier, Seligman, & Solo- 
mon, 1969). Thus, there is a conceptual and 
methodological convergence among these cog- 
nate research traditions that may disclose 
fruitful lines for subsequent research. For ex- 
ample, studies of fear appeals may benefit 
from research on anxiety and stress that has 
investigated some of the methods by which 
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people respond to threatening events with 
which they cannot cope; in addition, fear ap- 
peal research may offer a method of experi- 
mentally inducing anxiety and stress by creat- 
ing beliefs in an inability to cope with a 
threat. 

In summary, this series of experiments con- 
solidated and extended previous findings. The 
belief in one’s ability to ward off a danger 
strengthened intentions to adopt recommended 
preventive measures. Although protection mo- 
tivation theory’s hypothesized multiplicative 
combinatorial rule was rejected, the differen- 
tiation of a fear communication into three 
orthogonal components proved to be impor- 
tant because the variables produced different 
forms of interaction effects on intentions to 
comply with different health practices. Fur- 
thermore, the path analyses indicated that the 
effect of an emotional state of fear upon in- 
tentions was mediated by the cognitive ap- 
praisal of severity of the threat. These data 
not only support Leventhal’s separation of 
fear and danger control processes but empha- 
size the relative importance of the latter. 
Protection motivation theory’s cognitive medi- 
ational constructs of appraised severity and 
belief in efficacy of coping response may be 
viewed as more specific components of Leven- 
thal’s theoretical formulation. Hopefully, sys- 
tematic conceptual and empirical development 
of additional communication, cognitive, and 
dispositional variables will extend the theory 
far beyond its present circumspect terrain. 
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Vicarious Eyelid Conditioning 


Guillermo Bernal and Seymour M. Berger 
University of Massachusetts—Amherst 


This study attempted to demonstrate that elicited electromyographic (EMG) 
reactions in observers may become conditioned to environmental events during 
the observation of a modeling sequence. Observers were shown a videotape of 
a model in a simulated direct eyelid discriminative conditioning situation in- 
volving adaptation, acquisition, and extinction trials, Various EMG measures 
were obtained in order to determine the occurrence of vicarious instigation and 
conditioning. The results demonstrated vicarious EMG eyelid responses during 
acquisition and discriminative conditioning of these responses. Also, the motor 
reactions were found to be localized to specific muscular activity rather than 
a result of general arousal. Some theoretical and clinical implications of these 


findings are discussed, 


Several theories assume that some type of 
mediational process is involved in observa- 
tional learning (e.g., Aronfreed, 1968; Ban- 
dura, 1969; Berger, 1968; Rosenbaum & 
Arenson, 1968). All of these theories suggest 
that mediation occurs, at least in part, during 
the period when observers are exposed to the 
model’s performance. One form of mediation, 
motor mimicry, has been investigated by 
Berger (1966) and by Berger and Hadley 
(1975). The theoretical framework for these 
studies suggests that observers tend to mimic 
the motor responses of the model during the 
observation period and that these mimetic or 
vicariously instigated motor responses can be 
conditioned to environmental events that pre- 
cede the model’s behavior (Berger, 1968), 
Through this type of mediation observers 
learn to react to environmental events in the 
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same manner as the model. In order to ost 
this analysis directly, Berger and H: ley 
(1975) examined the extent to which | al- 
ized motor (electromyographic, or EMG re- 
sponses could be vicariously instigated ind 
conditioned during the exposure period. © hey 
found that observers exposed to models er- 
forming specific behaviors (i.e., arm wres’ ing 
and stuttering) reacted with increased 1 JIG 
arm activity to models engaged in rm 
wrestling and with increased EMG lip a: tiv- 
ity to a stuttering model. However, the cvi- 
dence for the conditioning of these EIG 
reactions was equivocal, 

This research finding seems to be relate.) to 
earlier work on the psychophysiology of 
thinking, which demonstrated that thin! ing 
about a particular movement was related to 
performance of that movement, as measured 
by changes in muscle action potentials (e.g., 
McGuigan & Schoonover, 1973), If instruc- 
tions to think about a movement tended to 
elicit the movement, then it would seem rea- 
sonable to expect that merely having someone 
observe another person performing a move- 
ment could influence the observer to think 
about the movement and produce it as well. 
If such movements occurred, it should be 
Possible to condition these movements to 
environmental events. Berger and Hadley 
(1975) demonstrated that movements can be 
mstigated through observation of another’s 
performance, but they were unable to provide 


substantial evidence for conditioning of these 
movements. 
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was to detemine yee a ee 
could be elicited aud ennai tie eee 

; a onditioned while ob- 
servers viewed a videotape of a model. Since 
previous studies used arm and lip movements 
(Berger & Hadley, 1975) or hand motor 
reactions (Berger, 1966; Berger, Irwin, & 
Frommer, 1970), the eyelid response was em- 
ployed in this study so that the findings 
would have greater generality. Furthermore, 
this experiment was designed to provide a 
better test of conditioning than was possible 
in the Berger and Hadley (1975) study. 

Observers viewed a videotape of a model 
undergoing adaptation, acquisition, and ex- 
tinction trials in a simulated eyelid condi- 
tioning situation employing a discrimination- 
learning paradigm. In the acquisition phase 
for some observers, Tone 1 (the positive 
conditioned stimulus, CS+) was paired with 
the air-puff sound, and the model’s blink (the 
unconditioned stimulus, UCS), and Tone 2 
(the negative conditioned stimulus, CS—) re- 
mained unpaired, providing a sensitization 
control. Other observers served as a con- 
trol group to identify the response-eliciting 
qualities of the air puff sound alone. 

On CS+ trials, observer EMG reactions 
from the right eyelid during the model’s blink 
constituted vicarious motor responses. Condi- 
tioning of these vicarious reactions was 
assessed by an analysis of the observer's 
anticipatory responses during acquisition and 
the observer’s resistance to extinction. As a 
sensitization control, the same measures were 
obtained for CS—. These measures were also 
obtained for the right arm of the observers 
in order to determine whether the EMG 
eyelid reactions were part of a general increase 
in motor activity due to arousal rather than 
localized vicarious responses. 

Tt was hypothesized that observer EMG 
eyelid responses would be greater for the CS+ 
trials than for the CS— trials during the 
anticipatory and vicarious instigation periods 
in acquisition and during the comparable 
vicarious instigation period in extinction. 
Furthermore, it was hypothesized that EMG 
vicarious reactions and conditioned discrimi- 
nation would be more apparent in the right 


eye than in the right arm for both acquisition 
and extinction phases. 


METHOD 
Subjects 


Subjects were 72 students (36 males and 36 fe- 
males) recruited from undergraduate psychology. 
courses at the University of Massachusetts who 
volunteered to participate in the experiment as ob- — 
servers. All received extra credit toward their final 
course grade. 


Apparatus 


A closed-circuit 4-inch (1.27 cm) videotape system 
was employed to present the modeling stimuli. The 
observers’ responses were recorded in an adjacent 
room with a Beckman Model R-411 dynograph 
recorder using Beckman 9852A EMG integrating 
couplers on two different channels. Miniature Beck- 
man bipotential electrodes were filled with Grass 
electrode cream (Type EC-2) and attached to the 
appropriate recording sites with surgical adhesive 
tape. It was possible for the experimenter to monitor 
visually the videotape presentation in the adjoining 
recording room by using another monitor that was 
connected to the closed-circuit system, Furthermore, 
the event marker on the Beckman recorder was con- 
trolled by a relay that was operated by the aud'o 
output of the experimenter’s television monitor and, 
therefore, gave a precise indication of specfiic events 
occurring on the videotape. The observers’ physio- 
logical reactions and the videotape audio signals were 
also recorded from the polygraph outputs on a Vetter 
FM tape-recording system for computer digitizing 
and analysis. 


Procedure 


Observers were individually run. Upon arrival, they 
were escorted to a soundproof room and were seated 
in a reclining chair located approximately 9 feet 
from the television monitor. The experimenter (the 
first author) placed the electrodes on the observer 
while explaining that the study involved the record- 
ing of alpha waves or electroencephalograms (EEGs). 

In order to simulate EEG recording, two pairs of 
electrodes were attached to the head region, one pair 
near each eye. Each pair was placed midway above 
the eyebrow or on the upper mid-region of the 
obicularis oculi and at the lateral temple region on 
the corner of each eye or laterally midway on the 
obicularis oculi. A third pair of electrodes were 
placed on the right forearm. This placement was 
identical to that employed by Berger and Hadley 
(1975). The observers were told that the placement 
on the arm was designed to measure differential 
alpha wave responses in a control region of the 


1 The obicularis oculi is a mimetic muscle fiber that 
surrounds the orifice of the eye. This muscle fiber is 
the primary control for the movement of the eyelid. 
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body. A ground electrode was placed on the back 
of the observer's right hand. 
After attaching the electrodes and adjusting the 
recording equipment in the adjacent room, the ex- 
perimenter read the standard instructions, which 
indicated that the purpose of the study was to 
investigate brain-wave changes as a function of 
stimulation, Observers were asked to watch a video- 
tape where neutral tones and aversive stimulation 
(an air puff to the eye) were presented to a subject. 
The purpose of the study, according to the instruc- 
tions, was to compare their brain waves with those 
of subjects in the live situation depicted by the 
videotape. Observers were told that they were a con- 
trol group and that they were only required to watch 
the television set, which would present additional 
instructions. The experimenter went into the ad- 
jacent room to monitor the recording and videotape 
equipment. 

All observer EMG responses were recorded at 
the maximum sensitivity level of the equipment: 
5 mv/cm, with a chart speed of 1 mm/sec. The 
EMG couplers were operated in the integrative mode. 

When the videotape deck was turned on, the 
television monitor screen remained blank while relax- 
ation instructions were presented over the audio 
channel. Subsequently, the upper right side of the 
model’s face appeared on the screen, the right eye, 
the nose, and part of the forehead being visible. 
The observer was able to see the air-puff apparatus 
that was attached to the right side of the model’s 
head, with a glass tube located 1 cm from the 
cornea of the model's eye. Also visible to the ob- 
server was a set of electrodes attached to the model 
in the same way as the electrodes were attached to 
the observer. 

Observers were given 10 trials in order to adapt 
their responses to 400 and 2,000 cycle CS tones. 
Each of these tone presentations was 4-sec in dura- 
tion and occurred without any UCS pairings or any 
contingent responses from the model. The adaptation 
period was followed by a random presentation of 
10 CS+ and 10 CS— trials. Following the acquisition 
stage, extinction was initiated without pause, which 
included 5 CS+ and 5 CS— trials presented in a 
random order, The CS presentations during extinc- 
tion, as in adaptation, occurred without any UCS 
pairings or any contingent response from the model. 
The intertrial interval ranged from 10 to 45 sec and 
was randomly varied. The CS+ and CS— tones 
were presented for 4 sec, and the UCS consisted of 
an audible 50-msec air puff followed by the model's 
eye blink. This procedure was identical for Groups 
1 and 2, except for the counterbalancing of the CS 
tones. The tone presentation for Group 3 (control) 
was the same as for Group 1 (experimental), except 
that for Group 3, the UCS following the CS+ tone 
consisted of an audible 50-msec air puff without 
any contingent reactions or blink from the model. 
Following the completion of the videotape, the 
experimenter returned to the observer's room, re- 
moved the electrodes, and asked the observer to fill 
out a questionnaire concerning the experiment. Sub- 
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sequently, the observer was thoroughly debri 
shown the polygraph record, given a full explan. 
of the true purpose of the experiment, and a 
not to discuss this experiment with other stud 


Scoring 


Measures of vicarious reactions and conditi 
were based on the size of the largest EMG res; 
that occurred during specific periods. A ba 
measure was obtained from the 2-see period p 
each onset of the CS in acquisition and extin 
A measure of vicarious instigation 
from the 2-sec period following each CS offset 
during the presentation of UCS with CS l 
during the comparable period with CS—). Ob r 
EMG reactions during the 2-sec period prior t 
CS offset during acquisition were define 
anticipatory responses and constituted a measu í 
conditioning when associated with CS+-. In extin 
a measure of vicarious instigation was obt i 
2 sec after each CS offset. ce all CS tones 
presented alone, observer reactions following 
during this period constituted a measure of resis 
to extinction. 

The EMG electrical signals recorded for h 
observer on audiotape were digitized and played k 
on a Hewlitt Packard 2100A computer that | 
and summarized the data in arbitrary units c 
programs calculated the maximum deflection d g 
each of the specified 2-sec periods for each CS il 
and computed the means for each period sepa y 
by averaging across CS+ and for CS— tri 1 
each location (i.e., eyelid and arm) for eac! 
Server. Separate sets of means were obtainc: 
acquisition and extinction.2 


was ot 


RESULTS 

The results were analyzed using analysis of 
variance computed programs (Dixon, 1973 
Finn, 1972) involving a mixed factorial ce- 
sign, with groups (either Groups 1 and ? or 
Groups 1 and 2 vs. Group 3) and sex (males 
and females) as between-groups factors and 
stimulus (CS+ and CS—) and measure 
(baseline and “after” periods) as withir 
subjects factors. Separate analyses of vari- 
ance were performed for each location (i.e., 


* During the actual running of the experiment, a 
Problem occurred with an electrical circuit that 
recorded the stimulus from the event marker signal- 
ing the onset of the CS on audiotape. This resulted 
in completely losing the stimulus channel for some 
of the observers. However, since the stimuli were 
audible on another channel of the tape recorder, 
Sie of the computer program per- 
nae aes signaling to the computer of the 
is $ e stimuli for these observers. The proce- 

e made it possible to digitize and average the 


data for all observers 
through the us ewl 
Packard computer, Op i Rear stag 
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TABLE 1 


Mean ELECTROMYOGRAPHIC EYELID Responses DUR- 
ING ACQUISITION FOR THE BASELINE, VICARIOUS 
INSTIGATION, AND ANTICIPATORY PERIODS 


Period 

A Vicarious 

Group mhis -Baseline patory tion 
ee 

1 & 2) CS+ 181.35 246.83 407.58 

CcS— 201.83 184.21 182.48 

Control CS+ 201.96 213.75 208.79 

cs= 228.46 193.63 183.25 


Note. Electromyographic responses are given in arbitrary 
units. 


eyelid and arm) and for each period (i.e., 
anticipatory and vicarious instigation period 
in acquisition and vicarious instigation period 
in extinction) .* 

The organization and presentation of the 
results were simplified by focusing on the 
critical sources of variance that suggested 
discriminative responding. A discriminative ef- 
fect was indicated by a Stimulus X Measure 
interaction that showed a greater increase in 
responding to CS+ than to CS—, the increase 
being measured from the baseline for each 
of the specific periods (i.e., anticipatory or 
vicarious instigation periods). 


Vicarious Instigation During Acquisition 


An analysis of variance for the EMG eye- 
lid reactions comparing baseline and vicarious 
instigation periods during the acquisition trials 
revealed discriminative responding for the 
instigation period, since there was increased 
responding for CS+ but not for CS—. The 
Stimulus x Measure interaction, F(1, 44) = 
72.30, p < .001, indicated that this difference 
was reliable. As shown in Table 1, the inter- 
action was due to significant increases in 
EMG eyelid activity during vicarious instiga- 
tion over the baseline period with CS+, F 
(1, 47) = 60.09, p < .001, and no reliable dif- 
ferences were found for CS—. The analysis 
revealed a Stimulus X Measure X Sex inter- 
action, F(1, 44) = 12.24, p < .005, indicating 
that the vicarious discriminative responding 
effect was dependent on sex. The mean EMG 
eyelid reactions for the baseline and vicarious 
instigation periods as a function of stimuli 
and sex were analyzed. These data indi- 
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cated that both males, F(1,35) = 10.79, p< 
01, and females, F(1,35) = 45.29, p < .005, 
tended to discriminate vicariously, but females 
generally responded at a higher level than 
males, F(1,44) = 6.10, p < .025. The analy- 
sis also showed that the two experimental 
groups (Groups 1 and 2) did not differ reli- 
ably from each other. This finding suggests 
that the responses were not a function of the 
specific tones per se. 

The relative significance of the response- 
eliciting qualities of the air-puff sound on the 
observer's discriminative responses was inves- 
tigated by combining the experimental groups 
(Groups 1 and 2) and contrasting them with 
the control group (Group 3, where model did 
not react to air-puff sound). An analysis of 
variance revealed a Stimulus X Measure X 
Group interaction, F (1, 66) = 20.29, p < .001, 
which was produced by the occurrence of dis- 
criminative responding in the experimental 
groups but not in the control group (see 
Table 1). A significant Stimulus X Measure 
interaction was present in the experimental 
groups, F(1,47) = 69.01, p < .001, due to 
increases in activity during the vicarious in- 
stigation period with CS+ and no reliable 
changes with CS—. However, the significant 
Stimulus X Measure interaction for the con- 
trol group, F(1, 23) = 7.69, p < .05, was ap- 
parently produced by nonsignificant increases 
in EMG eyelid reactions during vicarious 


8Interclass correlations were performed for each 
of the major analyses of variance in order to estab- 
lish the degree of consistency or reliability for each 
of the within-subjects factors (i.c., levels or classes) 
and for different combinations of levels of these 
factors (Hays, 1973, p. 535). The correlations for 
electromyographic responses obtained from the vari- 
ous analyses are not reported here to conserve space 
and are available from the author upon request. The 
correlations revealed interesting information concern- 
ing the use of electromyograms as a dependent mea- 
sure. Since these correlations may be thought of as 
reliability coefficients, given the generally high levels 
of the correlations found, it was reasonable to con- 
clude that the dependent measures were highly reli- 
able. Although this study was based upon group 
data, generalizations to the individual level are indeed 
meaningful given the high consistency found in indi- 
vidual responses with respect to baseline, anticipa- 
tory, and vicarious instigation periods. Furthermore, 
the consistencies found in the level of subject re- 
sponses to the different conditioned stimuli indicated 
that using a within-subjects design was advantageous. 
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TABLE 2 


MEAN ELECTROMYOGRAPHIC EYELID RESPONSES 
DURING EXTINCTION FOR BASELINE AND 
Vicarious INSTIGATION PERIODS 


Period 

Vicarious 
Stim- instiga- 

Group ulus Baseline tion 
i a ef 204.96 326.67 
Experimental (1 & 2) ae an fae 
Control CS+ 264.38 228.71 
‘CS 244.58 248.88 


Note, Electromyographic responses are given in arbitrary 
units. 


instigation with CS+ and decreases in these 
reactions with CS— that approached sta- 
tistical significance, F(1, 23) = 4.01, p < 10. 


Vicarious Conditioning During Acquisition 


The analysis of variance for the EMG eye- 
lid reactions comparing baseline and anticipa- 
tory periods during the acquisition trials was 
performed. Evidence for discriminative condi- 
tioning was found for the anticipatory re- 
sponse due to increases in responding for 
CS+ but not for CS—, as may be seen in 
Table 1. The Stimulus X Measure interaction, 
F(1,44) = 15.09, p < .001, showed that this 
effect was reliable and produced by a tend- 
ency for observers to show increases during 
the anticipatory period relative to baseline 
with CS+, F(1,47) = 11.06, but not with 
CS—. The analysis also revealed no effects 
or interactions involving groups, which sug- 
gests that there were no reliable differences 
between Groups 1 and 2, 

The effects of the air puff in the absence 
of the model’s response on the vicariously 
conditioned discrimination effect was ana- 
lyzed by contrasting Groups 1 and 2 (com- 
bined) with Group 3 (control). There were 
no effects due to groups or any other signifi- 
cant effects or interactions. Since all three 
groups showed a vicariously conditioned dis- 
crimination effect, it appears that the air 

puff was sufficient to produce conditioned 
anticipatory responses. 


Vicarious Resistance to Extinction 


The analysis of variance for the baseline 
and vicarious instigation periods based upon 
observer EMG eyelid reactions during extinc- 


tion showed no group effects. As shov 
Table 2, evidence for vicarious resistan: 
extinction was found during the instiga) on 
period, since there was increased respor 

for CS+ and not CS—. A significant S 

lus X Measure, F(1,44) = 13.96, p -< 
interaction appeared to be a result of y 

ous discrimination; significant increas n 
responding during the vicarious instigati > 
riod relative to the baseline period were id 
with CS+, F(1,47) = 5.04, p < .05, an. aot 
with CS—, 

In order to determine what effects th: ir- 
puff sound and the model’s response h n 
the vicarious discrimination effects « g 
extinction, extinction data for Groups 1 2 
(combined) were compared with Gro 
Table 2 presents the mean EMG eyeli 
sponses during extinction for baseline l 
vicarious instigation periods as a functio of 
stimulus and group. The analysis reve od 
a Stimulus X Measure X Group interactio F 
(1,66) = 8.52, p < .005. Resistance to 
tinction was found for the experimental gro 
There was an increase in these obser 
responses during the vicarious instigation 
riod with CS+, F(1,47) = 6.04, p< 
and no significant changes with CS—. 
vicarious discrimination effect was not fo 
with the control group. 


Specificity of EMG Responses 


In order to examine the extent to which 
the EMG eyelid responses were localized, 
analyses performed on the eyelid data were 
repeated using the EMG arm responses, Over- 
all, these analyses showed no evidence of dis- 
criminative responding or conditioning in the 
arm. Therefore, the eyelid responses appear 


to be localized and not to result from general 
tension. 


Postexperimental Questionnaire 


The effectiveness of the cover story was 
assessed by asking observers to rate on a 
S-point scale how trustful or suspicious they 
felt about the instructions given, While 52.8% 
of the observers indicated that they trusted 
the instructions by a rating that fell above 
the midpoint of the scale, 22.2% indicated 
Suspiciousness by ratings below the midpoint: 


di 
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Apparently 25% of the observers were neither 
trustful nor suspicious, since they rated the 
instructions at the midpoint of the scale, 
When asked specifically about the physio- 
logical measurements, 62.5% of the observers 
believed that brain waves were being recorded, 
20.8% were uncertain about what was re- 
corded, 11.1% did not respond, and 5.6% 
did not believe that EEGs were being re- 
corded. Of the observers who indicated dis- 
belief about the measurements, 2.8% felt that 
temperature was being recorded and 2.8% 
felt that muscle movements were being mea- 
sured. 

Overall, the postexperimental questionnaire 
data suggested that the instructions were rela- 
tively effective in disguising the true nature 
of the physiological measurements. Other data 
showed that most observers were unable to 
detect the sex of the model. 


DISCUSSION 

The results of the investigation reported 
here demonstrated vicarious EMG eyelid 
responses during acquisition and vicarious 
conditioned discrimination involving these re- 
sponses. The vicarious reactions and condi- 
tioned discrimination effects obtained from 
the EMG eyelid responses were not found in 
the arm, which indicated that the eyelid 
responses resulted from localized muscular 
activity rather than generalized tension. 

Sex differences were found when measuring 
vicarious reactions during the acquisition pe- 
riod, with females responding at a higher 
level than males. However, this sex difference 
was not found for either measure of condi- 
tioning (i.e, anticipatory responses during 
acquisition or resistance to extinction). It 
appeared that the stimulus control for the 
females’ high level of reactivity during acqui- 
sition was due to the model’s response. When 
the model’s response was not presented 
(during the anticipatory period and during 
extinction), the female level of EMG activity 
did not differ from that of male observers. 

In the present investigation, the compo- 
nents of the UCS (air puff and model’s blink) 
appear to have affected differentially the two 
indices of vicarious conditioning. Observer 
EMG reactions during the anticipatory period 
of the acquisition trials were shown to be 
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related to the air-puff sound per se, During 
the vicarious instigation periods of extinction, 
observer EMG reactions appeared to be a re- 
sult of prior exposure to the model’s response 
during the acquisition trials, since no effects 
due to the air-puff sound were evident for the 
control group during extinction. 

_ This study provides stronger evidence for 
vicarious responding and conditioning than 
the earlier investigation by Berger and Hadley 
(1975). One important difference between 
these experiments involves the baseline mea- 
sure. In the present study, observers could 
not anticipate the presentation of the CS 
because the intertrial interval varied. In the 
previous study, observers could anticipate the 
CS presentation because the period preceding 
the occurrence of the CS was standardized, 
Since baseline measures were obtained just 
prior to CS presentation in both studies, it is 
possible that the baseline in the Berger and 
Hadley study was elevated by anticipatory 
responding. Thus, the actual changes from the 
true baseline may have been greater than 
those that were measured. In addition, in the 
earlier study, the model stuttered different 
words on each trial of the stuttering sequence, 
which may have reduced the effectiveness of 
the UCS for lip conditioning. Finally, certain 
types of responses may be more susceptible to 
vicarious instigation and conditioning than 
others, depending upon the degree to which 
they are under voluntary control. There may 
be more voluntary control of arm and lip 
movements than eyelid movements, 


Implications and Conclusions 


The theoretical significance of these findings 
for observational learning is apparent. Al- 
though the overt performance of a previously 
learned imitative response may be regulated 
by subsequent reinforcing contingencies (eg. 
Bandura, 1969), the acquisition of the imi- 
tative response may be influenced by a vari- 
ety of mediating factors. Because it is difficult 
to measure covert mediation, theories of ob- 
servational learning have had to assume the 
presence of mediating processes. These results 
provide a direct assessment of one form of 
mediation, vicarious instigation and condi- 
tioning of motor responses. 
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The findings have implications for condi- 
tioning therapies as well. Given earlier work 
on the relationship between thinking and 
covert activity, there appears to be some 
empirical foundation for covert conditioning 
therapies that rely upon verbal instruc- 
tions concerning imagined activities (Cautela, 
1970). For instance, in covert reinforcement 
therapy, clients are verbally instructed to imag- 
ine themselves rehearsing a particular behav- 
ior and then to imagine a contingent fantasy 
reinforcer. The present results appear to have 
implications for behavior therapies involving 
the use of models (e.g., Bandura, 1969). 
Exposure to a model’s performance tends to 
elicit similar observer responses that can be 
conditioned to environmental cues, These two 
behavior therapy techniques may have much 
in common. In a sense, the covert condition- 
ing approach, based upon instructions, re- 
quires the client to visualize himself/herself 
as a model engaged in certain activities. 
Therefore, the verbal instructions may create 
a self-modeling situation that operates in the 
same way as in an observational situation. 
In other words, both techniques seem to 
involve the elicitation and conditioning of 
selected covert responses. 

In summary, the results demonstrated un- 
equivocal evidence of vicariously conditioned 
discrimination during acquisition of a strong 
tendency of vicarious resistance to extinction. 
The conditioned discrimination found indi- 
cates that the effect was not due to either 
pseudoconditioning or sensitization. Further- 
more, the vicarious reactions and condi- 
tioned discrimination effect were found to 


be a result of localized motor activity insti- % 


gated by specific model behavior rather than 
generalized tension. 
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Expectations of Choice Behavior Held by Cooperators, 
Competitors, and Individualists Across 
Four Classes of Experimental Game 


D. Michael Kuhlman and David L, Wimberley 
University of Delaware 


Kelley and Stahelski have provided data from Prisoner's Dilemma which sug- 
gest that competitive subjects are generally more likely than cooperators to 
assume that others are similar to themselves, The present paper evaluates this 
suggestion by examining expectations of these groups across games of variable 
motivational structure, including Prisoner’s Dilemma, Sixty-seven males and 
sixty-one females were classified as cooperators, competitors, or individualists, 
based on their choices across four classes of three-choice decomposed games, 
Next, in the absence of any information as to their partner's choice, each sub- 
ject estimated what proportion of the population would choose each alternative 
in each of the four game classes. Next, subjects estimated population choice 
proportions in a standard Prisoner’s Dilemma game and then played 30 trials 
of this game against either a totally cooperative or totally competitive stooge. 
In the Prisoner’s Dilemma task, subjects’ behavior and population estimates 
were wholly consistent with Kelley and Stahelski’s findings, Choice estimates 
in the decomposed task, however, strongly suggest that (a) each motivational 
category assumes most others are similar to itself, and (b) expectations in 


Prisoner’s Dilemma are the result of the particular structure of the game. 


The present work was inspired by two 
separate but theoretically overlapping lines of 
research in the area of experimental gaming. 
One line of research concerns the delineation 
of the motivational bases of choice in ex- 
perimental games. In this regard, the work 


demonstrates how a person’s orientation co- 
varies with his/her beliefs about the orienta- 
tions of others, Specifically, Kelley and 
Stahelski have shown that subjects who de- 
scribe their own intentions as competitive at 
the outset of a gaming study show a strong 


by Messick and McClintock (1968) and tendency to assume that most others are 
McClintock, Messick, Kuhlman, and Campos competitively inclined as well, Among sub- 4 
(1973) suggests the existence of three moti- jects who describe themselves as coopera- 
vational orientations which direct a great deal tive, however, a much more heterogeneous < 


of choice behavior in game settings: (a) 
Cooperation is the orientation to maximize 
the joint gain accruing to both self and the 
other person; (b) Individualism is the orien- 
tation to maximize one’s own gain with no 
concern for the gains or losses of the other; 
and (c) Competition is the orientation to 
maximize one’s gains relative to that of 
the other. 

The second line of research, which inspired 
the present study, is that of Kelley and 
Stahelski (1970). Their work concerns indi- 
vidual differences in gaming orientation and 


Requests for reprints should be sent to D. Michael 
Kuhlman, Department of Psychology, University of 
Delaware, Newark, Delaware 19711. 
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set of expectations emerges. Cooperators do 
not manifest the assumed similarity of com- 
petitors but rather expect some people to co- 
operate and others to compete, Kelley and 
Stahelski demonstrate the existence of this 
pattern in the context of the Prisoner's 
Dilemma game. The present study seeks to 
extend the work of Kelly and Stahelski in 
two dimensions: (a) motives of individual 
subjects, and (b) the structure of the conflict 
setting itself. The following paragraphs will 
develop the proposed extensions. 


Motivation oj Individual Subjects 


In addition to the orientations of coopera- 
tion and competition considered by Kelley 
and Stahelski, we will also examine a third 
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TABLE 1 
A PRISONER'S DILEMMA GAME IN MATRIX FORMAT 


Person 2 
Item Column X Column Y 
Person 1 
Row A 5,5 —10, 10 
Row B 10, —10 1,1 
Nole. The payoff to the left of the comma in each cell is 


Person 1's outcome; Person 2's outcome is to the right of the 
comma. 


orientation, individualism. Recent research 
(Kuhlman & Marshello, 1975a) has shown 
that individual subjects differ in terms of 
which of these orientations is most salient. 
They had subjects make 44 choices across 
four classes of decomposed game (described 
in the Method section of this paper) such 
that neither subject knew the choice of the 
other. Sixty-six percent of their 205 subjects 
made their choices consistent with one of the 
social motives mentioned above across all 
four games: (a) 23% chose to maximize own 
gain (individualists) ; (b) 29% chose to 
maximize relative gain (competitors); and 
(c) 14% chose to maximize joint gain 
(cooperators). A subsequent study (Kuhlman 
& Marshello, 1975b) showed subjects in each 
of these motivational categories to produce 
a unique pattern of responding to three 
different preprogrammed strategies (100%C, 
0%C, and tit for tat, where C = cooperative 
choice) in a Prisoner’s Dilemma game. The 
implication of such data is that at the be- 
havioral level (e.g., actual play in Prisoner’s 
Dilemma), each of these three orientations 
is unique. It seems quite straightforward to 
ask if these three groups also differ at the 
attributional level that Kelley and Stahelski 


have been examining. 


Structure of the Conflict Setting 


It is the case that most of Kelley and 
Stahelski’s theorizing is based on choice be- 
havior in the Prisoner’s Dilemma game. Also, 
their article suggests that the difference be- 
tween cooperators and competitors is a gen- 
eral one that is not bound to a specific situ- 
ation. One must wonder whether cooperators, 
competitors, and, in addition, individualists 
would hold constant beliefs about others 


choices across a variety of game settings. In 
fact, a recent study by Miller and Holmes 
(1975) strongly suggests that a game’s struc- 
ture can moderate expectancies of cooperators 
and competitors. The present study will ex- 
amine the expectations of each type of sub- 
ject across four different types of game. The 
games differ from one another in terms of 
their motivational structure, which will now 
be explained. 

Messick and McClintock (1968) show that 
an experimental game can be described in 
terms of how the choices it offers to subjects 
dominate with respect to each type of social 
goal (i.e., own gain, relative gain, and joint 
gain). Consider the Prisoner’s Dilemma game 
in this way. Table 1 shows the Prisoner’s 
Dilemma game used in the present study. It 
is clear that if one wishes to maximize joint 
gain (cooperation), Row A dominates Row B. 
On the other hand, if one wishes to maximize 
relative gain (competition) or own gain 
(individualism), then Row B dominates Row 
A. We can describe the motivational struc- 
ture of Prisoners Dilemma as OR-J; the 
notation indicates that own and relative gain 
are dominated by a single alternative, and 
joint gain is dominated by a different alter- 
native. In addition to games possessing this 
OR-J structure, the present paper also ex- 
amines choice expectations in three additional 
games that possess the following structures: 
(a) OJ-R, in which a single alternative 
maximizes both own and joint gain, and an- 
other alternative maximizes relative gain; (b) 
O-R-J, in which each type of social goal is 
maximized by a unique choice; and (c) ORJ, 
in which all three types of gain are maxi- 
mized by the same choice. To the degree that 
competitors show greater assumed similarity 
than cooperators across most (or all) of 
these classes, the generality of Kelley and 
Stahelski’s thinking will be bolstered. 

To achieve the two extensions just dis- 
cussed, the present paper must classify indi- 
vidual subjects as either cooperators, com- 
petitors, or individualists. This study makes 
use of a classification technique (to be de- 
scribed in the Method section) that differs 
considerably from Kelley and Stahelski’s. 
Therefore, it is highly desirable to demon- 
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strate the comparability of subjects we desig- 
nate as cooperators and competitors to 
Kelley and Stahelski’s categories. Such a 
demonstration will be made in two ways. 

First, we will acquaint subjects with a 
standard Prisoner’s Dilemma game (see 
Table 1). Then, in the absence of any 
information concerning the choice behavior of 
others, each subject will be asked to estimate 
how many people out of 100 would choose 
Row A and how many would choose Row B. 
Given that our groups of cooperators and 
competitors are comparable to Kelley and 
Stahelski’s, we should see the following: (a) 
Competitors will expect most others to choose 
Row B, and (b) cooperators will expect 
sizable subgroups to make each choice. 

The second demonstration of comparability 
between our groups and those of Kelley and 
Stahelski is in terms of actual choice behav- 
ior in the Prisoner’s Dilemma game. Kelley 
and Stahelski have shown that cooperators 
and competitors differ in their responses to 
their partner’s style of play. Competitive 
subjects choose competitively, independent of 
the nature of their partner’s responses, Co- 
operators, on the other hand, show what 
Kelley and Stahelski call “behavioral assimi- 
lation.” That is, they show high levels of 
cooperative choice when their partner is co- 
operative, but they show high levels of defec- 
tion when their partner is competitive. In 
a previous study (Kuhlman & Marshello, 
1975b), cooperators and competitors (clas- 
sified according to procedures similar to those 
of the present paper) responded precisely 
in this fashion to preprogrammed strategies 
which were either totally cooperative or to- 
tally defecting. The present paper will at- 
tempt to replicate this result; a successful 
replication would argue for the comparability 
of our cooperators and competitors with those 
of Kelley and Stahelski. 

Given that both demonstrations of compa- 
rability can be made, it would seem appropri- 
ate to examine the expectations of each moti- 
vational category across four classes of ex- 
perimental game in order to determine if the 
expectation patterns of Kelley and Stahelski 
are indeed general. 


Rare 


METHOD 
Subjects 


Sixty-cight male and 64 female students, selected 
at random from introductory psychology Classes, 
served as subjects, Their participation served as 
partial fulfillment of a labor atory experience require- 
ment, Subjects were run in same-sex dyads. The 
data from four subjects were not used. One male 
had participated in a similar study prior to the 
experiment. Two of the females in one dyad were 
well acquainted before the experiment, and another 
female did not fully understand the instructions. 


Apparatus 


Centered on one wall of the experimental room 
was an easel which supported a cardboard projec. 
tion screen and, later in the experiment, various 
visual aids. Facing the easel were three tables, each 
with a chair. The experimenter sat behind the 
middle table, upon which was placed a slide projec- 
tor, tape recorder, a 24-V de power supply, and a 
number of switches used to control other equipment 
used in the experiment. Subject 1 sat to the left of 
the experimenter and Subject 2 to the experimenter" 
right. Between each subject’s desk and the exper 
menter’s was a large cardboard partition whi 
served to visually isolate each subject from thi 
experimenter and the other subject. 

On each subject's table was a vertical panel wit 
three switches and one light which was used b; 
subjects to register responses in the first part of 
the experimental se « In the second half of the 
session, a box with four lights and two switches 
was used for the play of the matrix Prisoner's) 
Dilemma game, 


Procedure 


Subjects were escorted into the experimental roo: 
in pairs. They were assigned to the role of Subject 
and Subject 2 by the toss of a coin. Subjects wer 
seated, and experimental instructions were presented! 
on a tape recorder. Throughout the instructions, 
subjects were addressed as Subject 1 and Subject 2. 
The other player was called the other person, and 
the words game, player, partner, and opponent were 
never used. The outcome of each game was described 
in “units” rather than points. 


Procedure for Decomposed Games 


During the first part of the session, subjects were 
instructed in and required to make choices in what 
are called decomposed games. (See Table 2 for aw 
example of a decomposed game.) The decomposed 
game confronts each subject with a choice betwee! 
n alternatives, each of which possesses two nume 
ical outcomes: (a) Sf is the outcome to the self, a 
(b) Ot is the outcome to the other. On a give! 
trial, a subject’s total outcome is the sum of § 
(resulting from his/her own choice) and Ot (result! 
ing from the other subject's choice). While an # 
alternative decomposed game is formally equivalent 
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TABLE 2 
EXAMPLE or A Decomposep O-R-J Game 


Alternative 


A B c 
Self 40 50 40 
Other 40 20 0 


ote. Self indicates units that go to one's self; other indicates 
units that go to the other. (In this game, Alternative A maxi- 
mizes joint gain, Alternative B maximizes own gain, and Alter- 
native C maximizes relative gain.) 


to an nXn# game in matrix form, Messick and 
McClintock argue that the decomposed format de- 
emphasizes strategic considerations among its players. 
Such strategic deemphasis is felt to create greater 
f correspondence between a subject’s choice behavior 
and motivation. Table 2 presents an example of one 
of the decomposed games used in the present study. 

During the instructions for decomposed games, the 
subjects were told that they were participating in an 
experiment in decision making. They were told that 
they would be assigning units to themselves and to 
the other person and were informed they would 
receive a small amount of money at the end of the 
| experiment based on the total number of units they 
accumulated. 

Subjects were thoroughly instructed as to how 
„units were awarded when players selected various 
ove in the decomposed games. To ensure 
complete understanding, a quiz was administered 

asking subjects to compute the outcomes for each 
subject, given the nine possible pairs of alternative 
selections by the two subjects. 

Once each subject demonstrated full comprehen- 

sion of the decomposed game, the first part of the 
EET commenced. During this part, subjects 

responded to a series of 24 slides of decomposed 
games. Included in the series were slides of each of 
the following types: OR-J, OJ-R, O-R-J, and ORJ. 

Three different games of each type were used (by 
j different we mean that different numerical values 

were associated with each alternative), and each 
game was presented twice.* 

Trials proceeded as follows. slide was projected, 
and shortly thereafter the signal light on each sub- 
ject’s panel was illuminated to indicate the start of 
the trial. Each subject then selected Column A, 
Column B, or Column C and pushed up the corre- 
sponding switch. The experimenter recorded the 
subjects’ responses and then turned off the signal 
lights to indicate that the subjects should reset their 

;witches. When both subjects had reset their 
witches, the next slide was projected. During this 
part of the experiment, subjects received no feed- 
‘back as to the choice behavior of the other subject. 
Slides were presented such that one example of each 
game class appeared every four trials. 

Following the 24th decomposed trial, subjects were 
shown four additional decomposed games (one of 
each of the four types) and asked to indicate how 
many university undergraduates out of 100 would 


choose each alternative for each of the four types of 
game. At the time when subjects made these pre- 
dictions they had no knowledge as to the choices of 
the other person. 


Classification of Subjects 


The series of 24 choices made by each subject 
was used to classify him/her as an individualist, 
cooperator, or competitor. In previous research using 
decomposed games (Kuhlman & Marshello, 1975a, 
1975b), criteria were imposed on subject’s choices 
such that he/she had to make x (2/3 or 1/2, de- 
pending on the study) or more choices consistent 
with a single social goal within and across all four 
games. The technique for classification used in the 
present study departs from past practice and so is 
explained here. For each subject an estimate of the 
strengths of all three social motives was made; the 
subject was classified in terms of which motive was 
strongest in him or her. This, of course, requires a 
means of describing the relative strengths of the 
motives within each subject. Kuhlman and Marshello 
(Note 1) have shown that Messick and McClin- 
tock’s stochastic choice model can provide such 
a description, 

The stochastic choice model assumes that at any 
given time the subject is in one of four motivational 
states (individualistic, competitive, cooperative, and 
indifferent) with some fixed probability (w, x, y, 
and z, respectively, where w+x+y+s=1.0). If 
the subject is in one of the first three states, he/she 
chooses the game alternative which maximizes the 
type of gain appropriate to the state; if the subject 
is in the indifference state, he/she chooses between 
the alternatives randomly. Hence, the probability 
that a given alternative will be chosen is simply a 
sum of the probabilities associated with goals maxi- 
mized by that alternative, plus s/n, where n is the 
total number of alternatives in the game. For ex- 
ample, consider an OR-J in which Alternative A 
maximizes both own (individualistic) and relative 
(competitive) gain, Choice B maximizes joint (co- 
operative) gin, and Choice C maximizes none of 
these types of gain. The probability of an A choice 
here is w +x + (2/3); the probability of a B choice 
is y + (2/3); the probability of a C choice is </3. 
Hence, if one has & different classes of game (in this 
study k =4), each class having m alternatives (here 
n= 3), one can represent a subject's choice propor- 
tions across this k-n set as a group of simultaneous 
equations involving four unknowns (w, x, y, and 2). 
Solution of such a set of equations would allow 
estimates of the four motivational parameters for 
a given subject. 

Kuhlman and Marshello (Note 1) estimated these 
parameters for 205 subjects using a least-squares 
technique. Correlations between actual choice behav- 
ior in an O-R-J game (which was not used in the 


1A complete listing of all the decomposed games 
used here appears in Kuhlman and Marshello 
(1975b). 
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estimation of the parameters) and predictions based 
on estimated parameters were quite high: (a) for 
own gain choices, r= .81 (males) and .83 (females) ; 
(b) for relative gain choices, r=.89 (males) and 
-90 (females); and (c) for joint gain choices, r= .92 
(males) and ,90 (females). The use of this technique 
in the present study seems desirable for two reasons: 
First, it allows for classification of many subjects 
who would otherwise fail to meet (arbitrarily im- 
posed) criteria, and second, it appears to provide a 
fuller description of a subject's motivational sys- 
tem than the discrete categorical approach used 
heretofore. 


Procedure for the Prisoner's Dilemma Game 


During this second part of the session, subjects 
were introduced to a two-choice Prisoner's Dilemma 
game in standard matrix form. At the beginning of 
these (tape-recorded) instructions, a metal box con- 
taining two switches and four lights (the lights 
located at the corners of a 2-inch (5.08 em) square) 
was placed on each subject’s table. At this point, 
no game matrix was present. When both subjects 
pushed a switch, one of the four lights was illumi- 
nated on both of their boxes. The position of the 
light was determined by the specific switches the 
two subjects selected. Subject 1 could ensure that 
the light appeared on the upper row by selecting 
Row A; he or she could ensure that the light 
appeared on the lower row by selecting Switch B. 
Subject 2 could determine in which column the light 
would appear. If Subject 2 selected Switch X, the 
light would appear in the left column; if he selected 
Column Y, the light would appear in the right 
column. Thus, if Subject 1 pushed the B switch and 
Subject 2 pushed the X switch, the light would 
come on in the lower left-hand corner of each 
display. Subjects were thoroughly instructed as to 
how their switch selection determined the location 
of the light. 

Next, cardboard overlays, upon which outcome 
matrices were drawn, were placed over the control 
boxes. Each overlay had four holes that accom- 
modated the four light fixtures. At this point, sub- 
jects were instructed as to how their choices deter- 
mined the specific outcome received by each subject. 
Subjects were informed that during this part of the 
study, they would earn money at the rate of 10¢ for 
every 200 units accumulated; this established a pay- 
off rate comparable to that used in the series of 
decomposed games. After demonstrating their com- 
prehension of the game, and before actually playing 
it, each subject was asked to estimate how many 
university undergraduates out of 100 would choose 
each alternative in this game. 

Following this, subjects played 30 trials of the 
matrix Prisoner's Dilemma game pictured in Table 1. 
The trials proceeded as follows: First, the signal 
lights were turned on, indicating that subjects were 
free to make their choices. When each subject had 


TABLE 3 


FINAL CELL Sizes FOR THE PRISONER'S 
Diam Srssiox 


Other's strategy 


Subject’s EMOT a 

motivation» 10%C oC 100% C ORC 
Cooperation 17 17 13 12 
Individualism 8 8 10 13 
Competition 7 10 6 7 


Note, 100% C =totally cooperative; 09% C = totally defect 


* Motivational classification ix based uita of 
stochastic choice model analysit, ect 


made his choice, the responses were recorded, and ` 
the signal lights were turned off, indicating that - 
the subjects were to reset their switches for the 
next trial. 

Subjects were led to believe that they were play- 
ing with one another during this portion of the 
experiment. In reality, the experimenter intercepted 
the signal from the other person and substituted 
series of 30 cooperative responses or 30 defectiv 
responses. Whether a subject received cooperativi 
feedback or defective feedback depended on his/her 
behavior in the decomposed games, During the ex 
perimental session, subjects were given tentative 
classification as cooperators, competitors, individual- 
ists, or inconsistents. (Final classification was made 
on the basis of the previously described stochastic 
model.) In order to merit classification other than 
inconsistent, a subject had to select three or more 
alternatives consistent with a given motive within 
each of the four game classes, (Bach type of game 
was represented by six games.) Thus, for a subject 
to be classified as a cooperator he or she wouid 
have to choose the alternative which maximized 
joint gain in three out of the six games of each 
type. Subjects who failed to meet this criterion, 
or who met the criteria for two different categories, 
were designated inconsistent. Members of cach cate- 
gory were alternately assigned to cooperative and 
defective feedback. This procedure was continued 
until a minimum number of five such tentatively 
classified subjects appeared in each cell of a 2 (sex 
of subject) by 3 (motive of subject) by 2 (strategy 
of confederate) design. Each subject's final clasaifiv 
cation was based on the estimated values of te, 3, 
and y from the stochastic choice model. These 
estimates were made subsequent to the experimental 
session, Table 3 presents the final cell sizes obtained 
in this study. 

After completing 30 trials of the matrix Prisoner's 
Dilemma game, subjects were asked to complete two 
questionnaires and were awarded their earnings for 
the experiment. Subjects were asked not to discuss 
the experiment with anyone until the end of the I 
semester, but they were encouraged to consult the ` 
experimenter at the end of the semester regarding 


the outcome of the experiment. 
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TABLE 4 
YSIS OF PREDICTED 
REQUENCIES 
IN THE O-R-J GAME 
Choice 
p ee T 
Variable Joint Own Relative 
Mean observed 2.52 2.07 1.41 
Mean predicted 2.32 2.15 1.52 
Fiohsorved, predicted) 884 807 879 
a Slope 1.16 937 1.18 
\ m Y intercept - ISE 16 05 39 
a 
> RESULTS 
+ 


The results are presented in the following 
sequence: (a) an evaluation of the technique 
used to classify subjects; (b) analyses of 
choice expectations and actual play in the 
Prisoner’s Dilemma game; and (c) an analy- 
sis of choice expectations across the four 
classes of decomposed games. 


Evaluation of the Motivational Classification 
Technique 


As mentioned in the Method section, the 
probability of making a specific choice in any 
of the decomposed games used here can be 

as), Pressed as a linear combination of w, x, y, 
saand Z, which are the probabilities of being in 


` the individualistic, competitive, cooperative, 
and indifferent states, respectively. In the 
| present study, this produces a system of f1 
* equations in 4 unknowns. For example, one 
equation indicates that the probability of an 
OJ choice in the OJ-R game = 1 (w) + 0(x) 

+ 1(y) + 1/3(z). 

The estimation of each subject’s w, x, y, 
and z proceeded as follows. First, probability 
of choice in each game class was estimated 

from the subject’s decisions: If, for example, 
a subject chose the OJ alternative in the 
OJ-R game four times, then the probability 
of OJ/OJ-R was set equal to 4/6. Next, the 
| System of equations was solved. This system 
of equations is overdetermined and, in gen- 
eral, no solution exists. It is possible, how- 
ever, to solve for the four parameters such 
that the estimates of choice probabilities are 
a best fit in the least-squares sense. The 
technique used (Franklin, 1968) reduce g 
equations to a solvable set of four equations 
in four unknowns.? A Gauss-Seidel iterative 


method was then used to solve this set 
of equations. Iterations continued until a 
Convergence criterion of .00005 was achieved, 

The predictive accuracy of these estimates 
was evaluated by computing each subject’s 
w, x, y, and z based solely on choices in the 
OR-J, OJ-R, and ORJ games, These values 
were then used to predict choice frequencies 
in the O-R-J game. Table 4 presents the 
average obtained and predicted choice fre- 
quencies in the O-R-J game; in addition, 
it presents correlations and regression line 
values for observed and predicted frequencies, 
As the table indicates, observed choice in 
this game is strongly related to predicted 
choice derived from the model. It appears 
that this parameter estimation technique pro- 
duces values of w, X, Y, and z which provide 
an accurate picture of subject’s preferences 
in the decomposed task, 

The previous analyses indicate the internal 
consistency of the Present data. Hence, the 
reliability of parameter estimates for a given 
subject is very likely enhanced by using all 
choice data for that subject. Therefore, the 
final motivational classification of each sub- 
ject was based on values of w, x, Y, and z 
computed from all four game classes. Subjects 
were assigned to one of three motivational 
categories based on the maximum value of 
w, x, and y. Thus, for a subject to have been 
classified as a cooperator, y had to be higher 
than either w or x but not necessarily higher 
than z. 

Using this classification scheme, all sub- 
jects could be classified. Of 128 subjects, 
there were 34 cooperative, 16 individualistic, 


2 The overdetermined system is reduced to a de- 
termined one by (A*-A)-Q=A™-B, In this system 
(a) A is an 11 X 4 matrix of coefficients of w, X, yY, 
and z. Each row of A corresponds to a given alter- 
native in a decomposed game. For example, the 
coefficients for the own gain choice in the O-R-J 
game are 1, 0, 0, and 1/3. (b) A? ig the 4X11 
transpose of A. (c) Qisa four-element column 
vector whose elements are w, x, y, and z. (d) B is 
the 11-element column vector whose elements are 
response probabilities in the decomposed game alter- 
natives. For example, the element of B correspond- 
ing to the J choice in the O-R-J game is equal to 
(number J choices in O-R-J/6). The authors wish 
to thank Ronald Menlove for bringing this technique 
to their attention, 
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and 17 competitive males and 25 cooperative, 
23 individualistic, and 13 competitive females. 
A chi-square computed on the 2 (sex) by 
3 (motive) contingency table was nonsignifi- 
cant. This finding is consistent with a lack 
of sex differences reported in two prior studies 
(Kuhlman & Marshello, 1975a, 1975b), 

A one-way multivariate analysis of vari- 
ance was performed to test for sex differences 
in the values of the three motivational pa- 
rameters, w, x, and y.* The sex effect was 
not significant, multivariate F(3, 124) = 1.55, 
p= .2053. A similar result is reported in 
Kuhlman and Marshello (Note 1). 

Table 5 shows the mean value of the four 
parameters (i.e., w, x, y and z) for the three 
motive types. The average value of the 
highest motivational parameter for each 
group was about .70. Cooperative subjects’ 
motivational parameters fall in the order 
joint, own, relative. The motivational param- 
eters of competitive subjects, on the other 
hand, assume the opposite order: relative, 
own, and joint. Notice that the average rela- 
tive parameter for cooperative subjects is 
negative and that the ayerage joint param- 
eter for competitive subjects is also negative. 
Negative parameter values do not necessarily 
imply a shortcoming in the stochastic model. 
First, the current estimates were based on a 
least-squares (as opposed to maximum-likeli- 
hood) technique. Second, each subject's values 
were based on a very small amount of data 
(only six games of each type were used). In 
spite of the reliability problems posed by 
relatively few data points, the model appears 
to provide an acceptably accurate description 
of preferences in the decomposed task. 


Expectations and Cooperation Level in the 
Prisoner's Dilemma Game 


One now asks if the established motiva- 
tional categories are comparable to those of 
Kelley and Stahelski. First, we consider the 
levels of cooperation shown by cooperators, 
competitors, and individualists in response to 
wholly cooperative (100%C) or defective 
(0%C) preprogrammed strategies. Kelley and 
Stahelski’s cooperators show behavioral as- 
similation; that is, they cooperate with a co- 
operative strategy but become competitive 


TABLE 5 


M EAN VALUE OF w, x, Y AND 3 POR THY 
THREE MOTIVATIONAL TYPES 


Parameter 


Group w s y ‘ 
Individuatist J11 055 069 168 
Competitor 038 «703 190 265 
Cooperator 061 —.120 660 420 


against a defective one. Their competitors 
maintain a defective stance regardless of 
their opponent’s actions. As mentioned pre- 
viously, Kelley and Stahelski’s analysis does 
not speak to an explicitly labeled group of 
individualists. It appears reasonable, how- 
ever, to suggest that Kelley and Stahelski’s 
“competitive” groups contain mixtures of 
competitive and individualistic subjects. (To 
be classified as a competitor by Kelley and 
Stahelski, a subject had to describe his goals 
as follows: “I will work for myself, against 
the other player, and will be concerned only 
with my own score” [p. 68].) One might well 
expect similar behavior of competitors and 
individualists in the present Prisoner's Di- 
lemma study, In fact, previous work of the 
first author, Kuhlman & Marshello, 1975b) 
found. that individualists and competitors 
defected against both a 100%C and a 0% 
strategy. 

Table 6 shows the average proportion 
(over 30 trials) of cooperative choices made 
by each motivational category to the two 
preprogrammed strategies. Tt is very clear 
that the cooperators show behavioral assimi- 
lation, whereas competitors and individual- 
ists do not. A 2 (sex) by 3 (motive) by 
2 (strategy) analysis of variance on these 
data showed no main or interaction effects: 
for sex. Main effects for motive, F(1, 116) 
= 39.70, p<.001, and for strategy, F 
(1,116) = 63.23, p < .001, obtained. The 
nature of these effects are evident in the 
margins of Table 6: (a) Defection level was 
highest among competitors, followed by indi- 
vidualists and cooperators; (b) a O07%C 
strategy produced more defection than a 


2 The Multivariance Program, Version 5 (Finn, 
1972), on the University of Delaware Burroughs 
6700 computer was used for all analyses of variance. 
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TABLE 6 


MEAN Proportion oF COOPERATIVE RESPONSES IN 
Prisoner’s DILEMMA MADE BY EACH Motive 
CATEGORY AS A FUNCTION OF 
PREPROGRAMMED STRATEGY 


Strategy 
Group 100% C 0% C Overall 
Cooperator 84 28 37 
Individualist 35 -18 26 
Competitor 7 «13 AS 
Overall 56 21 


Note. 100% C = cooperative; 0% C = defective. 


100%C strategy. In addition to these main 
effects, the interaction of Motive x Strategy 
was significant, F(2, 232) = 14.95, p < .0001. 
As previously stated, the means associated 
with this interaction are very consistent with 
the findings reported by Kelley and Stahelski, 

Recall that prior to actual play of Pris- 
oner’s Dilemma, each subject was asked to 
estimate how many of 100 university under- 
graduates would choose each alternative in 
the game. Kelley and Stahelski found their 
competitors to assume most others are com- 
petitive, whereas Cooperators expect some 
people to cooperate and others to compete. 
Table 7 shows expected choice proportions 
for the current cooperative, competitive, and 
individualistic groups; the data are wholly 
consistent with Kelley and Stahelski’s pat- 
tern. A 2 (sex) by 3 (motive) analysis of 
variance, taking expected number of coopera- 
tive choices as the dependent variable, 
yielded but one significant effect (for motiva- 
tional category, F(2, 122) = 7.41, p< 001). 

The data presented thus far argue strongly 
for the comparability of the current coopera- 
tors and competitors with those of Kelley 
and Stahelski. Second, they suggest an inter- 


TABLE 7 


PROPORTIONS or Otters Exprcrep TO CHOOSE 
COOPERATIVELY OR COMPETITIVELY ry 
PRISONER’S DILEMMA AS A FUNCTION 
OF SUBJECTS ORIENTATION 
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esting possibility for the location indi- 
vidualists along a cooperative/co: titive 
continuum. Behavioral data (see Tole 6) 
and expectation data (see Table show 
individualists to lie somewhere bety -n co- 
operators and competitors. Subsequ: data 
may suggest whether or not such a limen- 
sional conceptualization is useful. 
Expectations of Motivational Catego: 
Across Four Different Games 

Does the pattern of behavioral expec ations 
evident in Table 7 persist as the na re of 
the game changes? Recall that each ibject 
was asked to estimate how many peo) e out 
of 100 would choose each of the thre ilter- 
natives across the four classes of deco: vosed 
game used here. The motivational st: cture 
of the OR-J game is most similar u Pris- 
oner’s Dilemma and so will be ex: ‘ined 
first. Figure 1 shows the choice expec: tions 
of cooperators (represented by J for joint 
gain), competitors (represented by for 
relative gain) and individualists (repre ated 
by O for own gain) across the three al rna- 
tives in the OR: J game, The alternativ are 
indicated in terms of which social g l(s) 
they maximize. 

These data form a pattern that is re. ark- 


ably consistent with that in Table Co- 
operators expect equal numbers of peo; œ to 
choose the joint gain alternative (.46) and 
the relative/own alternative (.46). Comipeti- 
tors expect most people to choose vela- 
tive/own (.65) and only a few (.17) to 
choose joint. The pattern for individuali«is is 
(like Table 7) very similar to competitors: 
expected choice of own/relative is .70 and of 
joint is 20. A 2 (sex of subject) by 3 (moti- 
vational orientation) multivariate analysis of 
variance was computed on these data, taking 
estimated numbers of people choosing the R 
and OJ alternatives as the two-element depen- 
dent variable vector, The analysis yielded no 
significant multivariate (or univariate) Fs 
for sex or for the Sex X Motive interaction. 
A significant multivariate effect, F(4, 242) 


Subjects Choice 
orientation Cooperative Defective 
Cooperation -532 468 
Individualism 386 -614 
Competition :293 -107 


= 15.144, p < -0001, obtained for motiva- 
tional orientation. The univariate effects of 
motivation were significant for both mea- 
sures: (a) for the OR choice, F(2, 122) = 
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14.14, p < .0001, and (b) for the J choice, 
F (2,122) = 28.75, p< .0001. The results 
easily allow for the conclusion that choice 
expectations of cooperators and competitors 
in a three-choice decomposed OR-J game 
are identical to those in a two-choice matrix 
game of similar motivational structure. In 
addition, they show the expectations of indi- 
vidualists to remain constant across two 
games of similar motivational structure. 

Figure 1 also suggests that if a choice alter- 
native in a decomposed game maximizes none 
of the three social goals considered here (the 
“None” alternative in Figure 1), these sub- 
jects do not expect many people to make 
that choice. Such a finding is not surprising 
for the competitive group in that all data 
to this point suggest they expect most others 
to do as they do. For the cooperative group, 
however, the result is important. Up to this 
point, cooperators have expected approxi- 
mately equal numbers of people to make each 
choice in Prisoner’s Dilemma. Since Prisoner’s 
Dilemma is a two-choice game, one interpre- 
tation of the cooperators’ expectation is that 
they expect approximately 1/nth (where n 
is the number of available alternatives) of 
the population to make each choice. The 
present data are clearly inconsistent with 
this interpretation. 

Having established that choice expectancies 
are invariant over decomposed and matrix 


+ 


Proportion Expected 
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= T SII 
Joint Relative None 
&Own 
Goal Maximized 


Ficurr 1. Mean expectations of ccoperators (J), 
competitors (R), and individualists (0) across the 
three alternatives of the OR-J game. 


Proportion Expected 


Joint Own = Relative 
Goal Maximized 


Ficure 2. Mean expectations of cooperators (J), 
competitors (R), and individualists (O) across the 
three alternatives of the O-R:J game. 


games of similar motivational structure 
(OR-J), we now examine expectancies in 
other structures. First, consider the O-R-J 
game. Unlike, the Prisoner’s Dilemma or 
OR-J game, in which a noncooperative choice 
implies either individualism or competition, the 
O-R-J game offers a unique choice for each 
type of social goal. Figure 2 represents 
the expectancies of each motivational group 
across‘the three O-R- J alternatives. 

A 2 (sex) by 3 (motivation) multivariate 
analysis of variance was computed, taking 
the expected number of joint and relative 
gain choices as the two-element dependent 
vector, As was the case in the OR:J game, 
no multivariate (or univariate) effects 0C- 
curred for sex or for the Sex X Motive inter- 
action. A significant multivariate effect, F 
(4, 242) = 21.36, p < .0001, for motivation 
occurred. The univariate F for each measure 
was significant: (a) for joint choices, F 
(2,122) = 39.27, p < .0001, and (b) for rela- 
tive choices, F(2, 122) = 24.56, p < 0001. 

As far as joint gain responding is con- 
cerned, the three groups have expectations 
very similar to those in the OR-J and 
Prisoner’s Dilemma games. However, the 
figure indicates two departures from results 
reported to this point. First, individualistic 
subjects are no longer identical to competi- 
tors, A post hoc Scheffé comparison (Harris, 
1075) between relative gain expectations of 
cooperators and individualists yielded an F 
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3 

Fıcure 3. Mean expectations of cooperators (J), 
competitors (R), and individualists (O) across the 
three alternatives in the OJ-R game. 


(5, 122) = 11.43, p < .05. The second depar- 
ture shown in Figure 2 rests with the co- 
operators. These subjects are not expecting 
equal amounts of each type of game behav- 
ior. Rather, they expect most others to 
pursue joint gain (.48 in Figure 2), then own 
(.31), and finally relative (.21). To this 
point, then, Figure 2 suggests that expecta- 
tions of various groups are not identical from 
one game to another, 

Much more important, however, is the fact 
that Figure 2 Suggests why the various groups 
hold the expectations they do in Prisoner’s 

Dilemma and the OR-J games. First, con- 

sider cooperators, Figure 2 implies they ex- 

pect around half the population to pursue 
joint gain. Their Second strongest expecta- 
tion is own gain (.31), and finally they 
expect .21 of the population to maximize 
relative gain. When a cooperator is asked to 
imagine the level of OR in an OR-J game 

or the level of defection in Prisoner’s Di- 

Iemma, the resulting expectation may be 

approximated by the sum of his/her expecta- 

tions for own gain and relative gain 

behavior Separately. In this case, we would 

observe the cooperator’s “50/50” expecta- 

tions in Prisoner’s Dilemma and the OR-J 
- games, 

Individualists expect relatively little 
amounts (.18) of joint gain and higher equal 
amounts of own gain (42) and relative gain 
(.39). Hence, when these subjects are asked 
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to estimate response level in OR-> or Pris. i 
oner’s Dilemma, they show a stro `xpecta- ~ 
tion of OR or defection, respectiv: `y, Com- 
petitive subjects expect most others 9 pursue 
relative gain (.60) and next, own 1 (.25), 
The same process suggested for < verators 
and individualists implies that c petitors 
would expect high amounts of Ok defec- ~ 
tion in OR-J and Prisoner’s Dilem games. 
The line of reasoning being veloped 
here has implications for expectanc, .atterns 
in a class of game yet to be exar ed, the 
OJ-R game. In this game, unlike ( ‘J and | 
Prisoner’s Dilemma, cooperators sho `d show 
a strong bias in their choice exp: ‘ations, | 
Specifically, they should expect mo. others < 
to make the OJ choice. Individualis! should 
continue to display a bias, but now i will be 
toward a “cooperative” (OJ) form o behav- 
ior. Finally, the bias shown by con. etitors 
should be reduced relative to that wn in 
OR-J and Prisoner’s Dilemma. The hould 
expect a somewhat larger number make 
the cooperative (OJ) choice sin such 
behavior maximizes own gain. i 
Figure 3 indicates the expectation each 
group in the OJ-R game. The ¢. a are 
wholly consistent with the present ‘ne of 
thought. A 2 (sex) by 3 (motive) multi- 
variate analysis of variance was comp: ted on 
these data, taking expected numi rs of 
own/joint and of relative choices as tco two- 
element dependent variable vector. As prior 
analyses, the only multivariate (or univari- 
ate) effect to reach significance was that due 
to motive, multivariate (4, 242) = 21.01, Ż 
< -0001; for the own/joint measure, univari- 


ate F(2, 122) = 49.09, p < .0001; for the rela- 
tive measure, F(2,122) = 40.08, p < .0001. 

We now turn our attention to the final 
game class used to measure expectations, the 
ORJ game. Here, all three motives lead to 
the same choice, and the line of reasoning 
developed in Preceding paragraphs suggest 
that all three motivational groups will evi- 
dence a strong bias in their expectations of 
the ORJ response. Although each group evi- 
denced such a bias (expectations were .63, .78, 
and .78 for Cooperators, individualists, and 
competitors, Tespectively), the cooperators 
apparently expect fewer ORJ choices than 


| 
| 


| 
l 
| 
l 
| 
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either competitors or individualists. A 2 (sex) 
by 3 (motive) univariate analysis of vari- 
ance was performed on expected number of 
ORJ ‘choices. Results showed a single signifi- 
cant effect for motive, F(2,122) = 7.1536, 
p < .001. 

Such a result leads to the following ques- 
tion: How were these groups responding to 
the two remaining alternatives in the ORJ 
game? Results were as follows: For Alterna- 
tive B (40 for self, 30 for other), mean 
expectancies were .27, .15, and .12 for co- 
operators, individualists, and competitors, re- 
spectively. A 2 (sex) by 3 (motive) analysis 
of variance yielded a single significant effect 
for motive, (2,122) = 10.476, p < .0001. 
For Alternative C (50 for self, 10 for other), 
mean expectancies were .10, .07, and .09 for 
cooperators, individualists, and competitors, 
respectively. The analysis of variance on this 
response yielded no significant Fs for any 
effect. Hence, it appears that subjects in the 
cooperative group expect more Alternative B 
choices than do competitors or individual- 
ists, whereas the groups do not differ on 
Alternative C. 

This result suggests why the cooperators 
did not generate higher expectations of ORJ 
than they did. Specifically, past research 
(Kuhlman & Marshello, 1975a) strongly 
suggests the existence of a fourth group of 
subjects who appear motivated to maxi- 
mize the other person’s gains. Kuhlman and 
Marshello found that such subjects (known 
as altruists) showed a decided preference for 
Alternative B over the ORJ choice when 
playing this particular game. In the three 
other classes of game (O-R-J, OR‘J, and 
OJ-R), altruistic subjects chose the joint 
gain alternative. This is consistent with their 
motive because joint gain and altruistic gain 
lead to the same choice in these three games. 
In the present study, it is very likely that 
subjects whose actual motive was altruism 
were classified as cooperators; this is because 
most of their choices (i.e., those other than 
in the ORJ game) appear to be maximizing 
joint gain. By including altruists and joint 
gainers in a single category of cooperators, 
ORJ expectations of the type shown here 
could emerge. 


To check on this possibility, subjects 
were classified into one of four categories: 
individualistic, competitive, altruistic, and 
cooperative. Classification was based on 
simple choice behavior: if 50% or more of 
a subject’s choices were consistent with a 
single social goal within and across all four 
classes of game, the subject was classified 
according to that goal. Otherwise, the subject 
was not classified. This procedure yielded 109 
classifiable subjects distributed across cate- 
gories as follows: 14 altruists (8 males), 39 
cooperators (23 males), 33 individualists (15 
males), and 23 competitors (13 males). A 
chi-square computed on the 2 (sex) by 4 
(motive) contingency table was nonsignificant. 

The expectancies of these four groups were 
submitted to a 2 (sex) by 4 (motive) multi- 
variate analysis of variance, taking expected 
number of ORJ and Alternative B responses 
as the two-element dependent vector. The only 
multivariate (or univariate) effect was for 
motive, multivariate F(6, 200) = 6.12, p= 
0001. The univariate F for each dependent 
variable was also significant. For ORJ, F 
(3,101) = 10.86, p < .0001, and for Alterna- 
tive B, F(3, 101) = 11.51, p< .0001. Mean 
expectations for the ORJ choice were .48 
(altruists), .70 (cooperators) , .82 (individu- 
alists), and .80 (competitors). A Scheffé 
comparison indicated that cooperators expect 
more ORJ responding than altruists, 7(1, 101) 
= 12.93, p< .01. Mean expectations for 
Alternative B were .37 (altruists), 23 (co- 
operators), .12 (individualists), and .11 
(competitors). Here, a Scheffé comparison 
indicated that altruists expected more Alter- 
native B responding than cooperators, F 
(1, 101) = 10.51, p < 01. 

Even when altruists are separated from 
cooperators, it is still the case that coopera- 
tors are less extreme in their expectations of 
ORJ than either individualists or competi- 
tors. This is apparently due to the fact 
that cooperators expect more Alternative B 
behavior than either of these two groups. 

The final question considered in this sec- 
tion deals with expectations of the altruistic 
group across the three previously examined 
game classes of OR-J, O-R-J, and OJ-R. 
As has already been stated, the maximize- 
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other (altruistic) and maximize-joint (coop- 
erative) responses always occur together in 
these three classes of game. Hence, it seems 
reasonable to expect cooperators and altruists 
to have quite similar behavioral expectations 
in these cases. The data are remarkably con- 
sistent with this expectation. Expectations of 
these two groups across the various games 
were as follows: (a) defection in Prisoner’s 
Dilemma = .42 and .47 (altruists and coop- 
erators, respectively); (b) J in OR-J = 46 
and .48; (c) OR in OR:J = .44 and .48; 
(d) O in O-R-J = .20 and 31; (e) R in 
O-R-J = .18 and .19; (f) OJ in OJ-R = .80 
and .85; and (g) R in OJ:R = .12 and .09. 
The expectations of competitors and individu- 
alists across these games is wholly consistent 
with data reported in Figures 1, 2, and 3 and 
in Table 6. 
A series of three (one for each game class) 
2 (sex) by 4 (motive) multivariate analyses 
was computed on choice expectations of sub- 
jects classified according to choice criteria. 
Each analysis yielded results identical to 
those reported earlier for subjects classified 
according to the stochastic model; that is, 
the only effect to reach significance was that 
of motivational category. Also, all multivari- 


ate and univariate Fs computed on this factor 
were significant. 


Discussion 


Results of the Present study, and previous 
research as well (Kuhlman & Marshello, 
1975a, 1975b), demonstrate the existence of 
four different types of subject in undergradu- 
ate populations, 


in this discussion is that it appears reasonable 


are consistent with this ordering. Specifically, 
they show cooperators’ weakest motive to be 
competition, with individualism taking on an 
intermediate value. Competitors’ weakest mo- 
tive is cooperation, 
being intermediate. 

In addition to the present data, recent 
models of gaming motivation have been de- 
veloped which also suggest a continuum of 


this type. Griesinger and Living (1973) — 
have developed a two-dimensiona! eometric © 
model of game motivation whic! lescribes 
a subject’s preference in terms of he angle 
of a “motivational vector.” As <e angle 
changes from —45° to 90°, motivati changes 
from competition to individualism oopera- 


tion to altruism. Although Grice er and ` 
Livingston’s article is primarily retical, 
they do point out a formal equiv. -nce be- 
tween their model and an algeb: model 
developed by Wyer (1969). Wye model 
describes a subject’s preference in rms of 


the weight that he or she assigns to wn gain 
and the weight assigned to other’s ; Wyer 
shows that his model can predict gai © choice ” 


behavior of individual subjects. H ce, the 
continuum being proposed here is b à theo- 
retically tractable and empirically sı ported. 

The second point suggested by th current 
data concerns a Subject’s motivation. expec- 
tations (beliefs about others) as a :nction 
of his/her own motivation. It woul: appear 
that subjects expect most others to be » their 
“motivational neighborhood.” As a gi n mo- 
tive becomes further removed from sub- 
ject’s own orientation, the Subject xpects 
fewer and fewer people to possess 1 +t mo- 
tive. Hence, each motivational cates -y can 
be thought to Possess a unimodal “: :pecta- 
tion of motive gradient” over the « rently 
Proposed motivational continuum. nen a 
subject is asked to generate a be’ wvioral 
expectation in a given game, the rsult is 
understandable as a combination of the sub- 
ject’s expectation gradient and the motiva- 


tional structure of the game. 

From what has been presented thus far, it 
is clear that we take issue with Kelley and 
Stahelski’s suggestion that cooperators and 
Competitors differ in terms of assumed simi- 
larity. We feel our data suggest that assumed 
similarity is common to all groups examined 
here. We also feel, however, that the Pris- 
oner’s Dilemma expectation patterns demon- 
strated by Kelley and Stahelski and repli- 
cated here (in the Prisoner’s Dilemma and 
OR-J games) are representative of the world 
view of these Sroups in the great majority of 
conflict Settings both inside and outside the 
laboratory. 


CHOICE EXPECTATIONS IN FOUR EXPERIMENTAL GAMES 81 


This is due to the fact that in most “real 
world” conflict there is a limited supply of 
the commodity sought after. This creates a 
situation in which people solely motivated to 
maximize own gain and those motivated to 
maximize relative gain would be difficult to 
distinguish, Hence, most conflict situations 
might be characterized as essentially OR-J 
in structure. To the extent this is so, Kelley 
and Stahelski’s expectational patterns would 
be demonstrable in conflict settings other 
than the formalized version of Prisoner’s 
Dilemma. In fact, Kelley and Stahelski cite 
data from two (non-Prisoner’s Dilemma) 
bargaining studies and one survey on student 
protest which show expectational patterns of 
the same type they obtain with the Prisoner’s 
Dilemma game. 

It is important to restate, however, the 
major implication of the current results: The 
stable world view of cooperators and competi- 
tors in Prisoner’s Dilemma and other OR: J 
structures does not result from differential 
amounts of assumed similarity between these 
groups. Rather, the expectations result from 
the shape of each group’s expectation gradi- 
ent and the motivational structure of these 
particular games. 
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Coalition Formation in Small Groups with Incomple: 
Communication Networks 
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The question of how coalitions form within grcups whose members cannot all 
communicate with each other was investigated with an experiment involving 
41 adolescent West German subjects. In groups of six or seven members, sub- 


jects played a computer-based coalition 


variably restricted to certain pairs of players. Subgroups with more communi- 


individual players 
whom they could 
channels, voting 


the frequency with which these channels were used and 


example, senders 


voted against receivers less than receivers voted against senders. The effects of 


Research on communication networks in 
task-oriented small groups has made it clear 
that these networks affect group behavior 
and that individuals at different positions in 
a network behave differently. In most in- 
stances, problem-solving efficiency, morale or 
satisfaction, and leadership or influence in 
teaching the solution have responded to ma- 
nipulation of the network (Bavelas, 1951; 
Gilchrist, Shaw, & Walker, 1954; Leavitt, 
1951; Macy, Christie, & Luce, 1953; Shaw, 
1954, 1955, 1958: Shaw, Rothschild, & 
Strickland, 1957). The relationships are still 
far from completely known and appear to be 
complex. Although task performance tends to 
be more efficient in denser networks, that is, 
those with more channels, this may not be the 
case if the task is quite simple or highly 
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Alexander von Humboldt-Stiftung through a re- 
search fellowship, and by training received under a 
Social Science Research Council Research Training 


manipulations of 


dependent on central coordination ise & 
Miller, 1951; Shaw, 1958: Snadows! 1972) 
or if the subjects have accumulate much 
experience working within the same twork 
(Flament, 1963, pp. 82-84: Guet: ow & 
Simon, 1966; Mulder, 1960: Shaw 954). 
Typically, subjects located at mor: entral 
Positions of a network have tende: to be 
More satisfied and to be seen as mo influ- 
ential in reaching the solution, but ex: -ptions 
have been reported, Many of the ex}. :imen- 
tal results have been reviewed by Hare (1962, 
chap. 10), Béssmann (1967, chap. and 
Shaw (1971, pp. 137-145), 

How do different networks and network 
positions affect intragroup conflict as opposed 
to cooperation? In particular, how does coali- 


ized by incomplete communication 
which 


municate directly with every other member? 


tions that were formed, the ways they divided 
the reward, and several measures of bargain- 
ing behavior, “Throughout all these results,” 
however, “there Were no differences due to 
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the various communication variables” (Rapo- 
port & Kahan, Note 1, p. 13). Only the order 
of message transmission was found to have 
an effect: Subjects permitted to communi- 
cate first had an advantage. But this was the 
case only in four-person games and not in 
five-person ones. 

If Rapoport and Kahan’s (Note 1) largely 
negative findings were to be generalized to 
all manipulations of the communication net- 
work in coalition situations, the result would 
be most surprising. It would go against the 
belief among students of coalition behavior 
(Francis, 1970; Kelley, 1968) that communi- 
cation opportunities affect the formation and 
composition of coalitions. It would also con- 
trast with, even if not logically contradict, 
experimental findings that, at least under 
some conditions, bargaining outcomes are in- 
fluenced by the type of communication per- 
mitted (Daniels, 1967; Westen & Buckley, 
1974), cooperation is more likely when com- 
munication is facilitated (review by Santi & 
Wells, 1975), hostility and the willingness to 
hurt are reduced by closer communication 
(Cappello, 1972; Milgram, 1965), and satis- 
faction with other members of a group is 
greater when one can communicate with them 
(Snadowsky, 1974).* 

The classical variable in communication 
network experiments—who may communicate 
with whom—is one that Rapoport and 
Kahan (Note 1) did not manipulate. It is 
to be expected that if this variable were 
manipulated in a coalition situation, it would 
have a strong effect on who enters the winning 
coalition and on the bargaining process lead- 
ing to its formation. In coalition situations, 
participants exchange more than factual in- 
formation and more, even, than offers of a 
given percentage of the reward. They also 
exchange offers of and bids for solidarity, or 
joint membership in an emerging group. Here 
it is to be expected that messages will lose 
much of their effectiveness when relayed 
through intermediaries, in addition to the loss 
of accuracy that is present for factual mes- 
sages (Flament, 1963, p. 86; Verba, 1961, pp- 
235-236). What should be important in coa- 
lition situations is not the indirect “reach- 
ability” of one group member by another but 
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rather their “adjacency” (Harary, Norman, 
& Cartwright, 1965), that is, the presence 
or absence of a communication channel link- 
ing them directly, Subjects permitted to com- 
municate with each other should be more 
likely to enter the winning coalition together, 
and subjects should be more likely to try 
to bring into the winning coalition those with 
whom they can communicate than those with 
whom they cannot. 


Hypotheses 


These and some related expectations can 
be formulated as a set of eight hypotheses. 
These hypotheses share a common pair of 
antecedent conditions: (a) There is a group 
in which some pairs of members are adjacent 
(i.e., the two members are linked by a direct, 
two-way communication channel) and other 
pairs are not (i.e., the two members cannot 
communicate directly). (b) If a coalition is 
formed by more than a certain number (the 
number being at least one half the size of 
the group) but less than the total number of 
the members of the group, the members of 
the coalition will be rewarded. Under these 
conditions, the following predictions can be 
made: 


Hypothesis 1; A coalition that is actually 
formed will tend to contain more adjacent 
pairs of members than would be expected by 
chance. This hypothesis deals with the out- 
comes of the coalition-forming process. All 
of the remaining hypotheses deal with the 
process itself. 

Hypothesis 2: A coalition that is proposed 
by any member of the group will tend to con- 
tain more members adjacent to the proposing 
member than would be expected by chance. 

Hypothesis 3: At the individual level, the 
likelihood of a given member p, proposing to 
include a given other member py in a coalition 
will be greater if they are adjacent than if 
they are not. 

Hypothesis 4: Given that two members of 
the group, pı and py, are adjacent (i.e, are 


1 The last finding emerges from an inspection of 
Snadowsky’s (1974) data for satisfaction with 
leaders and nonleaders in complete and incomplete 
networks, although the significance of this interaction 


was not reported. 
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able to communicate), the likelihood of pi receiver). In contrast, extreme 
Proposing to include p; in a coalition will vary might lead p; to offer to sacrifice 
with the frequency with which the communi- pjs benefit by suggesting that P 
cation channel between them has been used coalition including p; but exclud 
at the most recent opportunity. Specifically, should expect pj to take offense ; 
the likelihood will be greatest if communica- type of communication, which 
tions have traveled between them in both di- hostile communication, and to take 
rections, next greatest if in only one direction, of the second extreme type, whic 
and least if no communication has traveled in a “martyr” communication. Thi 
either direction. The next hypothesis elabo- the next prediction: 

rates this one. 

Hypothesis 5: If communication has 
traveled in one and only one direction be- > an 5 
tween p; and p; at the latest opportunity, the ™¢lude p; in a coalition will be 
sender will be more likely to propose the in- ‘Pe latest Opportunity, pı has sent | 
clusion of the receiver in a coalition than the °° @ martyr communication than 
receiver to propose the inclusion of the sender, SeMt P; no communication at all, « 
Deutsch (1958) found in a bargaining experi- for whether any communication ha 
ment that those who sent messages proposing i” the other direction. ‘ 
Cooperation were more likely to cooperate _ Hypothesis 7: The effects in Hyp. 
thereafter than those who received such mes- 2) and 3 will vary directly with the 
sages. He offered the explanation that when Which members of the group are 
he sends a Message, the sender commits him- t° the importance of the comn 
self, but does not commit the receiver, as long Channels and barriers in the group. 
as the situation is not so competitive as to Hypothesis 8: The effects in Hy 
make deception legitimate, 1-6 will vary inversely with the ar 

- information available to group 
The hypotheses above have ignored the about the preferences of other mem! 

content of communications and the motiva- sources other than member-memb: 
tions of the group members, but these may munication. These last two hypothese 
be related to coalition-forming behavior and that the various communication effe 
e and Krauss (1962) be strengthened when the (subjec 
aaide tien os ee channels objective) Importance of the commu 
competitively o AE a thaniby network (both hypotheses) or of 
group members develo, oner U so, as communication (Hypothesis 8) is in 
; P cooperative orienta- (Crott, 1972; Hartnett & Cu 


Hypothesis 6: Given that Pi 
adjacent, the likelihood of py pre 
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€ alescing, communication METHOD 
behavior should change. Those intending to Subjects 
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tion volume so as to Support Hypothesis 4,  extraverted had been 
they may affect Communication content in a Rother experiment, M 
way that counteracts that hypothesis, Ex- “aium students. Subjects Saved in’, six 
treme antagonism might lead p; not to avoid 


communicating with Pi, but instead ibavendl Suhiject ahs ea reduced to six members 


him offensive communications, such as a sug- Placed. A ratio 


gestion that p; Support a coalition that in- ene by accepting all female volunteers but not all 
cludes p; (the sender) but excludes p, (the alied mee’, distribution was approximately 


equalized among the six groups. 


; 
i 
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COALITION FORMATION WITH INCOMPLETE COMMUNICATION 


Apparatus 


The experiment took place in the computer- 
assisted instruction laboratory of a large training 
institution. Each subject was assigned to an IBM 
2740 typewriter terminal, located inside a three- 
walled carrel. Terminals were connected to an IBM 
370/155 computer, which executed the randomiza- 
tions, instruction giving, training, message taking, 
message delivery, vote taking, result tabulation, feed- 
back delivery, rule enforcement, timekeeping, event 
recording, and data analysis to be described below, 
via programs written in APL-PLUS. Subjects were 
already familiar with the use of the computer termi- 
nals, because they had participated in a 300-round 
Prisoner’s Dilemma experiment the same morning 
using this equipment.? 


Procedure 


Subjects and the experimenter had lunch together 
before the experiment. Upon their return to the 
laboratory, the experimenter told them that they 
would now play a game against each other rather 
than against the computer as they had done dwing 
the morning. He then seated the subjects at their 
terminals, so that they were out of visual contact 
with each other. He next turned on the terminals, 
which proceeded to give each subject the rules and 
training for playing the game. 

The game involved all six or seven persons in the 
group, had a coalition-forming task, and gave a 
monetary reward to those who succeeded in forming 
a winning coalition. The game was called Tod auf 
See (Death at Sea). According to the scenario, the 
subjects were passengers in a sinking lifeboat that 
could not hold them all and would continue to sink 
unless four of the passengers could reach an agree- 
ment as to which bare minority of the group (ies 
two in the six-person game or three in the seven-person 
game) should be thrown overboard. Passengers were 
informed at the beginning as to how long they had 
to reach this agreement before the boat would sink. 
If an agreement was reached in time, the four 
passengers who remained in the boat would reach 
an island, where they would find a buried treasure 
of 20 German marks, to be divided equally among 
them. In accordance with this scenario, the four 
winners of each game were paid five marks apiece 
(approximately $2), the losers were paid nothing, 
and, if the time limit passed without an agreement, 
no players were paid anything. 

The mechanism that was provided for officially 
forming a coalition was voting. The game went on 
for as many rounds as were necessary to reach an 
agreement or to exhaust the time limit, and on each 
round each player could vote by naming the players 
he wanted to exclude from the winning coalition. 
If on any round four players cast identical votes, 
the four passengers left on board by their votes 
became the winning coalition, and the players were 
informed of the outcome of the game. Otherwise, if 
there was still time left before the deadline, the fail- 
ure to reach agreement was announced, together with 


85 
6 PERSON GAMES 7-PERSON GAMES 
2 2 
a A 
2y be ĉr Sa 
i i 
j \ 
he 
Cee yee à i 
y . 
y y * 
GAME 2 
GAMES 3an04 
Uk 


Ns 
pra 
N 
l 


Ficure 1. Communication networks in 
Games 2, 3, and 4. 


the amount of time remaining, and the next round 
was begun. Nothing was stated in the instructions 
about the possibility of voting against oneself, but 
such a vote was accepted and processed if cast.# 

Of the experimental conditions that were varied, 
the most important for the present study was com- 
munication, Each group played the game four times. 
In Game 1, no communication was permitted. This 
game, plus a training exercise, helped to acquaint the 
subjects with the rules and procedures and is ex- 
cluded from the analysis below. In Games 2, 3, 
and 4, each player had the opportunity to send 
messages to other players before the votes were cast 
for each round. Each message went privately from 
one sender to one receiver. The number of messages 
that a given player could send in a round as well 
as the set of players he was allowed to send them to 
differed from player to player but remained in effect 
throughout a game. Figure 1 shows the commun 
tion networks that were provided in these games. 


2 Support for the programming, program revision, 
and conduct of this experiment was provided by 
Stiftung Rehabilitation (Heidelberg), the Deutscher 
Akademischer Austauschdienst, the Council for Euro- 
pean Studies, the Institut fiir Soz chaften 
der Universitit Mannheim, the State University of 
New York, and Sonderforschungsbereich 24 der 
Universitit Mannheim. Assistance of various kinds 
was rendered by Rudolf Wildenmann, Betty Weneser, 
Walter Wehrli, Wilfried K. Schreiber, Margit Oswald, 
Martin Irle, Jeannette Huber, Dietrich Franz, 
Wolfgang E. Fendt, Günter Dhom, Walter Augs- 
burger, and others. 

$Thus the winning and deciding coalitions need 
not be identical here. Since in practice they usually 
are, analysis is limited to winning coalitions. 
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Each point represents a player, each line represents 
a two-way communication channel, and each number 
represents the number of messages per round that 
the player in question was allowed to send. Since 
all channels are two-way and every player is 
“reachable from,” although not necessarily adjacent 
to, every other player, the networks shown in 
Figure 1 are all incomplete, symmetric, strongly 
connected digraphs (Harary et al., 1965). Pairs of 
players differ in the distances between them and, in 
particular, in whether they are adjacent. 

Messages were all of a fixed form and could only 
be voting suggestions or promises of side payments, 
A voting suggestion simply consisted of a recom- 
mendation as to which players should be excluded 
from the winning coalition. A promise of a side pay- 
ment was a binding commitment to pay the receiver 
some chosen amount of money at the end of the 
game if both the sender and the receiver were mem- 
bers of the winning coalition. A voting suggestion 
could name no more and no fewer players than the 
number (two or three) to be excluded. Side- 
payment promises could be made in any amount of 
pfennigs, with the restriction that, in the course of 
any game, no player could promise a total of more 
than the reward out of which the promises were to 
be paid (i.e, 5 marks). Each message could take 
the form of a suggestion or a promise, but not both. 

To prevent any single player from filibustering, a 
time limit of 30 sec was placed on each choice of 
message receiver, message type, message content, and 
actual vote. Players exceeding this limit were tem- 
porarily ignored so that the game could proceed. An 

fewer than the per- 
round but not for 


t however, a player 
could abstain from voting by exceeding the time 
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Ficure 2. Communication channels within 
winning coalitions, 
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failure to reach an agreement. In Game 


1oWever, 
each player was given a complete tabula! a of the 
votes cast that round, that is, who had vo. } against 
whom (and who had failed to vote). Ir cet, this 
condition removed ballot secrecy, permitt players 
to compare voting suggestions with a: Votes, 
and provided perfect information abou ow the 
process of coalition formation was proc ig. The 
other condition varied (Hypothesis 7) as the 
meaning attributed to the communicati: barrier, 
In every odd-numbered group, those su ts who 
were not adjacent to everyone else were ven an 
elaborated scenario, in which the pa: whom 
they could send messages spoke Ge nd the 
others spoke an “incomprehensible foreic ngue,” 
The players were identified only by nu r, and 
no one knew which player had whic! 


imber, 


except for himself. At the start of c 
numbers were randomly reassigned to ti 


bjects 
and also to the positions of the communic 1 net- 
work for that game. Each player was the: ld his 
own number, the numbers of the $ vhom 
he could send messages, and how y ssages 


he could send each round, along with 
requirements of the game.5 

Upon completion of the fourth game 
were paid whatever they had won in the fo 


other 


bjects 


ames, 
plus 10 marks for the day’s participation, were 
debriefed. 
REsuLTs 
Two of the expected effects were ent, 
The tendency to vote against nona ent 
players was identical, whether the sub -tive 


importance of the communication no work 
was high or low and whether the 


unt 
of extracommunicational information «bout 
player preferences was large or small. Thus 
neither Hypothesis 7 nor Hypothesis $ was 
confirmed, and the data for both values of 
the two independent variables in these 
hypotheses are combined in the remaining 
analysis. 


The data relevant to Hypothesis 1 are 
shown in Figure 2. In the 18 times Death at 


Sea was played in this experiment, the groups 


4This procedure avoids the criticism of small- 
group experiments in which every member is, un- 
realistically, essential to the group’s success (see 
Glanzer & Glaser, 1965). 

ê One of these other 


20, 20, and 14 minutes in the six- 


aching agree- 
and different 
priori by the 


tt i ya 


| 
1 


COALITION FORMATION WITH INCOMPLETE COMMUNICATION 87 


were successful in coming to an agreement on 
a winning coalition before the deadline in all 
but one case. If all of the 17 winning coali- 
tions had contained the least dense communi- 
cation networks allowed by their experimental 
-onditions, then the (cumulative relative) 
coalition network density distribution would 
have followed the curve marked “minimum.” 
If winning coalitions had been selected by 
chance, without regard to communication 
opportunities, the distribution of their net- 
work densities would have followed the 
“random” curve, And if the most communica- 
tively endowed subgroup had become the 
winning coalition each time, the “maximum” 
curve would have indicated the distribution 
of their network densities. The actual dis- 
tribution is the curve labeled “observed.” At 
some points it coincides with the “maximum” 
curve; elsewhere it lies considerably toward 
the “maximum” side of the “random” curve. 
This means that the winning coalitions had 
considerably more adjacent pairs than would 
have been expected by chance (p < .003).° 
The four-person subgroups in each experi- 
mental condition with the largest number of 
adjacent pairs would have been expected by 
chance to become the winning coalition only 
2.6 times, yet actually did become the coa- 
lition 9 times. The least densely linked sub- 
group(s) in each condition would have been 
randomly expected to form two of the coali- 
tions but in fact formed none of them at all. 
This result confirms Hypothesis 1. 
Hypothesis 2 deals with the votes cast in 
the process of reaching (or attempting to 
reach) a coalition, The only votes that are 
relevant to the hypothesis are those cast by 
players who were members of at least one 
adjacent and at least one nonadjacent pair. 
A total of 450 votes were cast by such players 
in the 18 games. Their distribution is shown 
in Figure 3. The pattern here is similar to 
that in Figure 2, although not so extreme. 
When each vote is rated as to its likelihood 
in the experimental condition where it was 
cast, it can be seen that the voting was defi- 
nitely more favorable to adjacent players 
than random voting would have been ($ 
=e ~ 0). In this case, the maximum 
curve shows the distribution that would have 
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Ficure 3. The tendency to vote against 
nonadjacent players. 


resulted if everyone had always voted against 
the largest possible number of nonadjacent 
players. The result confirms Hypothesis 2. 
Hypothesis 3 brings the level of analysis 
down from the vote to the vote element, that 
is, an instance of one player naming another 
player in a vote. There were a total of 1,409 
vote elements, as shown in Table 1. Of these, 
57% were against nonadjacent players, as 
compared with the 44% expected and the 
75% theoretically possible. This result 
(x = 148, P< .001) confirms Hypothesis 3. 
We can see the result of the test of Hy- 
pothesis 4 in Table 2. As expected, the tend- 
ency to vote against a potential communica- 


6 Figure 2 was composed by rating cach possible 
coalition in each of the four communication condi- 
tions (six-person Vs. seven-person game, circle-shaped 
vs. webb-shaped network) as to the probability 
of a coalition being formed in that condition with 
an equally dense or denser network of channels 
among its members. Density was simply measured 
by the total number of lines, that is, adjacent pairs. 
But since a coalition with a given number of chan- 
nels may be one of the densest possible coalitions 
in one condition and one of the least dense possible 
in another, all densities had to be translated into 
(within condition) probabilities for aggregation 
across conditions. The significance of the deviation 
of the observed distribution from the random one 
is measured here and in Figure 3 by the Smirnov 
test for maximum differences between cumulative 
distribution functions. ees 

7 By the likelihood of a vote is meant the likelihood 
that a vote cast by a person at that position in 
that communication network would, by chance, name 
as victims that many ot more nonadjacent players, 
after having taken into account whether one of the 
players voted against was the voter himself. 
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TABLE 1 


COMMUNICATION OPPORTUNITY AND 
COALITION CHOICE 


Vote elements against 
players not adjacent to 
voter 


As % of 

those 

As% subject 
No. vote ofall to 

Item elements cast choice 
Theoretical maximum 1054 15 100 
Observed 798 57 71 
Chance expected 617.2 44 50 
Theoretical minimum 173 12 0 


Note. N = 1409. 


tion partner varies greatly with the degree of 
actual communication that has just taken 
place between the two. Taking as the base 
the situation in which messages have flowed 
in both directions earlier during the round, 
we find a player more than twice as likely 
to vote against someone if messages have 
flowed in only one direction, and five times 
as likely to vote against him if they have 
not exchanged messages at all, This result 
(x? = 223, p< -001) confirms Hypothesis 4,8 
The same table confirms Hypothesis 5 as 
well. Among those who have engaged in one- 
way communication with another player, re- 
ceivers are more than twice as likely to vote 
against senders as senders are likely to vote 
against receivers (Y= 49, p< 001), 
Looking at the table in another way, we 
can ask: Given whether Pi Sent a message to 
Pj, does p,’s communication toward pi help 
us to predict either p,’s or bjs voting behav- 
ior? The answer is that each is significantly 
less likely to vote against the other if Pj 


TABLE 2 


Communication CONDITIONS AND 
COALITION CHotcE 


No. vote elements against 
other player 


Communication between Ob- Ey- see 
players served pected expected 
Meads in both 
irections 86 256.2 
Message(s) in only i z 
one direction 266 378.8 702 
From voter 65 1738 374 
‘0 voter 201 205.0 9 
Communication channel ey SO 
but no messages 259 156.8 1.652 


Note. N = 1409, 
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has sent p; a message during the r id. But 
the messages a player sends tell us mch more 
about his likely voting than do the essages 
he gets. 

Hypothesis 6 is only partly con’ ned by 
the data, which are shown in Table When 
player p; has sent a message to Pj, F +5 more 
likely to vote against pj if pj has se : hima 
martyr or hostile message (row 3) nif pj 
has sent him no message at all (ro ) and 
least likely to vote against him if he «s sent 
pı an ordinary “friendly” message (ro 1 ), as 
hypothesized (x? = 22, P< .001). B | when 
pı has sent no message to Pj, the o con- 
firmed part of the hypothesis is th pi is 
least likely to vote against Py if the la’ er has 
sent him an ordinary message (rov 5; x? 
= 38, p< .001); his likelihood oí oting 
against pj is greater—and about equal ' so— 
if p; has sent him a martyr message, ostile 
message, Or no message at all (rows 10). 
The greater tendency to vote agai the 
senders of martyr messages than aga the 
senders of hostile ones, although consis nt in 
both halves of Table 3, does not atta — sta- 
tistical significance, Finally, the table veals 
a regularity further elaborating H> othe- 
sis 5: Given p,’s Message behavior, p; © nore 
likely to vote against him if p; has n sent 
him a message during the round, is 
shown by the fact that ratios in rows »—10 
are greater than respective ratios in ro\. 1-5 


(P < .001, .001, .02, .02, and .25). 


SUMMARY AND Discussion 


Some hypotheses about coalition formation 
in small groups having incomplete communi- 
cation networks were tested by a computer- 
based experimental game played by West 
German adolescent subjects. As hypothesized, 
subgroups whose members had many channels 
of potential communication among themselves 
were far more likely to become the winning 
coalition than subgroups containing few chan- 
nels. Subjects were considerably more likely 
to vote to exclude someone from the winning 
Coalition if they were unable to communicate 


8 Chi-squares for 
puted imposed limits on 
Xpected frequencies and are 
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with him than if they were able to. Among 
those able to communicate with each other, 
there was a strong relationship between actual 
use of this opportunity and coalition partner 
choice: Those who had been communication 
vartners on any round were more often 
selected as coalition partners. Two-way com- 
munication partners were preferred to one- 
way partners and among the one-way part- 
ners senders were more benevolent toward 
receivers than vice versa. 

When messages were classified by content, 
it was found that recipients of hostile or self- 
sacrificial (“martyr”) messages were more 
likely than recipients of ordinary (coopera- 
tive) messages to vote against the message 
sender. Although players rarely voted against 
the same people they sent messages to, the 
likelihood of such a vote appeared to be very 
responsive to the communication behavior of 
the message target. A negative vote was some- 
what more likely if the target sent no message 
to the voter, and much more likely if the 
target sent him a hostile or martyr message. 
Why, then, was voting behavior less influ- 
enced by communications from those who 
were not among the voter’s message targets? 
Why did it make no difference whether they 
sent the voter a hostile or martyr message or 
did not send him anything? One commonsense 
explanation is that people usually fail to send 
messages to a potential communication part- 
ner because they intend to vote against him. 
In such a case, it would not make much dif- 
ference whether they received no message, 4 
hostile message, or a martyr message from 
this person: Their intention to vote against 
him would not change. The intention might 
well be altered, however, by receipt of a 
friendly message; this pattern is present in 
the lower half of Table 3. In the upper por- 
tion of Table 3, voters have sent messages to 
their potential communication partner, pre- 
sumably indicating an intention in most cases 
to vote for rather than against him. Although 
receiving a friendly message from him would 
reinforce that intention, receiving a hostile or 
martyr message would upset it, and receiving 
no message would subject it to reconsidera- 
tion. The pattern of behavior observed is 
what this reasoning would suggest. 


TABLE 3 
MESSAGE ‘Type AND COALITION Crore AMONG 
ADJACENT PLAYER 


No. vote elements against 
t other player 


Communicati ae 
nication between Ob- served? 
players served pected expected 
Voter has sent one or more 
messages to the other and 
has received: 
One or more ordinary 
messages 63 2284 
No messages OS 1738 
A hostile or martyr message 23 21.8 
A hostile message 10 17.1 
‘A martyr message 13 10.7 
Voter has sent no messages 
to the other and has re- 
ceived: 
One or more ordinary 
messages 172 186.9 
No messages 259 156.8 
A hostile or martyr message 29 18.1 
A hostile message 16 10.6 
A martyr message 13 15 


The effects that were discovered did not 
vary when the barriers of the communication 
network were reinforced with fictional lin- 
guistic differences, nor did they vary when 
complete feedback about the voting results 
(ie., who voted against whom) was provided 
at the end of each round. These negative 
findings disconfirmed the respective hypothe- 
ses. What might account for the stability of 
the communication effect when the impor- 
tance of communication was manipulated? 
The differences in “linguistic” identity might 
have had no effect because they coincided 
exactly with the already announced com- 
munication barriers and thus introduced no 
new information. Identity differences failing 
to coincide with the communication barriers 
might make a difference; this is being investi- 
gated in current experiments. 

As for the information contained in the voting 
feedback, two lines of reasoning are possible. 
On the one hand, more information should 
reduce the players’ dependency on communi- 
cation. But, on the other hand, feedback can 
make it harder to get away with misrepresen- 
tation of one’s voting intentions through mes- 
sages. If players are forced by complete voting 
feedback to send only sincere messages, then 
these messages may be trusted more by their 
recipients and thus may be more effective in 
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reaching a consensus.’ Perhaps what has been 
observed is a case of countervailing relation- 
ships canceling each other out. 

A question for future analysis is the follow- 
ing: What experimental conditions and bar- 
gaining tactics are most conducive to success 
in becoming a member of the winning coali- 
tion? Beyond this, the patterns of coalition 
formation in various communication networks 
under different boundary conditions (group 
size, time pressure, reward structure, unidirec- 
tional vs. bidirectional channels, probabilistic 
vs. deterministic channels, etc.) remain to be 
investigated. Interactive computing systems 
should provide increasingly attractive sites for 
the relevant experiments (Pool & Grofman, 
1975), and the combination of the communi- 
cation-network and the coalition-formation 
traditions of experiments may provide a 
successful paradigm, 


9 Compare Burton (1969, p. 56). The comparison 
of Games 3 and 4 obviously falls short of a Strict 
test of the effect of feedback, since players’ experi- 
ence and skill also increased from each game to 
the next. 
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This experiment was designed to explore the effects of externally imposed 


deadlines on individuals’ 


task performance and their subsequent interest in the 


task. In two deadline conditions, subjects were given an explicit time limit for 


solving a series of initially interesting 


portance of finishing was stated explicitly ; 


word games. In one condition, the im- 
in the other condition, the deadline 


was left implicit. In two control conditions, subjects worked on the puzzles 
without any explicit time limit, In one condition, subjects were asked to work 


at their own pace; in the other, they 


were asked to solve the puzzles as fast 


as possible. Virtually all subjects finished in the allotted time, Unobtrusive 


measures of subsequent interest indica 


ted that in the absence of external con- 


Straints, subjects in the deadline conditions were less interested in the game 
than subjects in the nondeadline conditions. The theoretical implications of 


these findings for the overjustification 


Deadline: a line drawn within or around a prison 
that a prisoner passes only at the risk of being in- 
stantly shot. (Webster’s Third New International 
Dictionary of the English Language, 1966, p. 580) 


Although the word deadline now refers to 
time rather than Space, and the consequences 
of going beyond the deadline are no longer 
quite so severe, its connotative meaning has 
remained largely unchanged. Deadlines are a 
pervasive fact of life in American society, 
exerting coercive power over our allocation of 
time and our expenditure of effort, We not 
only have an April 15 deadline for filing tax 
returns but also a time limit for filing forms to 
request an extension on the original deadline. 
Although it is generally accepted that dead- 
lines are often an unavoidable safeguard 
against procrastination, the external imposi- 
tion of a deadline may have unintended con- 
Sequences for future task enjoyment, Insofar 
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hypothesis are discussed, 


as a deadline causes us to see our es as 
extrinsically motivated, we may no onger 
desire to do something we once enjoy: after 
we have been forced to do it for the -ke of 
meeting a deadline. 

In fact, although the immediate env -izing 
effects of deadlines have been docume: `d by 
researchers interested in productivity í  -ved- 
son, 1974; Webb, Note 1), relatively li `e at- 
tention has been given to the long-ter: con- 
Sequences of deadlines, One such conse ence 
has been studied by Aronson and col- 
leagues (Aronson & Gerard, 1967: Aron on & 
Landy, 1967; Landy, McCuen, & Arison, 
1969). Their research has shown that su jects 
given either an arbitrarily long or short icad- 
line for finishing a task will later judge that 
Particular amount of time to be necessary for 
completion of the task. While these studies 


raise a number of interesting issues, sorme of 
the most important questions about the long- 
term effects of externally imposed deadlines 
have been overlooked: How do deadlines affect 
subjects? subsequent performance, the quality 
of their work, or their intrinsic interest in the 
activity at a later time, when external con- 
straints are no longer present? The present 
study was designed to test the proposition 
that, indeed, deadlines may be detrimental to 
subsequent intrinsic interest, 

Recently, a growing body of literature has 
been concerned with this issue of intrinsic 
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motivation and the ways in which external 
constraints may undermine it. Focusing pri- 
marily on the introduction of extrinsic re- 
wards for task performance, these studies 
(e.g, Deci, 1972; Lepper & Greene, 1975; 
epper, Greene, & Nisbett, 1973) have sug- 
gested that when an individual engages in an 
activity as a means to achieve some external 
reward, his or her subsequent interest in that 
activity is likely to be undermined. Clearly, 
giving an individual a deadline for completion 
of a task is quite different from offering a 
tangible reward for engaging in the task. 
"There is one crucial similarity, however; in 
both cases, it can be said that the individual 
undertakes the task as a means to accomplish- 
ing a specific end. In one case, the end is 
winning the reward; in the other, it is meeting 
the deadline. Thus, to the extent that im- 
posing a deadline leads the individual to con- 
sider his or her behavior as extrinsically mo- 
tivated, such a procedure may similarly result 
in a subsequent decrease in intrinsic interest 
in the activity performed under that con- 
straint. 

To examine this hypothesis, college students 
were asked to play with five sets of an en- 
joyable word game under different sets of in- 
structions. In a no-deadline condition, subjects 
played with the game with no performance re- 
quirements or time constraints. Two deadline 
conditions were run, differing only in the ex- 
plicitness of the deadline. In both conditions 
subjects were told they would be allowed to 
play with the game for 15 minutes, were asked 
to work as quickly as they could, and were in- 
formed that most students could finish the 
game within that time period. In the implicit- 
deadline condition, no further instructions 
were given, In the explicit-deadline condition, 
however, subjects were additionally told that 
they were required to finish the game in the 
time allotted in order for their data to be 
useful. During the second part of the experi- 
mental session, subsequent intrinsic interest in 
the word game was assessed through both 
unobtrusive measures of subjects’ desire to 
engage further in the activity and measures of 
subjects’ attitudes toward the activity. 

There are, of course, other compelling ex- 
planations for the hypothesized effect of dead- 
lines on subsequent intrinsic interest. One 


93 


might argue that the presence of a deadline 
causes an individual to feel pressured to work 
faster, which, in turn, may lead to more 
fatigue, satiation with the task, or frustration 
with set-backs. Furthermore, such factors 
might cause the quality of the individual's 
performance to deteriorate, and his or her dis- 
satisfaction with that performance may be- 
come associated with the task. 

To test the viability of these alternative 
hypotheses, a fourth condition was included, 
In this work-fast condition, as in the no-dead- 
line condition, subjects were given no time 
limit or information about the performance of 
others. As in the two deadline conditions, how- 
ever, they were asked to work as quickly as 
they could. If the performance (eg, speed 
of working, quality of work) of the subjects 
in the work-fast condition during the first 
part of the experiment did not differ from that 
of the subjects in the two deadline conditions, 
their failure to show a similar decrement in 
subsequent intrinsic interest in the task would 
indicate that the speed-of-work explanations 
cannot adequately account for any lessened 
interest shown by the subjects working under 
the deadline. 

It should be noted that the deadline phase 
of the experimental session was deliberately 
set at 15 minutes. Extensive pretesting indi- 
cated that this time limit would be perceived 
as a real deadline by subjects in the two dead- 
line conditions but would still allow sufficient 
time for nondeadline subjects to complete the 
task, This functionally superfluous deadline 
was used in order to eliminate possible sources 
of variance that might arise if a number of 
subjects failed to finish within the allotted 
time. 

To summarize, the following predictions 
were made: (a) Subjects in the two deadline 
conditions would show less subsequent in- 
trinsic interest in the game than subjects in 
the no-deadline and work-fast conditions; (b) 
subjects in the explicit-deadline condition 
would show less intrinsic interest in the game 
than subjects in the implicit-deadline condi- 
tion; and (c) although subjects in the work- 
fast condition would show more subsequent 
interest in the game than subjects in the dead- 
line conditions, these three conditions were not 
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expected to differ on measures of immediate 
performance. 


METHOD 
Subjects and Procedure 


Subjects were 40 male undergraduates at Stanford 
University recruited to participate for $2 in an ex- 
periment on “crossword games,” Each subject was 
greeted in the hall by a male experimenter and es- 
corted into a first experimental room (Room 1). The 
subject’s books, coat, and other belongings were 
placed in a cabinet, and the subject was asked to 
sign a standard consent form. The experimenter then 
introduced him to the crossword game “Ad-Lib” 
(Milton Bradley Company), in which players try to 
form crosswords from the letters facing up on a set 
of 13 dice. Using a prearranged set of examples, the 
experimenter showed the subject how to build such 
crosswords, He then explained that the study was 
concerned with the kinds of crosswords subjects 
could come up with, using as many of the 13 avail- 
able letters as possible, in order to test whether such 
word games were a feasible way of improving vo- 
cabulary skills, 

In order to compensate for differential familiarity 
with word games, the subject was provided with a 
set of hints for using all of the letters, although it was 
emphasized that the experimenters would consider 
a crossword complete if at least 11 of the 13 letters 
were used. The subject was then given 5 minutes to 
toss his own letters and Practice forming crosswords, 
During this time the experimenter answered any 
questions the subject raised and pointed out any 
tule violations or mistakes he had made in forming 
crosswords, 


Following this warm-up period, the practice sets 
Were removed and 


the tape player, left t 


quietly entered Room 2, where he could observe the 
subject through the one- 


the experimenter re- 
dition until the post- 


to the subject diverged 


conditions. In the firsts (no-deadline) condition, the 
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subject was told that he would not b quired to 
work with all five sets of letters; rather was told 
to play with the crossword games onl; much as 
he wished. In the work-fast condition, he Subject 
was asked to work as fast as he could a each of 
the five sets and to try to use at least of the 13 
letters in each set. As in the no-deadli: ndition, 
the subject was not given any inform on the 
amount of time he would have ay Y work- 
ing at this task. In the implicit-deadli ndition, 
the subject was also asked to work a t as he 
could on each of the five sets of letter l to try 
to use at least 11 of the 13 letters in eac? In ad- 
dition, the subject was informed that he ild have 
15 minutes to work at this task, a time | 1 which 
had proved sufficient for most Stanford lents to 
complete the task, Subjects in the exp] leadline 
condition received instructions identical ose for 
the implicit-deadline condition, with an litional 
explicit stricture that the subjects must c lete all 
five sets within the allotted time period for ir data 
to be of any use in the experiment, 

At the end of the tape, all subjects wer d that 
they should feel free to relax and look th La set 
of magazines on a nearby table if they had me be- 
fore the experimenter returned, At the cor ion of 
the instructions, the subject was asked t rn off 
the tape player and begin working on t} izzles, 
When the subject turned off the tape play< le ex- 
perimenter, sitting in Room 2, proceeded ecord 
the amount of time the subject spent wit ch of 
the five sets of letters. Subjects in all conditi- were 
given 15 minutes to work with the Ad-Lib ; all 
subjects who finished early did spend the inder 
of the 15 minutes looking through the maga that 
had been set out. At the end of this period ex- 
perimenter returned to Room 1 and recor the 
subject’s crosswords.2 

Measurement of intrinsic interest. The expe enter 
then explained to the subject that scheduline con- 
straints made it necessary to move him to ther 
room to fill out a questionnaire on the ex; ient, 
The subject was escorted to Room 2, which was 
filled with equipment and boxes to give it ap- 
Pearance of being a storage area, The subject was 
seated at a table next to a concealed one-way mirror. 

1 Post-experimental debriefing revealed that six 
Subjects were suspicious of the true purpose of the 
experiment: three in the no-deadline condition, one 


challenge to deadline subjects yet would allo 


otted time, Since virtu 
task (whether given 


uld appear to be a realistic 


w con- 


ally all 
a time 


limit or not), the deadline itself was, as intended. 


functionally superfluous, 
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On the table in front of him was a set of Ad-Lib 
cubes and blocks from a “Soma-Cube” puzzle (Skor- 
Mor Corporation), which extensive pretesting had re- 
vealed to be of approximately equal initial interest to 
subjects. 

The experimenter took the subject’s questionnaire 

3t of its envelope and as he explained the instruc- 
ions to the subject, he discovered that the second 
jage of the questionnaire was totally illegible. The 
experimenter then “decided” he should get another 
copy of the questionnaire and asked the subject to 
it and relax while he went down to the basement of 
the building to get one. In fact, the experimenter sur- 
reptitiously returned to Room 1 and observed the 
subject for a period of 15 minutes, recording the 
sunt of time he spent with the Ad-Lib and the 
na~Cube games, At the end of 15 minutes, the ex- 

imenter returned to Room 2 with a fresh copy of 
he questionnaire, which the subject then filled out. 

The questionnaire included a number of questions 

designed to assess the subject’s intrinsic interest in the 
Ad-Lib game. The subject was asked whether he 
would be willing to participate without pay in future 
experiments involving the game and whether he 
viewed the game as work or leisure activity. He was 
also asked to indicate how much satisfaction he re- 
ceived from the game and how much he had enjoyed 
playing with it. Finally, the subject was asked 
whether his play with the game during the experi- 
mental session was intrinsically or extrinsically mo- 
tivated, 

Upon completion of the questionnaire, the subject 
was probed for suspicions. concerning the true pur- 
pose of the experiment. Each subject was then ex- 
posed to a thorough debriefing concerning the true 
purposes of the experiment and the necessity for the 
deception involved. All subjects appeared interested 
in the experiment and its potential implications, and 
they expressed no negative feelings about the decep- 
tion or participation in the experiment. 
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RESULTS 


Three sets of dependent measures were col- 
lected: (a) behavioral measures of subsequent 
intrinsic interest in the task during the free- 
time situation which followed the deadline pe- 
riod, (b) attitudinal measures of intrinsic in- 
terest in the task collected at the end of the 
experiment, and (c) performance measures as- 
sessing the speed and quality of subjects’ 
work during the deadline period. The data 
from each of these measures will be con- 
sidered, in turn, with respect to the experi- 
mental hypotheses. 


Behavioral Measures of Subsequent Intrinsic 
Interest 


Of primary interest in this study was the 
amount of time subjects would choose to 


TABLE 1 


MEAN SUBSEQUENT INTRINSIC INTEREST IN THY TASK, 
ON BEHAVIORAL AND ATTrrUDINAL MEASURES, 
BY CONDITION 


Experimental condition 


No Implicit Explicit 
K dead- Work dead- dead- 
Intrinsic interest line fast line line 


Mean percent of 
free-choice time 
spent with task 62.34 54.16 3298 2448 

Mean summed rat- 
ings of task in- 
terest 1.86 1.62 


—.13 —1.94 


Nole, n = 10 subjects per cell. 


spend playing with the Ad-Lib game during 
the subsequent free-time period when they be- 
lieved that their behavior was no longer being 
monitored. The mean time spent with the Ad- 
Lib game in each condition, presented as a 
percentage of the total 15-minute period avail- 
able, is presented as the first item in Table 1. 

To test the hypothesis that imposition of 
an externally imposed deadline would decrease 
the likelihood that subjects would subse- 
quently choose to engage in the activity in the 
absence of external constraints, a planned 
contrast comparing the two deadline condi- 
tions to the two nondeadline conditions was 
performed, This contrast proved to be sig- 
nificant, F(1, 36) = 6.46, p < .025, account- 
ing for most of the systematic treatment vari- 
ance. However, a second planned comparison 
between the two deadline conditions, to test 
the hypothesis that subsequent interest would 
be more adversely affected by an explicit 
deadline, failed to approach significance (F 
Sale 

The same decrease in interest among sub- 
jects in the deadline conditions is also ap- 
parent if the data are viewed nonparametri- 
cally, indicating that the effect does not seem 
to be the result of a small number of extreme 
cases. A median split performed on the 
amount of time subjects chose to engage in 
the task indicated that 65% of the subjects 
in the nondeadline conditions fell above the 
median, while only 35% of the subjects in the 
two deadline conditions fell above the median 
in interest. This difference between the two 
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deadline and nondeadline groups is significant 
by Fisher’s exact test (p = .05) 2 

Finally, a similar pattern of results was ap- 
parent in terms of the game subjects first 
chose to play with after they had been left 
alone, Of subjects in the nondeadline groups, 
60% played first with the Ad-Lib game; in 
the two deadline conditions, only 20% chose 
to play first with Ad-Lib. Again, this differ- 
ence is significant by Fisher’s exact test (p 
= .04). 


Attitudinal Measures of Interest 


In order to investigate the relationship be- 
tween behavioral measures and verbal reports 
of interest in the game, subjects were later 
asked to complete a brief questionnaire which 
contained five questions designed to measure 
subjects’ enjoyment of and interest in the Ad- 
Lib game. From these questions a single mea- 
sure of interest was obtained for each subject 
by normalizing the responses to each particu- 
lar question and summing the five z scores for 
each subject. 

The mean ratings of interest in the task ob- 
tained by this procedure are presented as the 
second item in Table 1, where it is apparent 
that the results of these attitude measures 
closely parallel those obtained on the behav- 
ioral measures of interest, Again, a planned 
comparison assessing the difference in interest 
between the deadline and nondeadline condi- 
tions proved significant, F(1, 36) = 11.96, p 
< .025. As before, the contrast between the 
two deadline conditions did not approach sig- 
nificance (F < 1), 

As might be expected from these results, 

this overall attitude measure was significantly 
correlated with the Previous behavioral mea- 
sure, 7(38) = 40, p < 05. Not surprisingly, 
each of the five component questions which 
entered into this overall attitude measure also 
correlated positively with the amount of time 
subjects chose to play with the game, though 
only three of these correlations—willingness to 
participate in future Ad-Lib experiments with- 
out pay, perceived satisfaction while doing 
the Ad-Lib task, and perceived enjoyment of 
the Ad-Lib task—were individually statisti- 
cally significant. 
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Perjormance Measures During i} Deadline 
Period 

To examine possible differences i the qual- 
ity of subjects’ performances duri. > the ini- 
tial part of the experimental sessi: :, several 
qualitative performance indices v e exam- 
ined. Measures of the average leng f words 
formed in subjects’ crosswords, the ın num- 
ber of letters included in subjects’ « swords, 
and the number of misspellings « her er- 
rors, for example, were subjected \ series 
of analyses of variance. In none of e cases 
was there any significant effect of experi- 
mental treatments on the quality o bjects’ 
performance. 

Since subjects in all conditions ned to < 
demonstrate an equivalent proficienc n their 
use of the Ad-Lib game during ti “xperi- 
mental period, it is possible to con re the 
amount of time subjects required to- mplete 
the five crossword sets. A one-way an «ysis of 
variance performed on these time dat: ielded 
a significant treatment effect, F (3,36 4,09, 
$ < 05. Not surprisingly, a consider: « por- 
tion of this systematic treatment va: nce is 
accounted for by the longer times en by 
subjects in the no-deadline conditio: (M = 
13.5 min.) who had been told to 


k at 
their own speed, compared with sul -ts in 
the work-fast condition (M = 11.7 mi ) and 


the two deadline conditions (M = 10." min.) 
who all had been asked to complete five 
crosswords as quickly as they could, 2 36) 
= 7.99, p < 01, 

Of greater potential interest, how T, is 
the comparison between the work-fast and the 
two deadline conditions, Comparing osiy the 
work-fast condition with the two deadline 
groups on the amount of time needed to com- 
plete the crosswords produced no evidence of 
performance differences during the deadline 


period) # (1, 27)\= 1.27, ms, However, a simi- 
lar comparison between the work-fast condi- 
tion and the two deadline groups did yield a 
significant effect on the attitudinal measure of 
intrinsic interest, F(1, 27) = 7.26, p < .025, 
and a nearly significant effect on the behav- 
ioral measure of intrinsic interest, F(1, 27) = 
3.89, p < .06. These data would appear to 


SAI $ values reported in this paper are based on 
two-tailed tests of significance, 
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indicate that performance differences during 


* he initial part of the experimental session 


were not a necessary precondition for subse- 
juent decrements in interest shown by sub- 
jesis in the two deadline conditions. 


DISCUSSION 


This study demonstrates that an externally 
d deadline for completion of a task can 
ult in a decrement in subsequent intrinsic 
interest in that task. Compared with subjects 
in the two no-deadline conditions, subjects in 
both deadline conditions showed a marked de- 
interest that was evident in unobtru- 
havioral measures as well as attitudinal 

veasures of interest. Thus, this study provides 
additional support for the “overjustification” 
hypothesis (cf. Lepper et al., 1973), which 
suggests that imposition of a means—end rela- 
tionship between an activity and an external 
onstraint may undermine intrinsic interest in 
that activity. 

There are a number of other explanations 
that could be offered for the decreased in- 
trinsic interest of subjects in the two deadline 
conditions, but these are not supported by the 
data. First, it could be argued that the pres- 
sure of working quickly may have conse- 
quences (e.g., greater fatigue, satiation, dis- 
satisfaction with performance, frustration 
caused by setbacks) that could lead to a sub- 
sequent decline in interest. It will be recalled, 
however, that while the level of interest shown 
by subjects in the work-fast condition was 
higher than that shown by subjects in the two 
deadline conditions, there were no perform- 
ance differences in the time spent or the qual- 
ity of subjects’ work across these three condi- 
tions. The data, therefore, provide no evidence 
to support this class of alternative explana- 
tions. 

Second, it might be argued that subjects 
were responding on the basis of their infer- 
ų ences regarding the experimenter’s own inter- 

est in the task, In other words, his imposition 

of a deadline might communicate to the sub- 
jects that he found the task to be aversive. 

To examine this possibility, one item on the 

questionnaire asked subjects to judge how en- 

joyable the experimenter considered the Ad- 

Lib game to be. Since there were no sig- 

nificant differences in response across the 


' Ai 
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no-deadline (M = 9.7), work-fast (M = 7.7), 
and deadline (M = 7.4) conditions, however, 
this explanation receives no support from the 
data. 

Finally, it could have been argued that sub- 
jects in either the no-deadline or work-fast 
conditions would be more willing to play with 
the game later because of their failure to 
complete the word games during the initial 
experimental session (a type of Zeigarnik 
effect). Since virtually all subjects finished 
the game within the allotted time, this poten- 
tial explanation has little merit. 

It seems, then, that the overjustification hy- 
pothesis provides the best explanation of the 
data, Previous research (Lepper et al., 1973; 
Deci, 1972) has typically studied the effect of 
tangible rewards on intrinsic motivation, but 
the present study shows that the extrinsic goal 
or end need not be tangible to result in a 
subsequent decrease in intrinsic interest. It 
appears that simply engaging in an activity 
in order to achieve something as abstract as 
“meeting the deadline” can produce the same 
effect, Other research (Lepper & Greene, 
1976) has shown that other external con- 
straints, such as overt surveillance over an 
activity, can similarly undermine future in- 
terest in that activity. Thus, the perception of 
the external constraint itself, rather than the 
particular form of constraint, appears to be 
the critical variable. In an important sense, 
it is the psychology of extrinsic constraints— 
rather than the psychology of rewards, dead- 
lines, or surveillance—which is illuminated by 
these studies. i 

Whereas the data provided significant sup- 
port for the hypothesis that subsequent in- 
trinsic interest in the activity would be less 
for subjects undertaking the task under an 
externally imposed deadline compared pele 
subjects undertaking the task in the absence 
of such a constraint, the results failed to re- 
veal an expected difference between the 
deadline conditions. Although subjects in the 
explicit-deadline condition appeared to show 
less intrinsic interest in the task than the im- 
plicit-deadline subjects on each of the no 
sures of subsequent interest, none oe 
comparisons approached statistical signifi- 
cance. Whether this lack of difference between 
the two deadline procedures is 4 reflection of 
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a “floor effect” in the measures of subsequent ance. Journal of Personality and Soc 
interest, the relatively small differences in ee ee lasts er, 
those procedures, or the all-or-none nature of Parkinsons Tees A ATRA ae 
attributions of intrinsic versus extrinsic moti- 


the excess time effect. Journal of Ex 
vation awaits further empirical research. cial Psychology, 1967, 3, 274-285. 


Of course, whether the imposition of any Arvedson, b A: ae and organis 
particular deadline will have positive or nega- PE ue ese a E 40, A 
tive effects on later intrinsic interest in the lished doctoral dissertation, Stan! 
activity will obviously depend on a number of 


1974, 
factors in addition to those specifically ex- Deci, E. L. The effects of contingent 


amined in the present study (e.g. the indi- tingent rewards and controls on int 
vidual’s initial skills and interests), as well as EOE mei Shand ga 
the context in which the deadline is presented Landy, D., McCuen, MERC Atonion, 
(cf. Lepper & Greene, 1976). Undoubtedly, 


Parkinson’s Law: III, the effect of pri 
deadlines frequently serve a useful function;  contractive distractions on the wastin 


employed to excess, however, they may also e Ana Journal of Applied 
prove ultimately dysfunctional and even SA iE an? 7490839; ; 

S Sins epper, M. R., & Greene, D. Turning pla 
perpetuating. The constructive implementa- Effects of adult surveillance and extri 
tion of deadlines, then, will require a sensi- 


on children’s intrinsic motivation, Jour 
tivity to both their immediate and their sonality and Social 


Psychology, 1975, 
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The Effect of Premenstrual Anxiety and Depression 


on Cognitive Function 


Sharon Golub 
College of New Rochelle 


The Depression Adjective Check List, State-Trait Anxiety Inventory, and a 
battery of factor analytically derived cognitive tests sensitive to anxiety or 
depression were administered to 50 women between the ages of 30 and 45 
during the 4 days prior to the onset of menstruation and again 2 weeks later. 
Testing sessions were scheduled on the basis of a previously completed Men- 
strual Distress Questionnaire. There were significant increases in anxiety and 
depression. during the premenstruum. However, no statistically significant dif- 
ferences were found in cognitive test performance, and correlation data failed 
to support any consistent relationship between premenstrual mood and cogni- 
tive function. Moreover, no significant correlations were found between pre- 
menstrual complaints on the Menstrual Distress Questionnaire and either 
cognitive test performance or mood scores obtained during the premenstrual 
testing sessions, It was concluded that the magnitude of the premenstrual mood 
change was not great enough to affect intellectual function. Alternative explana- 


i li There are some who believe that a woman’s 
“f ability to function intellectually is impaired 
|j ‘during the premenstrual phase of the men- 
‘strual cycle. Among them is Tiger (1970), 
‘who has suggested that women may be handi- 
capped intellectually by the phenomenon of 
‘menstruation. He is concerned that 


an American girl writing her Graduate Record Ex- 
aminations over a two day period or a week-long 
set of finals during the premenstruum begins with a 
disadvantage which almost certainly condemns her 
me to no higher than a second-class grade. A whole 
career in the educational system can be unfairly 
jeopardized because of this phenomenon. (p. 132) 


h What evidence exists to support or refute this 
belief? 

Studies of the effects of the premenstrual 
phase of the menstrual cycle on cognitive 
function to date have been inconclusive. 

>" (1937) tested college students during 
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-tions of the absence of decrements in performance are discussed. 


four phases of the menstrual cycle and found 
no statistically significant differences in per- 
formance on a digit symbol learning test. 
Wickham (1958) gave a battery of intelli- 
gence, spatial, mechanical, math, and verbal 
tests to 4,000 young women and found no sig- 
nificant decrement in performance during the 
“period” phase, which consisted of the 4 days 
before and after the onset of menstruation. 
Unfortunately, Wickham’s inclusion of the 
menstrual days could have diluted the effects 
of the premenstrual mood change, since it has 
been observed that after the onset of men- 
struation there is a relaxation of premenstrual 
tension and irritability and an improvement 
in mood (Benedek & Rubenstein, 1942; 
Janowsky, Berens, & Davis, Note 1). Sommer 
(1972, 1973) reported no association between 
menstrual cycle phase and the performance of 
college women on the Watson-Glaser Critical 
Thinking Appraisal, regular class examina- 
tions, or matched forms of the Repetitive Psy- 
chometrics Measures. However, it should be 
noted that the subjects in these studies, as in 
the others cited, were college students or 
young women. Since premenstrual symptoms 
become most pronounced during the fourth 
decade, peaking in the mid-thirties (Dalton, 
1964; Lloyd, 1963; Moos, 1969), it is pos- 
sible that the negative findings reported in 
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these studies were related to minimal mood 
changes in young subjects. Simultaneous mea- 
sures of mood and cognitive function would 
have demonstrated whether or not a pre- 
menstrual mood change was present at the 
time of testing. 
Only Dalton’s studies report a premenstrual 
decrement in intellectual performance, and 
these studies have been criticized by Sherman 
(1971) and Sommer (1973) on methodologi- 
cal grounds. In a study of school performance, 
Dalton (1960) analyzed the effect of the pre- 
menstrual and menstrual phases of the cycle 
on weekly grades of students at a boarding 
school. She found a fall in the standard of 
school work during the premenstruum in 27% 
of the students, However, 17% improved their 
performance and 56% showed no change, The 
magnitude of the changes was not reported. 
Nor did Dalton provide any statistical valida- 
tion of her data. Dalton (1968) also reported 
lower exam scores among English schoolgirls 
during the premenstruum, but again no statis- 
tical tests were reported for these results; her 
conclusions were based on two different 
groups, which may or may not have been 
comparable. 

Tn contrast to these contradictory cognitive 
findings, premenstrual mood changes have 
been consistently reported in menstruating 
women (Bardwick, 1971; Benedek, 1959; 
Benedek & Rubenstein, 1939; Dalton, 1964: 
Golub, 1976; Ivey & Bardwick, 1968 ; Janow- 
sky, Gorney, & Kelley, 1966a, 1966b; Kessel 
& Coppen, 1963; Kutner, Phillips, & Hoag, 
1971; Kutner & Brown, 1972; Lloyd, 1963; 
McCance, Luff, & Widdowson, 1937; Moos, 
Kopell, Melges, Yalom, Lunde, Clayton, & 
Hamburg, 1969; Paige, RIRE Pennington, 
1957; Sherman, 1971). Furthermore, studies 
unrelated to the menstrual cycle indicate that 

affective states do influence cognition, An- 
xiety impairs such cognitive processes as prob- 
lem solving, incidental learning, ability to 
communicate, and performance on standard- 
ized intelligence tests, particularly timed tests 
(Levitt, 1967). And Rosenwald (1961) found 
that performance on very difficult tasks js 
impaired by anxiety. In an attempt to demon- 
Strate the effect of depression on both cogni- 
tion and personality, Messick ( 1965) com- 
pared two groups of college students and 
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found significant differences in performance £ 
cognitive tasks involving flexibility and Speed 
of closure. More recently, significant decre- i 
ments in graphic expression, writingaspeed, fi 
and digit symbol test performance were’ founds. 
among subjects in whom a depress mood 
had been induced (Hale & Strickland, Note 2; K 
Strickland, Hale, & Anderson, Note 3), ib 

Since previous research has consistently ty, 
shown that there are changes in mood during 
the premenstruum, and other studies have = 
demonstrated the adverse effect of depression 4 
and anxiety on cognitive function, this study 4 
attempted to test the hypothesis that there is 
a premenstrual mood change of sufficient mag- 
nitude to affect cognition, Women over the Y 
age of 30 were sampled, and simultaneous 
measures of mood and cognitive function were i 
obtained. Cognitive tests were selected that * 
have been shown to reflect the effect of de- ( 
Pression and anxiety. And confusion between Í 
Premenstrual and menstrual effects, whichéy. 
has limited other studies (Parlee, 1973; Som?” 
mer, 1973), was avoided by confining the pre’. 
menstrual observations to the 4 days’ before sc 
the onset of menstruation, The indrmenstrualsis. 
observations were made at the midpoint of % 
the intermenstruum, which is the interval be- y 
tween the menstrual and premenstrual phases. E 
of the cycle, 


MeETHop 
Subjects j! 


intermenstrual, Although no attempt was made t 

match the two groups, they were very similar with 
education, and number of children, 

The mean age in both groups was between 36 and al 

of both groups were college gradu- 

ates, and all but one subject in each group had had t t; 


Some college education, 3 


Tests and Questionnaire 


Menstrual Distress Questionnaire (A; 
e (MDOQ). 
MDQ (Moos, 1968) asses: 4 


toms during the Premenstrual menstrual, 


henstrual phases of a woman’s most recent menstrual 
cle, Three of the eight MDQ symptom scale groups 
were used in this study to record subjects’ complaints 
of premenstrual negative affect, difficulty concentrat- 
ing, and behavioral change. Negative affect refers to 
anxiety and depression; the concentration and be- 

) yavioral change scales are made up of such com- 
E ponents as forgetfulness, difficulty concentrating, and 
4) impaired School or work performance. The MDQ 
A also served to establish the subjects’ menstrual cycle 
L > length and schedule. 

Depression Adjective Check List (DACL). The 
DACL (Lubin, 1967) is a brief, reliable, and valid 
sure of self-reported transient depressive mood, 
E lists consist of 32 adjectives connoting varying 
$ degrees of depression and elation. Forms A and D, 
| with an intercorrelation of .90, were used. Form A 
was administered at the first testing session and Form 
D at the second. 

j State-Trait Anxiety Inventory (STAI). The STAI 
j (Spielberger, Gorsuch, & Lushene, 1970) consists of 
\ {wo separate 20-item self-report rating scales measur- 
} g state and trait anxiety. Instructions on the state 
|scale require that the examinee report how she feels 
at the time of administration, whereas the trait scale 
instructions ask that she indicate how she generally 
feels. 

The cognitive battery. This set of cognitive mea- 
sures was selected primarily from the factor analyti- 
cally determined revised Kit of Reference Tests 
(French, Ekstrom, & Price, 1963). They were chosen 
to appraise a representative sample of intellectual 
functions thought to be vulnerable to changes in 
affect and included measures of sensory-perceptual 
factors, memory, problem solving, induction, concept 
> formation, and creativity, The cognitive battery was 
comprised of the tests listed in Table 2. Each test is 
divided into two parts; Part A was administered at 
the first testing session and Part B at the second. All 
are timed tests that range in difficulty from rela- 

tively simple to extremely difficult. 


Procedure 


The DACL, STAT, and the cognitive battery were 
administered twice to each subject—once when the 
subject was within 4 days of the onset of her next 
menstrual period (premenstrual) and once about 2 
weeks after the onset of menstruation (intermen- 
strual). Testing sessions were scheduled on the basis 
of the previously completed MDQ that each subject 


told that the MDQ was part of a separate study of 
the consistency of menstrual cycle symptoms and 
was unaware of the purpose of testing session tim- 
ing. Follow-up phone calls verified the actual date 
of onset of the next menstrual period and checked 
the consistency of MDQ symptom complaints. Sub- 
jects were told that the purpose of the testing ses- 
sions was to determine the relationship between 
women’s personality and creativity. All of the tests 
were administered by one experimenter, a woman, 
and subjects followed the written directions on each 
test. 
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TABLE 1 


COMPARISON OF DEPRESSION Apjecrive Cueck List 
(DACL) anp Starr Anxtery (A-Stare) Scorns 
WITH Normative DATA 


Mean score 
Subjects DACL A-State 
Intermenstrual 
Group 1 7,88 35.46 
Group 2 5.80 31.92 
Combined mean 6.84 33.64 
Premenstrual 
Group 1 9.40 40,52 
Group 2 9.20 35.08 
Combined mean 9.30 38.10 
Normative data 
Normal adult women 7.80, 7.87% 
Female senior citizens 6.21, 7.08 
Female psychiatric pa- 
tients (depressed) 16.03, 14,95 
Female college students 
Unstressed 35.12 
Freshman orientation 39.39 
During exam 43.69 
After stressful movie 60.94 
Male psychiatric patients N 
Anxiety reaction 49,02 
Depressive reaction 54.43 


a The two norms given are for Forms A and D of the DACL, 


RESULTS 


Difference scores were computed for the 
mood and cognitive measures by subtracting 
the second test score from the first; multi- 
variate analysis (Hotelling’s T*) was then 
used to test whether or not the sets of differ- 
ence scores of the two independent groups of 
subjects were significantly different from zero 
(Harris, 1975). A similar analysis was done 
comparing the premenstrual and intermen- 
strual MDQ scales. In addition, Pearson prod- 
uct-moment correlations were computed be- 
tween the cognitive varia 
mood and MDQ scores. 


Mood Measures 

Table 1 shows the mean depression and 
state anxiety scores obtained during the pre- 
menstrual and intermenstrual testing sessions, 
as well as normative data reported by Lubin 
(1967) and Spielberger et al. (1970). Higher 
premenstrual scores were seen on both the 
DACL and A-State (state anxiety) scales. 
Multivariate analysis of the difference scores 
indicated that the difference between pre- 
menstrual and intermenstrual depression and 


bles and both the 
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TABLE 2 ,, 
MEANS AND STANDARD DEVIATIONS FOR COGNITIVE MEASURES 


Premenstrual Intermenstrual 
Group 1 Group 2 Group 1 Gro. 

Variable M SD M SD M SD M 
Hidden patterns 36.44 10.92 43.44 12.00 22:00 13.53 poe 
Nearer point 13.20 6.56 13.64 5.20 ae 5.74 13.12 
Gestalt 5.92 2.55 6.04 2.38 5.72 2.99 5.60 
Anagrams 12.72 3.97 13.16 3.56 13.20 3.78 12,28 
Inference 5.56 2.04 CIS e273 7.60 1.63 6.16 
Picture number 12.52 5.41 12.68 4.11 12.20 5.15 1 1.92 
Utility (fluency) 14.20 4.46 12.48 6.00 14.00 4.85 13.72 
Utility ee 11.32 3.95 9.44 4.73 9.68 3.72 11.80 
Associational fluency 8.84 2.15 7.96 2.52 8.48 2.40 10.44 
Plot titles (fluency) 5.96 2.07 6.44 2.66 6.44 2.40 6.40 
Plot titles (cleverness) 2.40 1.61 1.64 1.29 1.88 1.13 2.16 
Possible jobs 12.36 2.87 12.20 3.42 11.68 3.91 11.08 
Expressional fluency 4.68 1.77 4.68 1.89 5.04 1.74 4.12 
Hidden figures 4.36 2.80 6.20 3.48 4.32 2.64 6.00 
Figure classification 46.76 20.17 40.40 20.29 39.76 21.71 47.12 


anxiety was indeed significant, F(2,47) = 
5.60, p < OL. 

Some idea of the magnitude of the pre- 
anxiety scores can 
mean scores with 


intermenstrual 
DACL mean scores are similar to those of 


normal adult women and nonmenstruating fe- 
male senior citizens, The intermenstrual A- 


cated in Group 2, There were no othe 
tistically significant correlations betwee: 
menstrual mood and cognitive measure: 


Menstrual Distress Questionnaire 


Premenstrual and intermenstrual roc: 
scale scores are shown in Table 3, Higher 
menstrual scores were Seen in negative a 
concentration, and behavioral change, M 
variate analysis of the difference scores 
these three MDQ scales was highly signific 
F(3, 46) = 32.70, p < 001, However, no 
nificant correlations were found in eit 


TABLE 3 
MENSTRUAL DISTRESS QuEsTIONNATRE (MDQ) Meas 


sion and markedly different from the scores ae SAVARD Deviations 

of psychiatric patients or stressed students, MDQ scale 

Cognitive Measures Group Nee caa Senora 

Premenstrual and intermenstrual mean Premenstrual 

scores and standard deviations for the cogni- Group 1 

tive tests are presented in Table 2. Small and u 21.52 11.88 7.36 
inconsistent differences between the premen- Goi 740 3.69 2.84 
strual and intermenstrual testings were found M 18.68 9.96 6.04 
for both the simple and more complex tests SD 8.64 4.09 2.26 
Multivariate analysis of the difference scores Tntermenstrual 
showed no significant menstrual cycle effect Soup ; 
P15, 34) 61 $ = .12. A single significant SD Ne a ae 
positive correlation between premenstrual Group 2 ` ; i 
state anxiety and one of the 13 cognitive tests e ey Se) 5:36 
was found in Group 1, but it was not repli- È 198 a 


a 


ENE a 


Wha 
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group between premenstrual MDQ complaints 
of negative affect and mood scores obtained 
during the premenstrual testing sessions. 
Moreover, no significant correlations were 
found in either group between premenstrual 
MDQ complaints of difficulty concentrating 
or behavioral change and performance on any 
of the cognitive tests. 


DISCUSSION 
By measuring mood and cognitive function 
simultaneously, it was possible to determine 


e of depression and anxiety present 
of testing. Subjects were signifi- 
cantly more depressed and anxious premen- 
strually. However, this mood change had no 
effect on cognitive test performance, in spite 
of the fact that tests sensitive to depression 
and anxiety were used, 

The question therefore remains: Why 
doesn’t the premenstrual mood shift affect 
cognition? Research by Rodin (1976) sug- 

gests the explanation that subjects who are 

aware of being disturbed by symptoms which 
they attribute to the premenstruum may bol- 
ster their test performance by trying harder 
to overcome any handicap. According to 
Rodin’s findings, women who are aware of 
experiencing premenstrual anxiety or depres- 
sion would show an improvement in perform- 
ance, Alternatively, women with increased an- 
xiety that they do not attribute to the 
menstrual cycle would show impaired per- 
formance. However, the absence of positive 
correlations between MDQ premenstrual com- 
plaints and premenstrual scores on the cogni- 
tive tests in this study suggests that there was 
no relationship between awareness of pre- 
menstrual symptoms and performance. More- 
over, in a study by Hale and Strickland (Note 

2), in which subjects were aware of the cause 

their mood change, induced depression did 
Whdeed adversely affect cognitive performance. 

The magnitude of the mood change appears to 

be the important factor in determining the 

effect of mood on cognition. The mean DACL 
` score for Hale and Strickland’s depressed sub- 

Jects was 13.75 (Strickland, Note 4), which 

is considerably higher than the premenstrual 

mean scores in this study and close to the 

Scores given for psychiatric patients, as shown 

in Table 1, The magnitude of the premen- 


ne ee UMP ets CSE ee In ee Beh FAP er Ne ert ten 


103 


strual increase in anxiety and depression is 
small, as indicated by comparison with the 
normative data, and it is apparently not great 
enough to impair women’s ability to perform 
cognitive tasks. 

The absence of a deleterious effect of the 
premenstruum on cognitive function, in spite 
of MDQ complaints of premenstrual difficulty 
in concentration and performance, is consis- 
tent with Sommer’s (1973) observations that 
studies using subjective measures generally 
indicate a behavioral decrement associated 
with the premenstruum, whereas “studies uti- 
lizing objective performance measures gen- _ 
erally fail to demonstrate menstrual cycle re- 
lated changes.” Parlee (1974) has suggested 
that both men and women hold stereotypic be- 
liefs regarding menstrual cycle symptoms. 
This research demonstrates that some of these 
beliefs are erroneous: Despite complaints or 
expectations to the contrary, a woman’s abil- 
ity to function intellectually is not affected 
by the premenstruum. 
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Anticipatory Belief Change: Persuasion or 
Impression Management? 


R. Glen Hass and Robert W. Mann 
Brooklyn College of the City University of New York 


Two experiments tested whether anticipatory belief change represents inter- 
nalized attitude change or a presentational strategy. In the first experiment, the 
attitudes of subjects in one condition were measured while they anticipated 
hearing a counterattitudinal message, replicating the procedure of previous 
s es of anticipatory belief change. In a second condition, attitudes were mea- 
sured after subjects learned that the anticipated message was accidentally un- 
available. Subjects in a third condition were told of the existence of the 
message but never expected to hear it. In a fourth, control condition, subjects 
completed the attitude measure without knowing of or anticipating the message. 
Only subjects who anticipated hearing the message were found to hold attitudes 
that differed from the control condition. Canceling the message eliminated the 
attitude change, suggesting that precommunication belief changes result from 
impression management rather than internalized attitude change. The second 
experiment replicated the results of the first study and found additional support 
for the impression management interpretation of anticipatory belief change 


b'a 


phenomena. 


Precommunication persuasion toward a com- 
municator’s position, called anticipatory be- 
lief change, has been explained as resulting 
from either arousal of cognitive dissonance 
(Papageorgis, 1967), attempts to protect one’s 
self-esteem (McGuire & Millman, 1965), or 
moderation of view when expecting new infor- 
mation (Cialdini, Levy, Herman, & Evenbeck, 
1973; Hass, 1975). 

According to self-esteem theory (McGuire 
& Millman, 1965), people tend to fear ap- 
pearing gullible or being unable to resist social 
pressure. By adopting a communicator’s posi- 
tion following forewarning of an impending 
persuasive attack, but before the message is 
actually presented, an individual can preserve 
self-esteem by avoiding the appearance of 
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having succumbed to the persuasive force of 
the presentation. The illusion of agreeing all 
along or having been persuaded only slightly 
will be established and one’s self-esteem 
maintained. 

Another explanation for anticipatory belief 
change is based on the desire of individualsgto 
maintain cognitive consistency (Papageorgis, 
1967). Learning before a message is presented 
that a person who is knowledgeable on the 
topic of the communication disagrees with 
one’s own position creates a state of inconsist- 
ency among the recipient’s cognitions and 
evokes pressures to reduce the inconsistency. 
Changing one’s attitude before the message is 
delivered to agree more with the position of 
the communicator would be one way of re- 
storing cognitive consistency. Furthermore, 
knowledge that an expert disagrees with 
one’s own position may be an argument suffi- 
ciently powerful to stimulate attitude change 
toward the communicator’s position. 

The third explanation for precommunica- 
tion shifts in attitude focuses on people’s in- 
clination to moderate their viewpoint when 
they are about to be exposed to new informa- 
tion. When the topic is unfamiliar, a neutral 
position may result from a self-admission of 
ignorance on the issue. Or when the topic is 
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_ more familiar to the recipient, a moderation 
of opinion could result from an unbiased con- 
templation of arguments that favor as well 
as those that oppose the upcoming speaker’s 
position. Moreover, a moderate view is often 
associated with a broad-minded willingness to 
consider both sides of an issue, making neu- 
trality a defensible position that is difficult 
for others to criticize when one’s topic knowl- 
edge is low or the issue is controversial, As 
a result, an opinion shift toward moderation 
may be a logical synthesis of conflicting argu- 
ments on a topic on which he or she is poorly 
informed and uncommitted, or it may stem 
from a strategy to avoid criticism and protect 
the image one presents to others (Hass, 1975). 
Although the self-esteem theory is based 
: upon a person’s concern over appearing 
gullible, from the existing discussions of the 
É model, it is not clear whether one wishes to 
= prevent the appearance of gullibility to them- 
selves or to others, In other words, does one 
change one’s attitudes before a message is 
presented to affect one’s own self-perception 
(internalized attitude change) or as a strategy 
of self-presentation? 

Both the moderation model and the self- 
esteem theory are equivocal on this question. 
On the other hand, the cognitive consistency 
explanation unequivocally predicts real, inter- 
nalized attitude change produced by a re- 
structuring of one’s cognitions on the topic. 
The present experiment was designed to test 
whether anticipatory belief change is best 
explained in terms of internalized attitude 
change or a presentational strategy. If the 
change of opinion produced by expecting to 
hear a persuasive communication represents 
a presentational strategy, learning that the 
message will not be presented after all should 
remove the need for that strategy and elimi- 
nate any indication of opinion change. If the 

change has been internalized, cancellation of 
the communication should leave the attitude 
change intact and unaffected. 
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EXPERIMENT 1 
Method 


Subjects. Subjects were 56 undergraduates re- 
cruited from introductory Psychology classes who 
received $1 each for participating in the experiment. 
Although 1 to 5 subjects participated simultane- 
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ously during each session, subjects wer: ted apart 
in order to minimize interaction between (hem. Only 
one condition was run during each session, and the 
sessions were randomly assigned to onc the four 
conditions before the subjects arrived. 

Procedure. The experiment was an a only de- 
sign with four conditions intended to i upt the 
ongoing process at different points in o to mea- 
sure subjects’ attitudes. The procedure adapted 
from that used earlier by Cooper and J (1970) 
and Hass (1975). 

Subjects in the control condition merc ponded 
to the dependent measure, “There will be an 
economic depression in the United State y indi- 
cating their opinion on a 31-point scale w e end- 
points labeled “Completely Agree” and ‘ pletely 
Disagree,” 

Subjects in the three experimental cond were 
informed that the experiment was investi what 
“makes a successful persuasive message and t they 
would be listening to a message that othc liences 
had found very persuasive, A tape recor und a 
Pile of tape reels and boxes were con Lously 
displayed on a table near the experime The 
subjects were told that the communication were 
going to hear was entitled, “The comin; omic 
Prosperity.” The speaker was described a idely 
respected expert in his area, and his belic t the 
United States economy was on the ve of a 
period of “unparalleled prosperity, full em nent, 
and abundance for Americans at every vomic 
level” was emphasized. A preliminary sur had 
indicated that most subjects could be ex; d to 
disagree mildly with the speaker's position, eving 
a recession or depression was more likely. jects 
in the anticipation condition then completed de- 
pendent measure while awaiting the commu tion, 
replicating the procedure of earlier anticipatory atti- 
tude change studies. In the cancellation condition, 
prior to introducing the message, the experimenter 
sent an assistant out of the room to get the appro- 


priate tape, which he had apparently been unable 
to find among those on the table. A few moments 
after the tape had been introduced, the assistant 
reentered the room and reported: “The tape is not 
in the cabinet either, Dr, Brannon must have taken 


it by mistake.” The experimenter thought for several 
seconds and then explained to the subjects that the 
experiment was being conducted in two locations at 


once, and because of a mix-up, 
the tape recording after all, However, in order to 
make use of them as subjects, they would be included 
in a control group that would not hear the record- 


ing. Then they completed a short questionnaire that 
=e 


they would not hear 


*At the time Experiment 1 was conducted, the 
economy had peaked and was in a downturn, pro- 
ducing the subjects’ mildly unfavorable economic 
forecast. Continued Worsening of the economy and 
greatly increased pessimism on the part of the subject 
population were among the reasons a different topic 
was chosen for Experiment 2, 


with the main dependent item. Finally, sub- 
jn the no-anticipation condition were told of 
existence of the tape recording but never believed 
that they would hear it. 

The anticipation condition was included to obtain 
re of the subjects’ attitude while they were 
ing to hear the message. However, after they 
completed that single-item questionnaire, they 

iso told that the message was unavailable in 
ime manner as the subjects in the cancellation 

i. The experimenter told them that the data 

d ady provided could not be used, col- 
le | the questionnaires, tore them in half, and 
f nto o trash can. The subjects were then 
c same rationale as the cancellation group 

z filling out the “control group questionnaire.” As 
l out the second questionnaire, the experi- 
looked at the booklet and noted that one 
1 on it repeated the item on the previous 

e. He suggested that the subjects not 
t trying to duplicate their earlier response 
just answer all of the questions in terms of their 
yt attitudes. Of course, the data that they had 
ded on the first questionnaire had not actually 
been destroyed. The questionnaire had been prepared 
so that a tear across the middle of the page left 
the dependent item intact. The charade was per- 
formed to try to convince the subjects that the 
attitude they had indicated on the first questionnaire 
would never be looked at by anyone. 

No message was actually presented to any subjects, 
and the experiment was terminated following com- 
pletion of the questionnaires. The true nature of the 
experiment was then revealed to the subjects and the 
hypotheses and need for deception explained. Before 
] subjects’ cooperation in not revealing the 
of the experiment to other potential 
s solicited, and all readily agreed. 


began 


> pas 
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Results 


The results strongly supported an explana- 
tion of anticipatory belief change based on a 
presentational strategy used when a person 
expects to receive a persuasive message (see 
Table 1). An overall single-factor analysis of 
variance performed on the data yielded a 
significant result, F(3,52) = 4.15, $ < .02. 
More specifically, only when subjects antici- 
pated actually hearing the message did their 
reported attitude differ from that of subjects 
in the control condition, Dunnett’s ¢ = 2.75, 
p < .03.? Subjects who anticipated hearing 
the message moderated their opinion toward 
the position to be recommended, which repli- 
cates the results of previous anticipatory be- 
lief change studies; however, when presenta- 
tion of the message was canceled (in the 
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TABLE 1 
Mean ATTITUDE AND STANDARD DEVIATIONS, 
EXPERIMENT 1 
n 
Condition M SD 
Control _ 10.57 4.36 
No anticipation 9.86 4.57 
Anticipation 
Before cancellation 15.57 5.50 
After cancellation 15.29 6,04 
Cancellation 10.71 4.15 


nar Sle; Attitudes are elven on. Si-point scale, The higher thig ae 
D greater is the agreement with the tion of tl ti 
anticipated speaker. n = 14 per condition, SS 


cancellation condition), subjects’ description 
of their own attitude apparently reverted to 4 
their preexperimental position. The attitudes 
of subjects in the cancellation condition did 
not differ from those in the control condition 
but did differ from those in the anticipation — 
condition, F(1,52) = 7.12, p< .02. Simi- 
larly, as Cooper and Jones (1970) also found, 
there was no evidence of attitude change by 
subjects who knew of the existence of the 
message but never expected to hear it (no- 
anticipation condition). 


4 


Interestingly, when the attitudes of subjects 


in the anticipation condition were measured 
for a second time following the “discovery” 
that the tape recording was not available, — 
there was no evidence of reversion in attitude 
toward the control condition (see Table ia 
Apparently, having recorded their attitude 

while awaiting the communication prevented — 
the return to original attitude found among — 
subjects in the cancellation condition. This — 
absence of change may have been due to an 
internalization process such as commitment 
after recording their attitude on the first 
questionnaire (Kiesler, 1971), or it could — 
have been the result of a presentational strat- 
egy to appear consistent with the earlier mea- 
surement (Tedeschi, Schlenker, & Bonoma, 
1971; Goethals, Reckman, & Rothman, Note X 
1), although every effort was made to cons 
vince the subjects that their first questionnaire 
had been destroyed and discarded. i 


Il treat- 

2 Dunnett’s method was used to compare al i, 
ment conditions with the control group (Winer, 
1962). All statistical tests in both experiments are 


two-tailed. 
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EXPERIMENT 2 


A second experiment was designed to test 
whether the unexpected absence of attitude 
reversion following the cancellation of the 
message in the anticipation condition of Ex- 
periment 1 was due to an internalized change 
of attitude and restructured cognitions or to 
a presentational strategy. These competing 
hypotheses were tested by using different 
measures of opinion before and after cancella- 
tion of the message. On the one hand, if the 
failure to find attitude reversion were due to 
internalized attitude change, then the change 
should also be revealed on a different index 
designed to measure attitudes on the same 
topic. On the other hand, if the absence 
of reversion stemmed from a presentational 
strategy to appear consistent, changing the 
attitude scale should reduce any perceived 
Pressure to try to duplicate the prior mea- 
surement and should produce the same 
attitude reversion found in the cancellation 
condition of Experiment 1, 
Another explanation is also possible for the 
failure to find reversion in the anticipation 
condition of the first experiment. Subjects in 
the anticipation and cancellation conditions 
may have expressed different attitudes after 
the message had been canceled because of a 
difference between the two conditions in the 
length of time that elapsed between learning 
of the message and recording their attitude. 
The time duration between the description of 
the message and filling out the attitude index 
after being told the message was unavailable 
was longer in the anticipation condition be- 
Cause of the extra time required to fill out 
the first, single-item attitude scale. Although 
the additional time was only about 20-30 
seconds, it could have provided those subjects 
with extra time necessary to incorporate the 
new information into their cognitive structure, 
The additional time explanation does not 
seem likely because the first attitude mea- 
Surement in the anticipation condition, which 
did show attitude change, was taken after 
the shortest time lapse of all between noti- 
fication of the message and measurement of 
attitude, Nevertheless, in order to test more 
precisely an alternative explanation based on 
differences in elapsed time, the period between 
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informing the subjects of the mess and 
measuring attitude following cancel was 
made equal across all experimental cor . ions 
of Experiment 2. 
Method 

Subjects. Subjects were 75 introductory psy ogy 
students who participated in the experiment j der 
to satisfy a course requirement. They select he 
experiment from among a large number tha re 
available during the semester. From 1 to 6 s ts 
participated simultaneously during each sessio ut 
they were separated from each other in booth ut 
prevented interaction between them. One cor n 


was run during each session, and the session 
randomly assigned to one of five conditions 
the subjects arrived. 

Procedure. The experiment was an after 
design with three experimental and two cc 
groups. Two of the conditions replicated the ai 
pation and cancellation conditions of Experimer: 
The third experimental condition was identical to { 
anticipation condition of the first experiment exce 
that two different attitude scales were used: one fi 
the attitude measurement taken while the subjec 
anticipated the message and a second for the asses: 
ment following cancellation of the message. Subject 
in the two control conditions merely responded to 
the dependent measures. There were two control 
groups to allow the attitude scales to be presented 
to subjects in each order, 

The topic of the message subjects expected to hear 
in Experiment 2 was changed from the economic 
future of the United States to “No quick cure for 
cancer,” A preliminary survey indicated that most 
subjects were rather optimistic that a cure for cancer 
will be found soon. The introduction of both the 
experiment and the message was similar to that used 
in the previous experiment adapted to the topic of 
finding a cure for cancer, 

In an attempt to minimize further any suspicion 
subjects might have about the missing tape or 
whether their first questionnaire had actually been 
discarded (in the anticipation conditions), several 
minor changes in the procedure were made jn 
Experiment 2, 


several places he had looked. The experimenter and 
t tapes in the room 
once again and spoke to each other in tones that 
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w varently not intended for the subjects to 
} 1 yw to salvage the experimental session. 
riment then continued in the same manner 
previous study except that: (a) when the 


fi ynnaire was collected and thrown away in 
th on conditions, the experimenter tore the 
sh ıt the scale was torn in half, and (b) 
w ond questionnaire was distributed, the 


lid not mention to the subjects that 
the previous questionnaire had been 


asures. While anticipating the mes- 
the anticipation-different scales 
ed to indicate their best estimate 
that a cure for cancer will be 
hecking one of nine statements. The 
stateme: r expressed the belief that an immi- 
nent © was certain or that the chances were 
good, good, moderate, fair, poor, very 
istent, 

ipating the message, subjects in the 
ime scale condition responded to the 
{edical research will soon discover a cure 
on a 43-point scale with evenly spaced 
cd “Completely Agree,” “Strongly Agree,” 
Agree,” “Slightly Agree,” “Slightly 
“Moderately Disagree,” “Strongly Dis- 

Completely Disagree.” 
-g cancellation of the message, subjects in 
experimental conditions indicated their atti- 
the 43-point scale item as part of the 

m “control group questionnaire.” 


found so 


Results 


Che control conditions in the second ex- 
seriment served two functions. The first mea- 
urement taken in each group provided an 
attitudinal baseline against which the experi- 
mental conditions could be compared. The 
second measurement made it possible to deter- 
mine whether there were any order effects 
from consecutive presentation of two scales de- 
signed to measure the same attitude (r = .71). 

There was no evidence of an order effect 
(see Table 2). The attitude of subjects in the 
control conditions as measured by the 43- 
point scale was not affected by having been 
preceded by the nine-statement scale, (28) 
< 1, ns. Similarly, there was no difference in 
attitude as measured by the nine-statement 
scale between subjects who first completed 
the 43-point scale and those who filled out the 
nine-statement scale first, £(28) = 1.02, ms. 

The results of Experiment 2 successfully 
replicated the findings of the first experiment. 


When compared with their respective control _ 


groups, both of the anticipation conditions 
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TABLE 2 
Mean ATTITUDES, EXPERIMENT 2 


Attitude measurement 


While an- 
5 ticipating Foliowing 
Experimental condition message cancellation 
Anticipation- 
same scale 17.804 18.13, 
Anticipation— 
different scales 5.00), 13.13, 
Cancellation 10.874 
First Second 
Control condition measure measure 
a-b order 11.00, 3.33, 
b-a order 3.73), W475 


Note. Means with the same subscript were obtained on the 
same attitude scale. The letter a indicates the 43-point scale; 
the letter b signifies the nine-statement scale, The higher the 
number on each scale, the greater is the agreement with the 
anticipated speaker, n = 15 per row, 


showed evidence of anticipatory belief change _ 


while the subjects were expecting to hear the 
message: anticipation-same scale condition, 
t(28) = 3.59, p < .01; anticipation-different 
scales condition, ¢(28) = 2.87, p < .01. De- 
spite the attempts to reduce any doubts sub- 
jects in the anticipation conditions may have 
had about their first questionnaire having 
been destroyed and thrown away, once again 
no reversion in attitude following cancellation 
of the message was found among subjects in 
the anticipation-same scale condition, those 
who had previously completed the same atti- 
tude scale while awaiting the message, 
correlated samples #(14) = 1.05, ms. 

The primary focus of the second experi- 
ment was the effect of the cancellation of the 
message on measured attitude change. An 
overall single-factor analysis of variance per- 
formed on the attitude measurements taken 
after cancellation of the message plus the 
control group yielded a significant result, Æ 
(3,56) = 6.32, p< 01. Following cancella- 
tion, only those subjects who had previously 
used the same attitude scale to indicate their 
opinion reported attitudes that differed from 
the control condition, Dunnett’s +(56) = 3.73, 
p < 01. As in the first experiment, the atti- 
tudes of subjects who were not asked to fill 
out an opinion measure while they anticipated 
the message did not differ from the control 
group, Dunnett’s t(56) < 1, ns. More impor- 
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tantly, subjects who had previously reported 
= their attitude while awaiting the message, but 
_ on a different scale, reported attitudes that did 
not differ from the control group, Dunnett’s 
(56) = 1.12, ns. Furthermore, following can- 
cellation of the message, the attitude of 
these subjects (anticipation-different scales 
condition) differed from the reported attitude 
of subjects in the anticipation-same scale 
condition, F(1, 56) = 6.85, p < .02. 

Having previously recorded their attitude 
was not what prevented subjects in the antici- 
pation condition of Experiment 1 from return- 
ing to their preexperimental position. Having 
reported their opinion on the same scale each 
time it was measured prevented the attitude 
reversion. Apparently the high correspondence 
between subjects’ attitudes measured before 
and after cancellation of the message came 
from a desire to appear consistent rather than 
an entrenchment of attitude stimulated by 
recording their opinion. Even though the ex- 
perimenters had tried to convince subjects 
that their first questionnaire had been de- 
stroyed and thrown away, several subjects in 
the same scale condition said during the de- 
briefing that they suspected that the experi- 
menter might retrieve the remains of their 
first questionnaire from the trash can and 
compare their pair of responses. (Who was 
deceived, the subject or the experimenter?) 
Almost all of the remaining subjects in the 
same scale condition admitted that they had 
used their memory of their earlier response 

when they completed the same item on the 
second questionnaire. They tried to remember 
what they had written before and then made, 
as closely as possible, the same response again. 
Their concern was for what they had put 
on the first questionnaire, not for evaluating 
their current opinion, 
Discussion 
There was little support in either of the 
esent experiments for the cognitive con- 
explanation of anticipatory belief 
which predicts cognitive restructuring 
] a counterattitudinal forewarn- 
other model based on inter- 
data suggest that 
measured opin- 
rage the impres- 


d by 
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sion others form of us. The experiments did 
not test whether the impression man» ement 
tactics derive from fear of appearing , vllible, 
as self-esteem theory predicts, or te: encies 
to present oneself as broad-minded, =s the 
moderation model argues. However, self- 
esteem theory predicts attitude change ‘ward 
the expected speaker’s position, wher -s the 
moderation model predicts opinion shifts 
toward neutrality, regardless of the position 
the speaker will advocate (Cialdini, !evy, 
Herman, & Evenbeck, 1973; Hass, | 75), 
When these two predictions were com red, 
support was found for both processes: in !ina- 


tions to shift toward the communicator = po- 
sition as well as to take a cautious, ne (ral 


position. Thus, it seems that the app: ent 
precommunication changes in opinion are 
produced by a mixture of not wanting to 
appear easily persuaded and not wanting to 


be caught in a defenseless position. A neutral 
attitude may be protected from criticism bhe- 
cause it does not necessarily imply a firm 
rejection of any other position. Instead, it 
may suggest a willingness to consider both 
sides of the issue. On the other hand, agreeing 
with the speaker also protects someone from 
their persuasive attack. As a result, the opin- 
ion expressed will be one that combines the 
presentational strategies of open-mindedness 
and agreement in the most defensible w: y. 
In a recently completed series of experiments, 
Cialdini, Herman, Levy, Kozlowski, and 
Petty (in press) pursued similar reasoning and 
found that the direction of anticipatory shifts, 
as well as their durability following cancel- 
lation of the message, can be influenced by 
the extent of their instrumental value in the 
situation. 

The finding by Cooper and Jones (1970) 
that anticipatory belief change occurs only 
when the experiment is described to subjects 
as investigating attitude change is also con- 
sistent with an impression management expla- 
nation. The only time it is necessary to man- 
age the impression our attitudes provide to 
another person is when those attitudes are 
being monitored. If no one is paying atten- 
tion to our attitude (or we do not know that 
they are), such a presentational strategy will 
not be salient to us. 
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anther experiment, Cialdini, Levy, 
| and Evenbeck (1973) found evidence 


He } 

that expects to engage in face-to- 
fac ion with a peer, moderation of 
opi y also occur as a result of a tactic 
of tation. Cialdini et al. used a 
co in-subjects before-after de- 


nt scales used to measure 
ind postexperimental attitudes. 
admit, their procedure makes 

e explanations based on the 
dependent variables before 
ipulations and may raise such 
artifactual ibilities as sensitization effects. 
Althc . interpretations of the Cialdini 
et é o not seem likely, by using an 
ign and a single technique for 

bjects’ attitudes in the present 

first experiment and the replica- 

tic of the second experiment are not 
» same alternative explanations. 

n together, the studies demonstrate that 
lucti is in attitude may occur as a tactic 
ol selfpresentation both when one is about 


and after the r 


to envage in a discussion with a nonexpert 
peer (Cialdini et al., 1973) and when one 
anticipates listening to an expert. 


Of course, impression management is not 
ething that occurs only when psycholo- 
are around. Everyone wants to look 
good, especially if they are being evaluated 
by someone whose opinion is important to 
them. Still, it is not clear to what extent these 
changes in measured attitude represent con- 
scious, deliberate efforts to dissemble or less 
deceitful oscillation within the limits of one’s 
self-concept determined by momentary situa- 
tional cues (Kaplan, Firestone, Degnore, & 
Moore, 1974). For example, apparent changes 
in attitude could result from fluctuations 
within the bounds of one’s latitude of accept- 
ance (Sherif & Hovland, 1961) as different 
cognitive components of the attitude are made 
more or less salient by the situation. 
Fluctuations in how one presents oneself 
and whether the changes are intentional or 
operate outside of an individual’s awareness 


Sol 
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are issues that not only affect anticipatory 
attitude change phenomena but potentially 
affect all self-descriptive measurements. Espe- 
cially creative research will be necessary to 
understand these processes more completely. 


REFERENCE NOTES 


1. Goethals, G. R., Reckman, R. F., & Rothman, 
R. E. Impression management as a determinant 
of attitude statements. Paper presented at the 
meeting of the Eastern Psychological Association, 
Philadelphia, April 1974. 


REFERENCES 


Cialdini, R. B., Herman, C. P., Levy, A., Kozlow- 
ski, L. T., & Petty, R. E. Elastic shifts: Determi- 
nants of direction and durability. Journal of Per- 
sonality and Social Psychology, in press. 

Cialdini, R. B., Levy, A-, Herman, P., & Evenbeck, 
S. Attitudinal politics: The strategy of moderation. 
Journal of Personality and Social Psychology, 
1973, 25, 100-108. 

Cooper, J., & Jones, R. A. Self-esteem and con- 
sistency as determinants of anticipatory opinion 
change. Journal of Personality and Social Psychol- 
ogy, 1970, 14, 312-320. 

Hass, R. G. Persuasion or moderation? Two experi- 
ments on anticipatory belief change. Journal of 
Personality and Social Psychology, 1975, 31, 1155- 
1162. 

Kaplan, K. J., Firestone, I. J., Degnore, R, & 
Moore, M. Gradients of attraction as a function 
of disclosure probe intimacy and setting formality: 
On distinguishing attitude oscillation from attitude 
change: Study one. Journal of Personality and 
Social Psychology, 1974, 30, 638-646. 

Kiesler, C. A. The psychology of commitment. New 
York: Academic Press, 1971. 

McGuire, W. J., & Millman, S. Anticipatory belief 
lowering following forewarning of a persuasive 
attack. Journal of Personality and Social Psychol- 
ogy, 1965, 2, 471-479. 

Papageorgis, D. Anticipation of exposure to per- 
suasive messages and belief change. Journal of 
Personality and Social Psychology, 1967, 5, 490- 
496. 

Sherif, M., & Hovland, C. I. Social judgment. New 
Haven, Conn.: Yale University Press, 1961. 

Tedeschi, J. T., Schlenker, B. R., & Bonoma, TANA 
Cognitive dissonance: Private ratiocination or 
public spectacle? American Psychologist, 1971, 26, 
685-695. : 3 

Winer, B. J. Statistical principles in experimental 
design. New York: McGraw-Hill, 1962. 


(Received September 3, 1975) 


Journal of Personality and Social Psychology 
1976, Vol. 34, No. 1, 112-124 


Altruism, Ambiance, and Action: The Effects of Rur: 
and Urban Rearing on Helping Behavior 


Ferne H. Weiner 3 
University of Hawaii at Manoa, Honolulu, Hawaii 


y situation. The threefold purpose was to examine the behavior of male and 
female rural- and urban-reared college students in their prosocial responsiveness 


of the complexity measure on the residence variable, and in turn, on helping 
behavior. The results were ascribed to differences in social-perceptual learning 
as a function of the complexity of stimuli afforded by background experiences. 
Traditionally, it has been a common notion vestigation was conducted to test indepe: lent 
that city dwellers are less responsive to their variables chosen from each class of dete mi- 
fellows than are rural residents, Explanations nants as follows: residence (demograpl'c), 
of the lack of socially responsible behavior overload (situational), sex of subject (so- 
by passive bystanders to the murder of Kitty cial roles), and flexibility in cognitive style 
Genovese in 1964 and other emergency epi- (dispositional). 
sodes in New York City have focused on a The results of research directed to the res.- 
specific target: urbanism. The stereotype of dence variable have not been clear-cut. Or iy 
the “detached urbanite” has been editorialized one assessment of helping behavior has ben 
in the media and expressed in the views of made in true rural-urban areas (Schneide & 
professionals, whereas people who come from Mockus, 1974), in which it was reported ‘hat 
small towns have reportedly affirmed that such the helping pattern slightly favored urban 
incidents could never happen in their com- subjects. In field research conducted to com- 
munity (Rosenthal, 1964), However, the ex- pare Tesponses in a large urban center with 
istence of rural-urban differences in prosocial those in several small towns Krupat and 
behavior has not been empirically established, Coury (Note 1) also failed b support the 
This study was an attempt at an integrated hypothesis that urban subjects were less help- 
approach to factors that could serve to induce ful than those from small towns. In addition, 
or inhibit helping behavior. A controlled in- Milgram (1970) cited an unpublished study 
in which urban rates of helping behavior were 
This article is based on a dissertation submitted Pacey, high. However, Korte and Kerr 
to the University of Hawaii in partial fulfillment of (1975) reported a tendency for townspeople 
the requirements for the doctoral degree. The au- to offer greater assistance than their big-city 7 
d 
Í 


thor wishes to thank her major advisor, Ronald Oi counterparts. 
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ate, and to Anthony J, Marsella and Ervin Staub results were at odds with the original effects, 
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Weiner, Department of Psychology, University of y compared, a significantly higher 


Hawaii at Manoa, 2430 Campus Road, Honolulu, Percentage of subjects helped a stranger in 
Hawaii 96822. midwestern cities of the United States as com- 
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the megalopolis of New York 
73). Likewise, a higher overall 
te was reported for a southern 
ison with New York (Morgan, 
in a replication designed to 
iuence effects, strikingly higher 
g behavior were obtained in 
n were found in New York 
vk and Word (1972) posited 
differences in urban popula- 

‘s. However, in a recent 

a a New York population, 
were found to match those 
aller southwest city (Lesk & 


helping respor 
obtained ir 
Zippel 
\dings supported theorists who 
issumption that urbanites were 
sin ssponsive in providing help to 
an 1 other (e.g., Latané & Darley, 
70 n & Hollander, 1964). Rosenthal 
(is wed apathy as a problem in the 
huma ition. He suggested that a big-city 
ipathy appeared to be prominent 
world centers such as Tokyo, 
ind New York (attributable to the 
iber of people who could turn away 
© another), whereas most likely some 
ms of apathy existed in towns and 
xes that would preclude one’s prosocial 
iivement, 
bes and Gromoll (1971) reported a 
significantly lower helping frequency from 
small-town residents as compared with dwell- 
ers of either large cities or moderately sized 
stern cities, with the latter two having 
identical response rates. These unexpected re- 
sults were attributed to social anonymity be- 
tween the benefactor and the recipient; they 
were strangers. It has been found that de- 
creased helping rates were directly dependent 
on the condition of social anonymity (Latané 
& Darley, 1970; Macaulay, 1975), and the 
willingness to intervene was governed by es- 
tablished interpersonal relationships (Hackler, 
Ho, & Urquhart-Ross, 1974). Moreover, co- 
hesive group identification has been ascribed 
as the ingroup factor for enhanced altruistic 
attitudes between fraternity brothers (Fried- 
richs, 1960) and for accentuated helping ac- 
tion within college dormitories (Bickman, 
Teger, Gabriele, McLaughlin, Berger, & 


113 


Sunaday, 1973). Also, ingroup identification 
by a potential helper in classifying strangers 
has been proposed as a significant influence 
in affecting helping rates (Hornstein, 1972; 
Sole, Marton, & Hornstein, 1975). This 
distinction may be an important one, 
Perhaps the superior responsiveness attrib- 
uted to rural residents would be realized 
only under the conditions of friendship, or 
at least acquaintanceship. 

Milgram’s (1970) formulation of overload is 
a coordinate variable that provides a psycho- 
logical link between an individual’s experience 
and the demographic facts of residence for 
the exploration of rural-urban differences. 
The term overload refers to an excess of 
inputs from environmental stimuli that go 
beyond a person’s processing capacity, Ac- 
cording to Milgram, in order to cope with 
the excessive stimulation that abounds in 
the urban ambiance, a person gradually de- 
velops adaptations to overload. These defen- 
sive strategies serve as limiting, filtering, and 
selective devices among multiple impinging 
stimulus inputs. Consequently, the person 
becomes desensitized to the needs of an- 
other. Milgrim hypothesized that the lesser 
exposure to overload by rural and small- 
town residents should foster a greater social 
responsiveness toward an unknown other. 

Whereas several researchers have reported 
social inhibition effects that appear analogous 
to countering overload, Krupat and Epstein 
(1973) directly demonstrated its inhibiting 
effects on nonemergency helping behavior by 
simulating task situations with varying levels 
of stimulus inputs. Sherrod and Downs (1974) 
likewise reported decreased helping as over- 
load increased but also found that a subject’s 
perceived control over the environment acted 
as a mediating mechanism between altruism 
and the negative effects of overload. 

The sex variable interjects another dimen- 
sion, and the subjects’ sex as a determi- 
nant of helping behavior has received in- 
creasing attention since Gergen, Gergen, and 
Meter’s (1972) before-and-after intercorrela- 
tional analysis demonstrated variations in re- 
sults when sex was 4 moderator variable. 
However, there usually have not been sex of 
subject effects when (a) the behaviors re- 
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quired were equally masculine and feminine 
in orientation rather than directed to cultural 
sex role perceptions and expectancies, (b) the 
behaviors were free from high-cost and exter- 
nal threat factors, and (c) reportorial or 
indirect means of aiding were available and 
direct assistance was avoidable. Tt would seem 
that the absence of sex effects are generally 
Predictable when these factors are controlled 
(Latané & Darley, 1970; Deaux, Note 2). 
Tndividual-difference personality traits con- 
stitute the final class of independent variables 
in the altruism domain, Gergen et al. 
have contended that there is a necessary 
interrelation between 
situation, 
ality variables have often failed to correlate 
with altruism when behavioral indexes haye 
been tested (eg., Darley & Batson, 1973), 
Flexible-rigid modes of helping have emerged 


subjects’ styles of approach when aiding a vic- 
tim; these modes Serve as a springboard for 
identifying a relevant and untested individual- 
difference 
as inappropriate perseveration (Scott, 1966), 
whereas flexibility pertains to an individual’s 
alternative means of construing a situation, In 
the conceptual domain under consideration, 
cognitive interference and Cognitive complex- 


construct of “flexibilit —tigidity,” and each 
has been shown to be 
social judgment and interpersonal functioning, 
The purpose of this study was to explore 
the interrelation among the five independent 
variables as they related to helping behavior. 
It is indicated by the literature that evidence 
exists for hypothesizing 
stimulus overload 
helping behavior ( 
1974) and for Predicting that sex of subject 
effects should not b 
low in costs and free from sex-linked behay- 
Darley and 
of styles of 
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tions for favoring increased help behavior 
for persons who grew up in rura! areas æ 
compared with those who grew ub in urban 
centers (e.g., Korte & Kerr, 197 Schneider 


& Mockus, 1974). 


METHOD 
Subjects 
Initially, 198 experimentally naive rgraduate 
students enrolled in introductory ] science 


noa and 
stionnaire 
rales and 
ticipated 
the sub- 
juipment 
tal pur- 
iparably 
atments. 
> repre- 
of the 
tiethnic 
niddle- 


courses at the University of Hawaii's 
Kauai campuses participated in the 
session. Of these potential subjects, 67 

66 males who met the residence criteria 

in the experimental session,1 Data from 7 
jects were discarded from the analysis—fo 
malfunction or suspicion of the experir 
pose—leaving a total of 126 subjects 
distributed within and across the three 
All subjects were American-born. The sar 
sented a cross-section of the major regic 
United States. The subjects were of m 
origins and from predominantly bureaucrati 
class backgrounds. The mean age of the 
19.99 years. Each Participant in the q 
session received a bonus Point for course cr 
those who participated in the experimental 
were paid $2, 


nnaire 
lit, and 
ssion 


Design 
The experimental design was a 3 X 2 X 2 factorial 
with residence, Overload, and sex of subjeci as 


between-subjects independent variables. All su jects 
Were randomly represented within each of thre 
dence groups, operationalized on the basis o com- 


a Population of 500,000 or more, including areas 
within 40 minutes commuting time to city center 
(eg. Cambridge/Boston, Massachusetts) and exclud- 


1 There were 42 students who were unqualified: 
persons foreign-born and 


migrated to Hawaii during 
(specified as through 16 years of age) or had been 
born and reared elsewhere and had resided in Hawaii 


le was i 


ALTRUISM, AMBIANCE, AND ACTION 115 


measures. Cognitive interference, as 
he Stroop Color and Word Test 
<), and cognitive complexity, as assessed 
1 Complexity Scale (Barron, 1953, 

1l as measures of personality disposi- 
ing flexibility in cognitive style, the 
d Word Test principally measures 
ibility to handle the interference 
ompeting stimuli, The group form 
in this study was developed by 


Golden ( nd the test-retest reliabilities (.89, 
84, and ctively, for each of the three pages 
that € ie test) have favorably compared 
with ae corresponding individual form 
(.86, . a 

The »mplexity Scale measures the dif- 


onent (ie., available alternative 
ng interpersonal events) of the 
xity dimensions in perceptual-cogni- 
8 ie dimensions consistently have not 
related t Crockett, 1965). The Barron Com- 

revised version, 1963) controls for 
ibility in verbal fluency and level of 


ferent 


know providing 50 objectively equivalent 
stimulu ments in a forced-choice format and 
contai se items to control for response set. 
The se scored in the direction of complexity, 
the v lex individual preferring open-ended and 
asym stimuli. Barron reported a test-retest 


reliab o 
20 r 


$ 


74 and a Kuder-Richardson Formula 
efficient of 54. 

For the subjects’ task, and to vary over- 
and visual stimuli were used in the 
on. The visual stimulus items were 


load, auditor 


experimer 


patte pted from Raven’s (1956) Progres- 
sive ch were developed into a consecu- 
tively num lide series. Each set of six alterna- 
tive patte eces (one correctly fits the blank space 
in the matrices design) was numbered to correspond 
with its slide mate and laminated into separate 
rectangular cards. For the auditory stimulus, a series 
of git numbers selected from lists of 
random were taped by the experimenter. 


The number 6 was incorporated among any (one or 
more) of the digits at specific positions. Two tapes 
were made on which the patterning sequence and 
the 4-sec intervals between numbers were invariant, 
but the tapes differed in the amount of numbers 
recorded. 

Equipment. The tapes were played on a Panasonic 
Model RQ4208 auto-stop cassette recorder. A GAF 
Anscorama Model 980 slide projector with automatic 
timer and focus was used for viewing the slides on 
a portable screen. The large table that held the 
eee was full of contrived clutter for over- 
loaded subjects or devoid of unessentials for the 


nonoverloaded subjects. 
Procedure 


Se a study was ostensibly to investigate 
s iiterences. The personality measures were ad- 


ministered, and the background information was col- 
lected in the 30-minute group questionnaire session, 
At the close of these procedures, each qualified sub- 
ject self-selected an individual appointment time for 
the second session. 

The subject reported for the 35-minute experi- 
mental session to the “temporary quarters” occupied 
by the experimenter among a complex of offices, 
Subjects were randomly assigned to one of the two 
overload conditions, which were presented on a preset 
counterbalanced schedule. Then the subject was taken 
to the nearby experimental room, approximately 
14 X 18 feet (23.44 m2), which was identified as the 
office that the experimenter shared with a colleague. 
The subject was directed to read the printed instruc- 
tions, in which a training pattern and examples of 
number sets were incorporated, 

Three tasks were required of all subjects: (a) to 
view the slides while matching the missing part of 
the design from the cards, (b) to listen to the series 
of three-digit numbers and attend to the number 6, 
and (c) to record the answers from each set of 
stimuli. The overloaded (high-overload) subjects 
were given simultaneous tasks and multifocus de- 
mands for sensory bombardment. A complex answer 
sheet with multiple-column grids and specified lines 
for positioning the numerical answers was presented. 
High-overload subjects were exposed to competing 
visual and auditory stimuli and instructed to record 
both the correct matrices pattern number and the 
critical digit sets at the same time. The slides auto- 
matically advanced every 5.5 sec, for a total of 49 
different slides and cards. During this time 49 sets of 
numbers were played, of which 23 contained the 
critical number 6. 

The subjects who were not overloaded (low- 
overload subjects) were instructed to listen to the 
tape after having viewed the slides and to record the 
answers in two broadly spaced columns as pre- 
sented consecutively with each separate set of stim- 
uli. Low-overload subjects heard 14 number sets with 
6 critical ones to record, or 26% of the auditory 
stimulus items required of high-overload subjects. 
They viewed a total of 14 slides at 11.5 sec. viewing 
time intervals, or less than 29% of the visual stimu- 
lus items confronting subjects in the high-overload 
condition. Subjects in pretesting confirmed the ex- 
perimental intent that the low-overload condition 
represented a leisurely atmosphere, whereas the high- 
overload condition simulated the ambiance of a 
frenetic urban environment. 

At this point, all subjects were told that the word 
RerrrAL would appear as a cue when the total 
amount of novel stimuli had been presented. Then, 
the items would be automatically repeated. The 
emergency episode (described separately) occurred 
during this retrial period. After one repetition se- 
quence of the stimulus items, a transition was re- 
quired for the final experimental demand. This 
transition was conveyed by a slide that read: “Please 
Turn Off Projector and Proceed to Answer Post- 
experimental Questionnaire.” On this self-report 
postexperimental measure, the subjects rated the dif- 
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j or each experimental task on assess magnitude and type of res} Interjudg 

e ee a Sa ee as to awareness agreement was .94 among a pani 10 facul 
of the experimental purpose by means of an open- members who were asked to r order th 
ended inquiry (Orne, 1962). scrambled and unnumbered categories, € omparab] 
At the close of the instructions, the subjects rankings have been reported (e.g., D: & Batson 
practiced operating the uncomplicated on-off mecha- 1973). The scaled helping measure \ AS follows 
nisms of each machine, on which pasted strips served 0—the subject did not offer a respons any kind 
to emphasize placement and direction of the Switches. 1—the Subject remained in his (her) se id verbally 
Before the experimenter focused the first slide and set communicated with the victim, askir hether shi 
the automatic timer, she stated that she would be was hurt or needed help, but did n ifer help 
working in her temporary quarters and would return 2—the subject sought indirect help by 1g for the 
in 15 minutes for discussion and collection of the experimenter; 3—the subject went di over to 
materials. Familiarizing subjects with Operating the the victim and stayed briefly (and ma ve helped 
machines and telling them where to find the experi- her to the chair Provided); 4—the ect went 
menter were points designed to augment the subjects’ directly over to the victim and then t for the 

5 perceived control of the experimental environment, experimenter; 5—the subject offered of the 

ar The addition of these two options could provide the following forms of direct prolonged elp: (a) 

f: Opportunity for a wider range of response alter. flexible-passive—asked the victim ho» to help 

natives in handling the staged emergency situation. (b) flexible-active—initiated helping act’ 1 by sug. 

s The episode. The confederate was a 31-year-old gestions to the victim, or (c) rigid—toli he victim 

= small-framed female of Anglo-American origin, who the Proper treatment for the injury. 

P was dressed in the campus style of slacks and over- The second dependent measure Was f: uency of 
shirt. She was familiarized with and rehearsed in her response in absolute terms of dichotomize: tegories, 
performance only and was blind to the experimental In its relation to the Magnitude measure, he cate- 
hypotheses and the independent variables. She re- gories were (a) no help (Scaled Response.» and 1) 
corded each response to the incident, and in each and (b) help (Responses 2, 3, 4, and 5). 
case it was corroborated with the subject’s subse- $ 
quent report, One victim was kept constant through- Debriefing 
out the study and was chosen to minimize effects The detailed debriefing was opened with erence 
due to stimulus Person characteristics in relation to to the subject’s Perception of the demands f the 

the sample, task as rated on the Postexperimental questiow naire, 

The inokdent occurred midway during the retrial 


The subjects who provided help usually mentioned 

. Ages the episode Spontaneously, in contrast with the ron- 

items. The Purpose of this timing was to Maintain helping subjects, re the 

'ediDN | s nuty task. The subjects in the high-overload co ition 

minimizing evaluation apprehension, which is most consistently expressed their Perception of the ask as 

highly present during the initial presentation ofa “hectic” or “so many things to attend to 

task (Rosenberg, 1969). The victim knocked on the same time.” The victim was perceived as : 

door of the experimental room, opened it, and tional stimulus with which to cope. Thes 

loudly called, ‘Tom—Tom, are you here?” She Were conspicuously absent for low. 

tripped on the threshold, and as the door slammed All nonsuspicious subjects re 

shut, she landed on the floor and yelled, “Oh, my ; 

foot,” while the heavy book i i i 

s Se} v] 

clattered behind her, She remained on the floor, ; ie ees. the iy 

intermittently moaning and clutching her right ankle hi î 
al ad perceiv 

until either the subject helped or 1 minute had i Peeve 


the left to see the victim, who was in unobstructed i inci 
view at a 14-foot (4.27-m) distance from the sub- wi, the ag 
ject’s chair. If the victim was ignored, she looked 
up at the subject before limping out and said the 
preset phrase, “I think my ankle will be better 
soon.” In response to verbal reactions, the Victim 
avoided eye contact and said, “I hurt my ankle.” 
When Specifically asked if help was needed, she 
neutrally stated, “I don’t know,” and continued to 
moan. When prolonged help was given, the inter- ject appeared upset 
change was kept minimal, but natural, for approxi- 

mately 2 minutes. In all cases, the victim initiated ? This subject and five othe 
leaving Procedures, and for all helping responses she helpers who expressed concern 
thanked the subject, F 


Dependent measures, The primary helping behavior 
rating was a 6-point quantified scale formulated to 


assured them that she was fine 
them at length. 
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sul i to support the deception until a 
ed announcing the study’s termina- 

ddressing an envelope to receive a 
nmary of the results, each subject 
ər participating and reminded to take 


notice 
tion 
forthe 


was th 
the pa 
RESULTS 
Effecti of the Overload Manipulation 
The e! eness of the overload treatment 
was asse vy each subject’s self-report on 


the postex;, imental questionnaire. The sub- 
jects rated veir perceived level of difficulty 
in concentra ion for the task on a 7-point 
scale, whic’ ranged from very easy to very 
difficult. ighly significant difference be- 


tween tl > conditions was revealed by an 
analysis ariance, F(1, 114) = 323.47, p 
001.2 T high-overload subjects rated the 
combine ands of the task as almost three 
times n ifficult (M = 5.56) than did the 
low-0y subjects (M = 1.95). The sta- 
tistica s corroborate the perceptions of 
the t pressed by the subjects in the 
debric eriod as well as those in pilot 
proce 
Sca Dichotomized Helping Measures 


I ans and standard deviations, in 
of response, for the three resi- 
we presented in Table 1, and 
in each overload condition are 
é Table 2. 

om Table 1 it is evident that a lower 
rate of helping behavior was provided by the 
rural group than by either of the urban 
groups. Furthermore, the lowest rate of help- 
ing, regardless of condition, was demonstrated 
by the rural groups in the low-overload con- 
dition (see Table 2). The results of the analy- 
sis of variance revealed a significant main 
effect for residence, F(2, 114) = 4.37, p < 


TABLE 1 


DARD DEVIATIONS OF SCALED 
EFFECTS OF RESIDENCE 


MEANS AND S 
HELPING R 


Residences M SD 


Mainland 2.86 2.04 
Honolulu 2.51 1.85 
Rural 1.67 1.94 


PEA 
Mainland » = 42, Honolulu n = 41, rural n = 43. 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS OF SCALED 
HELPING Responses: EFFECTS OF 
RESIDENCE X OVERLOAD INTERACTION 


High overload* Low overload* 


Residence M SD M SD 
Mainland 214 2.01 3.57 1.81 
Honolulu 186 1.52 3.20 191 
Rural 1.90 2.14 1.45 1.70 
an = 63. 


.02, and the Residence X Overload interac- 
tion, F(2, 114) = 3.37, p < .05. A Newman- 
Keuls test for multiple comparisons showed 
that overall, the mainland and Honolulu ur- 
ban groups did not differ from each other in 
magnitude of response, whereas a significant 
difference was demonstrated between the 
rural group and each urban group ($ < .05). 
In the measurement of the Residence X Over- 
load interaction, both urban groups were sig- 
nificantly different from the rural group in 
the low-overload condition (p < .05). The 
rural group mean in the low-overload condi- 
tion did not significantly differ from any of 
the group means in the high-overload condi- 
tion. 

The main overload effect, F (1, 114) = 5.23, 
p < .05, showed that low-overload subjects 
helped significantly more (M = 2.71) than 
did high-overload subjects (M = 1.97). How- 
ever, the significant difference was totally ac- 
counted for by the urban groups, as shown by 
the Residence X Overload interaction means. 
A significant within-groups difference between 
overload conditions was revealed by the 
Newman-Keuls test for mainland subjects 
only (p < .05). The sex variable did not ap- 
proach significance as a main effect or as an 
interaction. 

The response frequency dependent measure 
showed that overall, 51% (64) of the sample 
helped the victim; 59% of the urban subjects 
provided help as compared with 35% of the 
rural subjects, x°(2) = 6.62, $ < .05. The 
helping frequencies were evenly distributed 
across residence groups for high-overload sub- 


isti tanti- 
3 Darley and Batson (1973) statistically subsi 
ated the 7 alidity of the use of parametric methods 
of analysis on ordinal data. 
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e scaled and dichotomized measures of help- 
z behavior. A Newman-Keuls test showed 
mainland urban males scored signifi- 
cantly higher than males from the other groups 
and from rural group females (p < .05) but 
did not significantly differ from either Hono- 
luiu or mainland urban females. While no 
si- nificant gender effects were obtained on the 
he'ping measures, the only cell in which 100% 
of ‘he subjects provided helping responses 
w:o mainland male low-overload. 
summary, in the above analyses involy- 
inz the dispositional variables, a significant 
rol-tionship was obtained between the cogni- 
t complexity measure of flexibility and 
dependent measures of helping behavior. 
s revealed by an analysis of covariance 
he residence variable was affected by 
live complexity as a factor influencing 
>` helping responses. When the Barron 
lexity Scale scores were treated as the 
dependent measure, the same pattern of re- 
st was reflected for the residence groups 
t obtained in the corresponding analysis 
of led helping responses: The rural group 
dei onstrated significantly less complexity than 


di urban groups. 

mplexity score relationships. The rela- 
tionship between the residence variable and 
the measure of cognitive simplicity-complex- 
ity was examined, The subjects’ scores ranged 
from 12.0 to 37.0, with the mean at 22.61 


and a standard deviation of 5.69. The sample 
mean was comparable to those for college 
student populations. The scores were catego- 
rized into classes of performance on the basis 
of standard deviation units. Despite no abso- 
lute difference in range of scores for the total 
sample, group differences were apparent in 
the number of subjects classed as less com- 
plex. The less complex were those who scored 
at levels of more than one standard deviation 
below the mean. Each urban group had an 
equal number of less complex scorers. The 
rural group had a significantly higher number 
of less complex scorers than did either of 
the other groups, x2(1) = 4.77, p < .05. 
Furthermore, 94% of the subjects who were 
helpers scored in the range of complexity 
(from one standard deviation below the mean 
through three standard deviations above the 


TABLE 3 


MEANS AND STANDARD DEVIATIONS or COMPLEXITY 
SCALE Scores; RESIDENCE X Sex INTERACTION 


Male Female 
Residence M SD M SD 
Mainland 26.35 543 23.50 . 5.23 
Honolulu 21.62 444 24.55 4.62 
Rural 21.00 6.24 19.04 4.62 


Note. The Barron Complexity Scale is scored as a positive 


integer from 0 to 50, Higher scores indicate more complexity, 


mean). In contrast, 85% of those who scored 


more than one standard deviation below the 


mean were nonhelpers. For the rural group, — 


62% of the scores were one or more standard 
deviations below the mean, and 88% of these 


subjects were nonhelpers. 

In relation to the overload variable, differ- 
ences were found among the less complex 
scorers in the low-overload condition. The 


high-overload less complex scorers usually did ` y 


not help. However, this combination of low 
Barron Complexity Scale scores and non- 
helping response differed significantly between 
residence groups in low overload, x(2) = 8.67, 
p< .02. There was no effect on helping be- 
havior due to depressed complexity scores for 
urban group low-overload subjects, whereas 
the rural group subjects who scored as less 
complex were nonhelpers more often than 
helpers, irrespective of overload condition. 
The effects of more and less complexity -on 
subjects’ helping behavior were demonstrated 
by the analysis of cognitive complexity scores. 
A significantly higher number of rural sub- 
jects than urban subjects had low Barron 


Complexity Scale scores and were nonhelpers. 


These findings served to clarify the pattern of 
low magnitude and frequency of helping 
behavior shown by the rural group. 


Qualitative Differences in Helping Responses 


The qualitative nature of the subjects’ re- 
sponses to the victim’s fall were examined. 
Among the nonhelping subjects, a significantly 
larger number of rural group subjects ignored 
the victim (60%) as compared with the Hono- 
lulu (17%), x°(1) = 6.26, # < .02, and a 
Jand (23%), x*(1) = 4.00, $ < .05, samples. 
For the rural group alone, Category Re- 
sponse 0 exceeded verbal reaction (Category 
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Response 1) and was also the dominant re- 
sponse made to the emergency among all 
category choices. No subjects who scored 
in the range of complexity gave Category 
Response 0. 

Among the helping subjects, 28% in the 
high-overload condition sought only indirect 
aid for the victim by going for the experi- 
menter (Category Response 2). No subjects 
in low overload chose this lowest type of 
helping response. Also, 21% who were over- 
loaded turned off the machines to aid the 
victim. Prolonged help (Category Response 5) 
was offered by 56% of the helpers, for a 
marked difference between this most helpful 
response and the lesser categories, y*(3) = 
34.00, p < .001. In regard to styles of re- 
sponse within the highest helping category, 
the flexible-passive response form was preva- 
lent among subjects who provided prolonged 
aid, x*(2) = 10.17, p<.01, and the two 
flexible forms comprised 78% of the response 
modes. The flexible-active form was used ex- 
clusively by males in all of the groups. No 
member of the Honolulu group used the rigid 
form, whereas the rural group employed 
this form more often than any one of the 
flexible forms. 

In summary, the primary response in the 
nonhelping categories for rural group subjects 
was to ignore the victim, whereas urban group 
subjects offered verbal communication. In the 
helping categories, the method of helping 
behavior varied between overload conditions. 
The majority of helpers provided prolonged 
aid, and the dominant mode of helping was 
a flexible one. 


Discussion 


The finding that rural subjects helped the 
victim less than did urban subjects does not 
conform to the popular notion but is in accord 
with the views that detachment is not nec- 
essarily an urban phenomenon (Latané & 
Darley, 1970; Milgram & Hollander, 1964; 
Rosenthal, 1964). The results substantiate 
reported findings of social inhibition under 
exposure to stimulus overload (Krupat & 
Epstein, 1973; Sherrod & Downs, 1974) and 
the predicted absence of gender effects. It was 
initially puzzling to find that the rural 
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group’s behavior was neither affecte: by the 
situational factor of overload nor © tingent 
on experimental task performance. } wever, 
the consistent pattern that emerged  ovided 
confirmation for the influence of flexi- 
bility-complexity variable and dem: strated 
that the observed group differences i) \elping 
behavior appear to be a function of : nitive 
complexity scores, The complexity d \ension 
was revealed as a very important ele: ont for 
helping action to have occurred. 

Of central importance in accountin: ‘or the 
results is Barron’s (1953) hypothesi ut the 
social cognition of cognitive comple may 
be acquired by early experience in > viron- 
mental mastery and exposure to com, °x de- 
velopmental stimuli. Put to empirica! vst by 
Sechrest and Jackson (1961), a six: ‘cant 
positive relationship between the cor Jexity 
of developmental stimuli afforded by amily 
background experiences and social « ctive- 
ness was demonstrated, which these «thors 
concluded was “of especial importance ‘cause 
that complexity measure might hay ausal 
implications” (p. 180), Berlyne (196 cited 
additional evidence to suggest that t more 
complex environment fosters an “ou oing” 
(p. 196) attitude to the external w d. In 
addition, Krug and Kulhavy (197: have 
discovered aspects of personality as.. ciated 
with less effective role interaction, whi ı they 
attributed to restricted opportunities r: ulting 
from relative social isolation due to lemo- 


graphic origin. Moreover, there is c\ idence 
that variations in cognitive styles are ‘slated 
to differences in family experiences and that 
stylistic tendencies are “end-producis 
particular socialization experiences” (\ 
1967, p. 234). Therefore, the present findings 
seem aptly attuned to the interpretation 
that perhaps circumscribed patterns of social- 
perceptual learning during the socialization 
years for rural-reared subjects have resulted 
in their being lower in social effectiveness 
than the urban group subjects. The similarity 
of age, educational level, and socioeconomic 
status in this sample attests to observed dif- 
ferences between groups as due to a hetero- 
geneity of background experiences. In addi- 
tion, differences in ethnic origin did not affect 
the frequency of helping response. 
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ecessary to consider how differences 
round experiences could have differ- 


enti `y influenced helping behavior between 
resi ice groups. Friedrichs’ (1960) sugges- 
tion sat altruism may depend on the relative 
ing cup or outgroup nature of the reference 
gr involved is appropriate. People who 
g! ıp in rural areas may have a maximum 
rap ort and cohesiveness within the family— 
fr reference group, which is likely to be 
ba ed by a minimum rapport and cohesive- 
ne with the secondary environment. Similar 


in us is the proposal that psychological 
n ership in a potential helper’s ingroup is 
bi upon clearly established bonds of simi- 
k (in shared beliefs), which serves to 
I ‘te a helping relationship (Sole et al., 
io). Thus, the victim as a total stranger 
ë yutsider” was not a member of the rural 
subjects’ perceived ingroup. Further- 
those low in cognitive complexity have 
found to be high in social distance 
ptions (Vannoy, 1965). 
offshoots of the ingroup—outgroup con- 
three sets of factors pertinent to the 
ism research could have acted as determi- 
ts for the rural-urban diversity. First, a 
‘lar outlook may foster an individual’s 
‘ocial responsiveness toward a stranger 
e London, 1970) and can be viewed from 
perspective of Becker’s (1957) sacred— 
cular typology. Becker described the sacred 
Lype of person on the continuum as “a kind 
of rural and small town American version of 
rineteenth century Protestant Christianity” 
(p. 146), who stresses ingroup loyalties and 
is reluctant to countenance change. In con- 
trast, the secular-oriented person endorses 
values emphasizing viable change and main- 
tains an allegiance to principles, while being 
free of prejudice. It has been shown that 
people who are socially aware and construc- 
tively active in social issues are responsive to 
change and have adopted values that reflect 
the contemporary scene (e.g, see Block, 
Haan, & Smith, 1969). They have been found 
to be at the highest level of moral reason- 
ing, rejecting both conservatism and violent 
radicalism (Fishkin, Keniston, & MacKinnon, 
1973). 
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Second, Feldman (1968) indicated the im- 
portance of isolating specific antecedent vari- 
ables and conjunctive learning patterns for 
tapping outgroup discriminations in helping 
behavior to a stranger. Defining the ingroup 
as “people like me” (p. 213), a different 
range of cognitive definitions seems to exist 
among subgroups within a general population, 
Because a more expansive rather than con- 
stricted view of the world is afforded by cog- 
nitive complexity (Barron, 1953), the com- 
plex person’s reference group is likely to be 
more inclusive than that of the less complex. 
As such, this reference group would probably 
extend to a stranger, especially to a victim 
of an empathy-evoking accident within the 
common milieu of the college campus. The 
less complex person’s reference group may be 
increasingly exclusive and, therefore, con- 
fined to significant others seen in terms of 
functional relevance. 

Third, the two orientations toward social 
dealings, reciprocity and responsibility, have 
been linked to differences in backgrounds as 
influencing one’s social outlook (Berkowitz, 
1972). Reciprocity calls for socially respon- 
sible behavior only toward those who com- 
prise the intimate group. In contrast, the re- 
sponsibility orientation is considered as being 
beyond the constrictions of social exchange 
and distributive justice considerations, moti- 
vated by the more abstract ideal of freely 
providing service. Perhaps reciprocity versus 
responsibility served to delimit rural from 
urban to account for differences in helping 
behavior between groups. 

As an alternative explanation, it may have 
been that cognitive complexity afforded in- 
creased response versatility (e.g., see Crockett, 
1965). In the framework of the decision- 
making model formulated by Latané and 
Darley (1970), only one set of choices leads 
to prosocial action. First, the subject must 
notice the event and then define it as one that 
calls for help, then decide to assume personal 
responsibility for taking action and finally, 
choose the type of action and implement it. 
As established in the debriefing period, all 
subjects were at the point of whether or not 
to assume responsibility, and one of several 
courses could be followed: (a) resolution of 
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conflict in a decision to help, (b) no conflict, 
(c) cycling in conflict, and (d) resolution of 
conflict in a decision not to help. The social 
costs of not helping were low. There was no 
expectation of either future interaction with 
the victim or of reproach from anyone out- 
side of the presumably casual encounter, and 
extrinsic rewards were also unlikely. Respond- 
ing to the total environment required moving 
beyond the specified task requirements to the 
implicit demands created by the accident. 
People who are low in cognitive complexity 
tend to eliminate complex and unwelcome 
aspects of the environment by psychological 
withdrawal of involvement and thus control 
= potential dissonance and minimize the pos- 
= sibility of conflict (e.g., Crockett, 1965). In 
this manner of restricting response variability, 
low-complexity subjects were not likely to 
experience moral pressure to help or to per- 
ceive the event as within the boundaries of 
personal relevance. By preventing the stimu- 
lus cues in the situation from becoming 
salient, responsibility can be denied by avoid- 
ance, and conflict can be eliminated by in- 
hibiting empathic arousal. The statements 
repeatedly made by the substantial number 
of low-complexity scorers in the rural group 
who ignored the victim were as follows: “It 
was not my concern” or “It was not my 
doing.” These comments conveyed the lack 
of association between the incident and the 
self, and, in their seeming emotional flatness, 
reflected a detachment from the situation. 
In contrast, when faced with multifaceted 
demas, the more complex urban group sub- 
uld have at their disposal a broader 
f options for processing information 
social environment than do persons 
omplexity, and thus have a more 
re ice as to whether or not to help. 
iting effects of the high-overload 
tre e revealed by the difference in the 


ma helping behavior between the 
two for example, by the minimal 
degree ise offered by high-overload 
subjects ved. This result was con- 
sistent w am’s (1970) theory for 
superficial ic ‘avolvement as an adap- 
tive mechani coping with excessive 
stimuli but upport his supposition 
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that rural and small-town people would dis- 
play more willingness to help and a greater 


degree of involvement than would urbanites, 
Corroborating the findings of Sherrod and 
Downs (1974) for decreased magnitude but 
not frequency of altruistic response, the nega- 
tive consequences of overload for pr: social 
action were apparently mediated by the fact 
that subjects had the options of securing 
other assistance and/or operating tho ma- 
chines. The substantial percentage of su jects 
(49%) who employed either of these means 
uniquely in the high-overload condition sup- 
plies an explanation for the lack of diffe: ence 
in response frequency between subjects i the 
two overload conditions. If it is assume: ‘hat 
the absence of these options would have pre- 
cluded helping responses for high-ove: load 
subjects, then the inclusion of these » iced 
choices would have served to doublo the 
number of helpers despite exposure to simu- 
lus overload. Slight variations in the situ- tion 
have repeatedly been shown to affect h«' ing 
rates drastically (e.g., Latané & D> ley, 
1970; Clark & Word, 1972). It follows ‘hat 
an increase in perceived control of the envi- 
ronment could serve to deflect the impact of 
the inhibitory effects of overload and re- 
sult in involvement with one in need on a 


limited basis. 

The singular ineffectiveness of the overload 
manipulation for the rural group is somewhat 
enigmatic, but it could be explained in the 
following manner. It seems reasonable to as- 
sume that the base rate of helping behavior 
was reflected in the low-overload condition. 
Exposure to stimulus overload was expected 
to inhibit prosocial response. The lack of a 
significant difference between the two levels 
for the rural group alone may have been due 
to the fact that their base rate of helping was 
so low that manipulations designed to de- 
Crease or inhibit this behavior further were 
ineffective. In other words, there may have 
been a “floor effect” operating for the rural 
group, who did not differ from the urban 
groups in task performance when they differed 
in helping behavior and vice-versa. It appears 
that low complexity affected the rural group 
subjects similarly in both overload conditions 
and thus superseded the overload treatment. 
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The importance of using multiple independent 
variables and dependent measures is clearly 
illustra in this instance, because otherwise, 


the effects of the residence variable on over- 
load w« have been masked. 

As expected, the sex of the subject proved 
incons ntial in affecting either the magni- 
tude or frequency of helping behavior. It is 
interesting to note that within the prolonged 
helpin egory on the scaled measure, only 
males ivitiated helping action. At this finer 
level ponse gradations, opportunities for 
cultu <pectations to come forth were 
prese the situation, in that males are 
usually trained to be more assertive. 

The ‘| lack of a relationship between 
cogni interference and helping behavior 
serve mphasize the multidimensionality 
of si broad constructs as “flexibility— 
rigidit However, flexibility was demon- 
strate her among the cognitively complex 
subje ithin the helping domain, as those 
who | prolonged help to the victim 
predominantly used the flexible forms of 
respon 

The question of generalizability of results 
is periiuent when the population studied is 
culturally or geographically dissimilar from 
the mainstream, The findings of this study 
would seem to have external validity based 
on the following: (a) all subjects were born 
and re l in the American core culture, and 


differex 
quency 
grounds 


in ethnicity did not affect the fre- 
helping behavior, (b) the back- 
of the mainland urban and rural 
I s covered 21 states representing 
> major regions of the United States, 
(c) the Honolulu urban group’s helping be- 
havior and cognitive complexity did not 
differ from the mainland urban group, and 
(d) controls were instituted in establishing 
residence criteria that eliminated students 
who had migrated to Hawaii during their 
Stowing-up years, and those classified as 
mainland urban had resided in Hawaii for 
less than 3 years, Therefore, the results ap- 
Pear to be a function of true rural-urban 
differences in subjects of college age. 

The implications derived from this study 
ate that social-learning methods, such as the 
techniques of role playing and responsibility 
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training, may be fruitful means of extending 
an individual’s experience during the sociali- 
zation years within the confines of a given 
environment. The finding of an interrelation 
among variables from the separate classes of 
determinants leads to the following sugges- 
tion: The readiness to respond is the contri- 
bution of personality, while the situation af- 
fects its activation, and both claim an inter- 
dependence with the demographic aspects of 
a potential helping agent’s experiential world. 
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Mistreating Other Persons Through Simple Discrepant Role 
Playing: Dissonance Arousal or Response Contagion 


Herman Verhaeghe 
Catholic University of Louvain (K. U. Leuven) 
Louvain, Belgium 


Nuttin recently advanced a theory of response contagion to account for the 
attitudinal effects of elaborate counterattitudinal advocacy, Two studies are 
reported that examine the relative merits of dissonance theory and response 
contagion theory in explaining the evaluative changes produced by a simple 
discrepant speech that effects negative outcomes for the audience. Subjects 
were asked to encourage their experimental partners to perform a dull task 
for a fairly long period of time by telling them that it was enjoyable. In line 
with both theories, subjects displayed attitudinal changes only: (a) under a 
low monetary inducement, (b) when perceived choice was high, (c) when the 
aversive consequences were inevitable, and (d) when the audience was not 
compensated for the negative outcomes incurred. Consistent with dissonance 
theory but contrary to response contagion theory: (a) perceived choice and 
liking for the audience decreased when dissonance reduction through attitude 
change was blocked, (b) no attitude change occurred when the subjects had 
been previously mistreated by the audience, and (c) the particular attitude 
cognitions changed depended upon the specific aversive consequences pro- 
duced rather than upon the specific discrepant statements professed. In addi- 
tion, postexperimental attitude responses were directly related to perceived 
choice, However, perceived responsibility for behavioral consequences was not 
related to perceived choice and attitude change, and discrepant role playing 


was not found to produce behavioral change. 


ent dissonance theorizing (Collins, 1969; 
Col: us & Hoyt, 1972; Cooper & Worchel, 
1970; Nel, Helmreich, & Aronson, 1969) 
post:!ates that an individual experiences cog- 
nitive dissonance when he freely chooses to 
deliver an attitude-discrepant speech pro- 
duciny negative consequences for himself 
and/or ‘or his audience. In order to reduce 
dissonance, he would then change his attitude 


to be more consistent with overt behavior. 
The studies reported were conducted at the 
m voor Experimentele Sociale Psychol- 

' at Louvain, Belgium, and constitute a second 
Section of the author’s doctoral dissertation, in 
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An accumulating body of research supports 
this prediction (Calder, Ross, & Insko, 1973; 
Hoyt, Henley, & Collins, 1972; Verhaeghe, 
in press), but its theoretical interpretations 
remain controversial. At least two theories 
can adequately account for prodissonance re- 
search without hypothesizing a motivational 
process of dissonance reduction: self-percep- 
tion theory (Bem, 1967, 1972; Kelley, 1967) 
and response contagion theory (Nuttin, 1975). 

Nuttin (1975) recently proposed a response 
contagion theory of persuasion that ap- 
proaches the classical attitude change phe- 
nomenon with a “mere evaluative verbal re- 
sponse” paradigm. Abandoning the traditional 
conceptualization of social attitudes as “en- 
during organized interdependent systems of 
cognitions, affects and behavioural disposi- 
tions” (p. 178), Nuttin defined attitude re- 
sponses as evaluative verbal responses which, 
as any other form of verbal or nonverbal 
behavior, are determined by previous (old or 
more recent) responses and some specific 
features of the behavioral situation. Persua- 
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sion occurs when subsequent evaluative re- 
sponses (emitted during the postexperimental 
“attitude” measurement) are more similar to 
the prevailing evaluative characteristics of the 
prior evaluative responses (emitted during the 
persuasive message). Such evaluative assimi- 
lation of subsequent responses to prior re- 
sponses is said to result from a noncognitive 
process of response contagion, which in turn 
is assumed to be a direct function of the 
nature of the prior responses, the degree of 
perturbation or arousal under which the re- 
sponses are emitted, and the temporal contigu- 
ity between prior and subsequent responses. 
Nuttin’s (1975) response contagion model 

was developed on the basis of a progressive 
research program on the attitudinal effects of 
proattitudinal and counterattitudinal advo- 
cacy. In all of his studies, subjects were in- 
vited to generate arguments against or in 
support of their attitudes on the important 
issue of exam reforms. Postexperimental at- 
titudes were assessed immediately after sub- 
jects had presented their written or oral 
advocacy and had been passively exposed to 
the product of their improvisation, An initial 
study, a critical replication of the Rosenberg 
(1965) and Helmreich and Collins (1968) 
studies, produced the dissonance-predicted, 
inverse incentive-attitude change relation- 
ship. Several other findings, however, clearly 
disconfirmed the cognitive dissonance view on 
the role of a monetary reward for compliance, 
the discrepant nature of the behavior, and 
its potential aversive consequences in pro- 
ducing dissonance arousal and subsequent 
attitude change. First, a small monetary re- 
ward produced significantly more attitude 
change than no reward. Second, the atti- 
tudinal effects of various rewarded counter- 
attitudinal conditions did not differ signifi- 
cantly from those obtained in proattitudinal 
conditions. Finally, a public and identified 
discrepant advocacy on a nationwide tele- 
vision broadcast did not produce more atti- 
tude change than a similar private and 
anonymous advocacy. 

Further research revealed that greater post- 
advocacy change resulting from low reward 
relates to the embarrassing properties of a 
trivial reward rather than to the subject’s 
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feelings that such a reward does noi justify 


his behavior sufficiently. A similar inc:case in 
postadvocacy attitude change occurre when 
subjects received severe but undeserved \!ame, 
when they received an important but ` egiti- 
mate reward, and when the experimenic wore 
an embarrassing outfit. From these data, 
Nuttin (1975) concluded that any pertusbing 


im- 
1ge 
nce 


or arousal-increasing event enhances ti 
pact of a self-generated persuasive m 
on a subsequent attitudinal response. 
this effect was equally strong for both ; ro- 
attitudinal and counterattitudinal condit: ns 
and obtained when subjects were asked to 
either maintain a most convincing plea or 
merely show off their verbal creativity, an 
interpretation in terms of cognitive dissonan e 
or self-persuasion was not very plausible. . 
noncognitive process of response contagion 
was therefore advanced to account for the 
results. The response contagion hypothesis 
essentially maintains that an increase in 
arousal enhances the relative impact of the 
most recent ongoing responses on immediate!y 
following attitude responses. As a resuit, 
with increasing arousal, postadvocacy attitude 
responses would become increasingly similar to 
the prevailing evaluative characteristics of ‘he 
previously ongoing responses. 

This response contagion view implies that 
counterattitudinal advocacy does not chenge 
an “underlying organized and structured so- 
cial attitude” (Nuttin, 1975, p. 171) but 
merely affects the evaluative loading of the 
particular evaluative response that has been 
emitted in close temporal contiguity to the 
evaluative responses emitted during role play- 
ing. Nuttin’s data confirmed this view. When 
the postexperimental attitudinal response was 
tapped for the second time 5 weeks after the 
advocacy, the experimental treatment proved 
to have a long-lasting effect on this previously 
changed evaluative response. However, when 
this same response or a similar attitudinal 
response was tapped for the first time 5 weeks 
after the advocacy, there was not the slightest 
sign of enduring attitude change. These find- 
ings clearly challenge the traditional interpre- 
tation of the observed postadvocacy change 
in an attitude response as a symptom of a 
modification in an underlying attitude. Ac- 
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cordingly, classical concepts of social 
attitudes a titude change were abandoned 
by Nuttin, «nd the response contagion hy- 
pothesis v ormulated in terms of a mere 
evaluative ıl response view on persuasion. 

Nuttin ı obviously are more clear in 
invalidati issonance interpretation of the 
attitudin ts of discrepant advocacy than 
in clarify ie hypothesized process of re- 
sponse «í m. They nevertheless provide 
a seriou ‘enge to recent dissonance theo- 
rizing an sest a possibly valid alternative 
interpre or the data obtained and inter- 
preted w his framework. Indeed, freely 
performi act that produces aversive 
consequ likely to be a perturbing 
event th ases arousal and fosters re- 
sponse «í n. Prodissonance research can 
therefore egrated by response contagion 


theory =: ily as by recent dissonance 


iments were conducted to exam- 


ine furt relative merits of both inter- 
pretation 1e studies were especially de- 
signed to test the plausibility of recent 
dissonance theorizing on the cognitive effects 
of simple discrepant role playing that pro- 


duces important aversive consequences for the 


audience, (b) control for the alternative re- 
sponse cx ion theory, and (c) examine the 
effects of simple discrepant role playing upon 
Subsequent attitude-related, overt behavior. 


EXPERIMENT 1 


The purpose of the first study was twofold. 
First, it attempted to replicate the disso- 
hance-predicted, inverse relationship between 
amount of incentive for compliance and atti- 
tude change under conditions of high choice 
and high consequences. Second, it tested the 
hypothesis that discrepant role playing does 
Not produce dissonance arousal and subse- 
quent attitude change when the audience is 
Compensated for the negative outcomes it 
incurs or when it deserves to be misled. Since 
the consequences of an attitude-discrepant 
Speech become less aversive under such cir- 
Cumstances, less dissonance is present, and a 
tole Player would not have to change his 
attitudinal cognitions, 
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Response contagion theory makes similar 
predictions. Like dissonance theory, it pre- 
dicts the inverse incentive-attitude change 
relationship. It also predicts no change when 
the audience gets compensated for the aver- 
sive consequences received, because such 
compensation is likely to reduce the arousal 
produced by effecting these negative conse- 
quences. Unlike the dissonance arousal hy- 
pothesis, however, it does predict a change 
in attitudinal responses when the audience 
deserves to be misled. Indeed, if the audience 
has first severely mistreated the role player— 
and hence deserves the aversive conse- 
quences—a sharp increase in arousal can be 
expected and the stage is set for response 
contagion to operate. 

To test these hypotheses, the traditional 
baseline, low-incentive, and high-incentive 
conditions were supplemented with a com- 
pensation and retaliation condition. In the 
compensation condition, subjects were given 
an opportunity to compensate the victims of 
their discrepant behavior by having them 
receive an appreciated reward. In the retalia- 
tion condition, subjects were seriously mis- 
treated by their collegaues before engaging 
in discrepant role playing. The subjects were 
subsequently asked to mislead their col- 
leagues, which would give them a chance to 
retaliate for the mistreatment received. 
Whereas dissonance theory predicts more at- 
titude change in the low-incentive condition 
than in all other conditions, response conta- 
gion theory also predicts a change in the 
retaliation condition. 


Method 
Subjects 


Subjects were 46 male high school students (from 
14 to 16 years old) who volunteered to participate 
in a 2-hour experiment of “group discussion and 
group decision making.” 


Procedure 


Since the basic procedure for the retaliation and 
compensation conditions is identical to that for 
the low- and high-incentive conditions, the latter 
conditions are described in detail first. 

Overview. Under the guise of an experiment on 
the effects of an expert's advice on a group decision, 
subjects played the role of expert and were first 
induced to perform a dull and monotonous task. 
Next, they were informed that the group (actually 
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two confederates) intended to perform the task for 
15 to 20 minutes. The experimenter offered them 
either 5 (low incentive) or 50 (high incentive) 
Belgian Francs for telling the group that the dull 
task was enjoyable and for advising the group to 
engage in the task for 50 minutes.? After subjects 
had delivered their counterattitudinal advice, they 
discovered that the group had decided to perform 
the task for 45 minutes, They then anonymously 
indicated their attitudes toward the task and decided 
on a performance period for themselves. 
Preliminary instructions. Upon the arrival of one 
subject and two confederates, the “group decision- 
making” experimenter (Experimenter 1), pretending 
to be interested in how members of a small group 
proceed in reaching a preliminary consensus and 
how the advice of an expert might contribute to the 
group’s final decision making, told the subjects that 
they had to perform a simple task. First, however, 
they should decide upon how long to perform the 
task. The task was said to be studied by Mra 
(Experimenter 2) of the General Experimental Sec- 
tion of the Institute, who wanted to administer the 
task to a large number of people over periods vary- 
ing in length from 5 to 60 minutes. The partici- 
pants, therefore, were said to be completely free 
in choosing whatever performance period they liked. 
Experimenter 1 further explained that one of the 
three participants would play the role of expert. 
His task was first to perform the task and then, 
as an expert, to recount his impressions and advise 
some particular performance period to the group. 
The other participants, as a group, first had to 
make a preliminary decision on the basis of a short 
description of the task, and after receiving the ex- 
pert’s advice, make a final and unanimous decision by 
voting. Experimenter 1 emphasized that the expert’s 
role was strictly advisory, so that he would not 
participate in the final group decision making. After- 
wards, however, he would be given an opportunity 
to freely choose a performance period for himself, 
Experimenter 1 then proceeded to “randomly” assign 
the role of expert to one of the three participants, 
and the only true subject was selected. A first ques- 
tionnaire, intended to enhance the credibility of the 
cover story, was handed to all participants, and 
Mr (Experimenter 2) was invited by phone 
to come and administer the task to the subject. 
Experimenter 1 left the subject to Experimenter 2 
and supposedly went to start the group discussion, 
Task performance. Experimenter 2 conducted the 
Subject to a room on another floor of the building 
where he made him perform a dull and boring task. 
Over a set of headphones, the subject listened to a 
series of simple additions and subtractions (e.g., 
es) monotonously read at a rate of one per 
5 sec. When the answer given was incorrect, the 
subject had to cross out the word correct on an 
answering sheet. Actually only 1 out of every 60 
answers was incorrect. 
Inducement of counterattitudinal behavior. When 
the subject had performed the task for 20 minutes, 
Experimenter 2 was informed by Experimenter 1 
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that the group discussion was over anc s asked 
to return with the subject. Experimente: ien gave 
the following instructions: 

Well, the others, in the meantime, ha nore or 
less agreed to perform the task for to 20 
minutes. As I told you, I am specificall erested 
in studying the effects of an expert’ ice on 
a group decision, Advice which devi from 
the preliminary group agreement would ‘hus be 
in my best interests. Therefore, I my. í have 
Prepared two sets of advice. 

[While showing them to the subjects, he c vues.) 
The first one advises the group to perform task 
for 10 minutes, while the other one ad: ses 50 
minutes. The group now is spontaneously villing 
to participate for 15 to 20 minutes, and nce I 
wish to have advice that clearly deviate: from 
their preliminary choice, I would like ask 
you to advise 50 minutes. Of course, it om- 
pletely up to you to decide, but if you are lling 
to do it for me, I can give you 5 Belgian ancs 
(50 Belgian Francs, respectively). Do you hink 
you can do it.. .? 

After the subject agreed, he was given the low or gh 


monetary reward. Experimenter 1 then left the r: om 
to give the subject the Opportunity to rehearse his 
discrepant advice. The standardized text stated: 
“Well, after all, it is a rather simple task abe at 
which there is not so much to tell, but I enjo: ed 
it very much. I found the task very pleasant, ond 
I therefore propose that it is really worthwhile to 
perform the task for 50 minutes.” 


Counterattitudinal behavior and behavioral ose- 


quences. The subject was then taken to the con- 
federates and delivered his advice, One confedirate 
(always carrying his swimming trunks and towel 
with him) replied that he had planned to go swim- 
ming but that he was willing to give up this plan 


in order to Perform the task for a longer 
since the subject said that the task was to be 
ant and enjoyable. Hereafter, the confederates 
their final decision 


nade 
by voting. They were instructed 
to vote first 45 and 40 minutes, then 40 and 45 


minutes, and finally 45 and 45 minutes. Experi- 
menter 1, after restating the final decision, handed 
out a questionnaire, mainly intended to check the 
effectiveness of the Consequence manipulation. On a 
17-point scale, subjects indicated how strongly or 
weakly they felt their advice had affected the group 
decision. Experimenter 1, in the meantime, invited 
Experimenter 2 by phone to come and administer 
the task. 

Dependent measures. Experimenter 2, informed by 
Experimenter 1 of the group decision, conducted the 
subject and the confederates to his section of the 


1 Dollar equivalents are $.125 and $1.25, respec- 
tively. 
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instructed two of his assist- 
onfederate perform the task 
separate room) and to have 
ionnaire afterwards. He then 
ject for having to leave 
e other person (Experi- 
arrive to take over. Experi- 
) the condition the subject 
m the latter’s point of view, 
any of the events that had 
imenter bias or social desir- 
only minimally affect the 
he experiment. 
handing a questionnaire to 
u have already performed 
give you a questionnaire 
st impression on it. Please 
d sincerely as possible.” The 
several 15-point scales, de- 
ee index scores. Summing 
; varied—monotonous, enjoy- 
pleasant produced an “enjoy- 
the task, with values ranging 
) through 24 (the midpoint) 
Since additional justification 
| task for a long time might 
ing the importance of the 
ising its tiresome nature, two 
s were planned. The “impor- 
| the scales important-unimpor- 
meaningless. The “hardness 
s tiresome-not tiresome and 


re 


asked to choose a perform- 
“As you know, there is a 
periods ranging from 5 to 60 
sll of these periods are equally 

for this research, so you may 

il What period do you then 
3 circled the period chosen 
a form and invited him to 


s and postexperimental inter- 
t had performed the task for 
was conducted by Experimenter 3 
1 who administered a final ques- 
subject to evaluate both his 
and Experimenter 1 on 8-point 
on these scales produced an 
nging from —68 (most negative) 


the 


through 0 (the midpoint) to +68 (most positive). 


to 
tioy 


aion index toward the confederates served 
k m the manipulations of the retaliation condi- 
* “he attraction indexes also examined whether 


$ subj 
\Subjects would eventually attempt to find additional 


Justification for their 
victims of the discrep 
Positive feelings towar 
for whom they had 


colleagues, A fin 
Menter 1 att 
ied: heen an 


a 


behavior by derogating the 
ant act or by enhancing their 
d the person (Experimenter 1) 
been willing to mislead their 


al interview, during which Experi- 
ki pted to detect whether the subject 
n suspicious about the procedures used, 
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concluded the experiment. No subject indicated 
having the least suspicion as to the manipulations 
of the experiment. The subject was then carefully 
debriefed and sworn to secrecy. 

Compensation condition. In all but one respect 
this condition was identical to the low-incentive 
condition. After the group decision phase had been 
completed, Experimenter 2 mentioned that he had 
some film tickets as a reward for those who would 
cooperate with his research. Unfortunately, there was 
a shortage of tickets so that only two of three per- 
sons could receive a ticket. He therefore wondered 
if one of the subjects perhaps did not care so much 
about receiving a free film ticket. If so, he would 
be able to give the available film tickets to the 
other participants. The confederates, of course, never 
showed any sign of disinterest, If the subject did 
not spontaneously propose that the reward be given 
to the victims of his behavior, Experimenter 2 sug- 
gested that they express on a 10-point scale how 
strongly they desired to receive the ticket. The 
confederates always indicated the number 10, and 
the subject was thus asked if he would agree to 
give the tickets to the confederates. All but one 
subject agreed to do so. One of the confederates 
then expressed his happiness with the reward by 
saying that for such a reward, he would be quite 
willing to perform the task for an even longer time. 
Subjects thus contributed either spontaneously or 
more passively to the compensation of the victims 
of their discrepant act. 

Retaliation condition. In this condition, otherwise 
identical to the low-incentive condition, an informal 
conversation was introduced at the end of the pre- 
liminary group decision and the expert's first per- 
formance of the task. The alleged purpose was to 
have an opportunity to improve each other’s ac- 
quaintance. As soon as Experimenter 1 had left the 
room, the subject was asked by a confederate to 
choose a topic of conversation. The subject’s pro- 
posal, however, was rejected as uninteresting. The 
confederates then claimed not to be interested at 
all in having a talk anymore and started reading a 
newspaper. Some time later, a confederate happened 
to discover some bottles of cola and cups and served 
a drink to himself and the other confederate. 
After the bottle and the cups had been emptied, 
they were carefully put back. When Experimenter 1 
returned, he noticed the empty bottle and asked 
if someone had perhaps drunk a cola. At first there 
was no reaction, but then one confederate suddenly 
accused the innocent subject. Apparently annoyed, 
Experimenter 1 said that the colas were not his 
and that such behavior certainly was not as it 
should be. He then immediately continued in a 
neutral way. v. 

Baseline condition. In the baseline condition, the 
dependent measures were assessed after the task had 
been performed for the first time. aseline subjects 
thus indicated their attitude to’ the task and 
chose a performance period in the absence of any 
discrepant behavior or monetary incentive. 
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TABLE 1 
MEANS For POSTTEST VARIABLES, EXPERIMENT 1 


Enjoyment Attractiveness cries veness 
if oi 


oi oi 
Group tasks audience experimenter 
High incentive 18.25 +35.7 +43.95 
Low incentive 27.75 +34.65 +4745 
Compensation 15 +25.6 +37.8 
Retaliation 14.75 —15 +44.33 


Nole. For the compensation and retaliation groups, n = 6; 
for the other groups, » = 10. Higher (positive) numbers repre- 
sent more positive evaluations, 

* Baseline group M = 15,4, 


Results 


The data from four subjects were not in- 
cluded in the final analysis. Two subjects (in 
the low-incentive condition) refused to accept 
the money. The results from another subject 
(in the low-incentive condition) were dis- 
regarded because of the confederates’ acting 
out of character. Finally, the data from a 
subject assigned to the compensation condi- 
tion were deleted, since he was not willing to 
give up the free film ticket. 


Effectiveness of the Manipulations 


All subjects indicated that their discrepant 
advice had strongly affected the group deci- 
sion. No significant differences among the 
four experimental conditions were found. In 
the retaliation condition, negative feelings 
toward the confederates were effectively in- 
duced. They were the only subjects to give 
negative scores on the attraction index 
(M = —15). These scores were significantly 
different from the responses of the other 
subjects (M = +31.95). 


Cognitive Changes 


Aversive consequences consisted of causing 
the confederates to perform the dull task for 
a long time by asserting that the task was 
pleasant and enjoyable. The enjoyability in- 
dex toward the task, therefore, is considered 
the major dependent measure of the experi- 
ment, The mean scores on this index are 
presented in Table 1. The results strongly 
corroborate the predictions derived from dis- 
sonance theory. Only low-incentive subjects 
rated the task as more enjoyable than base- 

line subjects (¢ = 3.69, p < .005).2 Planned 
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t tests further revealed that the ;onance- 
predicted, inverse relationship betwee: amount 
of monetary reward and subsequen: attitude 
change is significant (¢ = 2.45, p < .025). As 
predicted, compensation subjects an: retalia- 
tion subjects evaluated the task as less enjoy- 
able than low-incentive subjects (p < 01 and 
p < .005, respectively). The results of the 
retaliation condition thus do not sup: ort the 
prediction derived from response c i‘agion 
theory. No significant differences were found 
on the importance index, the hardnes. ‘ndex, 
and the attraction indexes toward the xperi- 
menter and the audience (retaliation si >jects 


excluded). 


Behavior Changes 


Subjects’ postexperimental choices of a per- 
formance period provide a measure of the 
behavioral effects of discrepant role playing. 
If discrepant role playing is effective in pro- 
ducing behavior change, low-incentive sub- 
jects should have chosen a longer perform ince 
period than the others. Differences be‘ een 
conditions were in the predicted directioi. but 
nonsignificant. The low-incentive manipula- 
tions, although quite effective in producing 
attitude change, did not affect subse: went 
overt behavior. It appears that simple dis- 
crepant role playing, by itself, is not effective 
in producing behavioral changes. 


Discussion 


The data from the first study are consistent 
with the notion that mistreating other per- 
sons through discrepant role playing leads to 
the arousal of cognitive dissonance only when 
the recipients of the discrepant speech are not 
compensated for the negative outcomes in- 
curred or when they do not deserve to be 
misled. The results thus support the predic- 
tions derived from dissonance theory, while the 
differential prediction derived from response 
contagion theory is not upheld. 


*The present findings cannot be attributed to 4 
Process of self-selection, since the results remain sta- 
tistically significant when the data for all subjects 
are included in the analysis. This statement also 
holds for all other instances where data were deleted: 

* Planned ¢ tests are one-tailed, whereas the com- 
parisons reported for unpredicted effects are tw0- 
tailed. 


A 
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It could be argued, however, that the 
retaliation subjects (and to some extent com- 
pensation subjects who were deprived of an 
appreciated reward) did not evaluate the task 
more tively because of the very negative 
expel »s associated with incurring a severe 
but | erved blame. This could likely re- 
sult egative feelings toward the entire 
exper tal situation, including the task. 
This «:;ument becomes less convincing, how- 
ever, when one notes that the attraction index 
toward experimenter was not affected by 
the n > treatments, In addition, the re- 
sponst filler item asking the subjects to 
rate t eeableness (or disagreeableness) 
of the listing the sums during the task 
perforn did not reveal any such gener- 
alized ney for negative evaluation. This 
altern interpretation is therefore not 
very pl le 

More the striking similarity between 
the re \btained in the present study and 
those her recent experiment, designed 
indeper (Cooper, Zanna, & Goethals, 
1974), r adds to the validity of our 

, interpr In that study subjects changed 
i their at > in the direction of the discrep- 
ant sp only when the recipient of that 
speech convinced by the discrepant 
statements and liked by the role player. In 
the pre study, all subjects were success- 
ful in convincing their colleagues of the dis- 
crepant yn advocated, but only low- 
incentive subjects who liked their audience 


(M on attraction index = +34.65) displayed 
the dissonance-predicted cognitive changes. 
Retaliatic subjects who disliked their au- 
dience (M = —15) did not manifest any 
tendency to reduce dissonance. 


EXPERIMENT 2 


The purpose of the second experiment was 
to clarify further the plausibility of disso- 
nance theory and/or response contagion the- 
ory by means of studying the role of the 
Specific content of discrepant role playing in 
Producing subsequent changes in attitude re- 
Sponses, examining the cognitive consequences 
Of blocking off the most appropriate mode of 
oo reduction, and examining the roles 

Perceived choice and uncertain postbehav- 
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ioral consequences in producing dissonance 
arousal. 

Response contagion theory emphasizes the 
very specificity of the forced-compliance ef- 
fect. Only those postexperimental responses 
directly related to the specific content of the 
previously ongoing responses are predicted to 
be affected. According to recent dissonance 
theorizing, the production of aversive conse- 
quences rather than the overt profession of 
attitude-discrepant statements is said to be a 
source of dissonance arousal and its reduction 
through attitude change. Consequently, one 
would expect that the postcompliance changes 
in attitude responses depend upon the specific 
nature of the aversive consequences rather 
than upon the particular content of discrepant 
role playing. 

In order to test this hypothesis, the nature 
and importance of the aversive consequences 
were kept constant, but the content of dis- 
crepant role playing was systematically 
varied. Subjects were induced for a small 
monetary incentive to convince the confeder- 
ates to engage in the dull task for 50 minutes 
by telling them either that the task was 
pleasant and enjoyable (enjoyable condition), 
that it was worthwhile to perform the task 
for 50 minutes (worthwhile condition), or 
that it was a simple task (simple condition). 
All subjects were successful in convincing 
their colleagues. It was predicted from disso- 
nance theory that all subjects would come to 
believe that the task was somewhat enjoyable. 
Response contagion theory, however, predicts 
this effect only in the enjoyable condition. 

A second hypothesis derived from disso- 
nance theory is that when it is made difficult 
for subjects to come to believe that the task 
is rather enjoyable, they will tend to reduce 
dissonance by minimizing the degree of choice 
they had in deciding whether or not to accept 
the experimenter’s proposal and/or by re- 
ducing the attractiveness of the persons they 
have misled. Response contagion theory, of 
course, can neither predict nor account for 
postexperimental changes in responses that do 
not relate to previously ongoing responses. 
This hypothesis was tested by restating the 
dull and monotonous characteristics of the 
task after discrepant role playing with its 


_—s 
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attending consequences had been performed 
(restatement condition). 

To further check the plausibility of a dis- 
sonance interpretation of the previous find- 
ings, a no-choice condition was run. Since 
freedom of choice is necessary for the produc- 
tion of dissonance arousal, no-choice subjects 
were predicted not to evidence attitudinal 
change. It was also expected that they would 
tend to reduce their positive feelings toward 
the experimenter, since he forced them to 
mistreat some innocent colleagues. 

Two conditions were run to study the ef- 
fects of uncertain behavioral consequences. 
In one condition, the subjects knew for cer- 
tain that the confederates had to perform the 
dull task for the period of their final decision, 
but they did not receive feedback about the 
influence of their discrepant speech on the 
confederates’ choice of a performance period 

(uncertain influence condition). In the other 
condition, subjects perceived that they were 
highly successful in convincing the confeder- 
ates, but it was left unclear whether the con- 
federates would or would not have to perform 
the dull task afterwards (uncertain perform- 
ance condition). An additional condition was 
run in which the subjects were equally suc- 
cessful in convincing the audience but knew 
beforehand that the recipients of their ad- 
vice would not have to perform the dull task 
(no-aversive-consequence condition Ne 

It was predicted that both no-aversive- 
consequence and uncertain performance sub- 
jects would not manifest dissonance reduction 
through attitude change. In the no-aversive- 
consequence condition, there were no aversive 
consequences to produce dissonance arousal. 
In the uncertain performance condition, the 
subject’s knowledge that his discrepant speech 
would possibly not effect negative conse- 
sequences should reduce the magnitude of dis- 

sonance arousal. In addition, the subject 
likely could reduce dissonance more readily 
by convincing himself that the confederates 
would probably be dismissed from performing 
the dull task rather than by convincing him- 
self that the task was not dull and boring. The 
prediction for the uncertain influence condi- 
tion was less straightforward. Subjects had 
the status of expert when presenting the dis- 
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crepant statements, which could ce it dif: 
ficult for them to deny any pote ial influ- 
ence on their audience. Nevert! ss, they 
were expected to tend to minimize e poten: 
tial persuasive impact of their disc: pant ad- 
vice rather than to change their : itudinal 
cognitions to be more consistent w h overt 
behavior. 

A high-incentive and a baseline dition 
were included in the experimental ign in 
order to replicate the dissonance-) . licted, 


f in- 
luate 


inverse relationship between amoun 
ducements and opinion change and to í 
the cognitive changes produced. 


Method 


Subjects 


Subjects were 101 female high school sti lents 
(from 16 to 18 years old) who volunteered to par- 
ticipate in an experiment on “group decision makiag.” 


Procedure 
The basic procedure of the present stud: was 
identical to that employed in the first expe nent, 
except for the fact that the confederates’ role ; xying 
and some of the instructions had been pre. ously 
videotaped and were played back to the s bject. 
There was no face-to-face interaction betwee. sub- 
jects and the confederates. Subjects simply tched | 
the preliminary instructions in a separate roois. The 
group’s final decision and the subsequent tas per- 
formance were also seen (supposedly) over losed- 
circuit television, whereas subjects’ discrepant. advice 
was communicated to the group by means of an 
intercom system. The final questionnaire was ex- 
tended to check on subjects’ perception of the degree 
of choice they had in performing the discrepant act 


and on their feelings of personal responsibility for 
the consequences of their discrepant advice, ihat is, 
for the confederates’ final group decision and for s$ 
their performing the task for a given amount of time. 

The baseline and high-incentive conditions pro- 
ceeded as in the first experiment, while the ev joyable 
condition was a replication of the low-incentive 
condition. The worthwhile condition was identical to 
the enjoyable condition, but the words pleasant and 
enjoyable were omitted from discrepant role playing. 
The standardized text of the discrepant advice read: 
“Well, after all, it is a rather simple task about 
which there is not so much to tell. But I find that 
it is really worthwhile to perform the task long 
enough. So, I propose to perform the task for 50 
minutes.” In the simple condition, the advice read: 
“Well, I have performed the task now, but there is 
not so much to tell about it. It is a simple task, 
but I propose to perform the task long enough. S0, 
I propose to perform the task for 50 minutes.” 

All remaining conditions were identical to the 
enjoyable condition, except in the following respects: 
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In the ement condition Experimenter 2, when 
handins questionnaire for recording each sub- 
ject’s | impressions on the task said: “As you 
have performed the task, I will give you a 
questic » for indicating your first impressions on 
it. Pl respond as honestly and sincerely as 
possib! now very well that the task is very 
boring monotonous.” Since the researcher said 
that ¢ ¿ was very dull, it should have made it 
diffict he subject to convince himself that the 
task her enjoyable. The subject thus should 
tend duce dissonance by changing some other 
relev nitions. 

In choice condition, subjects had no choice 
but t »rm the discrepant act. Experimenter 1 
said i in need of advice that deviates 
from liminary group decision (using the same 
ratio! in the other conditions) and instructed 
the s to tell his colleagues that the task was 
enjoy { to propose them to perform the task 
for 5 eS. 

In rtain influence condition, one confeder- 
ate ri » each subject’s advice in a way that 
left ar whether or not he was likely to 
chan ecision on the task performance, saying: 
“Sine xpert states that the task is enjoyable 
and I could perhaps perform the task for 
a lo ie, but I actually intended to go to 
swim ' Immediately after this reply, a ques- 
tionn handed to the confederates, and shortly 
after o the subject. The item concerning the 
perc ence of the expert’s advice on the final 
groug ion, however, was reformulated in terms 
of ex i influence. Each subject was also asked 
to mal guess on the group’s final decision. The 
video in this condition ended as soon as the 
que: e was handed out. The dependent mea- 
sures v then assessed after the questionnaire had 
been completed. 

In the uncertain performance condition, before en- 


gaging 
that it 


n the discrepant act, subjects were informed 
as not certain whether the group would 
perform the task. Experimenter 1 explained 
participants were invited to his group 
decision experiment than were actually needed for 
Experimenter 2’s research on the task. Experi- 
menter 2 therefore made random decisions as to 
whether a particular group would or would not have 
to perform the task. However, there was an agree- 
ment between Experimenter 1 and Experimenter 2 
not to inform Experimenter 1 and the participants 
until the group decision experiment had been com- 
Pleted and all questionnaires had been filled out. 
Otherwise the group might be influenced in its 
choice of a performance period that would of course 
spoil Experimenter 1’s research data. After discrepant 
advice had been presented and the group had made 
4 final decision, subjects were asked to estimate the 
likelihood that the group would have to perform the 
task. They were also asked to guess whether the 
Poa aerates would or would not have to perform 
eee The videotape in this condition also ended 
e moment this questionnaire was handed out. 
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In the 2o-aversive-consequence condition, subjects 
were informed that their experimental partners 
would definitely not have to perform the task after- 
wards. When Experimenter 1 induced discrepant role 
playing, he explained that there were more subjects 
participating in his group decision research than were 
needed for Experimenter 2’s research. Therefore, it 
was decided beforehand that some groups, including 
the present one, would not have to perform the 
task. Experimenter 1 further added that the group 
would be informed of this state of affairs only after 
they had made a final decision, in order not to affect 
Experimenter 1’s data. The videotape also showed, 
after the final group decision had been made and 
the subsequent questionnaire had been completed, 
that Experimenter 1 informed the confederates that 
they indeed did not have to perform the task. 


Results 


Data from five subjects were not included 
in the final analysis. Three subjects refused 
to accept the money (two in the worthwhile 
condition and one in the restatement condi- 
tion). One subject (in the no-choice con- 
dition) reacted so emotionally to the conse- 
quence manipulation that she was debriefed 
before the dependent measures had been 
assessed. The data from another subject (in 
the uncertain performance condition) were 
disregarded, since Experimenter 1 entered the 
room of this subject while Experimenter 1 
was still shown on television to be present in 
the group room. 


Effectiveness of the Manipulations 


As in the first experiment, all subjects 
(uncertain influence subjects excluded) per- 
ceived their speech to have been effective in 
changing the confederates’ choice of a per- 
formance period. The manipulation of the 
choice variable proved to be effective as well, 
since no-choice subjects reported to feel less 
free to decline Experimenter 1’s proposal for 
discrepant behavior than enjoyable subjects 
(t= 5.69, p< .005). Experimenter 2 had 
also been successful in restating the dull char- 
acteristics of the task, since restatement sub- 
jects rated the task as even less enjoyable 
than baseline subjects. 


Cognitive Changes 

The results, presented in Table 2, confirm 
the predictions made from dissonance theory. 
Enjoyable (low-incentive) subjects rated the 
task as less boring than high-incentive sub- 
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TABLE 2 
i MEANS FOR POSTTEST VARIABLES, EXPERIMENT 2 
i Feelings of ı al 
A y responsi E 
} Enjoi EC Bitachyenoss See Rest F a 
p Group task audience experimenter choice decision per ance 
h High incentive 11.07 +37.57 +35.29 7 12, 1 
} Enjoyable 17.75" +36.9 +41.5 11.7 13.8 ! 
f Worthwhile 22.25 +37.7 +45.8 11.2 12.5 1 
‘ Simple 18.58 +34.3 +43.6 11 13.5 1 
t; Restatement 7.3 +22 +39 7.2 13.6 13 
4 No choice 8.4 +16.6 +22.55 43 11.2 12 
No aversive consequence 11.6 = +38.4 7.8 9 
Uncertain performance 12.33 +34.56 +39 7.22 13.89 14 
Uncertain influence 17.58 +31.7 +46.8 12.2 13 12.6 


jects (¢ = 2.0, p < .05). The mean scores on 
the enjoyability index in the enjoyable, worth- 
while, and simple conditions were signifi- 
cantly more positive than in the baseline con- 
dition (¢ = 3.0, p < .005; £= 4.03, p < .005; 
and ¢ = 2.79, p < .01, respectively). 

Restatement subjects reported more felt 

obligation in delivering the discrepant state- 
ments (¢ = 3.22, p < .005) and expressed less 
positive feelings toward the recipients of the 
discrepant speech (£= 2.79, p< .01) than 
subjects in the enjoyable condition. No-choice 
subjects rated the task as less enjoyable than 
enjoyable subjects (t = 3.86, p < .005), and 
they were less attracted toward the experi- 
menter (¢= 2.84, p < .01). Quite unexpect- 
edly, however, they also rated the victims of 
their discrepant statements as less attractive 
(t = 3.73, p < .005). The less positive scores 
from no-choice subjects might thus be attrib- 
uted to their generalized negative feelings 
toward the entire experimental situation as a 
consequence of having been forced to mis- 
treat their colleagues. 

When subjects knew for certain that the 
confederates would have to perform the task 
for the time period they had decided upon 
(enjoyable and uncertain influence condition), 
they rated the task as more enjoyable than 
baseline subjects (¢ = 3.0, p < .005; £ = 2.84, 
Ż < .02, respectively). However, when sub- 
jects knew for certain that the confederates 
would not have to perform the dull task 
(no-aversive-consequence condition) or when 
this negative outcome of their behavior was 


Nole, For all groups, n = 10, except for the high-incentive group (» = 7) and the uncertain performance group (1 = 9), H: her 
numbers represent more positive evaluations, more perceived choice, and higher feelings of personal responsibility. 
S ^ These means are significantly different from that of the baseline group (M = 10.1). 


left unclear (uncertain performance conci- 
tion), they did not tend to bring their atu- 
tude cognitions in line with overt behavior. 

As previously noted, uncertain influence 
subjects may have found it difficult to de: y 
that their speech would not bring about av>:- 
sive consequences, since they played the role 
of expert when presenting the advice to ‘he 
other participants. The mean expectation of 
uncertain influence subjects about the final 
group decision, in fact, was that the con!«d- 
erates would decide to perform the task for 
31.5 minutes. They also expected that their 
advice was likely to influence the final «!eci- 
sion rather strongly (M = 10.7 on a 17 point 
scale on which 1 refers to “extremely weakly” 
and 17 to “extremely strongly”). Uncertain 
performance subjects, on the other kand, 
strongly tended to believe that the confeder- 
ates would not have to perform the task 
(M = 7.1 on a 17-point scale with 1 referring 
to “certainly not to be performed” and 17 to 
“certainly to be performed”). In addition, 
when asked to guess whether or not the group 
would have to perform the task, only one out 
of nine subjects said yes. 

No significant differences were obtained on 
the importance, hardness, and the attraction 
indexes toward the experimenter (no-choice 
subjects excluded) and toward the audi- 
ence (no-choice and restatement subjects ex- 
cluded).* The results of the perceived choice 


i 4The data from the no-aversive-consequence sub- 
jects were not included in the analysis, since the con- 


Sand perceived responsibility measures are 
discussed 
Behavior (hanges 
As in the first experiment, discrepant role 
playing « t produce behavior change. No 
reliable í nces were found among the ex- 
periment ıd the baseline conditions. Simple 


P 


i 


4 


discrepa ole playing is clearly ineffec- 
tive in ing a role player’s subsequent 
attitud: l overt behavior. 
rENERAL DISCUSSION 
The d om the present studies strongly 
add to validity of recent dissonance 
theorizir wut the combined influence of 
perceive: ice and postbehavioral aversive 
consequi n producing attitude change. 
Discrepa ole playing led to changes in 
subsequi aluative responses only when 
the subje ad some choice in whether or 
not to p m the discrepant act and when 
the beh produced the aversive conse- 
quence using some innocent colleagues 
to spen h time on a dull task. When 
| the con ices were made less aversive, as 
when the recipients of the speech deserved to 
be misled or when the subjects could compen- 
Sate the nnocent victims, they did not 
manifest onance reduction, Also, when it 
was made difficult for subjects to come to 
believe in the position overtly espoused, they 
reduced the attractiveness of the persons they 
had misled and enhanced their felt obligations 
in performing the discrepant act. These find- 
ings, taken together, offer quite strong sup- 
port for the notion that postcompliance cog- 


j 


A 


nitive changes are instrumental in justifying 
one’s free commitment to a consequence- 
Producing, discrepant act. They are also con- 
Sistent with the body of research on post- 
aggression dissonance reduction (e.g Brock 
& Buss, 1964; Davis & Jones, 1960; Glass, 
ce cf. Walster, Berscheid, & Walster, 


federates’ behavior on this videotape was not com- 

ted to their behavior in the other conditions. It 

aol has been found, however, that the pres- 

ated absence of aversive consequences does not 

(Verh a role player’s evaluation of the audience 
erhaeghe, in press). 
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Response contagion theory, on the other 
hand, does not readily account for the present 
findings. Postcompliance changes in attitude 
responses did not necessarily relate to previ- 
ously ongoing responses, nor did they neces- 
sarily increase with increasing levels of arousal. 

Response contagion theory and dissonance 
theory differ, however, in terms of the condi- 
tions they consider optimal—and perhaps 
necessary—for the operation of the processes 
they respectively postulate. Response conta- 
gion theory emphasizes the noncognitive ef- 
fects of previously ongoing responses upon 
immediately subsequent attitude responses 
(under conditions of relatively high levels of 
arousal). For the model to have predictive 
power, it might be necessary first, that the 
prior responses are frequently emitted so as 
to become the most dominant ongoing re- 
sponses, and second, that the subsequent atti- 
tude response is emitted in close temporal 
contiguity with the prior responses. Nuttin’s 
(1975) data obviously met these criteria, and 
they do suggest that in such instances, re- 
sponse contagion theory provides a more 
plausible explanation than dissonance theory. 
Dissonance theory, on the other hand, deals 
with the cognitive consequences of discrepant 
behavior that has aversive implications. The 
theory most likely applies when the dis- 
crepant act inevitably produces important 
negative consequences and when the time 
delay between discrepant behavior and the 
postexperimental attitude measures allows the 
subjects to accomplish the cognitive work 
presumably necessary for dissonance arousal 
and its reduction. The present studies were 
especially designed to meet these latter condi- 
tions, and they unequivocally support disso- 
nance theory. Hence, whether a dissonance or 
a response contagion effect is found may very 
well depend on the extent of temporal conti- 
guity between discrepant role playing and the 
dependent measures and/or the relative domi- 
nance of the responses emitted during role 
playing . Further research is needed, how- 
ever, to clarify the sufficient and necessary 
conditions for each model to apply. 

The final discussion relates to the sig- 
nificance of perceived choice and personal 
responsibility. The combined influence of 
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perceived choice and foreseeable negative con- 
Sequences on dissonance arousal has led re- 
searchers to think of personal responsibility 
as a basic concept in dissonance theory. Only 
if one feels personally responsible for one’s 
behavior and its negative implications would 
dissonance be aroused (Collins & Hoyt, 1972; 
Cooper, 1971; Worchel & Brand, 1972). The 
exact meaning of the personal responsibility 
concept is rather unclear, however, and vari- 
ous levels of responsibility have been de- 
scribed (Fishbein & Azjen, 1973; Heider, 
1958). Similarly, the specific circumstances 
under which a person is willing to accept re- 
sponsibility for the consequences of his behay- 
ior are as yet not well understood. It is likely 
that a person takes responsibility for his 
actions under conditions of high choice and 
low external justification. But in addition, 
for a person to feel responsible for the con- 
Sequences of his behavior, it may also be 
necessary that the consequence be foreseeable 
at the time of his choice and that he intended 
to effect that consequence. The degree of 
perceived responsibility for behavioral conse- 
quences may further depend upon the impor- 
tance of the consequence, the effort invested 
in producing the consequence, the temporal 
contiguity between act and consequence, the 
degree of personal involvement with the conse- 
quence, as well as the tendency to attribute 
responsibility to an actor rather than to the 
environment or to deny responsibility for con- 
Sequences that threaten one’s self-esteem, 
Most theorizing and research, however, sug- 
gest that a person’s acceptance of responsibil- 
ity for the consequences of his discrepant be- 
havior is mainly a joint function of decisional 
freedom and foreseeability. 

Our second experiment provided some data 
that bear directly on this issue. Because the 
behavioral consequences were foreseeable, the 
attitude measures were expected to covary 
with perceived choice and perceived respon- 
sibility. As can be seen in Table 2, the degree 
of choice reported by subjects is directly 
related to their attitudes. However, their re- 
ported feelings of responsibility for the con- 
sequences of their discrepant advice are not 
related in any systematic way to perceived 
choice and attitude change. 
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The relationship between per: 1 choi 
and attitudes is remarkably clea’ ond quil 
impressive. Subjects’ evaluation o! e task i 
an almost linear function of the ount oi 
freedom reported. Subjects who ev) ced sige 
nificant attitude change reported to .ve be 
rather free in accepting the ex; venter’s 
proposal for discrepant behavior, Subject: 
whose evaluation of the task wa t dif- 
ferent from the evaluation of baselin: -ubjeci 
claimed to have been rather forced, ad sub- 
jects who rated the task as more bo: og than 
baseline subjects reported to have be: » rather 


strongly forced to give discrepant 
Only the data of restatement subjects lo not 
fit the perceived choice-attitude relat onship 
very well. These subjects, it should be remem- 
bered, received a persuasive message about 


the dull nature of the task, and thr also 
tended to derogate the audience as a: addi- 
tional means of dissonance reduction. Caken 
as a whole, this pattern of results ) vides 
strong support for the notion that disso- 
nance arousal and subsequent attitude hange 
is a direct function of the amo at of 
choice experienced in committing ones: { to a 


discrepant act. 


Feelings of personal responsibility ‘or be- 
havioral consequences, on the other h: id, are 
unrelated to perceived choice and . ttitude 


change. All but the no-aversive-con quence 
Subjects reported equally strong fe: lings of 
responsibility for the consequences of their 
advice. This suggests that perceived respon- 
sibility for the consequences of one’s discrep- 
ant behavior is not crucial in the process of 
dissonance arousal and attitude change, It 
also suggests that freedom of choice is not 
necessary for a subject to accept responsibil- 
ity for the consequences of his discrepant act 
when those consequences are foreseeable, im- 
portant, and immediate. This lack of a rela- 
tionship between perceived responsibility on 
the one hand and perceived choice and attitude 
change on the other may, however, be due t0 
the specificity of the interpretation subjects 
gave to the personal responsibility notion. 
The level of responsibility crucial to the pro- 
duction of dissonance arousal and attitude 
change may actually require freedom of 
choice and foreseeability, but our subjects 


; 
4 


; 


interpreted personal responsibility 
lent to “causation” of a foreseeable 


may ! 


as eq 
conse’ -rice, What Heider refers to as level 
MI ( ler, 1958, p. 113). The present data 
thus í >t be taken as clear evidence against 
the | 1al responsibility formulation of dis- 
sonar heory. They do indicate, however, 
that notion of responsibility for behay- 
ioral equences and its role in dissonance 


aroi in need of clarification if dissonance 


theo is to make any further meaningful 

pro: 
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Learned Helplessness: The Result of Uncontrollable 
Reinforcements or Uncontrollable Aversive Stimuli? 


James S. Benson and Kevin J. Kennelly 
North Texas State University 


To determine whether learned helplessness results from exposure to uncon- 
trollable reinforcements, as suggested by Seligman’s theory, or exposure to un- 
controllable aversive outcomes, 100 university students, both male and female, 
were each randomly assigned to one of four pretreatments involving discrimi- 
nation problems: (a) soluble, in which each participant was presented soluble 
problems and response-contingent “correct” and “incorrect” feedback; (b) in- 
soluble-aversive, in which each participant was presented insoluble problems 
and response-noncontingent correct and incorrect feedback (i.e. received the 
responses “correct” and “incorrect” randomly) ; (c) always correct, in which 
each participant was presented response-noncontingent correct feedback (ie, 
received the response “correct”) to all solution attempts; and (d) control, for 
which no pretreatment was given. The insoluble-aversive group performed 
signiñcantly worse and the soluble group significantly better than the control 
and always-correct groups on the anagrams solution test task. The latter two 
groups did not significantly differ, The results supported the hypotheses that 


controllable aversive outcomes prod: 


uce learned “competence,” and uncon- 


trollable aversive outcomes—not uncontrollable reinforcements—produce learned 


helplessness. 


Inescapable aversive stimulation in one 
situation has been shown to interfere with 
escape-avoidance learning in a new situation 
(Overmeier & Seligman, 1967). This inter- 
ference phenomenon, called learned helpless- 
ness, has been demonstrated in rats, cats, fish, 
dogs, monkeys, and men (Seligman, 1974, 
1975), In man, helplessness has been induced 
and observed not only in instrumental learn- 
ing situations but in cognitive tasks as well; 
also, helplessness induced in an instrumental 
situation has been found to transfer to a cog- 
nitive situation and vice versa (Hiroto & 
Seligman, 1975). Learned helplessness is thus 
a very general transfer phenomenon. 

Seeman and his associates (Maier, 1970; 


Seli 1974, 1975; Seligman & Maier, 
1967 “man, Maier, & Solomon, 1971) 
have h sized that helplessness develops 
because uncontrollability of reinforce- 
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ment. When an organism is exposed to in- 
escapable aversive stimulation, he learns | hat 


his responding and reinforcement are i de- 
pendent (reinforcement is uncontrollable). An 
future 


expectancy develops, therefore, that in fi 
situations responding and reinforcemen: will 
also be independent. This expectancy tht re- 
inforcement is uncontrollable interferes with 
the subsequent learning of response-reinforce- 
ment contingencies. 

Since reinforcement can be the onset of a 
positive stimulus as well as the offset of a 
negative stimulus, Seligman and Maier (1967) 
suggested: “Learning that one’s own respond- 
ing and reinforcement are independent might 
be expected to play a role in appetitive situa- 
tions, Further, might learned ‘helplessness’ 
transfer from aversive to appetitive situations 
or vice versa?” (p. 8). Seligman (1975) has 
recently stated this position in stronger terms: 
“T am claiming, then, that not only trauma 
occurring independently of response, but non- 
contingent positive events, can produce help- 
lessness . . .” (p. 98). In spite of Seligman’s 
claim, tests of the effects of response-rein- 
forcement independence (uncontrollability of 
reinforcement) have typically been con- 
ducted under conditions where reinforcement 
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is equaed with the offset or the avoidance of 
aversive sitmuli or where both positive and 
nega events are delivered noncontingently. 


Th esent study is an attempt to an- 


swer following question: Will noncontin- 
gent controllable) positive events produce 
leary ielplessness or must noncontingent 
(un lable) aversive events be present? 
Selig (1975) noted that responding and 
reini ent are independent if the proba- 
bili reinforcement granting the occur- 


response is equal to the probability 
of r: cement granting the nonoccurrence 


of t esponse. This condition can hold 
und tinuous (probability of reinforce- 
meni ; one whether or not the response 
is m r intermittent (probability of rein- 
forc equals less than one but remains 


the s vhether or not the response is made) 


rein nt conditions. 

y rmittent reinforcement schedules, 
hoy difficulty arises in answering the 
abo stion. Nonreward in a context of 
rewa duces frustration, an aversive event 
(Am J . Consequently, with an inter- 
mit hedule of rewards (positive events), 
whe probability of reward remains the 
sam ‘ther or not any particular response 
is n the probability of aversive stimula- 
tion istration due to nonreward) also re- 
mair jual whether or not a particular 
response is made. Thus, noncontingent (un- 
controllable) rewards delivered on an inter- 
mittent schedule means that noncontingent 
(uncontrollable) aversive stimulation (frus- 
tration) is also occurring. 


This confounding of the uncontrollability 
of rewards and aversive stimulation is not 
present, however, when the schedule of re- 
wards is continuous and noncontingent. When 
the probability of reward is equal to one 
whether or not any particular response does 
or does not occur, responding and reinforce- 
ment are independent (reinforcement is un- 
controllable), but aversive stimuli, such as 
frustration, do not occur. 

In the search for an experimental situation 
that could be used to determine whether 
uncontrollable positive events of themselves 
Produce learned helplessness, Hiroto and 
Seligman’s (1975) cognitive-cognitive study 
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methodology served as a guide. In that study 
it was assumed that insolubility in a cognitive 
discrimination task is formally analogous to 
inescapability in an instrumental task, since 
in both the probability of reinforcement (the 
response “correct” and aversive tone offset, 
respectively) is independent of responding. 
Hiroto and Seligman (1975) found that those 
subjects pretreated by exposure to insoluble 
Levine-type (1971) discrimination problems 
and who received random correct and incor- 
rect feedback (i.e., were given the responses 
“correct” or “incorrect”) during solution at- 
tempts and a statement at the end of each 
problem, “that’s the wrong answer,” per- 
formed worse on a subsequent anagrams test 
task than control subjects and those who had 
been exposed to soluble Levine-type discrimi- 
nation problems and response-contingent cor- 
rect and incorrect feedback. In short, those 
who failed in one learning task, because it 
was insoluble, had trouble with a subsequent 
cognitive task. Hiroto and Seligman’s study 
was not, however, an adequate design for 
determining whether uncontrollable positive 
events alone produce learned helplessness, 
since subjects in their insoluble condition 
were receiving response-noncontingent aver- 
sive stimuli (i.e., “incorrect” and “that’s the 
wrong answer”) as well as response-noncon- 
tingent positive stimuli (i.e., “correct”). 

In the present study, four groups, three of 
which with some modifications constitute a 
replication of the Hiroto and Seligman (1975) 
cognitive-cognitive study, were compared on 
an anagrams test task similar to that used 
in the Hiroto and Seligman study. A com- 
pletely new group was added to answer the 
experimental question posed by Seligman’s 
theory: Can uncontrollable positive events in 


the absence of uncontrollable aversive events 
produce learned helplessness? This group re- — 
ceived uncontrollable reinforcement in the 


form of continuous noncontingent positive 


feedback to all discrimination-problem solu- 


tion attempts. This always-correct group was 
pretreated with the same Levine-type dis- 
criminations as used in the Hiroto and 
Seligman study but received noncontingent 
correct feedback (i.e., the response “correct”) 
on each trial and a statement, “that’s the 


pes a 


= jects response. Such noncontingent positive 
' feedback and statements can in no way be 
considered aversive, but they do generate a 
situation in which responding and reinforce- 
ment are independent; that is, uncontrollable 
positive events occur for each subject in the 
always-correct group without uncontrollable 
aversive events. 

If Seligman’s contention that uncontrollable 
positive events produce learned helplessness is 
correct, the always-correct group as well as 
the group receiving response-independent cor- 
rect and incorrect feedback (Hiroto and Selig- 
man’s insoluble group) should both be inferior 
in performance to the control and soluble 
problem groups on the anagrams test task. 


METHOD 
Subjects 


Subjects were 38 male and 64 female undergradu- 
ate psychology students at North Texas State Uni- 
versity who were obtained through the department 
subject pool, which is composed of students in fresh- 
man and sophomore psychology courses who volun- 
teer to participate in research for extra course credit. 
Two subjects were excused from the experiment, one, 
a foreign student, because of inadequate knowledge 
of the English language and the other because ofa 
break in procedure by the experimenter. Both of 
these subjects were replaced with the next partici- 
pants who were scheduled to appear in the labora- 
tory. Final analyses were made on the data from 
100 subjects. 


Design 


Four groups were tested on an anagrams test task 
immediately after the following differential pretreat- 
ments: (a) The soluble problem group was pre- 
treated with five soluble Levine-type (1971) dis- 
crimination problems. Each participant received 
response-contingent correct and incorrect feedback 
during solution attempts, and at the end of each 
problem he/she received response-contingent feedback 
to his/her statement of what he/she thought was the 
correct value. (b) The insoluble~aversive group was 
pretreated with five insoluble Levine-type problems. 
Each participant received random correct and incor- 
rect feedback to his/her solution attempts and at the 
end of each problem received negative feedback to 
his/her statement of what he/she thought was the 

correct value. (c) The alw. correct group was pre- 
treated with the same five insoluble problems that 
were used for the insoluble-aversive group except 
that instead of the feedback used that group, mem- 
bers of the always-correct group ived continuous, 
noncontingent correct feedbac! 1g solution at- 
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he/she 


ntrol 


tempts and at the end of each problen 
feedback for his/her statement of wha 
thought was the correct value. (d) Th 
group was given no pretreatment. 

The above design differs from the Hii and 
Seligman (1975) cognitive-cognitive studi: 
following respects: (a) The soluble grou; 
present study was exposed to five, rather thi 
or four, soluble problems. Also, in the preser 
subjects were not dropped from this condit 
failure to learn one or more of the soluble pr 


hree 
udy, 
for 
ms. 


(b) The insoluble-aversive group of the ent 
study, comparable to the insoluble group © the 
Hiroto and Seligman study, was exposed to ive, 
rather than three or four, insoluble problem. (c) 
The Hiroto and Seligman studies did not h a 
group comparable to the always-correct group o ‘he 


present study. (d) The control group of the pre ont 
study was given no pretreatment prior to the anagr ns 
test task. The Hiroto and Seligman procedure x- 
posed the control subjects to the stimuli used for ne d 
discrimination problems but required no respo! se 
from them. The present study’s control procedi re 
was used, since it more closely parallels the animal 
learned helplessness studies where naive anim ls 
served as controls. 


Apparatus and Materials 


Pretreatment. Both the soluble and insoluble p: >- 
lems were composed of a series of four-dimensi al 
stimulus patterns similar to those used by Hi: cto 
and Seligman (1975) and in previous discriminat >n- 
learning studies (Levine, 1971). Each of the {our 
dimensions had two values: (a) The border was 
either a solid or a dashed line, (b) the size of | ter 
was either large or small, (c) the letter was ¢ her 
an X or T, and (d) the letter color was either x d or 


black. Each problem was made up of 10 cards, cach 
card having two stimulus patterns on it. The cords 
were presented one at a time on 20.2 X 12.7 cm 


white index cards contained in a three-ring binder. 
A soluble problem was one that had one valuc on 
one of the dimensions (e.g., large letter) consistently 
correct for the 10 card problem. An insoluble prob- 
lem had no consistently correct value. Any value 
stated by the subject was correct for an alw ays- 
correct problem, 

Test task. The test task was the same for all 
subjects, a series of 20 five-letter anagrams similar 
to those used by Hiroto and Seligman (1975). 
Examples are the following: (a) B L o £ N, (b) 
R N U TB, and (c) B O AR 1L. The letter order for 
all anagrams was the same, 5-3-1-2-4. Once this 
pattern had been discovered, solution of the remain- 
ing anagrams was rapid. These anagrams, composed 
of .48-cm letters, were presented one at a time on 
20.2 X 12.7 cm white index cards that were contained 
in a three-ring binder, 

For both the pretreatment and test task portions of 
the experiment, each research participant was seated 
at a table on which there was only the binder con- 
taining the problems to be presented. Immediately 
behind the table was a screen that hid the experi- 
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menter fror view of the participant but that 
did not pr the experimenter from seeing the 
participant ice communication between experi- 
menter ant vant. Instructions for both tasks 
were given neans of an Ampex cassette tape 
recorder. A > watch was used to time the 
responses € ırticipants. 
Proceduri 

In each four conditions of the experiment, 
25 subject pated on an individual basis. Sub- 
jects were i to conditions according to their 
order of a c in the laboratory. A randomized 
running sc! jetermined which condition was to 
be run at Each subject in the soluble, 
insoluble-« and always-correct conditions re- 
ceived th. ims test task immediately after 
having cor the pretreatment phase. Each sub- 
ject in th | condition received the anagrams 
test task first having experienced a pre- 
treatment 

In the tment phase, after the subject 
had been « ibly seated, the tape recorder was 
switched « uctions concerning the discrimi- 
nation pro were presented. These instructions 
were pattern osely after those used by Hiroto 
and Seli $). After the instructions, a five- 


trial samp! m was completed by the subject. 
Any quest oncerning how to respond to the 
stimulus p of the discrimination problems were 
answered time. On each trial, the subject was 
required to verbally which of the two stimu- 
lus pattern the card contained the correct value, 
the one or left or the one on the right. Ten 
Seconds afl subject had begun to look at a card, 
if he/she had not made a decision, he/she was in- 


formed th she had 5 sec in which to make a 
decision. Thos the maximum time allowed for a 
decision on trial was 15 sec. 

In the soluble problem condition, as each partici- 
pant indica hich side he/she concluded contained 
the correct v he/she was given contingent feed- 
back of “correct” or “incorrect” by the experimenter. 


At the end of each problem, the subject was asked, 
What value did you choose?” Upon answering, 
he/she was told whether he/she was correct or 
incorrect, each participant in the insoluble-aver- 
Sive group responded with the side he/she believed 
Contained the correct value, he/she was given the 
following randomized correct (C) and incorrect (1) 
feedback used by Hiroto and Seligman (1975): For 
the 10 trials of Problem 1, C-I-I-C-C-I-I-C-C; 
i Problem 2, I-C-I-C-C-I-C-I-C-I; for Problem 3, 
SIIRA -C-I; for Problem 4, C-I-I-C-C- 
ane! ; for Problem 5, I-C-I-C-C-I-C-I-C-1, At 
the end of each problem after the participant had 
one a statement of the value he/she had chosen, 
¢/she was told, “That’s the wrong answer.” As each 
ae in the always-correct group responded 
al the side he/she chose as containing the correct 
Ria he/she was always told, “correct.” To his/her 
Bae of the value believed to be correct at the 

of each problem, he/she was told, “That’s the 
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right answer.” The control group was not exposed 
to the discrimination problems of the pretreatment 
phase but to the test task only. 

Immediately after the pretreatment phase for all 
conditions except the control, or immediately upon 
entering the laboratory for the control condition, 
subjects were given the test task instructions. The 
binder containing the anagrams was placed on the 
table in front of the subject, the tape recorder was 
switched on, and the subject heard the instructions 
for the anagrams test task. These instructions were 
taken verbatim from Hiroto and Seligman’s (1975) 
study. 

Time began for each anagrams problem when the 
participant was told to turn the card. The solution 
time for each anagram was recorded after each 
problem and before the participant was told to turn 
the next card. If the solution of any anagram had 
not been completed within 90 sec, the participant 
was informed that he/she had 10 sec to make a 
word. If the problem had not been solved within 
100 sec, the solution time was recorded as 100 sec, 
and the participant was asked to turn to the next 
card. 

After the completion of the anagrams phase, par- 
ticipants in the soluble, insoluble-aversive, and 
always-correct conditions were asked to indicate 
his/her feelings during the pretreatment phase of 
the experiment by answering the following five-choice 
question: “My progress on the discrimination prob- 
lems was mainly due to (1) the difficulty of the 
problems; (2) the ease of the problems; (3) luck; 
(4) skill; (5) experimenter control.” The research 
participant was then debriefed and dismissed. 


RESULTS 


As in the Hiroto and Seligman (1975) 
study, three dependent measures were used to 
assess helplessness. These measures were as 
follows: (a) The first was mean trials to 
criterion, with criterion being defined as the 
solution of three consecutive anagrams in less 
than 15 sec each, after which no failures to 
solve occurred. When this criterion has been 
reached, it indicates that the subject has 
caught on to the pattern of letters that is the 
same for all anagrams. (b) Mean response 
latency for the 20 anagrams constituted the 
second measure. (c) The third measure was 
mean number of failures to solve, with failure 
to solve being defined as a solution time equal 
to 100 sec. 

As predicted by Seligman’s theory, the 
interference phenomenon was produced in the 
insoluble—aversive group, but contrary to this 
theory, interference was not produced in the 
always-correct feedback group. “Competence 
was induced in the soluble problem group, a 
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TABLE 1 trollable was produced in the always correct 
MEANS AND STANDARD DEVIATIONS feedback condition as well as in the insoluble- 
For THE Four MEASURES aversive condition. This was verifie’ by an 
TENE analysis of the responses to the five choice 
Measure Insoluble “Correct” Soluble Control -tribution question given, after the comp i 
‘Trials to criterion tion of the anagrams task, to the = uble, 
x 18.60 15.608 11.88 Loe insoluble-aversive, and always-correc: sub- 
Be 3 Sg Ga Sue jects. Answer 4 on the measure, “My progress 
arego on the discrimination problems was mainly 

4 A 42.62 ri eras h j; 
35 sett sae eet ae due to skill,” indicated the perception o- con- 
Failures to solve trollability (i.e. Progress was perceiv:d as 
x 4.20 4.80 3.30 5.40 determined by something internal to the sub- 
SD 2.50 4.30 2.90 4.06 ject) and was assigned a score value of 1. 
Attribution Answers 1, 2, 3, and 5, “My progress « the 
ae pus ae sh discrimination problems was mainly dè to 
‘ č j the difficulty of the problems,” “the es © of 


a For the two measures producing significant Fs, trials to 
criterion, and attribution, the means having a common super- 
script are not significantly different at the .05 level by Newman- 
Keuls procedure. 


finding compatible with Seligman’s theory but 

not obtained in the Hiroto and Seligman 

(1975) studies. 

The mean and standard deviations for the 
trials-to-criterion measure and all other de- 
pendent measures are presented in Table 1. 
Analysis of variance of the trials-to-criterion 
measure produced a significant effect, F 
(3,96) = 6.28, p < .01. Individual compari- 
son of pairs of means by the Newman-Keuls 
method indicated that all pairs of means ex- 
cept the pair, always correct and control, were 
significantly different (all ps < .05). Those in 
the soluble problem group learned the letter 
pattern for anagram solution, as indicated by 
the trials-to-criterion measure, more readily 
than those in any of the other groups; only 6 
of these subjects failed to learn the pattern. 
Those in the insoluble-aversive group learned 
less readily than those in any of the other 
oups; 19 of this group failed to learn the 

‘tern. Those in the always-correct feedback 
gv ap learned it no more or less readily than 
thos in the control group. 

Analysis of variance of the latency and 
failures-to-solye measures indicated no signifi- 
cant differences in the mean performance of 
the groups; for the latency measure, F (3, 96) 

2.56, P>05; for the failures-to-solve 
measure, FG96)= 1.66, p > .05. 

As expected, the perception that feedback 
on tic cliscrimination problems was uncon- 


if] 
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the problems,” “luck,” or “experimenter con- 
trol,” respectively, indicated the perception of 
uncontrollability (i.e., Progress was perc: ved 
as determined by something external tc the 
subject) and were assigned a score value of 2. 
The means and standard deviations base’! on 
these scores for the conditions expose: to 
the discrimination problems are present: d in 
Table 1. Analysis of variance indical.d a 


significant effect due to condition, F(2, 72) 
= 8.05, p<.01. An individual comp json 
of pairs of means by the Newman „euls 
method indicated that there was no signi icant 
difference between the insoluble-aversive and 
always-correct groups. These groups both per- 
ceived their progress as significantly more 
uncontrollable than the soluble group (Ps 


<.01). With Answers 1 and 2, “d 
of the problems” and “ease of the problems,” 
respectively, collapsed together as one cate- 
gory, a chi-square computed using the sj vecific 
attribution choices of the subjects, x°(3) 
= 5.20, p > .10, indicated no significant dif- 
ference in patterns of attribution between 
the insoluble-aversive and the always-correct 
groups. 

Since Hiroto and Seligman (1975) had 
dropped several subjects from their soluble 
problem condition because of failure to solve 
one or more of the soluble discrimination 
problems, it was expected that some soluble 
condition subjects would fail to solve all five 
discrimination problems in the present study. 
Such was the case: 10 subjects solved all five 
problems, 11 subjects solved four problems, 
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4 subjects solved 


ferences, hi 
- anticipated 


T to criterion 
` for those wt 


lems were 


d three problems, and 1 sub- 


two problems. These dif- 
did not bring about the 
nce in performance on the 
sk between those who solved 
) solved only part of the pre- 
The means on the trials 
sure for anagrams solution 
\ved all and those who solved 
soluble discrimination prob- 

d 12.73, respectively. A £ 
test indic at there was no signiñ- 
cant differ: these means, ¢(23) = 1.61, 
p> 05. 


ject solved y 


anagrams í 
all and tho 
treatment | 


only part © 


The findit t the insoluble—aversive con- 
dition subj veloped a functional blind- 
hess to the vulus pattern that all of the 
anagrams sh eplicates the outcome of the 
second Hir id Seligman (1975) cogni- 
tive-cogniti dy and supports Hiroto and 
Seligman’s c ision that insolubility, de- 
fined as respose-noncontingent correct and 


incorrect fec 
inescapabili t 
tion subject 

faster than cor 
a significant | 
Seligman co; 

quite compa 
difference in re 
greater numbc 
number of sol 


is formally analogous to 
e finding that soluble condi- 
nized the anagrams pattern 
i subjects was not found at 
n either of the Hiroto and 
cognitive studies but is 
vith Seligman’s theory. This 
ilt may be due simply to the 
subjects or the greater 
le problems used in the 
Present study her possible reason for this 
difference, the difference in the control condi- 
tion procedure, is unlikely to be responsible, 
Since the conirol condition results for the 
present study appear comparable to those 
reported by Hiroto and Seligman in their 
Studies, 
The failure to find significant effects for the 
other two dependent variables analyzed, re- 


_ SPonse latency and failures to solve, suggests 


es actors other than the experimental treat- 
ba contribute heavily to an individual’s 
sible nee on the anagrams test task. Pos- 
Of this < ors to control for in future research 
is ge pe are anagrams solving ability and 
tion ae ntrol. Based on the control condi- 

S, subjects in experiments like this 
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differ widely in anagrams-solving ability. A 
pretest with a series of unpatterned anagrams 
would allow for a check on the comparability 
of treatment groups with regard to this factor. 
Also, Hiroto (1974) has provided evidence 
that learned helplessness is analogous to 
Rotter’s (1966) concept of internal-external 
locus of control. A pretest of the subjects’ 
loci of control would enable experimenters to 
control for individual differences in predis- 
positions to helplessness, that is, externality, 
or to determine how these predispositions 
interact with experimental treatments. 
Insofar as learned helplessness is considered 
a transfer phenomenon (i.e., to be tested for 
in a different situation or task than the one 
in which it was induced)—and this seems to 
be a consistent feature of learned helplessness 
experiments—the failure to find any evidence 
that the always-correct condition subjects de- 
veloped a functional blindness to the pattern 
common to all of the anagrams is damaging 
to Seligman’s (1975) assertion that uncon- 
trollable positive events produce learned help- 
lessness. The response “correct” seems an 
obviously positive event in this situation and 
was delivered independent of responding in 
the present study’s always-correct condition 
procedure. Also, based on the subjects’ re- 
sponses to the attribution question given at 
the end of the anagrams task, both the 
always-correct and insoluble-aversive subjects 
perceived feedback to their responses in the 
pretreatment phase as uncontrollable, that is, 
determined by something external to them- 
selves. However, only the insoluble-aversive 
subjects acted helpless on the anagrams task. 
Of course, it is possible that the always- 
correct subjects had a different reason for 
making an external attribution concerning the 
discrimination problems than did the in- 
soluble-aversive subjects. For example, the 
insoluble-aversive subjects may have made the 
external attribution because they perceived 
response-outcome independence, whereas the 
always-correct subjects may have made the 
external attribution because they felt it pre- 
sumptuous to claim that the perfect perform- 
ance resulted from their own skill. It should 
be noted, however, that the pattern of external 
attributions to task (ease or difficulty of the 
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problems), luck, or experimenter control did 
not significantly differ across these two condi- 
tions. It is reasonable to assume that if the 
reasons for making an external attribution 
varied across these conditions, the pattern of 
attribution to task, luck, or experimenter 
control would also have varied. 

These results suggest that subjects acquire 
in both an always-correct and an insoluble- 
aversive procedure an expectancy or set that 
responding and reinforcement are independent 
(reinforcement is uncontrollable by them) in 
that particular task or situation. The transfer 
of this expectancy to another task, such as 
the anagrams task of the present study, how- 
ever, apparently requires that uncontrollable 
negative events (i.e., the responses “incorrect” 
and “that’s the wrong answer”) be present 
in the original task. 

In a recent study, Dweck (1975) used a 
“success-only” treatment with helpless chil- 
dren that bears a similarity to the always- 
correct condition of the present experiment. 
Although Dweck hypothesized that continu- 
ous success might increase the children’s 
persistence by raising their expectations of 
success whereas Seligman’s theory suggests 
that the continuous uncontrollable success 
might have increased the helplessness of the 
children, Dweck found no significant change 
in the pre- to posttreatment helplessness of 
the children, a finding consistent with the 
result of our always-correct procedure. 

Insofar as learned helplessness is analogous 
to Rotter’s (1966) concept of internal-exter- 
nal locus of reinforcement control (Hiroto, 
1974), the present results are also consistent 
with two studies in the locus of reinforcement 

control literature. In a study of the determi- 
nants of locus of control, Yates, Kennelly, 
and Cox (1975) found that internal college 
students, those who perceived personal con- 
trol over reinforcements, remembered their 
parents as being more contingent in their pun- 
ishments than did externals, those who per- 
ceived reinforcements as under the control of 
some agency external to themselves. No sig- 
nificant relationships between perceived con- 
tingency of parental rewards and locus of 
reinforcement control were found. Also, Ken- 
nelly and Kinley (1975) found that for sixth- 
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grade boys, perception of their teaclo:s as 
contingently punitive was positively ited 
to an internal locus of reinforcemen: con- 
trol and academic achievement, but per- 
ceived contingency of teacher rewards w: not 
related to either locus of control or ac: mic 
achievement. 

Previous research (e.g., Hiroto, 1974 the 
instrumental-instrumental and instrum: al- 


m, 


cognitive studies of Hiroto and Seli 
ble 


1975) has demonstrated that uncontro 
aversive stimuli without the concomitant | °S- 
ence of uncontrollable positive events ca: 10- 
duce helplessness. The present study’s alw“ ys- 
correct results suggest that uncontrol! 
positive events without the concomitant |) es- 
ence of uncontrollable aversive events do iot 
induce helplessness. It should be emphasi ed, 
however, that a condition in which un 
trollable positive events are delivered or 
intermittent schedule implies uncontroll. ole 
negative events (due to frustration). U: 
most circumstances, therefore, uncontro!! le 
rewards could indirectly contribute to :he 
development of a sense of helplessness. 


Whether the positive events, the resp. ses 
“correct” and “that’s the right answer,” : on- 
tribute anything of themselves to the hel}: ess- 
ness induced when uncontrollable positiv: and 
negative events are intermixed, as in the 
insoluble-aversive procedure of the pi sent 


study, is unknown. If they do, then the cur- 
rent insoluble-aversive procedure should pro- 
duce as much helplessness as an always- 
incorrect procedure involving a continuous 
schedule of negative feedback. If they do not 
and it is simply the uncontrollability of 
aversive events that induces helplessness, then 
an always-incorrect procedure should induce 
greater helplessness. 

Since those soluble condition subjects who 
solved all of the discrimination problems did 
not significantly differ from those who solved 
only part of the discrimination problems On 
the anagrams test task, future researchers 
using the current or similar paradigms should 
not drop subjects from the soluble problem 
condition for failure to solve all of the 
problems, Such a procedure is apparently un- 
necessary and violates the assumption of com- 
parability of subject groups across conditions. 
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Stability of Adult Personality Traits: Fact or Fable? 


K. Warner Schaie and Iris A. Parham 
University of Southern California 


Theoretical positions for 
traits are examined. Resu 
over a 7-year period and wi 
suggest that stability within ge! 
bined with many differences bet 


ject to transient secular trends. A typology 
to account for the different developmental trends observed for the various 


personality factors. 


One of the more curious developments in 
the history of the psychology of aging has 
been that most investigators have generally 
assumed the validity of an age decrement 
model for learning, the intellectual functions, 
and psychomotor behaviors, but have con- 
trariwise assumed that personality traits and 
attitudes remain stable over the adult life 
course. Over the past two decades the first 
author and his associates have worked to 
demolish the general consensus on a decre- 
ment model for the intellectual processes (cf. 
Baltes & Schaie, 1974; Schaie, 1973, 1974; 
Schaie & Gribbin, 1975). In the past, we 
were able to demonstrate that much of the 
apparent age decrement was an artifact of 
data collected from cross-sectional studies 
comparing cohorts with differing base levels 
of ability. 

Tt occurs to us now that the assumed stabil- 
ity of adult personality traits (e.g., Reichard, 
Livson, & Peterson, 1962) might be equally 
artifactual, involving fortuitous combinations 
of generational differences in the expression 
of personality traits and attitudes, with secu- 
lar trends and ontogenetic trends operating in 

opposing directions. Further, just as it has 
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and against a stability model for adult personality 
lts of a sequential study of 19 personality factors 
th participants ranging in age from 22 to 84 years 
nerations appears to be the rule, albeit com- 
ween successive population cohorts and sub- 


of 13 possible models is offered 


age 
rence 

and 
nined 
ation, 
s, as 
vhich 
t are 
ch, It 
arize 
| and 
ne the 
of 19 


been profitable to hypothesize differ« 
trends for crystallized and fluid intel 
abilities, there may well be personali 
attitudinal constructs that are overdete 
by species-specific ontogenetic transfor: 
which should show true age patte: 
opposed to culturally determined traits 
might be subject to secular change |i 
not affected by ontogenetic events as 

is the purpose of this paper to sum 
briefly some of the related theoreti 
methodological issues and then to exan 
results of sequential analyses of a se 
personality and attitudinal factors at we 
have recently abstracted from our long-term 
studies with participants ranging in 9° from 
22 to 84 years for the 7-year tim: period 
from 1963 to 1970. 


Theoretical Basis of the Stability Model 


To no one’s surprise, the origin of the 
stability model of adult personality may well 
be attributed to the neglect of adult develop: 
ment by personality theorists. That is, the 
early dominance of psychoanalytic thought 
among child psychologists led to the notion 
that basic personality traits are indeed estab- 
lished in the first few years of life and that 
only modifications of such early traits occult 
during adulthood. Individual differences here 
are attributed to fixations at earlier stag 
and actual reversals are seen as special cases 
on the regression mechanism. Similarly, life- 
span models that involve growth, maturity: 
and decline in cognitive functions neverthe- 
less do not postulate a decline in personality 
traits; consequently, they portray adulthoo 
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as a period during which earlier achieved be- 
havior ige becomes stabilized rather than 
being su sjected to further significant modifica- 
tion (K erg, 1969; Kuhlen, 1959), Other 
writers early life experiences as oriented 
toward ceveloping traits that may have im- 
portance ater life, but that are affected by 
biobeh: | events that may not be develop- 
mental ure. Such writers argue for indi- 
vidual « `, but they also present evidence 
for a co ity model of personality (Livson, 
1973; M l, 1971; Neugarten, 1966, 1969). 

Furt} ilysis of the stability model sug- 
gests th: re may actually be a diversity 
of geno! models that would account for 
the phe of adult stability. The first 
possibili iid be a strictly hereditarian 
one. It v iold simply that individual dif- 
ferences sonality are genetically deter- 
mined hus that certain traits may 
indeed si ystematic differences between 
Sexes, race other heritable classifications). 
Further, i s model were appropriate, we 
would ha eason to postulate either onto- 
genetic < or generation differences. A 
second ry would argue that there is a 
critical | l early in life that determines 
the formation of personality traits and/or 
attitudes that the pattern then formed 
will persi iroughout life. The first two 
models coi distinguished only if there 
were cultural changes in child-rearing prac- 
tices that ld result in the detection of 
cohort di s, provided no ontogenetic or 
transitory r trends prevailed. 

A third would postulate systematic 
generational changes in child-rearing practices 
whose i we might fail to detect in adult- 
hood bec of the presence of opposing 
Ontogenetic trends. For example, there might 


`y for people to become less ag- 
gressive with increasing age; this trend might 
Not be detected if changes in child-rearing 
Practices led to successive population cohorts 
Showing lower aggressiveness, A fourth model 
Would be quite compatible with social learning 
theory (Ahammer, 1973) in prescribing that 
Individual differences in personality traits at 
any age are specific responses to individual 
environmental programs. Such differences, 
When averaged over populations, should in 
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general lead to a finding of stability in the 
adult personality, since age differences would 
be detectable only for those traits or attitudes 
that are significantly affected by societal 
stereotypes regarding the age appropriateness 
of certain role expressions (cf. Neugarten & 
Datan, 1973). 

Only the first two alternatives above are 
true stability models; the third is merely 
phenotypically stable, and the fourth may or 
may not be a stability model, depending upon 
whether stimulus situations follow an onto- 
genetic sequence or occur without reference 
to the age status of the organism. Our own 
studies thus far seem to suggest, interestingly 
enough, that the fourth model (which is really 
a pseudostability model) may be the most 
plausible with respect to personality (cf. 
Schaie & Parham, 1974). But in any event 
it would be desirable to differentiate the alter- 
native assumptions for the stability model, 
and in particular to differentiate between 
intergenerational shifts and transitory secular 
trends that seem to affect members of all adult 
cohorts over limited periods of time. We will 
return to this matter later when we turn to 
appropriate research paradigms. 


Adult Differentiation Models 


Although most theoretical models seem to 
argue for stability of adult personality traits, 
there is at least some evidence from cross- 
sectional studies (Chown, 1968; Neugarten, 
1973: Schaie & Marquette, 1972) that adult 
age differences can be observed on many per- 
sonality variables. However, such differences 
may also be a function of a number of differ- 
ent combinations of developmental events. 
Certain dynamic models (cf. Erikson, 1964; 
Kohlberg, 1973), which imply that adult dif- 
ferentiation occurs as a function of successive 
transformations to more advanced stages, 
would also seem to require corollary changes 
in the structure of personality. None of these 
models, however, requires that such transfor- 
mations occur according to an ontogenetically 
determined timetable. Indeed, Erikson’s sche- 
matic of the eight stages of development, 
each of which requires successful mastery of 
the preceding stage, permits a total of 15 per- 
sonality types, but there is no necessary pre- 
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scription as to the incidence of each type at 
any given life age. 

Tt might be argued that any model of aging 
that involves the concept of stages or differ- 
entiation may imply an ontogenetic pattern, 
which places such stages, whether mandated 
by the biological matrix or the social context, 
at a specific chronological age. Thus disen- 
gagement (Cumming & Henry, 1961) may 
occur only in old age (whether for some or all 
individuals) or the onset of profound rigidity 
in response style may occur in midlife 
(Schaie, 1962). But, alternately, such phe- 
nomena may also be the direct consequence 
of societal expectations that may change at 
different historical points in time for all indi- 
viduals or may be transformed gradually as 
a function of successive cohort membership. 

In the latter case we might observe disen- 

gagement shift to different life stages: Have 

we not seen the “disengagement” of the 
young in the decade of the 1960s? We might 
also note shifts in the occurrence of peak 
levels on various traits and attitudes which at 
first glance appear as most curious if we try 
to force the data into a matrix requiring linear 
developmental phenomena that disregard the 
nonlinear transformations so characteristic of 
sociocultural change (e.g., Schaie & Parham, 
1974). 
It is the interaction of changes in the socio- 
cultural matrix with the biologically deter- 
mined attributes of the life course that is so 
puzzling both to the developmentally oriented 
student of individual differences and to the 
social psychologist who is impressed solely by 
changes in the social matrix. Neither stability 
nor a series of regular transformations (e.g., 
changes in level or quality) seems to fit the 
data at hand. As always in situations such as 
these, the truth would seem to lie somewhere 
in between, but where it lies precisely may be 
difficult to ascertain. 


Three Types of Personality Traits 


For the purpose of studying changes in 
traits and attitudes over the life span, it may 
be well to distinguish three different kinds of 
traits. In our further discussion we will refer 
to the three groups of variables identified 
below as biostable, acculturated, and bio- 
cultural traits. First there is a class of bio- 
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etically 
event, 


stable traits, behavior that may be g 
determined or constrained (or in < 
shaped) by environmental influer- es 
occur early in life, perhaps during critical 
imprinting period. The most note 
tribute of these traits is that they 

tematic sex differences at all age le «ls, but 
few if any age differences, whether amined 
by cross-sectional or longitudinal me 

second set of acculturated traits app: 
versely, to be overdetermined by ens 

tal events occurring at different stag 
individual’s life as well as being si 
rather rapid modification by soci 
change. Such traits generally show no ystem- 
atic sex differences and have puzz! ig age 
difference patterns that can frequent!) be re- 
solved into generational shifts and/or :ecular 
trend components by sequential analy. s. 

A much more infrequently occurring ‘ype is 
the biocultural trait, which in cross-s. ‘tional 
studies shows an Age X Sex interactic . This 
kind of variable would seem to refle: onto- 
genetic trends whose expression is  odified 
either by generational shifts or by socio- 
cultural events affecting all age levo- in a 
similar fashion. We first encountered nis tri- 
chotomy in a study in which we com; ired 50 
retired college graduates with a mea age of 
76 years with a group of graduate iudents 
matched for field of study on the | wards 
Preference Schedule (Schaie & rother, 
1968). There was no opportunity at that time 
to trace this observation further, bu: we are 
now convinced that an orderly description of 
the adult development of personality requires 


l 
notation of groups of variables with different 


3, COn- 
mmen- 


life courses. 


Some Methodological Issues 


It may be apparent from the above discus- 
sion that there is disagreement in theory and 
interpretation of facts. Much of this disagree- 
ment, we think, just as was true in the field 
of intellectual changes with age, may be at- 
tributed to the common mistake of using te 
sults gained from cross-sectional studies 4% 
evidence for or against the stability model of 
personality. Moreover, undue inferences a! 
often based on rather unique samples sub 
jected to single-cohort longitudinal study 
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(e.g, F- y, 1955; Livson, 1973). A detailed 
account ` ihe methodological issues that have 
recenti en set forth in detail elsewhere 
(Schai: 73, 1974, 1976; Schaie & Gribbin, 
1975) not be given here. Instead, we will 
simply indicate some of the design re- 
quiren that we think need to be adhered 
to in to obtain interpretable data and 
then iescribe the results of collecting 
data Í ng these prescriptions, 

Cor ility of data. If one is concerned 
with development over a long time 
period : the issue arises whether person- 
ality sı re remains constant. This matter 
can b led most elegantly by construct- 
ing se ` instruments for each differenti- 
able cı and then engaging in boundary 
studie ross-instrument factor matching. 
Cattell 5) has attempted such an ap- 
proach limited success with children, We 
doubt, ver, whether this procedure is 
realistic adults, although it should per- 
haps | upted by constructing different 
instrui verhaps one for each age decade. 
A mori ed approach would be to identify 
a ma iverse of self-report items or de- 
scrip! e factored separately for differ- 
ent agi | cohorts. Again, this is an under- 
taking hould be attempted, but remains 
to be done A third approach, and one that 


we i en, is to administer a manageable 


set o! illy selected items to a population 
stratij z age over the major portion of 
the adul: life span, and most importantly, 
to sampile each age level for at least two dif- 
ferent cohorts. The latter constraint is im- 
portani io minimize the impact of transient 


secular events in overdetermining individual 
differences variance. 

Conirol for experimental artifacts. One of 
the important concerns in personality investi- 
gations using self-report material is that com- 
pletion of a questionnaire may actually serve 
to formulate individuals’ attitudes about 
themselves and the behaviors to be indexed. 
Tt is consequently necessary in designing a 
developmental study of personality both to 
look at repeated measures taken from the 
Same individuals and to compare independent 
random samples taken from the same cohorts 
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over the time interval during which the re- 
peated measurement sample has been followed. 

Sequential designs. In order to unconfound 
the sources of variance that might be mis- 
taken for adult personality change (or stabil- 
ity), it will be necessary to assess two or more 
cohorts at a minimum of two measurement 
points (cross-sequential method). In addition, 
we will want to assess individuals at two or 
more age levels at a minimum of two mea- 
surement points (time-sequential method). 
The first scheme is most appropriate if we 
assume the validity of the stability model and 
therefore wish to differentiate between genera- 
tional trends and current sociocultural events. 
The second scheme must be used if we have 
reason fo believe that there are indeed age- 
related developmental trends for some of the 
variables under study. A reasonable proce- 
dure (cf. Schaie & Parham, 1974) is to begin 
with an application of the cross-sequential 
method to a repeated-measurement sample, to 
check resulting data by looking at a compara- 
ble independent random sample, and finally 
to apply the time-sequential method to the 
latter sample to see whether changes in F 
ratios require questioning of the stability 
assumption. 


MertHop 


The literature does not contain work that will 
satisfy the above design criteria with the exception 
of a study by Woodruff and Birren (1972) limited 
to a single cohort over a 26-year period. Our own 
studies of cognitive behavior, however, contained 
material that upon proper reanalysis could meet our 
requirements at least in substantial part and thus 
shed some light on the questions raised. Throughout 
our study we have used a 75-item questionnaire 
(Schaie & Parham, 1975) that contains 22 subtle and 
9 obvious rigidity items as well as 44 social respon- 
sibility items. As is true with any questionnaire, this 
pool of items was selected not because it represented 
pure measures of rigidity or social responsibility, but 
simply because the test items showed high saturation y 
on relevant scales. Obviously each item contains ad- 
ditional variance, and what we have done here is 
sort out this additional variance as a first stab at 
the sequential study of personality. Our basis of 
analysis consists of approximately 2,500 question- 
naires collected during 1963 and 1970 on subjects 
ranging in age from 21 to 84 years of age, approxi- 
mately equally distributed by sex and age. 

We began by factoring the intercorrelations among 
the 75 items by the conventional principal axes 
method. Examination of eigenvalues by Kaiser’s cri- 
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TABLE 1 


REPEATED MEASUREMENT Cross-SEQUENTIAL 
"ANALYSIS OF VARIANCE: SIGNIFICANT F RATIOS 


Factor Cohort Time Sex 
A + Afiectothymia 3.35** 4.13* 
D TE abiy 4.16 8.01 
E + Dominance 
G + Superego strength 2.89** 
H — Threctia 16.48** 
I + Premsia 
J + Coasthenia 3.38** 6.24* 
L + Protension 11.60**  5.99* ic 
M — Praxernia 4.07%"  5.26* 9.66 
O — Untroubled 
adequacy : 15.28" 
1 — Conservatism ol 
p temperament 9.34** si 
2 — Group dependency  2.40** 20.04 
® — Low self-sentiment — 2,70** 
lonesty 2,81** 20.99** 
Jnterest in science 3.72** 70.09** 
Flexibility 
Financial support 
of society 10.39** 
Humanitarian concern 3.34% A id 
Community involvement 6.48** 


Nole, Degrees of freedom: cohort (7, 394); time (1, 394); 
sex (1, 394). The Cohort X Time interaction was significant 
for flexibility (F = 2,08, p <.05), humanitarian concern 
(F = 2.82, t <.01), and community involvement (F = 2.64, 
b <.01), The triple interaction of Cohort X Sex X Time was 
significant for humanitarian concern at p <.01 (F = 2.16), 


p <05. 
“p <0 


terion and Cattell’s scree method led to the accept- 
ance of 19 non-error factors. These factors were then 
rotated by means of the oblimin procedure to maxi- 
mum simple structure. Careful analysis of item con- 
tent of the resulting factors and comparison with 
existing taxonomies (Cattell, 1957) suggested that we 
ible to identify personality scales (albeit of 

length and possibly lower reliability) that 


estimate 1- of the personality factors identified by the 
Sixteen Personality Factor Questionnaire (16 PF): 
affectothymia (A+), excitability (D+), dominance 
(E+), superevo (Gt), threctia (H—), premsia (I+), 
coasthen'n (|-|-), protension (L+), praxernia (M—), 
untroubled adequacy (O—), conservatism of tempera- 
ment (Qi—), group dependency (Q2—), and self- 
sentiment (( . In contrast to the balanced nature 


of the 16 PF scales, however, our scales typically 


represent only one end of the trait continuum de- 
scribed by a given fa (cf. Cattell, Eber, & 
Tatsuoka,,19'). In add , We have identified six 
attitudinal stes that we ‘led flexibility, honesty, 


financial support of society, humanitarian concern, 
community involvement, and interest in science. 

We next computed factor scores on all 19 scales for 
our samples tested for the first time in 1963 and 
1970 and for the repeated measurements obtained for 
the 1963 sample in 1970. We consequently have two 
cross-sectional samples of subjects arranged in eight 
7-year age intervals, the youngest group ranging in 
age from 22 to 28 and the oldest from 71 to 77 
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years. In addition we have short-term longitudinal 
data for eight cohorts over the 7-year period, the 
youngest cohort being followed from mean age 25 
to 32 and the oldest cohort from mean age o 81. 


For hypothesis testing purposes we combined these 
data into three analyses: a cross-sequential rs cated 
measurement analysis to test the stability model 
with respect to cohort differences, sex differens and 
sociocultural change (1 =410); a cross-sevential 
analysis with independent samples to test wether 
results obtained in the first analysis would be main- 
tained when the effects of retesting are eli inated 
(n=1,556); and a time-sequential analy with 
independent samples, conducted to test the « ump- 
tion of the stability model and to differentia! onto- 
genetic changes from sociocultural change (” = 10). 
RESULTS 

Repeated Measurement Cross-Sequenti: 
Analysis 

Cohort differences significant at or bevond 
the 1% level of confidence were foun’ for 


nine of the personality factors and four of 


the attitude factors. Successive cohorts were 
found to be more outgoing, less exc \ able, 
more coasthenic (internally restrained), with 
higher protension (jealous suspiciousness), 
greater superego strength, lower praxernia 


(practical down-to-earth mindedness), in- 
creased conservatism, increased grou), de- 
pendency, and increased self-sentimen!. Suc- 
cessive cohorts also reported themselves to be 
more honest, more interested in science, and 
to have more interest in community involve- 
ment and less interest in humanitarian con- 
cerns (see Table 1). 

Three personality factors and two attitudi- 
nal factors showed significant time trends 
from 1963 to 1970. There was a drop in 
excitability and protension, with an increase 
in praxernia. There was also a decline in 
positive interest in financial support of society 
and in universal political concern, 

Sex differences significant at or beyond the 
5% level of confidence occurred on six of 
the thirteen personality factors and on two of 
the six attitude factors. Men received higher 
scores than women on affectothymia (out- 
going participation), threctia (threat reactiv- 
ity), coasthenia, praxernia, untroubled ade- 
quacy, honesty, and interest in science. 
Women scored higher than men on group 
dependency. 


Cohort x Time of Measurement interactions 


occurred only for community involvement and 
humanitsrian concern, For the former there 
appeared to be little change over time for the 
older an! middle cohorts, with increased in- 
volveme by the youngest cohort. In the 
case of humanitarian concern there was a 
marked by the oldest cohort, with a lesser 
| decline by the middle and younger cohorts, 
A signi triple interaction suggested a 
particu! noteworthy lessening of humani- 
tarian «< n by the oldest female cohorts. 
These lts offer strong support for the 


stability lcl, since ontogenetic trends in 


the cros ential analysis, if any, should 
appear irious time-of-measurement ef- 
fects. Be there are unlikely to be onto- 
genetic t for the attitudinal factors the 
results f ich seem so clearly in tune with 
our exp ns for this historical epoch, we 
are left the possibility of ontogenetic 
trends fo itability, protension, and prax- 
ernia, By ‘re we address ourselves to a 
further ch we must next consider the issue 
of possib founding of results in the longi- 
tudinal > by the impact of repeated 
| Measuren vifecting attitude formation and 
Self-description of personality traits. The 
proper nue of inquiry here is the com- 
Parison of within-cohort changes based on 
independ unples. 
Independent Samples Cross-Sequential 
Analysis 
Cohort ences significant at or beyond 
the 1% level of confidence were replicated for 


all factors for which such differences appeared 
in the repeated measurement study. Succes- 
Sive cohorts were also found to be more group 
dependent and to express more flexible atti- 
tudes. Significant time-of-measurement effects 
“Were also replicated, and additional effects 
Were detected at the 1% level of confidence 
for an increase in dominance and a decrease 
M untroubled adequacy. (Note that sugges- 
tive effects significant at the 5% level further 
“bpeared for increased conservatism, group 
dependency, and flexibility, and an increase 
Mm honesty. ) 

All sex differences detected in the repeated 
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TABLE 2 


INDEPENDENT SAMPLES Cross-SEQuENTIAL ANALYSIS 
Ov VARIANCE: SIGNIFICANT F RATIOS 


Factor Cohort Time Sex 
A + Affectothymia 21.02** — 4.60* 
D + Excitability 13.21" 8.17"  646'* 
E + Dominance 8.44" 
G+ Superego strength 18.85** 
H — Threctia 7.35%" 468.24** 
I + Premsia 6.18** 
J + Coasthenia 3.94** 27.48" 
L + Protension 32.14** 4.70" 
M — Praxernia 749%" 12,78" 28,420» 
O — Untroubled 

adequacy 2.45* 6.40" — 37,06" 
Q1 — Conservatism of 

temperament 60.26%" — 3,03* 
Q2 — Group dependency 7,62" — 4.26% 19.81%" 
Q3 — Low self-sentiment 14.18 8.17  5.61* 
Honesty 7.23** 5,23" 48.54%" 
Interest in science 9.65** 155.05** 
Flexibility 10.87**  4,23* 
Financial support 

of science 6.78" 

Humanitarian concern 17.69**  9.04** 
Community involvement 22,71% 6.75% 


Note. Degrees of freedom: cohort Mi 1,524); time (1, 1,524); 
sex (1, 1,524). The Cohort X Time Interaction was significant 
for flexibility ( = 2.50, p <.05). The Cohort X Sex inter- 
action was significant for affectothymia and threctia (both 
Fs = 2.16, p < 05). 

*p <05. 

> <01. 


addition differences significant at or beyond 
the 1% level of confidence occurred, with 
women showing higher excitability and men 
greater community involvement, On the other 
hand, none of the interactions reported for 
the repeated measurement data could be 
replicated at the 1% level of confidence (see 
Table 2). 


Independent Samples Time-Sequential 
Analysis 


It was noted earlier that the component of of 
sociocultural change identified in the cross- 
sequential ANOVA, if our assumption of the x 
stability of personality through adulthood 
does not hold, could be confounded with 
maturational change occurring over the 1-year 
period under investigation. This question can 
be approached with our data base by re- 
organizing the data into an Age X Time of 
Measurement design and applying a time- 5 
sequential ANOVA, This alternative design a) 
differentiates age levels and sociocultural 
change while confounding cohort effects. 
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TABLE 3 


INDEPENDENT SAMPLES TIME-SEQUENTIAL ANALYSIS 
OF VARIANCE: SIGNIFICANT F RATIOS 


Factor Age Time Sex 
A + Affectothymia 17.15%% 19.749" 
D + Excitability 17.58"* 5.04" 
E -++ Dominance 11.29%* 
G + Superego strength —17.36** 
H — Threctia 8.21°* 469.78** 
I + Premsia 2.26 3.97* 4.75* 
-+ Coasthenia 3.81% 6.57% 28.217 
-+ Protension 22.81% 31.86* 
M — Praxernia 8.74% 14.13%"  26.95** 
O — Untroubled 
adequacy FATA 12.10%” 35.89% 
Q1 — Conservatism of 
temperament 54.10" 45.91** 
Q2 — Group dependency 6.83** 8.24** 20.17** 
Q3 — Low self-sentiment 13.53** 6.99**  5.21* 
Honesty 12.08** 53.99** 
Interest in science 7.39°* 4.63* = 147.45** 
Flexibility 10,82°* 12.44** 
Financial support 
ety 5.77* 
Humanitarian concern 14.11%° 
Community involvement 25.43** 27.12**  8.89** 


Note, Degrees of freedom: age (7, 1578); Time (1, dork 
sox (i, yee The Age X Time interaction was significant 
(p <.05) for interest in science, (F = 2.08), universal political 
concern (F = 2,06), and community involvement (P = 2.03). 
Age X Sex was significant for superego strength (P = 2.15, 
tr < 730 es Ae Sex X Time was significant for threctia 

F m 2.49, 

p< 

*D< OL 


It should be noted that the 7-year aging 
effect for the time period monitored would 
appear in the cross-sequential analysis in the 
main effect for time of measurement rather 
than in the cohort level effect, which covers 
a span of 56 years. By the same token, the 
generational confound in the time-sequential 
analysis would be located in the main effect 
for aging covering the 56-year span rather 
than in the 7-year time-of-measurement ef- 
fect. Consequently, those time-of-measurement 
effects that remain significant can now be 
attributed unambiguously to sociocultural 
change, while those that disappear must be 
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ikewise, 
uivalent 
‘ios. for 
sis will 
ion dif- 
F ratio 
above- 
rement 
to con- 
ion. In 


attributed to maturational factors 

F ratios for age levels that are 

or lower than the comparable 

cohorts in the cross-sequential an. 
still need to be interpreted as gene! 
ferences. In those instances where t 
is increased markedly and where (' 
mentioned changes in the time-of-me: 
effect have been noted, we will hay 
sider revision of our stability assur 


addition, where a time-of-measuren: : effect 
now appears, even though it did no\ how in 
the cross-sequential analysis, it mu: ve con- 


sidered a confound for cohort d 
rather than evidence for sociocultur: 
which would otherwise appear in both 


rences, 
hange, 
ralyses 


(see Table 3). 
The evidence from the time-sequent’ analy- 
sis is quite clear: Within the domai: of fac- 


tors identified in our study, we can \ 


fidence support the stability mode! for all 
variables, with the exception of ex ability 


and humanitarian concern, both of 

crease with age (Table 4 provides me 
by age group for these factors). In 
to the sociocultural trends apparent 


idition 
r indi- 


viduals at all ages already identiñe in the 
cross-sequential analysis, we now fo! addi- 
tional time trends significant at o beyond 
the 1% level of confidence for ` creased 


conservatism and group dependency 


Cohort Personality Profiles 


Our analysis also allows us to describe 
characteristic profiles for each of the eight 
cohorts studied (see Table 5). Cohort 1 
(mean birth year 1889), for example, was 
found to be the least outgoing, most excitable, 
highest in praxernia, and lowest in self- 
sentiment. Notably, this cohort is most willing 
to accept responsibility for the financial sup- 


BLE 4 


Age 
(22-28) (29-35) (36-42) (43-49) (50-56) (57-63) (64-70) (71-17) 
45.69 4723 4830 49.18 48.86 5107 S159 5286 
7 43.01 4896 5038 5049 5028 S113 3246 Sl 
Huma 49.85 4883 4748 48.87 5002 5085 3277 SKIT 
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TABLE 5 
MEAN Scores or PERSONALITY Factors BY Conorts 
C8 tors CO p s` nas ERE 
Factor 1938 1931 1924 ir 1910 1903 15% Si 
A + 48.79 48.63 48.45 49.84 52.24 54.15 "56.00 
i) 8 46.41 49.08 49.18 49.97 51.42 51.97 53.19 
E 4 49.81 49,51 50.07 50.10 49.96 49.58 48.67 
G 4 51.32 51.25 52.22 50.26 48.59 46.36 45.77 
H - 51.55 50.04 50,27 49.98 49.88 47.80 48.32 
I + 49.45 50.13 50.98 50.01 49,91 48,92 49.97 
J +i 50.42 51.08 49,37 49.49 48.97 48,06 
L + 51.41 51.80 49.47 47.57 46.67 44.22 
M- 50.16 48.81 49.78 50.21 51.49 52.24 
Oo — adequacy 49,34 49.00 50.24 50,69 51.14 50.48 
Qi — m of temperament 52.44 52.66 49.06 4641 44.39 42.22 
Q2 — ndency 50.69 50,69 49.41 48.14 48.06 47.37 
Q3 — if-sentiment 49.28 48.97 50.37 52.12 53.25 53.67 
Hone 50.89 51.06 49.71 49,71 47,89 49.50 
Intere ience 49.98 50.59 49,98 48.52 46.68 46.31 
Flexibi 51.04 51.08 49,19 47.45 47.80 49,92 
Financi ort of society 50.46 49.64 50,26 50.90 50.14 51.18 
Humar concern 48.68 47.04 50.24 51.98 53.66 55.31 
Commi nvolvement 50.98 51,96 49.50 47.79 46.41 45.86 
Note € rhort 8 wines mean birth year is 1938), and =i for C7. ch. 7 
port ot ‘ty and consistent with this shows (jealous suspiciousness). Cohort 6 resembles 
the hi; humanitarian concern, Cohorts 3 and 4 in being high on dominance. 
Coh (mean birth year 1896) is quite Another instance of continuity between the 
similar short 1 in terms of high praxernia cohorts involves Cohorts 5 and 6, who are 
and | lf-sentiment. Cohort 2 has its comparably high on premsia. Cohort 6 is 
highes in score on untroubled adequacy; comparably high with Cohorts 7 and 8 on 
that i ibers of this cohort perceive them- superego strength, in marked contrast to the 
Selves e more self-assured, placid, or low scores for Cohorts 1, 2, and 3 on this fac- 
compla tor. A similar pattern is found for coasthenia 
Coh (mean birth year 1903) was also (internal restraint). Wes 
high troubled adequacy (similar to Cohort 7 (mean birth year 1931) is high 
Cohort .) and next highest only to Cohort 1 on financial support of society and, again, 
in finavial support of society. Cohorts 3 resembles Cohorts 6 and 8 in being high on 
and 4 {birth year 1910) are quite similar in protension. The youngest cohort, Cohort 8 
their h scores on dominance. In fact, the (mean birth 1936) is highest on conservatism 
latter coliort has the highest mean score for of temperament, group dependency, flexibility, ¥ 
all groups sampled on the dominance factor. interest in science, honesty, and community 
Cohort 5 (mean birth year 1917) scored involvement. * he } 
highest on threctia (threat reactivity) and There is remarkable similarity and continu- 


Premsia (tendermindedness). This cohort also 
has a high interest in science, second only to 
the youngest cohort, Note that Cohorts 4 
and 5 fall into a special group in regard to 
the importance of certain personality factors, 
for example, tendermindedness, threat reac- 
tivity, and dominance, which may reflect the 
Pressures of middle age such as anticipated 
retirement, menopausal change, and reduced 
Physical vigor, 

Cohort 6 (mean birth year 1924) resembles 
Cohorts 7 and 8 in being high on protension 


ity between adjacent cohorts on these person- 


ality factors; nevertheless, a clear break may 
be noted between the three oldest cohorts and 
the younger cohorts on protension, superego, 
and coasthenia. The cohort groupings found 
for these personality factors are quite similar 


to groupings previously noted for intelligence 


$ 


data (Schaie & Labouvie-Vief, 1974). 
DISCUSSION 


The results of our study point to the tre- 
mendous complexity of the task of describing 
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Ficure 1, Untroubled adequacy. 


the adult life course of personality traits. We 
find it necessary therefore to reexamine the 
trifold model suggested earlier and to present 
a revision that will do more justice to the 
data. In terms of our earlier analysis it 
would seem that of the factors included in our 
sequential study, 9 are of the biostable, 6 or 
7 of the acculturated, and only 1 or 2 of 
the biocultural variety. It is clear, however, 
that the trifold typology is far too simple, 
since it was formulated on the basis of cross- 
sectional data and consequently glosses over 
the full complexity of developmental phe- 
nomena. On the basis of the present analysis 
it is possible to describe at least 13 subtypes, 
examples for 9 of which may be given from 
our own data. These subtypes are derived by 
identifying the possible combinations of the 
joint occurrence or absence of sex differences, 
ontogenetic change or cohort differences, and 
sociocultural change. 


Biostable Traits 


These are traits that because of genetic or 
early environmenta! influences show sex dif- 


ferences that are reliably maintained through- 
out the a life course. Although cross- 
sectional sti es may show age differences, 
these are d by sequential analysis to be 
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cohort dificrenc 

_ Four subtypes 1 

1, Stable sex 
would appear to 
genetic variance pri 
chromosome. The o: 


this model in our sti 


sociocultural change. 

‘dentified: 

ces only. Such traits 
overdetermined by 
y located on the sex 
trait qualifying for 
is that of premsia 


(tendermindedness). Even here we mus: vacil- 
late, since that factor was significant +t the 
5% level of confidence also for time am! age 
(see Table 3), suggesting that upon replica- 
tion it might well be assigned to the bio- 
cultural traits. Indeed, it does not seen too 
likely that we could find personality traits 
that would fit the purely inherite: trait 
category without ambiguity. 

2. Time-of-measurement differences 
These traits would also appear to be 
determined by genetic variance, but ar 
fied in their expression by transient 


nly, 
ver= 
yodi- 
cio- 


cultural changes. In our analysis this type 
was represented by untroubled ade: șacy, 
which showed a decrease over the peri’ we 
monitored (see Figure 1). 

3. Cohort differences only. These are ~ able 
traits not subject to transient environ -ntal 
influences, but ones in which differen! pat- 
terns in early training or socialization pro- 
duce significant generational differences 1t is 
likely that many traits developed via -arly 
socialization follow this pattern. In our «tudy 
those falling into this category were thre t re- 
activity, coasthenia, expressed honesty. nter- 
est in science, and community inyolv: ment 
(see Figure 2). 

4. Time-of-measurement and cohort difer- 
ences. Some age-related traits develop in re- 
sponse to early socialization and appear to 
be subject both to cohort differences in such 
socialization practices and to transient socio- 


cultural impact. In our study these include 
praxernia and group dependency, both of 
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Fıcure 2. Threctia. 
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Ficure 3. Praxernia. 
which showed a time-of-measurement effect 
in t ross-sequential and time-sequential 
anal is well as no noteworthy difference 
bety he two analyses in F ratios for age 
and t (see Figure 3). 
Accu ed Traits 
Ti raits show no sex differences, but 
cross- tional studies typically identify age 
diffi It is assumed here that these are 
either culturally prescribed and age-related 
patt or that upon longitudinal inquiry, 
susy age differences are identified as 
either cohort or transient secular trends. Six 
diff patterns are possible for this type 
of trai 
5. Age changes only. Such traits reflect 


Social roles that might be determined by uni- 
Versals underlying a stage model of develop- 
ment and that would be rather impermeable 
to cohort differences or secular trends. Identi- 


fication of such traits changing in the ap- 
Propriate direction would, incidentally, be 
required as support for disengagement theory. 
Tn our study the only such trait was humani- 
tarian concern, which to the probable chagrin 
of Supporters of disengagement theory showed 
Increase rather than decrease with age (see 
Figure 4). 

6. Age changes and cohort differences. Traits 
here would be mediated by stage changes in 
universally determined life role, which in turn 
are modified by shifts in early socialization 
Practices. Lacking the data for a cohort- 
Sequential analysis, we had to infer that our 
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Ficure 4, Humanitarian concern. 


data did not contain such traits. Examination 
of relative F ratios, however, suggests that 
affectothymia, protension, and low self-senti- 
ment could conceivably fit this pattern in- 
stead of the cohort-differences-only pattern to 
which the present study assigns them (see 
Subtype 8). 

7. Age changes and time-of-measurement 
differences. Such traits would also be medi- 
ated by universally determined life roles and 
would be impermeable to early socialization 
practices, but would be subject to transient 
influence at all ages. Frankly, it is difficult to 
imagine such a trait, and we did not identify 
any fitting this pattern in our study. 

8. Cohort differences only, These traits are 
not subject to ontogenetic change but instead 
are ones in which the specific pattern of early 
socialization might mediate the particular 
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Ficure 5. Superego. 
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Ficure 6. Dominance. 


for affectothymia, superego strength, proten- 
trait expression as an adult. This pattern held 
sion, and low self-sentiment (see Figure 5). 

9. Time-of-measurement differences only. 
Such traits are non-age-related, apparently in- 
dependent of early socialization practices, but 
subject to transient modification by socio- 
cultural change at all ages. Dominance and 
financial support of society fit this pattern, 
with the time-of-measurement effect remain- 
ing significant in both the cross-sequential 
and time-sequential analyses (see Figure 6). 

10. Cohort differences and time-of-measure- 
ment differences. Non-age-related traits that 
are modified by specific patterns of early 
socialization as well as by transient socio- 
cultural change affecting all ages belong in 
this subtype. Note that the F ratios for cohort 
in the cohort-sequential analysis show little 
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difference from the F ratios for age » the 
time-sequential analysis; there is also -ù in- 
crease in the time-of-measurement efi t in 


the time-sequential analysis. Our study den- 
tified the flexibility factor as falling in this 
group (see Figure 7). 
Biocultural Traits 

These traits, which appear to be over «ter- 
mined by genetic variance and conseq: tly 
show significant sex differences, are also sub- 
ject to modification in their expressio: be- 


cause of universally experienced life age 
expectancies. Moreover, some of these ` aits 
may be affected by early socialization e> eri- 
ences or sociocultural change. Three su! 
are possible: 

11. Age changes only. These traits ve 
clear ontogenetic “programs” and appe: to 
be impermeable to cohort differences or © i0- 
cultural change affecting all ages. Our aly 


example of this type was excitability, v ich 
increased with age, with the time-of-mea. re- 
ment effect appearing in the cross-sequ:. ‘ial 


analysis (see Figure 8). Although proba. y a 
rare type, a wider sampling of person- lity 
traits would no doubt identify several overs 
(e.g., surgency). 

12. Age changes and cohort differc:ces. 


Modification of cohort level through curly 
experience would here be required. Alt! ough 
our design makes it difficult to detect such 


traits (lacking a cohort-sequential analysis), 
it is unlikely that such traits exist. 


62 
to 
s 
s 
sa 
ue read | 
s S 
8 6 a 
a 
E s 
“6 lea 
§ y 
“4 
3 ee mws 
2 
O---0 Females 
40 
28 
T T 7 T T T ees! 
at Aare a o r N 
Mean Age 


Ficure 8. Excitability. 
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13. Ag: changes and time-of-measurement 
diferenc:. Here we would expect modifica- 
tion of ti- ontogenetic “program” for all ages 
due to iñc environmental interventions, 
We did identify any such trait at the 
1% lev: confidence, but we could have so 
classifie msia had we accepted the 5% 
level of dence. 

CONCLUSIONS 

The odological approach used here, 
that is, ‘pplication of sequential designs, 
has pre ly been successfully employed in 
assessin ‘llectual changes as a function of 
age. In intellectual domain (cf. Schaie, 
1974) i been demonstrated that the uni- 
versal ment model for cognitive pro- 
cesses i the most useful. The decrement 
model h ven supported primarily by arti- 
factual from cross-sectional studies that 
had c d cohorts with differing base 
levels ility. The present paper now 
demons the utility of applying the same 
method to the area of personality trait 
develo; with age. We have shown that 
stabilit personality traits is the rule rather 
than t} eption, but that such stability, 
nevert! cannot be attributed to lack of 
change #‘ier adolescence, as most traditional 
person: ‘heorists might believe. Rather, in 
the 19 onality traits studied here much 
change was observed. The change, however, 


is a function of specific early socialization 
experiences, commonly shared generation- 
Specific environmental impact, and particular 
socioculiural transitions that may affect indi- 
viduals at all ages. For those variables that 
are overdetermined by biological constraints 
or early socialization there seems indeed to be 
life-long stability, It is only the unusual trait 
of excitability (biocultural) that shows clear 
Ontogenetic “programs” and appears to be 
Unaffected by cohort differences or socio- 
cultural change. This trait exhibited increment 
With age. 

The conceptualization suggested here allows 
4 more sophisticated examination of the de- 
velopment of personality traits, Within the 
three major types of personality trait that 
Were suggested on the basis of cross-sectional 
Studies of personality (biostable, acculturated, 
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and biocultural), further breakdowns into 
subtypes of personality traits are now pos- 
sible. These subtypes have been derived by 
identifying all possible combinations of the 
joint occurrence or absence of sex differences, 
ontogenetic change, cohort differences, and 
sociocultural change, Thus, for example, the 
trait of expressed honesty can now be iden- 
tified as a biocultural trait that is not subject 
to transient environmental influences, but is 
affected by cohort differences produced by 
different patterns of early training or sociali- 
zation. Determination of the specific develop- 
mental status of a given personality trait can 
move our understanding beyond the level of 
simple categorization. 

It may then become possible to conduct 
more precise analyses on the developmental 
course of different types of personality traits 
as well as to determine the points, whether 
in childhood or adulthood, where intervention 
may or may not be practicable. 
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| Some Perceptual Determinants of Perceived Similarity, 
Liking, and Correctness 


Virginia Hensley and Shella Duval 
University of Southern California 


The present study extends several principles of adaptation level theory to the 
perception of social others. In preparation for a group discussion, subjects 
were informed of the opinion positions of other participants in the experiment 
with regard to two specific issues, These opinion positions were represented by 
two distinct groups, The positions of one group, Group S, were moderately 
similar to the subject’s opinions and remained constant across conditions. The 
extent to which the opinions of a second group, Group O, differed from the 
subject’s opinions was varied across five levels. Measures of the subject’s per- 
ceived similarity between self and other participants were obtained. As pre- 
dicted by adaptation level theory, increasing dissimilarity between the opinions 
of the subject and Group O (a) increased perceived similarity between the 
subject’s opinions and those of Group S, (b) decreased perceived dissimilarity 
among the opinion positions of Group S, and (c) decreased perceived dis- 
similarity among the opinion positions of Group O, Further, measures of the 
subject’s liking for Groups S and O and perceived correctness of the opinions 
of Groups S and O were taken to test a set of predictions derived from the 
postulated role of perceived similarity in mediating attraction and correctness. 
As predicted, we found that as dissimilarity between the opinion positions of 
the subject and Group O increased, (a) liking for members of Group S in- 
creased, (b) liking for members of Group O decreased, (c) the extent to which 
the subject perceived his own opinions as correct increased, (d) perceived 
correctness of the opinions of members of Group S increased, and (e) perceived 
correctness of the opinions of Group O decreased. These patterns of data were 
obtained using both scale and behavioral measures of the subject’s perceptions 
and attitudes, 


test stimuli increases, the extremity of evalu- 
ation of the stimulus will also increase. Stim- 


The role of contextual stimuli in deter- 
mining a person’s perception of a specific 


stimulus object is defined particularly well in 
the psychophysical literature (e.g., Appley, 
1970; Koch, 1959). Helson (1948, 1959, 
1964, 1970) interprets these effects in terms 
of a theory of adaptation level. This theory 
postulates that properties of stimuli are eval- 
uated relative to the prevailing adaptation 
ievel (AL), or region of stimulation, that 
elicits a neutral or indifferent response. Stim- 
uli equal in intensity to the AL will receive 
a moderate, or neutral, evaluation from an 
organism. As distance between the AL and 


The authors accept equal responsibility for the 
contents of this paper and would like to thank 
Norman Cliff, Jerry Jellison, and Norman Miller 
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preparation of the final manuscript. 

Requests for reprints should be sent to Virginia 
Hensley or Shelley Duval, Department of Psychol- 
ogy, University of Southern California, University 
Park, Los Angeles, California 90007. 


uli above AL will receive an evaluation 
opposite that assigned to stimuli below AL. 
The AL operant at any moment is deter- 
mined by the simultaneous and successive 
pooling of values of residual stimuli (i.e., past 
stimulation), focal stimuli (i.e., the stimuli 
presently being judged), and all stimuli 
present at the time of the judgment which 
serve as context or background for the focal 
stimuli. Conceptualized in this manner, the 
prevailing AL for any stimulus dimension, 
and, consequently, the evaluation of a stimu- 
lus on that dimension, will vary according to 
the particular values of the residual, focal, 
and background stimuli. ; 
With the AL determined by the interaction 
of these three broad classes of stimulation, 
effects of context on the evaluation of focal 
stimuli can be interpreted as the result of 
background-induced shifts in AL. The pres- 
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ence of background stimuli that differ from 
focal stimuli in intensity produces a shift 
in AL in the direction of the background 
value. If the value of the background stimuli 
is more intense or extreme than that of the 
focal stimuli (e.g., heavier), evaluation of the 
focal stimuli will be shifted in the direction 
opposite to the background values (¢.g., 
lighter). This constitutes a contrast shift. 

The magnitude of the shift in evaluation 
of the focal stimuli is a function of the degree 
of the difference between the background and 
focal stimuli. However, shifts in AL toward 
the value of the background stimuli are pos- 
tulated to be of lesser magnitude than the 
objective increments in the difference between 
the background and focal stimuli (Helson, 
1948). Therefore, with increasing objective 
differences between the background and focal 
stimuli, the magnitude of concomitant con- 
trast shifts in evaluation of the focal stimuli 
will be curvilinear in form. 

In addition to explaining contrast effects 
produced by dissimilar background stimuli, 
AL theory predicts that these shifts in evalu- 
ation of focal stimuli will be accompanied by 
decreased discrimination among members of 
the focal series, This prediction is based on 
the assumption that the “zone of maximal 
discrimination” for any stimulus dimension 
is centered at the prevailing AL. When the 
AL is shifted away from focal stimuli by the 
presence of a dissimilar background, the dis- 
tance between the focal stimuli and this 
zone of maximal discrimination also increases 
(Helson, 1948, 1964). Hence, the degree of 
discrimination among focal stimuli is reduced 
as the difference between the value of these 
stimuli and the AL increases. In addition, 
since shifts in AL toward background stimuli 
are curvilinear in nature, we can predict that 
movements of background away from focal 
stimuli will also be accompanied by decreased 
discrimination among the background stimuli. 

Although there is no evidence regarding 
this effect on discrimination among back- 
ground stimuli, previous research (Dinner- 
stein, 1958, 1965; McGarvey, 1942-1943; 
Rogers, 1941) has demonstrated that dis- 
crimination among focal stimuli decreases 
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following presentation of a dissimi!ar back- 
ground stimulus. 

Principles of AL theory have been applied 
to judgments in the social as wel! as the 
psychophysical field. Most relevant o the 
present research is a study by Maso: > and 
Graves (1973). After exposure to a inger 
who expressed attitudes on politica!  cliefs 
scales which were similar to the jects? 
attitudes at either the 90%, 50%, 0 10% 
level, the subject was asked to rate ec- 
ond, 50% similar stranger (neutral © <et) 
on perceived similarity and attraction © les. 
Results show an inverse relation betwee: oer- 
ceived similarity of the neutral target p -on 
and the degree of attitude similarity bt n 
the subject and the first stranger. The ad: a- 
tion level prediction of “perceptual con 
is clearly applicable to these results, 

The present study, involving a furthe v- 
plication and extension of the principle vf 
AL theory to social perception, differs © n 


Mascaro and Graves (1973) in several 
nificant respects. First, Mascaro and Gr s 
used the method of single stimuli presen 
successively (successive pooling). Hel 
(1959) has also postulated that the valuc 
AL is determined by the interaction of 
stimuli confronting the organism at a giv: 
moment (simultaneous pooling). This impli 
that AL theory is applicable to a social situ 
tion in which the subject is exposed to muli 
ple social stimuli at the same moment in tin 
(i.e., a group interaction context). To det: 
mine whether AL theory is applicable to thi- 
situation, the stimuli designated as focal an 
background are presented simultaneously an: 
are made up of two groups of item values 
Opinion positions of seven individuals, re 
ferred to as the subject’s group or Group S, 
constitute the focal stimuli. The background 
stimuli are referred to as the other’s group, or 
Group O, and consist of the opinion positions 
of two additional persons. 

Second, to determine whether perception 
of the focal stimulus is shifted to the direc- 
tion opposite to the value of the background 
stimulus, Mascaro and Graves represented 
the background stimulus as either less simi- 
lar, equally similar, or more similar to the 
subject than was the focal stimulus. AL the- 
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ory aiso predicts that when the direction of 
i-background difference is held constant, 
itude of the contrast shift increases 
2 ihe magnitude of the background-focal dif- 
îi- cence increases, To provide a test of this 
> -»othesis, the opinion positions of Group S 
r nain constant across conditions, whereas 
t ose of Group O are systematically varied 
ugh five levels of increasing dissimilarity 
the subject’s opinions. 

Finally, as previously discussed, AL theory 
vostulates that the relative value of back- 
ound stimuli will affect perception of both 
ocal and background stimuli. Mascaro and 
Graves (1973) and most research derived 
from AL theory has concentrated on verifying 
only one of these two hypotheses. To obtain 
data relevant to both hypotheses, subjects in 
the present experiment are asked to make 
judgments regarding both focal and back- 
ground stimuli, This procedure is a departure 
from the conventional method in which ex- 
plicit judgments of only one of the two sets 
of stimuli (i.e., focal stimuli) are required, 
but it does not require modifications in the 
predictions derivative from the principles of 
AL theory." 

Given the parameters of the experimental 
design, the following predictions concerning 
perceived similarity can be derived from AL 
theory, As the degree of dissimilarity between 
the subject’s opinions and the opinions of 
Group O increases, (a) the degree to which 
the subject perceives the opinions of members 
of Group § as similar to his own should in- 
crease, (b) the subject’s discrimination among 
the opinion positions of members of Group S 
should decrease, thereby increasing perceived 
similarity among members of Group S, and 
(c) the subject’s discrimination between the 
opinion positions of members of Group O 
should decrease, thereby increasing perceived 
similarity within this group. 

Experimentation has found contrast shifts 
using (a) attitude-attitude comparisons (Ager 
& Dawes, 1965), (b) a “detailed” scale 
(Campbell, Hunt, & Lewis, 1959), (c) abso- 
lute numerical judgments (Helson, 1970), 
(d) extra-experimentally anchored reference 
points (Manis, 1967), (e) an unlimited re- 
sponse language (Upshaw, 1969), as well as 
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(f) equal-interval scales. These data indicate 
that the contrast shift is not a scale artifact 
(Upshaw, 1968, 1969) and suggest that 
contrast shifts in the judgment of focal 
stimuli represent a reliable change in per- 
ception of the focal stimuli rather than a 
simple shift in the language or categories 
used to describe those stimuli. In a further 
series of experiments, Dawes, Singer, and 
Lemons (1972) devised methods (other than 
degree of rank-order correlation) to compare 
the predictive validity of the category usage 
versus the perceptual shift interpretation of 
contrast shift. Their results indicate that the 
subject’s own attitudes produced contrast 
shifts in the comparative rating of the extrem- 
ity of pairs of attitude statements, irrespective 
of use of categories, and in the spontaneous 
generation of statements describing the atti- 
tudes supposedly held by disagreeing others. 
These results clearly favor the perceptual 
shift hypothesis and support Helson’s (1964) 
contention that the contrast phenomenon re- 
flects an actual “re-structuration” of the per- 
ceptual field. Given this, background shifts in 
the perceived similarity of focal and back- 
ground stimuli should be accompanied by 
changes in variables which are causally re- 
lated to perceived similarity. Based on the 
data which indicate the presence of a causal 
relationship between perceived similarity and 
attraction (eg., Byrne, 1961; Jones & 
Daugherty, 1959; Schachter, 1951; Smith, 
1957), Mascaro and Graves (1973) found 
that context-produced shifts in perceived simi- 
larity of the neutral-target person were, in 
fact, accompanied by shifts in liking for the 
neutral-target person. These results, coupled 
with the present extensions of AL theory to 
multiple stimuli simultaneously presented, 
imply the following hypothesis. As the degree 
of dissimilarity between the subject’s opinions 
and the opinions of Group O increases, liking 
for members of Group S should increase, 


1 Requiring subjects to make judgments of both 
groups means that Group S and Group 0 alternate 
in the role of background and focal stimuli, How- 
ever, Group S is referred to as “focal” and Group o 
as “background” because the values of Group S do 
not to change over conditions, whereas the values of 
Group O vary according to condition, 
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whereas liking for members of Group O 
should decrease. 

Social comparison theory (Festinger, 1954) 
suggests that perceived similarity between self 
and others is causally related to perceived 
correctness of self. To the extent that a per- 
son perceives others’ opinions as similar to 
his own, he will evaluate his opinion as cor- 
rect (Festinger, Gerard, Hymovitch, Kelley, 
& Raven, 1952). Combining social compari- 
son theory and AL theory, we can make the 
following prediction. As dissimilarity between 
the subject’s opinions and the opinions of 
Group O increases, the extent to which the 
subject perceives his own opinion to be cor- 
rect should increase. Also, it seems reasonable 
to assume that the subject’s degree of per- 
ceived correctness in his own opinions will 
generalize to the opinions of others, with the 
extent of generalization positively related to 
the degree of perceived similarity between the 
subject’s opinions and the opinions of others. 
This assumption would yield the following 
predictions. As the degree of dissimilarity be- 
tween the subject’s opinions and the opinions 
of Group O increases, the perceived correct- 
ness of the opinions of members of Group S 
should increase, while the perceived cor- 
rectness of the opinions of Group O should 
decrease. 

The present experiment, conducted in the 
context of a group discussion, afforded an 
added opportunity to measure the subject’s 
tendency to direct communication to members 
of either Group S or Group O. For example, 
data from Festinger et al. (1952) indicate 
that group members who held a similar atti- 
tude and were in a majority position tended 
to direct communication to the deviant mem- 
bers of the group rather than to one another. 
Festinger and Thibaut (1951) found that the 
greater the attitudinal dissimilarity between 
a majority and members of a deviant group, 
the greater the amount of majority-member 
communication directed to the deviants. These 
data, coupled with the application of AL 
theory to the opinion similarity dimension, 
yield the following predictions. As the degree 
of dissimilarity between the subject’s opinions 
and the opinions of Group O increases, the 
tendency for subjects to direct communications 
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to members of Group O should increase, and 
the tendency for subjects to direct communi- 
cation to other members of Group 5 should 
decrease. 


METHOD 
Design 


The extent to which one group of opinio 
tions (Group O) deviated from the subject’s 
and the opinions of the members of Group 
varied across five levels in a single-factor 
Measures of perceived similarity, liking, cor 
and amount of communication 
group were obtained. 


Josi- 
ions 
was 
ign, 
tSS, 
directed to cach 


Subjects 


The subjects were 70 undergraduate males 10 
volunteered to participate in partial fulfillmc vf 
an introductory psychology course requirement 
jects were randomly assigned to one of five ex 
mental conditions, and 10 subjects were run or 
session. All conditions were represented in 1 
session. 


Procedure 


A male experimenter greeted the subjects 
seated each one at least 8 feet away from 
nearest neighbor. To further ensure the privacy 
the experimental materials, subjects were request 
to remain silent until instructed otherwise. T 
experimenter then explained that the purpose of t 
experiment was to investigate patterns and conte 
of intragroup communication and that the grou 
discussion portion of the experiment would be ca 
ried out via note writing, Subjects were told th 
they would be asked to read an actual case histor 
involving the arrest and sentencing of two high 
school youths so as to provide a common topic fc 
the group discussion, The experimenter continued b 
saying that subjects would then be asked to fill ou 
a short questionnaire regarding their opinions about 
the incident described in the case history and that 
each subject would subsequently receive information 
about the opinions expressed by all members of 
the group. The experimenter justified this procedure 
by explaining that he wanted each person to know 
the relative position of every group member prior 
to commencement of the discussion, 

At this point, a short essay describing the case 
of two boys who were caught stealing car stereo 
equipment was distributed. One youth was described 
as being from a wealthy family, while the other was 
from a poor one. The account of the arrest, book- 
ing, and court procedures clearly indicated that the 
boy with wealthy parents received better treatment 
than his indigent friend. After reading this essay 
subjects were given a questionnaire consisting of fhe 
following two statements: (a) “Equal justice pre- 
vailed for both youths,” and (b) “The criminal 
Justice system does not always discriminate in favor 
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of individuals with money and against those without 
Each statement was accompanied by a 
oint scale, with “extreme disagreement” and 
> agreement” as the end anchors. Subjects 
ed to indicate their position on these issues 
ling the dot on each scale which corresponded 
their opinion. Upon completion of these ques- 
‘onnaires, the experimenter stated that he was going 
plot, in graph form, the opinion positions ex- 
essed by the subjects. He instructed subjects to 
gn their questionnaires so that a code letter could 
: assigned to each person that would correspond 
o the letter representing that person’s opinion posi- 
ion on the graph. Justifying the use of code letters, 
he experimenter explained that he did not want 
fic individuals associated with their particular 
pinion positions and that the purpose of the graph 
vas simply to give each person an idea of where 
he stands relative to other group members with 
regard to the discussion topic. Reminding the sub- 
jects to remain quiet, the experimenter left the room 
“to draw up and duplicate the grap! eas 
Tn approximately 5 minutes, the experimenter 
returned and distributed a single sheet of paper to 
each subject. This sheet was separated into a 
vertical dimension by the placement of the word 
disagree at the top center of the page and agree at 
the bottom center; a horizontal dimension was 
constituted by placement of the word disagree at 
the left center of the page and agree at the right 
center position. Ten small, equal-sized circles, 
each labeled with a letter of the alphabet, ap- 
peared on the sheet. The experimenter explained 
that the position of each of the circles with reference 
to the vertical and horizontal agree-disagree dimen- 
sions roughly corresponded to each group member's 
level of agreement or disagreement with the two 
questionnaire statements. Subjects were then told 
that the code letter assigned to each person was 
printed on the back of that person’s graph, All 
‘subjects were given sheets marked with the letter G. 
Description of the stimulus sheet. The circles 
labeled A, B, C, F, G, H, I, and J, which constituted 
Group S and the subject’s position, were clustered 
within the boundaries of the upper left-hand quad- 
rant. The position of each of these circles and the 
differences between them remained constant across 
conditions; the differences between G and the other 
members of this group ranged from 1 to 1.9 inches. 
To decrease the possibility that subjects could make 
an exact comparison between the position of the 
circle assigned them and their response to question- 
naire items, the vertical and horizontal agree-dis- 
agree dimensensions were described by the place- 
ment of the anchors only. G was positioned to 
imply fairly strong disagreement on the horizontal 
dimension and a neutral position on the vertical 
dimension. Placement of G was dictated by pre- 
testing, which indicated that subjects exposed to the 
previously described essay typically expressed strong 
disagreement with the first questionnaire item and 
a neutral response regarding the second. 
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The difference between the positions of the circles 
labeled D and E (Group O) was constant across 
conditions (1 inch); the differences between the 
positions of these circles and G varied according to 
condition. For each condition, the position of D 
and E on the agree-disagree dimensions and the 
mean distance from G in inches are as follows: 
Condition 1—moderate disagreement on the hori- 
zontal dimension and slight agreement on the vertical 
(2.0 inches from G); Condition 2—mild disagree- 
ment on the horizontal dimension and mild agree- 
ment on the vertical (4.0 inches from G); Condi- 
tion 3—a neutral response on the horizontal dimen- 
sion and moderate agreement on the vertical (6.0 
inches from G); Condition 4—mild agreement on 
the horizontal dimension and agreement on the 
vertical (8.0 inches from G); Condition 5—agree- 
ment on the horizontal dimension and strong agree- 
ment on the vertical (10.0 inches from G). 

Dependent measures, After exposure to the graph, 
subjects were asked to fill out a second question- 
naire. The experimenter justified this request in 
terms of an investigation of possible relationships. 
between certain attitudes and the pattern of intra- 
group communications. This questionnaire consisted 
of three specific questions about each member of 
the group: (a) “To what extent are the opinions 
of the other group members similar or dissimilar to 
your own opinions?” (b) “To what extent do you 
think the group members’ opinions (including your 
own) are correct?” (c) “On the basis of this opinion 
survey, to what extent do you think you would like 
the other members of the group?” Each question 
was accompanied by a listing of all group members, 
starting with Person A and continuing alphabetically 
through Person J. Each of these listings was accom- 
panied by a 15-point scale. These scales were 
anchored by “very similar” and “very dissimilar,” 
“completely correct” and “completely incorrect,” and 
“very much” and “not at all” for Questions 1, 2, 
and 3, respectively. After collecting this question- 
naire, the experimenter indicated that it was time to 
begin the communication portion of the experiment. 
He distributed blank note cards to the subjects and 
stated that, during the 5-minute period allotted for 
this portion of the experiment, each person could 
write as many notes as he wished to whomever he 
wished and that there were no restrictions on the 
length or content of the notes. Extra note cards 
were readily accessible to each person. Subjects were 
instructed to begin each note with the code letter 
of the person for whom the communication was 
intended and to end with his own code letter. At the 
end of the 35-minute period, the experimenter col- 
lected the notes, checked for suspicion, and explained 
the purpose of the experiment and experimental 
procedures. Of the 70 subjects, 8 expressed some 
suspicion concerning the experimental procedure, and 
their data were eliminated from the analysis. How- 
ever, subjects dropped from the analysis were fairly 
equally distributed across conditions: three from 
Condition 1, two from Condition 4, and one cach 
from Conditions 2, 3 and 5. 
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TABLE 1 


Mean RATINGS or PERCEIVED SIMILARITY or EACH 
Group TO G AND MEAN RANGE oF SIMILARITY 
Ratincs Witrin Eacn Group 


Con- Similarity to Gs Range within 
isan; GroupS GroupO GroupS Group O 
1 13 54.31 5.39 8.92 3.37 
2 12 46.17 11.25 8.58 2.92 
3 14 36.86 26.57 6.00 :12 
4 12 32.58 28.08 4.42 08 
5 11 30.36 28.18 3.64 18 


^ The higher the mean score, the less the perceived similarity 
between the subject and Group S and Group O. 
A » The possible range of similarity for both Group S and Group 
was 0-14. 


RESULTS 


An analysis of subjects’ responses to the 
two questions concerning equality of justice 
indicated no significant differences between 
conditions for Question 1 (F < 1) or Ques- 
tion 2, F(4,57) = 1.26. The overall mean 
response to Question 1 was M = 13.62, indi- 
cating that subjects felt that equal justice 
did not prevail for both youths in the hypo- 
thetical example. The overall mean response 
to Question 2 was M = 6.63, indicating that 
subjects were slightly positive with regard to 
the statement that the justice system does 
not always discriminate in favor of wealth, 
These data indicate that the actual attitudes 
of the subjects included in the analysis 
closely approximated their assigned position 
on the graph. 

Subjects’ responses on the perceived simi- 
larity questions were divided into four sepa- 
rate measures: (a) the degree to which the 
subject perceived the opinions of A, B, C, F, 
H, I, and J as similar to his own opinions, 
(b) the degree of perceived dissimilarity 
among the opinion positions of Group S calcu- 
lated in terms of range, and (c) the degree 
of perceived dissimilarity between the opinion 
positions of D and E also calculated in terms 
of range. As a check on the manipulation of 
similarity between Group O and the sub- 
ject, measures of Perceived similarity between 
Group O and the subject were also calculated, 

The means for perceived similarity of 
Group O and Group S to self are presented 
in Table 1, which shows that the manipula- 
tion of perceived similarity between the sub- 


VIRGINIA HENSLEY AND SHELLEY DUVAL 


jects and Group O was successful, (4, 87) 
= 95.13, p < .001. Furthermore, the means 
for all three measures concerning pe 
similarity are in the predicted di 
dissimilarity between the subject’s opini 
and the opinions of D and E increased per- 
ceived similarity of Group S to the 
increased, F(4,57) = 9.47, p< .001 
of perceived dissimilarity within G+. S 
decreased, F(4, 57) = 10.55, p< .O 
range of perceived dissimilarity within 
O decreased, F (4, 52) = 12.65, p < © 


Liking and Perceived C. orrectness 


Subjects’ responses to the question 
cerning liking and perceived correctness vere 
separated into five measures: (a) liki 
A, B, C, F, H, I, and J, (b) liking D 
and E, (c) perceived correctness of ow: 
ions, (d) perceived correctness of the oj 


ins 
of A, B, C, F, H, I, and J, and (e) perc ved 
correctness of the opinions of D and As 
can be seen from Tables 2 and 3, the ins 
for liking and perceived correctness are i the 
predicted directions. As the degree of lis- 


similarity between the opinions of the si ‘ect 
and those of D and E increased, likin, fi 


? As can be seen in Table 1, Condition 
average rating for a member of Group S is » her 
than for a member of Group O. This is acco. ied 
for by the fact that subjects consistently rated 
of the distant members of Group S (I, as 


as more dissimilar to self than other memb of 
either group represented on the graphs as beir p- 
proximately the same distance from the Subject. © his 
effect could have been due in part to the faci at 
the positions of I, H, and J were Closer to the ¿ph 
anchors, “disagree—disagree,” whereas subjects’ al 


Opinions regarding the second attitudinal que 
in greater agreement with 


of these possible influences on Subjects’ perceptions 
H, and J would be consistent across conditions 
in the subjects? perceptions 
y the changing values of 
because the Positions of I, H, and J were 


or X ed, lee * Ae 
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TABLE 2 
VIAN RATINGS OF LIKING ror Each GROUP 


Measures of liking for 


Taaa 
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TABLE 4 


SUMMARY OF UNIVARIATE TREND ANALYSES FOR 
MEASURES OF PERCEIVED SIMILARITY, LIKING, AND 
PERCEIVED CORRECTNESS 


n n Group S Group O 
1 13 47.00 7.54 
2 12 36.25 9.00 
3 14 32.64 19.07 
4 12 31.25 16.00 
5 11 36.73 19.91 


As the magnitude of the means increases, liking de- 


‘roup S increased, F (4, 57) = 2.53, p < .05, 
king for Group O decreased, F(4,57) = 
38, p < .001, perceived correctness of own 
pinion increased, F'(4, 57) = 5.32, p < .001, 
perceived correctness of Group S increased, 
F(4,57) = 3.92, p < .01, and perceived cor- 
rectness of Group O decreased, F(4, 57) = 
18.86, p < .001. 


Trend Analysis 


Patterns of the means for each of the nine 
dependent measures suggest the presence of 
curvilinear effects. A multivariate trend anal- 
ysis evidences a significant quadratic trend, 
F(8, 50) = 17.28, p < .001. Table 4 contains 
a summary of the univariate trend analy- 
ses, indicating the presence or absence of 
quadratic trends within the dependent vari- 
ables. Coupled with the pattern of means for 
each measure, we can conclude that the gen- 
eral relationship between the dependent vari- 
ables and the experimental manipulation is, 
as predicted by AL theory, curvilinear in 
nature, 


Communication 


The final hypotheses tested in the present 
experiment concerned the direction of com- 


TABLE 3 


MEAN RATINGS or PERCEIVED CORRECTNESS or EACH 
Group AND OF G 


asa Perceived correctness of 
Condi- 


tion n Group S Group O G 
1 13 52.69 6.54 4.31 
2 12 50.33 13.33 4.08 
3 14 37.93 23.43 3.36 
4 12 38.75 21.67 2.00 
5 11 33.18 23.64 1-73 


Note. As the magnitude of the means increases, perceived 
correctness decreases, 


Univariate F tests* 


Variable MS F p? 
Similarity of Group S 805.70 6.19 .020 
Similarity of Group O 1,719.89 112.51 .001 
Similarity : range of 

Group S 17.73 2.74 100 
Similarity : range of 

Group O 6.88 2.95 .100 
Liking for Group S 724.86 3.73 060 
Liking for Group O 651.56 15.00 .001 
Correctness of Group S 842.51 3.78 .060 
Correctness of Group O 1,159.31 30.33 .001 
Correctness of G 1,33 AL ns 


adf = 1,57. 


munication. As the degree of dissimilarity be- 
tween the opinion positions of the subject 
and of Group O increased, it was predicted 
that the amount of communication directed 
to members of Group O would increase and 
that the amount of communication directed 
to members of Group S would decrease. As 
shown in Table 5, the number of notes that 
subjects addressed to members of each sepa- 
rate group supports these predictions, x°(4) 
= 349.04, p < .001. 


Behavioral Measures 


In the introduction, the controversy be- 
tween the category usage and perceptual re- 
structuration interpretation of the contrast 
shift was discussed. Although the research 
cited clearly favors the latter interpretation, 
it may be wisest to buttress data obtained 
through the use of scales with data from 
more behavioral measures. Of course, the 
frequency of communication is a behavioral 


TABLE 5 


NUMBER AND PERCENTAGE OF COMMUNICATIONS 
Direcrep TO Each GROUP 


Number and percentage directed to 


Condition Group $ Group O 
24 (,86) 4 (14) 

2 9 (.56) 7 (44) 

3 7 (.27) 19 (.73) 

4 6 (.25) 18 (.75) 

5 4 (.20) 16 (.80) 


Note. Numbers in parentheses are percentages: 
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TABLE 6 
Jupces' Mean Ratencs or Supjects’ 

Communications TO Grour S ann Group O 
Condition Similarity Correctness Hostility 
1 z 

Group S 10.40 10.50 8.95 

Group O 6.25 7.50 8.25 
2 

Group S 9.58 9.50 7.16 

Group O 9.23 10.60 11.10 
s 

Group S 5.44 5.66 5.00 

Group O 10.10 12.00 12.15 
4 

Group § 6.83 6.00 5.66 

Group O 11.72 12.82 12.25 
5 

Group S 5.21 5.50 4.76 

Group O 12.63 13.40 13.00 

Nole As tle magnitude of the means increases, dissimilarity, 


incorrectness, and hostility increase. 


measure and, within the context of Festinger 
et al. (1952) and Festinger and Thibaut 
(1951), implies that perceived similarity of 
Group S was actually shifted as Group O be- 
came more dissimilar. However, to obtain 
further data based on subject’s spontaneous 
behavior, two independent judges were asked 
to rate the contents of Group S and Group O 
communications on three dimensions. These 
dimensions were (a) the extent to which the 
content of each note stated or implied that 
the writer of the note perceived the intended 
recipient as holding similar opinions on the 
two issues, (b) the extent to which the con- 
tent of each note stated or implied that the 
writer perceived the intended recipient as 
holding correct opinions on the two issues, 
and (c) the extent of hostility expressed by 
each note (this latter measure is a direct 
evaluation of the immediate hostility felt by 
the writer for the intended recipient and an 
indirect indicator of the writer’s liking for 
the intended recipient). The code letters of 
the writer and intended recipient were blocked 
out on each note, and the order in which the 
judges read and rated each note was random- 
ized. The rating scales used by the judges 
were 7-point scales, anchored at each end by 
“very dissimilar—very similar,” “very incor- 
rect-very correct,” and “very hostile-very 
favorable,” respectively. 

Judges’ ratings on each item were compared 
for reliability. The results of this analysis 
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indicated high interjudge reliability on each 


item (similarity, r= .93; correctness, y= 
.93; hostility, r = .94). Given these levels of 
agreement, judges’ ratings of each not- were 
summed to provide overall measures oî per- 
ceived similarity, correctness, and ility 
expressed by the contents of the note- The 
means for Groups § and O, divided by ondi- 


tion, are presented in Table 6. As can b» cen, 


the patterns of the means for each din) sion 
and each group are in the directions pre «ted 
by AL theory and the notion of re-str ra= 
tion (Group S: similarity, F(4,46) =€ p 
< 001; correctness, F(4,46) = 7.73 < 


001; hostility, F(4,46) = 5.72, p< 5; 
Group O: similarity, F(4, 60) = 27.79, < 
.0O1; correctness, F (4, 60) = 9.56, p < i 
hostility, F(4,60) =8.47, p<.001). ic- 
cause the content of the notes represent: he 
spontaneous responses of subjects and the is 
close correspondence between the scale od 
behavioral data (although no behavioral: a- 
sure of within-group discrimination is 
available since subjects usually did not © le 
notes to all members of each group), e 
can be little doubt that the data obtainc n 
the study represent the effect of social cor t 
on subject’s actual perceptions of and 
tudes toward other group members.° 


Discussion 


Although the present data regarding 
ceived similarity is in general agreement v h 
predictions derived from AL theory, an 
ternative approach that provides a fairly c 
plete explanation for these data can be 
rived from Gestalt principles of unit fori 
tion. Heider (1958) notes that the forma: 
of a unit relationship between two or m 
stimuli is dependent not only on the deg: « 
of similarity among these stimuli but also on 
the degree of dissimilarity between these 


k è The differences in content in the various condi 
tions were quite striking, For example, in Condi 
tion 1, a subject writes to J, “You are too extrems 
Either this is a bad day for you or you don’t believe 
that criminal justice does have some good connola 
tions attached to it.” On the other hand, in Condi- 
tion 5, a note addressed to the same member of 
Group S reads, “Seeing as how our opinions fell 
so close together, T'm interested in how you came 
about your conclusion,” i 
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uli and the surrounding, contextual stim- 
Vor example, the unit relationship be- 
n two Ethiopians surrounded by people of 
ifferent nationality should be stronger 
n the unit relationship between the same 

) Ethiopians surrounded by their fellow 

untrymen. Further extension of this rela- 

onship implies that, holding the level of 

milarity between two or more stimuli con- 
stant, increased dissimilarity between those 
stimuli and their context should increase the 
perceived unit relationship among members 
of both sets of stimuli, Applied to the per- 
ceived similarity data obtained in the pres- 
ent study, this principle suggests that the 
effects of increased dissimilarity between the 
two groups were due to an increase in the 
degree of unit relationship perceived among 
the members of each group. This interpre- 
tation has the advantage of simplicity in that 
it dispenses with the notions of an adapta- 
tion level and zones of maximal discrimina- 
tion. The major weaknesses of this approach 
involve a lack of predictive precision compa- 
rable to that of AL theory and an absence of 
theoretical explanation as to why unit forma- 
tion should obey one set of principles rather 
than another. 

Turning to the correctness data, it was 
found that increases in opinion differences 
between the subject and Group O also in- 
creased the subject’s perceived correctness of 
his own opinions, increased perceived cor- 
rectness of the opinions of members of Group 
S, and decreased perceived correctness of 
Group O's opinions. Considering that the 
measures of perceived similarity between the 
ubject’s opinions and the opinions of Group 

and the subject’s perceived correctness of 

wn opinions were highly correlated, the 

in the subject’s perceived correctness 

aÍ his own opinions can be adequately ac- 
counted for by social comparison theory. In- 
terpretc\ions of the second and third find- 
ings are more ambiguous. The generalization 
of correctness notion advanced in the intro- 
duction can account for these results, but so 
can 4 number of other theories. For example, 
balance theories in general predict that un- 
desirable traits, perhaps including incorrect- 
hess, are attributed to dissimilar persons. The 
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more articulated version of this principle 
(Festinger, 1957) states that a person may 
derogate others to the extent that they hold 
opinions that differ from his own. Given that 
the degree of derogation is related to the ex- 
tent of opinion discrepancy, the decreases in 
perceived correctness of Group O in the pres- 
ent study could represent the subject’s dero- 
gation of D and E. The operation of this 
dissonance-reducing mechanism could also ac- 
count for increases in the perceived correct- 
ness of Group S. As the perceived similarity 
of Group § is increased by contrast effects, 
the amount of derogation of members of 
Group S would decrease, On a correctness 
measure, decreases in derogation could ap- 
pear as increases in perceived correctness. 
Finally, we would like to note the possibil- 
ity that a substantial number of perceptions 
and attitudes that are characteristic of ethno- 
centric behavior may be mediated in part by 
perceptual contrast effects caused by the 
presence of a dissimilar group in the social- 
contextual field. This shift in level of inter- 
pretation is justified (a) by the resemblance 
between the experimental setting and an in- 
group/out-group configuration and (b) by 
the degree of similarity between the sub- 
jects’ perceptions of Groups S and O and 
perceptions of the in-group and out-group 
which are regarded as ethnocentric (LeVine 
& Campbell, 1972). For example, less dis- 
crimination of the opinion differences present 
within Group S could correspond to in-group 
solidarity; greater liking for Group S$ could 
parallel cooperative relations between in- 
group members and low degrees of social dis- 
tance among in-group members. Greater per- 
ceived correctness of own opinions and the 
opinions of Group S could correspond to see- 
ing in-group standards and values as uni- 
versally true. Increased dislike for Group o 
could correspond to the hatred of out-group. 
members and the high degree of social dis- 
tance which is maintained between the in- 
group and out-group. Decreased differentia- 
tion between the opinion differences present 
within Group O could reflect the same pro« 
cesses which result in stereotyping of out 
and greater perceived in- 


group members. ed i 
correctness of Group O could parallel viewing 
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out-group members as contemptible, immoral, 
and generally inferior to the in-group. 
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The political fate of 


revolutionary leaders was examined in relation to their 


skill in adapting to the information-processing demands of the environment. 
Specifically, it was hypothesized that long-range success for a leader would be 
associated with low conceptual complexity during the phase of revolutionary 


struggle (when it is desirable to have 


a categorical, single-minded approach to 


problems) and a change to high complexity during the poststruggle consolida- 


tion phase (when, as leaders of a government in power, 


jsts would need a relatively graduated, 


materials, originating from 19 leaders in five successful revolutions, 
Changes in the predicted 


for conceptual complexity. 


the former revolution- 
, flexible, and integrated view). Archival 
were scored 
direction were found for 


11 historical figures who retained power until voluntary retirement or natural 


death; no such changes occurred amo. 


ng eight leaders who lost their positions 


after the victory of their movement. These findings demonstrate the usefulness 
of conceptual complexity theory and methodology as a framework for analyzing 


historical and other archival materials 
to political psychology and psychohist 


Studies on the relationship between person- 
ality and politics can be divided into two 
broad categories, work on individual political 
Jeaders and investigations into the character- 
istics of large populations (Sigel, 1970). The 
first of these two modes is primarily repre- 
sented by general biographical studies. Fre- 
quently based on a psychodynamic point of 
view (Bychowski, 1948; Erikson, 1958, 1969; 
Freud & Bullitt, 1967), these studies have 
been criticized on the grounds of selective 
attention to events and variables. Further, 
many critics feel that such treatments can be 
used to explain any event after it occurs 
md are thus incapable of being refuted (see 

lad, 1973). 

\nalyses based on other theoretical frame- 

} ve also been proposed. For example, 
(1972) categorization of United 
5 presidents into character types differ- 
ihe two orthogonal dimensions of 
egativity and positiveness—-negative- 
enabled him to relate certain presi- 
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and support the interactionist approach 
ory. 


dential decisions and actions to characterol- 
ogy. However, some of the same criticisms are 
relevant. In particular, the rules for deriving 
explanations from the construct are vague, 
and the data used to assign individuals to 
one or another of the categories are open to 
biased selection. So far, the theory has not 
been used to make actual predictions of 
future behavior (Katz, 1973). 

Approaches that attempt to analyze and 
predict behavior from a particular relevant — 
personality trait or characteristic can be less 
open to distortion. In an excellent recent 
review, Knutson (1973) listed authoritarian- 
ism, level of activity, anomie and alienation, 
power motivation, and dogmatism among the 
characteristics that have been studied in- 
this context. 

One problem in the empirical study of p 
sonality characteristics of political leader: 
the dependence on tests that require coope 
tion from potential subjects. As one of the 
authors i 
study has recently pointed out, “the 
with research on authoritarianism 1s 
has been research on The Authorit 
Personality” (Sanford, 1973, p: 163), th 
on the rationale, characteristics, and im| 
tion of the California F Scale. There are 


= 
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ous problems here: problems of language, era, 
availability, and cooperation. Political leaders, 
even of the researcher’s own time and culture, 
may perceive social scientists as potential 
tools of ideological and political opponents 
and their questionnaires as impertinent or at 
best irrelevant. This, of course, is one reason 
why many psychohistorians prefer the general 
biographical approach and why empirical, 
quantitative studies deal much more fre- 
quently with individuals such as college stu- 
dents (including their political attitudes, 
adherences, and activities) than with actual 
politicians. Even those researchers who suc- 
cessfully recruit subjects outside the campus 
typically restrict their work to followers 
rather than to leaders (Knutson, 1973). 

Another, more basic problem with the psy- 
chological analysis of political behavior is the 
tendency of workers in this area to take a 
personological approach without due consid- 
eration for environmental influences. The 
study of political leaders has tended to em- 
phasize consistencies within the individual to 
explain his political behavior, The field has 
largely ignored criticisms (e.g., Endler, 1975; 
Mischel, 1968) that point out the variability 
of behavior across situations. For example, 
Mischel’s defence of social behavior theory 
advocates that efforts should be directed 
toward the identification of those changing 
conditions that lead to changes in behavior 
rather than toward the demonstration of 
cross-situational consistency, One would think 
that the personologist working in the area of 
political psychology would be particularly 
sensitive to this issue, since it is intuitively 
obvious that changing political conditions 
might sometimes require drastic alterations 
in the behaviors necessary for success. 

The present study combines the use of a 
personality variable that can be scored from 
generally available archival materials with an 
emphasis on the relevance of this trait to 
changing environments, The characteristic 
measured was conceptual complexity (Harvey, 
Hunt, & Schroder, 1961; Schroder, Driver, 
& Streufert, 1967). Conceptual complexity 
theory takes as its major focus the cognitive 
differentiation and integration of information, 
According to this theory, individuals progress 
and may become generally fixated at various 
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levels along the dimension of sin licity— 
complexity of information proce . Indi- 
viduals at the simple end are char: erized 
by rigid evaluations of stimuli, the reiection 
of dissonant information, submissive=-ss to 
authority and prestige suggestion, . At 
the complex end of the dimension, vidu- 
als are shown to exhibit flexible a: pen 
cognitive systems; the use of many men- 
sions in an integrated, combinatorial on; 


a search for novelty and further infor: ion: 


and the ability to consider multipl ints 
of view simultaneously. 

Measures of this characteristic inclu: the 
semiprojective Paragraph Completion “est 
and Impression Formation Test (Schro: et 
al., 1967), the somewhat more struc’ ved 
This I Believe Test (Harvey, 1966), anc be 
multiple-choice Interpersonal Topical Ir >n- 


tory (Tuckman, 1966). The relationshi, of 
these tests to other measures of cogni ve 
style and the relationship among the S 
of conceptual complexity themselves rem `n 
somewhat uncertain (Suedfeld, Tomkins 4 
Tucker, 1969; Vannoy, 1965). However, 
Paragraph Completion Test is among th 
most frequently used and most thoroug 
validated and has the unique advantage 

a scoring system that can be applied to 
wide range of verbal material. 

Levels of conceptual complexity are relat 
to a variety of personality traits that ha 
been discussed by other authors. For examp|: 
the conceptually simple individual exhibit 
tendencies toward authoritarianism and do; 
matism. This relationship is not merely theo 
retical; it has been empirically demonstrate: 
by a correlation of —.25 to —.55 betweei 
complexity and the California F Scale, and up 
to —.4 between complexity and the Dogma- 
tism Scale (Schroder et al., 1967), 

One of the important aspects of this theory 
is its concentration upon structure rather 
than content (cf, Adorno, Frenkel-Brunswik, 
Levinson, & Sanford, 1950; Rokeach, 1960). 
It is content free: that is, unbiased either 
for or against any particular political philos- 
ophy (Schroder et al., 1967; Russell & Sandi- 
lands, 1973). It is just as easy to be concep- 
tually simple about the necessity for everyone 
to be completely free as about the necessity 
for everyone to be completely submissive to 
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a tv; it is just as possible to be complex 

he reasons for radically transforming 

is about the reasons for maintaining 

similarly, it is possible to be either 

le or complex about a myriad of issues. 

is, the implications of the theory do not 

nge, whether the case in point is a bour- 

or a proletarian party, a 17th or 20th 

natury movement, an intellectual aristocratic 

ier or a guerilla soldier of peasant origins. 

The data are not clear as to whether con- 

ptual complexity is indeed an ability or a 

reference, that is, whether it involves a 

1aximal capacity to function at a given level 

if complexity (parallel to the maximal capac- 

ity to function at a given level of intelligence) 

or a habitual or preferred style that individu- 

als can change when they desire to do so or 

when the environment appears to require such 

change. Some research has indicated that 

training programs of varying degrees of inten- 

sity can affect behavioral complexity (Lee, 

1968, cited in Schroder, 1971; Suedfeld, 

1968). Thus, complexity may be a some- 

‘what flexible, situation-specific response style 

rather than a pervasive, stable, consistent trait. 

The proximate concern of the study reported 

here is the fate of revolutionary leaders after 

the revolution takes and consolidates power. 

It is a truism that revolutions tend to con- 

sume their own sons (de Lamartine, 1848): 

Many prominent revolutionary leaders fall 

victim to other members or factions of their 

= own movement. While this can be explained 

on the basis of a continued power struggle 

ər in terms of chaos and nihilism reinforcing 

vemselves, another viewpoint is that the 

racteristics needed to be a successful 

ler before and during the phase of struggle 

erent from those needed for continued 

; a member of the government in 
loffer (1958) has stated: 


power, As 


ent is pioneered by men of words, ma- 

wed ınatics and consolidated by men of 
action 
It is 


perhaps a p 


usually an advantage to a movement, and 
equisite for its endurance, that these 
‘ould be played by different men succeeding 
each other as conditions require. When the same 
son or persons (or the same type of persons) 
is a movement from its inception to maturity, 
isually ends in disaster. . . , There is of course 
possibility of a change in character, The man 


roles 


per 


the 
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of words might change into a genuine fanatic or 
into a practical man of action. (p, 134) 


This taxonomy is very much like that of 
Lasswell (1930/1960), who divided political 
types into the following three categories; the 
theorist, the agitator, and the administrator. 
Of the latter two categories with which we 
are concerned, the agitator or fanatic is de- 
scribed by Lasswell as being primarily ori- 
ented toward arousing and sharing high levels 
of emotion. In fact, the characteristics of this 
type of individual sound like an excerpt from 
the description of the conceptually simple 
person: He exaggerates the differences be- 
tween alternatives, sees all opponents as evil 
or cowardly, subordinates all other considera- 
tions to the ideas of correct principles, and is 
confused and frustrated by the demands of 
the administrative process, The political ad- 
ministrator, in contrast, may be either an 
innovative leader or a competent performer 
of important and difficult, but routine, tasks. 
Lasswell’s conceptualization is somewhat more 
complex than that of Hoffer in that there is 
a recognition that mixed types exist and that 
the major categories can be further sub- 
divided. It is unfortunate that the empha- 
sis on psychodynamic determinants and on 
the links between various types of political 
behavior and psychopathology (Lasswell, 
1930/1960, 1948, 1966) are so pervasive and 
salient as to reduce chances for an objective 
empirical test of the theory. 

Our hypothesis was that the “fanatic,” or 
successful pretakeover revolutionist, is char- 
acterized by a conceptually simple level of 
functioning: He depicts the enemy as all evil 
and the revolution as the ultimate good; he — 
evaluates all individuals, classes, and nations — 
as either enemies or friends, depending on 3 
their attitude toward the revolution; his ulti- — 
mate court of appeal is the authority of the $ 
revolution; and he directs all energies, faith, 
and struggle toward the success of the move-" 
ment. Prominent pretakeover individuals whee 
show less than this complete single-minded- 

à ived by other revolu- 
ness are probably percelv š 
tionists as weak, KNOL disloyal, which would ' 
danger their Jater careers, } j 
tend: tore the agitator, who functions 

In contrast to the agitator, re 

at the simple level, a government in power 


must function at a complex level in order to 


i 
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solve the numerous and complicated problems 
that confront it. Generally, there is no longer 
a single overriding enemy, various factions 
must be reconciled and conciliated, policies 
must be based on diverse considerations in 
complex interactions, and both ideology and 
practice must be flexible and adaptable to 
dynamic events. Leaders who remain too 
fanatic in this phase are unable to cope with 
the new set of requirements. 

We therefore hypothesized that individuals 
who showed themselves to be successful both 
before and after the success of the revolu- 
tionary movement should demonstrate a low 
level of conceptual complexity before the 
movement takes power and a high level of 
complexity after its success. It was predicted 
that leaders who did not show such changes 
would be unable to remain successful in the 
long run. For the moment, we leave moot 
the question of whether successful individuals 
are conceptually complex people who tempo- 
rarily function at a lower level in order to 
meet the requirements of the pretakeover 
phase or conceptually simple persons who 
develop higher levels of complexity when the 
posttakeover environment demands it. 


METHOD 


The second author, who was blind to both the 
hypothesis of the study and the criteria by which 
the material was to be scored for conceptual com- 
plexity, obtained samples of published materials by 
individuals who were prominent in various revolu- 
tionary struggles. The only requirements were that 
the material be originally in, or translated into, 
English; that successful revolutionary movements 
from a variety of eras and countries be included ; 
that for each movement, some representatives be in- 
cluded who remained successful after the revolution 
took power as well as others who fell from leader- 
ship; and that each sample of material include at 
the minimum five paragraphs. There was no restric- 
tion as to the original mode of presentation (e.g., 
spoken or written). The final sample included 
sources ranging from letters to colleagues, through 
theoretical treatises, to a plea for the defence at a 
treason trial. All material identifying names, dates, 
nations, etc., was eliminated to the maximum extent 
possible when the material was duplicated, so that 
in no instance was the scorer (the first author) 
able to ascertain the author of the material, and in 
Many cases he could not even identify the move- 
ment or the phase (that is, pre- or posttakeover) 
during which it was generated. 

Unfortunately, a great deal of difficulty was en- 

_ countered in obtaining appropriate samples, Many 
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historically remote revolutions had no reliable sur 
viving archives, some leaders had very little if any 
surviving materials that were dated before their 
success, and in some cases the documents on fallen 
leaders were impossible to obtain. The final sample 
consisted of five revolutionary movements in five 
countries and 19 leaders of whom 11 were judged 
to be successes and 8 failures (see Table 1). A suc 
cessful leader was defined as one who was prominent 
in the pretakeover phase and who also held impo 
tant public office in the posttakeover government 
until (a) natural death or (b) the end of a constitu 
tional term, followed by voluntary (as far as could 
be ascertained) retirement. The period of time spent 
in office after the takeover was ignored, since 
retirement and natural death could occur at any time, 
This led to a few apparent anomalies: for example, 
Lenin was categorized as successful and Lin Piao 
as a failure, although the former remained in power 
for a much shorter period of time. Biographies, 
autobiographies, and historical reference books were 
used to determine the assignment of individuals to 
the successful or unsuccessful category. 


Scoring 


The basic scoring system for the Paragraph 
Completion Test (Schroder et al., 1967) was used. 
Although the scoring system was developed for the 
particular material of the test, it is equally appro- 
priate for any other connected verbal material. The 
first author, who scored the material, had been 
trained in an intensive workshop and had obtained 
an interjudge reliability with the authors of the test 
of approximately .90, Each Paragraph was inde- 
pendently scored. While the paragraphing of the 
original source was generally followed, in some cases 
Paragraphing was changed to conform to contempo- 
rary English usage. In some cases, long paragraphs 
were divided into several shorter ones, while in other 
cases extremely short paragraphs were combined into 
one longer one. 

In addition, each paragraph was assigned to one 
of the three categories developed by Claunch (1964) 
for scoring unstructured essay materials. The three 
categories are polarized contrast (PC; when two 
points of view are treated as polar opposites), 
qualified contrast (QC; when two points of view are 
compared or contrasted with the implication that 
differences are not simple and absolute opposites) , 
and integrative comparison (IC; when a common- 
ality between two divergent points of view is stated 
or implied). 

The data from this measure were combined into 
a polar contrast index by the following formula: 

fares Se Oa, 
PC+QC+IC 
High scores on this index identify relatively low 
levels of conceptual complexity, 

Unscorable paragraphs were omitted from both the 
complexity measure and the polar contrast index: 
however, any source that had unscorable paragraphs 
amounting to more than 10% of the total was rejected. 
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RESULTS 


cores on complexity and polar con- 

tr ex are shown in Table 2. Complexity 

i ured on a 1-7 scale, with higher 

ers representing greater complexity; the 

ontrast index can hypothetically range 

) to 1.00, with larger numbers indicating 

igher proportion of polar contrast (i.e., 
ple) paragraphs. 

\n unweighted-means analysis of variance 

w complexity scores showed a significant 
nain effect for condition, with posttakeover 
samples having higher complexity, F(1,17) 
= 9.20, p < .01, and a significant interaction 
effect, F(1,17) = 12.37, p < .005. Successful 
posttakeover samples were considerably higher 
in complexity than any of the other three. 
Comparing the pre- to posttakeover differences 
among successful and unsuccessful leaders, we 
found the change toward increased complex- 
ity to be much greater among the successful 
sample (M = 1.98 vs. —.25), #(17) = 14.82, 

-p< 107. 

For the polar contrast index, unweighted- 
means analysis of variance showed a significant 
interaction effect, F(1,17) = 6.88, p < .05. 
As can be seen in Table 2, successful leaders 
showed a decrease in polar contrast in the 
posttakeover sample, while among the failures 

_ there was an increase in polar contrast from 
pre- to posttakeover. The mean change was 
—.34 for successful and +.10 for unsuccessful 
leaders, (17) = 3.40, p < .03. To assess the 
usefulness of the polar contrast index as a 
test of complexity, a product-moment cor- 
relation coefficient was computed between pre- 
to posttakeover changes on the two measures. 
Che rationale of the index was supported 
(r = —.73), t(17) = 4.29, p < 001. 

Because of the relatively good documenta- 
tion that was easily available on figures of 
the American Revolution, this group was 
somewhat overrepresented in the total sample. 
However, the results do not seem to have 
been unduly biased in consequence: Among 
successful leaders, 83% of Americans showed 
changes in the predicted direction as opposed 
to 80% of non-Americans, while the pattern 
of changes was identical among American and 
non-American failures, Incidentally, changes 
among the successful leaders were consider- 


ably larger than those among the failures. To 
check the effect of type of document, scores 
derived from letters to specific individuals 
were compared with scores derived from 
writings for a general audience, using the six 
subjects who had both types of materials 
included in this study. The mean complexity 
scores were 2.7 and 3.4, respectively, not a 
significant difference. 


Discussion 


The results showed the predicted relation- 
ships, with a high degree of reliability. The 
pretakeover samples included a range of com- 
plexity scores regardless of time or culture of 
origin. Those individuals who were relatively 
complex at this stage joined those who were 
simple and remained simple after the take- 
over in the category of failures. Successful 
leaders were indeed those who exhibited a 
high degree of single-mindedness and com- 
plete dedication (i.e., conceptual simplicity) 
during the revolutionary struggle, with a 
change to more complex functioning after 
grasping power. With very few exceptions in 
our sample, this pattern was consistent re- 
gardless of the revolution in which the indi- 
vidual participated. This lends support to the 
hypothesis, which was based on the changing 
demands of the environment as one progresses 
from rebel to ruler, and on the necessity for 
different problem-solving characteristics to 
meet these changing demands. 

One could derive a more detailed hypothe- 
sis as to the basis for this change. That is, 
it may be that both simple and complex in- 
formation processing are necessary in both 
phases of the revolution, to the degree that 
rallying the masses behind the revolutionary 
forces and the new government calls for a 
fairly single-minded approach, while negotia- 
tions and decision making among the inner 
group of leaders must remain complex in both 
stages. This hypothesis would argue that the 
real shift is in the relative emphasis on these 
two activities, with successful leaders perceiv- 
ing the need to concentrate on internal and 
policy problems after the takeover. Unfortu- 
nately, this hypothesis could not be tested 
with the present data base, because of the 
difficulty of getting a wide enough sample of 
documents and because a high proportion of 
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TABLE 1 
LEADERS INCLUDED IN ANALYSES 


Activities and offices held 


Name Pretakeover Posttakeover End of public career 


English Civil War 


i Member of Parliament; Member of House of _ Died of malaria while Sı 
pe cone eminent criticof church Commons; first chair- still in power. 
ritualism; captain to man of the Council of 
lieutenant-general State; Commander in 
during Civil War; Chief of the army; 
member of Parliament Lord Protector of the 
and 2nd in command Commonwealth. 


of New Model Army; 
chairman of series of 
meetings on revising 
form of government, 


ilburne and editor; Continued career as Imprisoned 1653-1655, Fall 
proma nipare ater of Leveller writer and speaker; when he was released 
faction in Parliament; repeatedly arrested, upon giving security 
captain to lieutenant- banished, and im- for good behavior, 
colonel in Civil War, prisoned. 


U, S. Revolution 


Author, orator, in- Minister to Court of St. Defeated for reelection 
Sa aes fluential separatist; James's; leader of and retired, em- 
member of Con- Federalist party; bittered, to private 
tinental Congress; Vice President and life. 
member of committee then President of the 
to draft Declaration of United States, 


Independence; min- 
ister plenipotentiary 
to negotiate peace 


treaty with Great 
Britain. 
2, Alexander Hamilton Writer against British Delegate to Constitu- After being repeatedly Failu 
policy in North tional Convention; passed over for nom- 
America; Washington's first Secretary of the ination for the presi- 
aide-de-camp and Treasury; a founder dency, he supported 
liaison officer between and leader of Federal- the opposition candi- 
Washington and ist party; Inspector date (Jefferson) in 
French generals and General of the Army; 1800 and lost the 
admirals during influential author and leadership of the 
Revolutionary War; advisor. Federalists, ending his 
member of Con- public career. Killed in 
tinental Congress, a duel with old 
political and personal 
enemy, 
3. Patrick Henry Orator; n Active in Virginia Reelected to Virginia Succes 
Virginia House of _ legislature; responsible State legislature but 
Burgesses; influential for drafts of major died before he could 
radical leader in proportion of the Bill take his seat. 
several committees and of Rights; declined a 
congresses; commander series of high govern- 
of Virginia forces; ment posts because of 
governor of Virginia. family responsibilities 
and ill health, 
4. John Jay Radical leader of colonial Member of Congress; Retired to private life Ces 
cause; member of first U.S, Secretary for after ister af pucte 
Continental Congress; Foreign Affairs; first governor. 
drafted resolution for Chief Justice of the 
New York's approval United States; chief 
of the Declaration of negotiator of contro- 
Independence; Minis- versial treaty with 
ter plenipotentiary to Great Britain; gover- 
Spain; first Chief nor of New York 
ustice of New York; 1795-1801. 
President of Con- 
tinental Congress; one 
g of commissioners to 
make peace with 
Great Britain. 
5. Thomas Jefferson Member of Virginia Member of Congress; i ashi: ig 
House of Burgesses head of committee to “yawn Washington's Success 
í and several revolu- draft Treaty of Peace sible reelection toe ar 
. tionary conventions; > with Great Britains eames nat 


term and retired to 


leading member of private life. 


Continental Congress; 
rimary author of 


Minister to France; 
Secretary of State; 
leader of Democratic 


eclaration of Inde- Republican party; 
pendence and of ice President and for 
Virginia State Con- two terms President of 
aut i governor of the United States. 
'irgini; 
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TABLE 1—(Continued) 
Activities and offices held 
Category 
Name Pretakeover Posttakeover End of public career assignment 
Richard Henry Lee Delegate to first Con- President of Congress; Retired to private lifeat Success 
tinental Congress; one of first U.S. age 60. 
introduced resolution Senators for 
for independence; 
member of Virginia 
House of Delegates 
and the Congress. 
7. George Mason Member of Virginia Member of Constitu- Declined appointment Success 
House of Burgesses; tional Convention; nator from 
resolutions accepted originator of several and retired to 
by Virginia, later by amendments that be- e life, 
Continental Congress; came Pay of the U.S, 
author of Virginia Bill of Rights; worked 
Constitution and Bill out plan to resolve 
of Rights. territorial conflict be- 
tween Virginia and 
federal government. 
8. George Washington Commander of Virginia Retired to private life Refused to accept 3rd Success 
troops; honorary brig- for four years after the term as President and 
adier general; member war; delegate to Con- retired to private life. 
of Virginia House of gtitutional Conven- 
Burgesses; member of tion; unanimously 
first provincial con- chosen to be first 
vention supporting President of the 
Boston's revolt under United States and 
British rule; leading reelected, 
radical member of 
Continental Congress; 
commander-in-chief of 
the military forees of 
the colonies during 
Revolutionary War. 
Russian Revolution 
1. N. I. Bukharin Active revolutionist from Member of Politburo; Opposed Stalin's policie: Failure 
1906; Bolshevik mem- influential in economic expelled from Comin- 
ber of Central Com- policy, called by Lenin tern and Politburo; 
mittee of Russian “the most valuable executed in 1938. 
Social Democratic and greatest theo- 
Workers Party; editor retician of the Party”; 
of Pravda; radical General Secretary of 
member of executive Comintern. 
committee of Com- 
intern. 
2. V. 1, Lenin Active revolutionist from Chairman of Soviet Died from a series of Success 
1887; journalist and government and strokes while in 
pamphleteer; leader Teader of the Com- power, 
of Bolshevik branch munist Party and 
of the Russian Social government ol 
Democratic Workers USSR, 
Party almost from its 
inception. 
3. J. V. Stalin Active socialist and Commissar of Nationali- Died of cerebral hem- Success 
Marxist from youth; ties; head of Workers morrhage while in 
propagandist and and Peasants Inspec- power. 
organizer of guerilla torate; commander of 
fighting groups; parti- the defence of Petro- 
cipated in several grad in 1919; Secre- 
party congresses; tary General of Com- 
member of Central munist Party; Prime 
Committee of the Minister of U.S.S.R. 
Bolshevik Party; and commander-in- 
editor of Pravda. chief of armed forces 
during and after 
World War I1. 
4. L, D. Trotsky Revolutionary from People's Commissar for Successive loss of in- Failure 
youth; leader of 1905 Foreign Affairs; fluence and sup- 
revolt; repeatedly, ex- leader of delegation porters; ousted from 
iled or jailed; leading negotiating peace ost as Commissar for 
organizer of 1917 treaty with Germany; var and eventually 
Bolshevik revolution. Commissar for Wari from the Communist 
founder and com- Party; exil 
mander of the Red Soviet Union; assas- 
Army during Civil sinated in Mexico. 
War. 
Chinese Revolution n Fail 
1, Liu Shao Ch'i Labor organizer; mem- Vice-chairman of Central Rem wed from all post, ailure 
ber of Central Com- Government Council; tions Eg ty. 
mittee of Communist chairman and chief-ol- expel Ciy after ol ae 
Party; Head, Central state of the People’s reported, Mao Te 
China Bureau; com- Republic. ung: 


missar of New Fourth 
Army; author of 
theoretical texts. 


(table continued) 
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TABLE 1—(Continued) 


Activities and offices held A 
Name Pretakeover Posttakeover End of public carcer ignment 
9 i. ale Premier; mem- Accused of having Failure 
ye ne See erry aoe Deuy Politburo; plotted to take power 
Long March; served Marshal of the © reported killed in 
i with the Red Army in Feon aes pee re Peat 
5 M ia. ter of National De- em. t e 
ane EA officially desig- Soviet Union. 
nated as successor to 
2 Mao Tse Tung. 
r 3. Mao Tse Tung Trade union organizer; Chairman of Communist Still in power, ccess 
one of the founders of Party and until 1959 
Chinese Communist (when he resigned) | 
Party; guerilla com- Chairman of People's 
mander and chairman Republic. Acknowl- 
of the Communist edged to be the ulti- 
Party. mate ruler of the 
government. 
Cuban Revolution 
à 1, Fidel Castro Revolutionary organizer Prime Minister and Still in power, 'eBg 
from youth; leader of ruler since takeover, 
several revolts and of 
successful revolution 
of 1958, 
2, Ernesto "Che" Guevara Communist revolution Minister of Industries; Disappeared from public re 
ary leader from youth; powerful and in- view after reported 
leading guerilla com- uential leader in a disagreements with 
mander during revolu- variety of areas. Castro; killed leading 
tion, a small and unsuccess- 
ful revolutionary 
movement in Bolivia, 
the documents used could not be easily classi- came from periods long before the fina de- 
fied as to target. Some were ostensibly letters nouement, while the subjects still held er 
to private individuals, but with the clear and high office, this does not seem to a 
P f y pi ? 
implication that their contents were to be plausible basis for interpreting the rigidi of 


communicated to a larger body; in other the unsuccessful men, 

Cases, speeches and reports to groups of fellow As we have previously indicated, ther: re 
leaders were equally obviously intended for two alternative explanations for the apparont 
dissemination through mass communication increase in complexity of successful lead vs. 


media, They may be complex people who, ei er 
It should be noted that the major change explicitly or implicity recognizing the r-ed & 
occurred for the successful rather than the for simplicity during the struggle, perform at 


unsuccessful leaders. Among the latter group, a level lower than their maximum abil 
changes in complexity tended to be quite 


small, which supports the idea that unsuc- TABLE 2 
+ cessful leaders were unwilling or unable to MEAN COMPLEXITY AND POLAR 
adopt more appropriate information-process- Conrrast SCORES 


ing approaches as circumstances changed. The ~ i 

majority of posttakeover materials evaluated Condition 

in this study originated in the period between Son Pretake--Posttake- Overa ae 
the success of the revolutionary movement seg iat ae: 

and the fall of the individual from power, so Complexity 


that the subject’s final failure did not aff Bucci rene oa 
i j e ect i 2 3 
his complexity score. It is difficult to say Poe ae 196 505 en 
whether some of these leaders had a feeling Polar Contrast 

that failure was imminent and whether, if Successful 08 33 50 
that were the case, their complexity was af- Failures S7 69 63 


fected ; however, since much of the material Greg aS eS 


REVOLUTIONARY LEADERS 


wi ircumstances become more favorable, 
tł resume their previous high level of 
€ tion, Or it may be that these individu- 
so are truly fanatic and simple in concep- 
tool structure during the struggle phase but 
distinguished from their colleagues in 

ing flexible enough to reach new and higher 
els of complexity when these are required 

v the environment. 

The original theory (Harvey et al., 1961) 
osits that conceptual structure is formed 
hrough childhood experiences and does not 
»rovide for major changes in response to en- 
vironmental pressures once the individual has 
reached adulthood. If we accept this assump- 
tion, our results should be interpreted as 
showing changes in the utilization of complex 
thought rather than in the development of 
a greater ability to be complex. However, 
there are no data bearing on the assumption, 
since no attempts have been made to induce 
long-term change in general complexity with 
adult subjects. One way to answer the ques- 
tion would be to score the productions of 
our subjects from periods long before they 
became involved with revolutionary move- 
ments; unfortunately, very little such mate- 
rial was found, partly because such material 
was not preserved and partly because some of 
our subjects had been revolutionaries from 
adolescence, Thus, some of the practical and 
theoretical applications of our findings remain 
to be elucidated. 

At another level, the results support the 
usefulness of conceptual complexity theory 
and of the Paragraph Completion Test in the 
analysis of nonlaboratory events, Most of the 
research with this technique has been based 
m the traditions of experimental social psy- 
chology and has drawn data from laboratory 
or field experiments in psychology and educa- 
tion. It now appears that the approach can 
be used with archival material and that dif- 
ferences in historical era and in original lan- 
guage do not seem to invalidate its utility. 
Obviously, it is a nonreactive measure (Webb, 
Campbell, Schwartz, & Sechrest, 1966) and 
avoids the problems of equipment, facilities, 
subject recruitment, etc., and their concomi- 
tant artifacts. Thus, it enables the researcher 


to ayoid some of the temporal, geographic, 
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and cultural restrictions and the problem of 
obtaining cooperation. 

The usefulness of conceptual complexity as 
a variable in political psychology can be 
further explored. In a previous study (Sued- 
feld & Epstein, 1973), it was found that a 
group of peace demonstrators showed higher 
conceptual complexity than a group of army 
reserve officers and a mixed control group; 
while these data were obtained by a specific 
test of complexity, there is no reason why 
textual analyses of a wide variety of mate- 
rials cannot be made to estimate group 
levels of conceptual complexity in somewhat 
the same way that McClelland and his co- 
workers had studied general levels of the 
achievement, affiliation, and power motives 
(Atkinson, 1958; McClelland, Atkinson, Clark, 
& Lowell, 1953). It is also possible to predict 
behaviors and outcomes by scoring currently 
available material and letting history test the 
hypothesis. By proper timing (e.g., shortly 
before an election), one could even avoid an 
unduly long latency period. 

Of course, there is no reason why the use 
of this technique must be restricted to the 
political context. If conceptual complexity is 
indeed a factor that broadly affects cognition 
and information processing, then science, edu- 
cation, business, war, foreign policy, and in 
fact responses to an unlimited range of prob- 
lems would offer an adequate data base. How 
the complexity demands of situations change 
and why some people can meet these demands 
more flexibly than others are questions that 
have important applied as well as theoria 
relevance. 
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Equity and the Use of Overreward to Motivate Performance 


Gerald S. Leventhal 
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Jerald Greenberg 
Case Western Reserve University 


Two studies were conducted in which subjects played the role of industrial 
consultants who distributed bonus pay to groups of hypothetical workers. The 
subjects were told either to give workers what they deserved based on perform- 
ance (maintain equity) or to use the reward to motivate the workers. Subjects 
who were told to motivate better performance gave higher reward to poor 
performers than subjects who were told to maintain equity. In some instances, 
the tendency to overreward poor performers was so strong that subjects who 
tried to motivate the workers violated the equity norm by giving higher pay 
to workers with lower inputs. Thus, in Study 1, subjects who attempted to 
raise workers’ performance gave higher reward to members of failing groups 
than to members of successful groups. In Study 2, male subjects who attempted 


to motivate better performance gave higher reward to lazy workers than to 
well-motivated workers. 

This research investigates the ways an in- allocators use to motivate substandard per- 
dividual uses reward to influence the behav- formers to improve their work and the extent 
ior of other people. An allocator’s distribution to which they deviate from equity when 
of reward may often be designed to elicit using such strategies. Two general strategies 
better performance from receivers or to keep for motivating poor performers may be iden- 

~ them from feeling dissatisfied (Leventhal, tified. First, an allocator may elect to punish 
1976b; Leventhal, Michaels, & Sanford, 1972; poor performers by reducing their rewards. 
Leventhal & Whiteside, 1973). Allocators’ Alternatively, the allocator may pursue a 
decisions are also influenced by their desire more liberal strategy and overreward sub- 
to behave fairly and follow appropriate jus- standard performers. 
tice rules (Leventhal, 1976a, 1976b). For There is reason to believe an allocator may 
example, allocators often maintain equity by often prefer a strategy of giving high reward 
giving higher reward to better performers to motivate poor performers. For example, 
(eg, Lane & Messé, 1971; Leventhal & past research suggests that a receiver’s per- 
Michaels, 1971). formance often improves following overpay- 

s The present paper examines the interplay ment (e.g., Adams & Jacobsen, 1964; Adams 


between an allocator’s desire to behave fairly 
and his or her desire to control receivers’ 
behavior in situations in which good perform- 
ance is important. We examine the strategies 
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& Rosenbaum, 1962; Garland, 1973; Kessler 
& Wiener (1972) or prepayment (e.g., Doob, 
Freedman, & Carlsmith, 1973; Doob & 
Zabrack, 1971; Kephart & Bressler, 1958; 
Watson, 1965). Consequently, past experi- 
ence probably leads most adults to believe 
that overreward or prepayment are effective 
methods for stimulating recipients to improve 
their performance. Furthermore, results from 
several studies suggest that allocators often 
do prefer to use a generous strategy of over- 
reward rather than a punitive strategy of 
reducing rewards when dealing with recipients 
whose work is poor (Bankart & Lanzetta, 
1970; Leventhal & Whiteside, 1973; Roth- 


bart, 1968). 
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We have suggested that allocators whose 
goal is to motivate better performance may 
overreward substandard performers to stimu- 
late better work. In contrast, allocators whose 
primary concern is to distribute rewards 
fairly will be less likely to inflate the rewards 
of inferior performers. Their primary concern 
is to evaluate the worth of the receivers’ 
work inputs. They are less concerned about 
the effect of their allocation decision on re- 
ceivers’ future performance and motivation. 
Thus, an allocator who is instructed to main- 
tain equity is likely to give lower reward to 
poor performers than an allocator who is 
instructed to motivate them. 

The present studies compare the allocation 
Strategies used by subjects whose goal is to 
raise workers’ performance with those used 
by subjects whose goal is to maintain equity 
by rewarding workers in accordance with per- 
formance. The studies seek to demonstrate 
the phenomenon of overreward by showing 
that subjects who try to motivate failing per- 
formers give higher reward than subjects who 
try to be fair. 


Stupy 1 


Subjects were asked to allocate bonus pay 
to two two-man work groups in a hypothet- 
ical company. Each work group consisted of 
an above-average and a below-average worker, 
In addition, subjects were told that one of 
the groups would probably fail to meet an 
important deadline, whereas the other group 
would probably succeed. Some subjects were 
told to give the workers what they deserved 
based on performance (equity condition). 
Others were told to allocate bonus pay so as 
to motivate the workers to meet the deadline 
(motivate condition). It was hypothesized 
that subjects who were told to motivate the 
workers would give higher reward to a failing 
group than subjects who were told to allocate 
rewards equitably. In other words, subjects 
trying to elicit high performance would over- 
reward failing performers. In contrast, when 
subjects were distributing rewards to workers 
in a successful group, those who were told 
to motivate the workers would probably see 
little reason for using overreward to elicit 
better performance, Instead, they might be 
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reluctant to place extra pressure on the 
recipients for fear of disrupting their work, 
Method 

Subjects. The subjects were 64 male and female 
volunteers from introductory psychology asses at 
Wayne State University who fulfilled a irse Te- 
quirement by participating. They were ted in 
groups ranging in size from 4 to 9 individ 

Alleged purpose of the experiment. Eac subject 
was seated at a separate desk and giv \ test 
booklet. The subjects were told that were 
taking part in a study of the effect of tr g and 


background on an individual's effectiven: is an 
industrial consultant. More Specifically, th peri- 
menter said, 


We're trying to find out what’s the best ining 
for industrial consultants. A consultant's c tive- 
ness is often influenced by his age an ick- 
ground. So, we want to see how person: lif- 
ferent age and background differ in th vice 
they give about certain business problems lay 
we'll ask you to make some decisions abou ius 
pay for workers. Setting pay rates an or- 
tant job, of course, and it’s one that in ial 
consultants often have to deal with. You m) not 
be familiar with this type of problem but ts 
okay. Consultants often have to deal wit! 1b- 
lems they never encountered before. 

The experimenter then described a hypo! cal 
case that involved the allocation of bonus pi to 
groups of workers who worked for the same n- 
pany. The job was described as follows: 

We'll ask you to judge some typical workers ind 

decide how much bonus pay each one shoul et, 

The workers in these Cases are punch press op- 

erators. This is a kind of factory job wł a 

worker makes Parts at a machine. A person ng 

this task generally works at his own pace, Ww 
fast he works depends Primarily on his wn 


effort and interest in doing a good job. 


The effect of bonus payments on the 
weekly income was described as follows: 


worl: ors” 


Bonuses, of course, are given in addition to : 
lar pay. The workers you'll be judging do gei a 
regular weekly salary. Your decision doesn’t affect 
that. You'll just decide how much additional 
each worker will get as a bonus. 


pay 


The subjects were further instructed to assume 
that the hypothetical company would be penalized 
if it failed to meet the terms of a contract by 
failing to complete a special project by a specific 
deadline. More specifically, they were told, 


In the cases we'll deal with, I want you to as- 
sume that a company has accepted a special con 
tract and must meet a deadline. In other words, 
the management and the workers in the company 


J to complete an important job in a short 
d of time, This kind of situation can create 
yf problems and advice from consultants is 
ı sought in this type of situation. 


nus pay scale, The experimenter then explained 
rocedure for making bonus pay allocations. 
subjects looked at a sample response scale, 
described the payment schedule as follows: 


A bonus can range anywhere from zero to one 
hundred dollars. Fifty dollars is considered me- 
dium bonus pay and one hundred dollars is 
extremely high. 


Che bonus pay scale consisted of 21 points ranging 
ji from $0 to $100 in $5 increments. Descriptive verbal 
| inchors were provided as follows: $0, extremely low 
i bonus pay; $25, moderately low bonus pay; $50, 
Ų medium bonus pay; $75, moderately high bonus 
{ pay; $100, extremely high bonus pay. 

Individual performance and group success. Two 
two-man groups were then described, Individual 
performance was manipulated by telling subjects 
that each group contained a worker whose perform- 
ance was above average and a worker whose per- 

i formance was below average. In each group, each 
| worker was represented by a separate box which 
contained a concise description of his or her per- 
formance. In addition, the four boxes contained 
spaces in which the subjects would write their rec- 
s ~ ommendations for bonus pay. 

Group success was manipulated by telling sub- 
jects that one two-man group had a poor chance of 
meeting the deadline, whereas the other had a good 
chance. For the failing group, the experimenter said 
the workers had 


a poor chance of meeting the deadline. . . - if 
the workers continue to work at the same rate, 
their combined productivity will be much lower 
than the amount required by the contract. So, 
the group of workers will surely fail to meet the 
deadline if Workers A and B continue to produce 
at that rate. 


or the successful group, the experimenter said the 
rkers had 


n excellent chance of meeting the deadline. . . > 
the workers continue to work at the same rate, 
eir combined productivity will be much higher 
in the amount required by the contract. So, 
ie group of workers will surely succeed in meet- 
z the deadline if Workers A and B continue to 
oduce at that rate. 


bjects were told that they would have to allo- 

onus pay four times, namely, to an above- 
aver ge and a below-average worker in a successful 
grow and to an above-average and a below-average 
worker in a failing group. The experimenter’s com- 
ments about group success were reiterated by brief 
summary statements printed in the subjects’ test 
booklets, 


cat 
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Order of presentation of group success—failure. 
The manipulation of group success employed two 
different orders of presentation. In half of the test 
booklets, the two workers in the successful group 
were shown on the upper half of the page, whereas 
those in the failing group were shown on the lower 
half. In the remaining booklets, the order was re- 
versed. 

Allocator’s goal. Subjects were then asked to study 
printed instructions concerning their goals as allo- 
cators and make their decisions about bonus pay- 
Half of the subjects of cach sex were assigned to an 
equity condition in which printed instructions told 
them to 


pay each worker the amount of bonus pay that he 
deserves based on his productivity. It’s important 
that you keep this in mind when you make your 
decisions. You will have to give each worker the 
amount of bonus pay that he deserves based on 
his productivity. 


The remaining subjects were assigned to a motivate 
condition in which the instructions told them to 


pay each worker the amount of bonus pay that 
will motivate him to meet the deadline. It's im- 
portant that you keep this in mind when you 
make your decisions. You will have to give each 
worker the amount of bonus pay that will moti- 
vate him to meet the deadline. 


To permit simultaneous testing of subjects in these 
two conditions, no spoken instructions were given con- 
cerning the allocator’s goal. Instead, a separate page of 
written instructions describing one goal or the other 
was included in each experimental booklet. The 
manipulation was reinforced by a brief summary 
statement on the page on which subjects actually 
recorded their allocation responses. 

Subjects were given approximately 2 minutes to 
read the allocator’s goal instructions (either equity 
or motivate) and make their decisions about bonus 
pay. After completing their allocation decisions, 
subjects filled out a questionnaire that attempted to 
assess their perception of the probable impact of 
their responses.’ Subjects were then asked to main- 
tain secrecy about the experiment, thanked for their 
help, and dismissed. 

Experimental design and randomisation pro- 
cedures. A five-factor mixed design was employed. 
The three between-subjects variables were allo- 
cator’s goal (equity or motivate), sex of the allo- 
cator (male or female), and order of presentation 
of the group success factor (success-failure or fail- 
ure-success), The two within-subjects variables were 
group success (successful group or failing group) 
and individual performance (above-average worker — 
or below-average worker). Four different versions 


1 Due to a serious systematic error in formulating 
the questions, the items were ambiguous and un- 


interpretable. Consequently, prior to data analysis 
we decided to disregard the data from the question- 
naire, 
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of the test booklet were used, the versions differing 
in the between-subjects variables of allocator’s goal 
and order of presentation of the group success fac- 
tor. Prior to the experimental sessions, the four 
versions of the test booklet were randomly ordered 
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TABLE 2 


MEAN Bonus Pay (IN DOLLARS) Given to EACH 
WORKER AS A FUNCTION OF ALLOCATOR Goat, 
Sex or ALLocATOR, Group Success, «wp 
INDIVIDUAL PERFORMANCE 


with a separate order being used for each sex. In SS 
this manner, the conditions were randomized across Group pipe and in ual 
sexes and testing sessions, eae = 
Failing group group 
Bel Abov Abov 
Results Allocator's KEEN eA EA 
ey i goal and sex worker worker orker 
The subjects’ allocation of bonus pay was sya m 
analyzed with a five-factor analysis of vari- Equity 
ance for designs with repeated measures on Man Hee ae Re 4 
; H a Female D. s „63 2.6! 
two factors, and appropriate specific compari- Combined 19.38 4477 2.66 41 
Motivate 
TABLE 1 Male 43.13 57.50 30.31 84 
9 ae i re 2 Female 54.38 58.5 46.22 75 
SIS Or AOTER Or V ARANCE TOE Combined 48.75 58.05 39.77 wò 
Sour of varians Ay MSF dele, Boa pay aloetons could range from #0 «00 ig 
~ — cells, n = 64.) 
Between subjects 
S i Been ee sons. Results from the analyis of varian: are 
AXB 1673.63 <2 summarized in Table 1. Table 2 show the 
NESA (C) l 312.50 <1 mean bonus pay given to each worker a 
BSC 1 i- function of the four major independent v ia- 
AXBXC 1 bles.* 
see within groups 120 1,344.38 Subjects in the motivate condition re 
eas ee Rs eats expected to utilize the strategy of overre rd 
x B coRE AD i ae Soe to elicit better performance, that is, they > re 
p S D 1 2,719.53 3.06 expected to inflate the rewards of membe: of 
ORS D 5 ere = a failing group. The result shown in T. le 
AXCXD 1° 361.13 <1 3 are consistent with this assumption, a is 
te x sx 2 1 Sar 2.66 the significant Goal x Group Success in r- 
D X subjects within aE <T action. Further tests were performed to de r- 
; ie ; 120 888.44 mine whether differences between adjacent 
aeo ual performance e con a cells in Table 3 were greater than zero. S: »- 
AXE 1 8613.28 1754+ jects in the motivate condition gave higher 
XE 1 634.57 <2 ri s S ili 
ASAS HS Se RS ae ewards to members of the failing group than 
CSE 1 "10332. 21 subjects in the equity condition, F(1, 120) 
AXCXE 1 38.28 <1 = 26.08, p < 01. However, the two con 
BXCXE 1 548.63 <2 ti did G : ; 
AXBXCXE ih sp Oo ions did not differ in the amount given to 
E X subjects within members of the successful group, F < 1. 
Pean a8 ne cae Results in the bottom row of Table 3 re- 
AXDXE 1 343. <1. veal that, in the motivate condition, sub- 
p XDXE 1 361.13 4.42* 
a 5 meen XE i 19.53 <1 2 Order of presentation has been disregarded in 
AXCXDXE 4 ee Bens Table 2 because it was of interest only insofar as its 
BROXDSXE i 33056 He interaction with other variables might restrict inter- 
AXBXCXDXE i 2813 <1 pretation of other effects. One interaction term that 
Dx E X subjects included order of presentation was significant (Goal 
within groups 120 81.64 X Order X Group Success X Individual Performance). 
=~ A v However, inspection of the cell means for this term 
mp i indicated that it does not affect the interpretation 


of other significant effects, 


F 
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is? tendency to overreward members of 
failing group was 50 strong that they ac- 
ally violated the equity norm. They gave 
ivher reward to members of the failing group 
han members of the successful group, F(1, 
120) = 4.18, P< 05. Thus, subjects violated 
he norm of equity in their attempt to elicit 
better group performance. Conversely, sub- 
jects in the equity condition allocated bonuses 
on the basis of workers’ inputs and gave 
lower reward to members of the failing group 
than to members of the successful group, F (1, 
120) = 16.12, p < .01. 

The significant main effect of individual 
performance reflects a general tendency for 
subjects to give higher reward to the above- 
average performer in each work group. How- 
ever, the significant Goal X Individual Per- 
formance interaction indicates that subjects 
in the motivate condition did tend to over- 
reward below-average performers. Further 
tests suggest that the tendency to over- 
reward the below-average worker occurred 
primarily in the failing group. Compared with 
subjects in the equity condition, subjects in 
the motivate condition gave higher pay to 
the below-average performer in the failing 
group, F(1, 120) = 23.20, p < 01. However, 
they did not give significantly more to the 
below-average performer in the successful 
group, F < 1. There was a further indication 
that subjects told to motivate the workers 
made greatest use of an overreward strategy 
when allocating to inferior performers in the 
failing group. Compared with subjects ìn the 
equity condition, subjects in the motivate 
condition gave higher reward to both the 
below-average performer, F(1,120) = 19.68, 

01, and the above-average performer, 

120) = 4.02, p< .05, in the failing 
g However, the magnitude of the in- 
significantly greater for the below- 
performer than the above-average 
r, F(1,120) = 11.81, 2x0: 


crease wa 
aver 


perform 


Discussion 

As expected, subjects who were instructed 
gave higher reward to 
above-average performers and successful 
groups. These results are consistent with 
previous findings that subjects allocate re- 
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TABLE 3 


Mean Bonus Pay (iN Do. 
ACH WORKER AS A F 
ALLOCATOR’S GOAL AND GROUP 


s) GIVEN TO 


Success 
Group success 
Allocator's es 
goal Failing Successful 
Equity 07 47.03 
Motivate 45.78 


Note. In each cell, # = 64. 


ward to recipients in accordance with their 
contributions. In contrast, subjects instructed 
to motivate high performance tended to raise 
the rewards of groups and individuals with 
poor performance. There was no indication 
that subjects were using a punitive tactic of 
reducing the rewards of workers with poor 
performance. In fact, when subjects sought to 
motivate a failing group, their tendency to 
overreward poor performers was so great that 
they actually violated the equity norm and 
gave higher reward to members of a failing 
group than members of a successful group. 
Supplementary analyses reported by Green- 
berg (1974) indicate that the surprising re- 
versal of the equity effect in the motivate 
condition is not an artifact resulting from a 
few subjects with extreme scores. The re- 
sponse of giving more to the failing group was 
the response preferred by exactly 50% of the 
subjects in that condition, Apparently, many 
of them believed that giving high reward to 
members of the failing group was the best 
way to stimulate better work, and they were 
willing to violate the equity 
purpose. However, it should 
that a sizable portion of the subjects in the 


norm for that — 
also be noted — 


motivate condition (43.7%) did give some- 


what higher reward to 
In spite of their strong tendency to inflate 
the rewards of failing performers, they 
stopped short of violating equity. 
The results indicate that allocators are 
i iscriminating in 
T e r performance. Subjects 


reward to el 
in the motivate condition tended to focus 


their use of overreward primarily on the be- 


low-average member in the failing group. 


de greatest use of the strategy when 
They th the group member whose work 


the successful group. — 


“+ 
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was most deficient. Furthermore, there was 
little indication that subjects were using an 
overreward strategy when they responded to 
members of the successful group. Although 
they were directed to motivate the workers, 
subjects in the motivate condition may have 
felt it was unnecessary to take special steps to 
influence the members of the successful group. 
This finding provides further evidence for 
the assumption that allocators are quite dis- 
criminating in their use of the tactic of over- 
reward. They use it only when necessary. 


Stupy 2 


Study 2 focused on the manner in which 
an allocator’s readiness to overreward failing 
performers is affected by the performers’ 
motivation. All recipients were described as 
members of failing groups. However, the re- 
cipients’ substandard performance was at- 
tributed either to exceptionally low or excep- 
tionally high motivation. In the low-motiva- 
tion case, recipients’ poor performance was 
attributed to their laziness and lack of con- 
cern about meeting an important deadline. In 
the high-motivation Case, recipients’ poor 
performance was attributed to their having 
become worried and discouraged when they 
realized they might fail to meet the deadline. 
As in Study 1, subjects took the role of con- 
sultants and were instructed to allocate bonus 
pay to workers in a hypothetical company. 
Half of the subjects were instructed to allo- 
cate reward based on performance (i.e., to 
maintain equity). ‘The temaining subjects 
were instructed to allocate reward in a way 
that would motivate the workers to improve 
their performance. 

It was assumed that subjects would view 
exceptionally high motivation as a positive 
contribution even though the workers were 
failing, It was therefore predicted that sub- 
jects who were instructed to maintain equity 
would give greater reward to worried and dis- 
Couraged performers (who were concerned 
about doing well) than to lazy and uncon- 
cerned performers. It was also expected that 
subjects who were told to motivate better 
performance would use a strategy of Over- 
rewarding poor performers to stimulate better 
work. It was therefore predicted that sub- 
Jects in the motivate condition would give 
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higher reward to the workers than ubjects 


in the equity condition. However, it was less 
certain how subjects in the motivat: condi- 
tion would respond to differences in orkers? 


motivation. One possibility was tha‘ ubjects 


in the motivate condition would di y the 
same pattern of response as subject: in the 
equity condition, that is, they woi give 
higher reward to worried and dis: raged 
performers than to lazy and unconce: per- 
formers. However, there was also son ossi- 
bility subjects in the motivate conditi uld 
give higher reward to lazy perform than 
to worried performers. Workers who lazy 
and unconcerned about doing well ı be 
seen as requiring strong incentives © rod 
them into action. Thus, subjects who a con- 
cerned about motivating such worker ight 
believe it necessary to give them a | ‘ive 
extrinsic reward to stimulate them. the 
same reasoning, subjects might seo ‘ttle 
need to provide a lucrative reward fo: vor- 
ried workers who are already highly oti- 
vated. Since the workers who are worri and 
discouraged are already concerned abi. do- 
ing well, a large incentive might not re- 
quired to stimulate them. 

We have suggested two possible vari of 
the overreward strategy, namely, (a) ing 
greater reward to worried workers th izy 
workers and (b) giving greater reward izy 


workers in order to stimulate them \ ha 


strong incentive. There was reason to © cct 
that in the motivate condition, mal and 
female subjects might differ in their ref- 
erence for these two types of response. | ales 
may be more willing than females to sive 


high reward to lazy workers to stir the to 
action. This suggestion is consistent with vork 
which indicates that males are highly con- 
cerned about achieving task success 


one 


Bardwick, 1971; McClelland, Atki ison, 
Clark, & Lowell, 1953; Uesugi & Vinacke, 
1963; Leventhal, Note 1). Their overriding 


concern with successful performance may 
predispose them to deviate markedly from 
equity in order to achieve success, Conse- 
quently, males may be more likely than fe- 
males to inflate the rewards of lazy 


Method 


Subjects. The subjects were 100 male and 100 fe- 
male volunteers from introductory Psychology classes 


workers. 


| 


$ 


i 


wyne State University who fulfilled a course 

ement by participating. They were tested in 

ranging in size from 4 to 10 participants. 

„cedure, The experimental procedures were gen- 
ily similar to those of Study 1. As before, sub- 
its were told to take the role of an industrial 
ultant and recommend bonus pay to workers 
ho were trying to meet a deadline. The workers’ 
sb was described in the same terms, and the same 
anus pay scale was used. 

Unlike Study 1, subjects allocated bonus pay only 
o a failing group. In addition, they rewarded the 
group as a whole instead of giving specific amounts 
to individual workers. Furthermore, the subjects 
were given no information about the performance 
of individual workers and were not even told how 
many workers were in the group. They knew only 
that the group was failing. In addition, both the 
deadline and group performance were described in 
somewhat different terms than in Study 1. ‘These 
changes in procedure appeared in the experimenter’s 
comments as follows: 


Your task will be to decide how much bonus pay 
to give to a group of workers, Bonuses, of course, 
are given in addition to regular pay. The group 
you'll be judging does get a regular weekly salary. 
Your decision doesn’t affect that. You'll just 
decide how much additional pay the group will 
get as a bonus. 

Tn the particular cases we'll deal with, you are 
to assume that the deadline that the group has to 
meet is a very reasonable one and that deadlines 
ind contracts of this type have often been handled 
effectively in the past. However, we want you to 
assume that in spite of the reasonableness of the 
deadline, the group you are judging is doing 
omewhat more poorly than most groups. So, 
there is a definite chance that the group will fail 
to meet the deadline if the workers continue to 
produce at the same rate. If the workers do not 
improve, their productivity will be too low and 
the group may fail to meet the deadline. 


Subjects were then instructed to allocate bonus 
two times, once when the group members were 
ied and discouraged and once when they were 

nd unconcerned, The experimenter explained 
rocedure as follows: 


vitants often know the reasons why work 
ire having difficulty and this affects the 
nis’ recommendations about bonus pay. So, 
make your recommendations we want 
asider several different reasons why the 
vy be having difficulty. We want to get 
pinion about two reasons for poor per- 
nance. So, we'll ask you to give bonus pay two 


when you 


independent variables. Worker motivation was 
oanipulated by providing two different reasons for 
‘oup failure. In one case, the group's poor perform- 
ance was attributed to the workers’ fear of not 
meeting the deadline (worried condition). Subjects 
were told that 
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the workers became worried and discouraged be- 
cause they felt they might not be able to get the 
job done. Their feelings of worry and discourage- 
ment have lowered their performance. 


In the second case, the group's poor performance 
was attributed to the workers’ laziness and lack of 
concern (lazy condition). Subjects were told that 


the workers are lazy and don’t care about get- 
ting the job done. Their feelings of laziness and 
Jack of concern have Jowered their performance. 


The subjects were then asked to study printed 
instructions concerning their goal as an allocator 
and then make their decisions about bonus pay. 
The manipulation of allocator’s goal was carried out 
exactly as in Study 1. Half of the subjects were 
assigned to an equity condition and half to a moti- 
vate condition, (A separate random order was 
used for subjects of each sex. 

Subjects recorded their responses about bonus pay 
jn two separate boxes, one representing the worried 
and discouraged workers and the other representing 
the lazy and unconcerned workers. The bonus pay 
scale was the same as in Study 1. After making their 
allocation decisions, subjects completed five question- 
naire items which measured their perception of the 
probable effect of bonus pay on the workers’ per- 
formance. The response scales used with these items 
consisted of a row of numbers ranging from 1 to 
25 with four equidistant verbal labels to define the 
points. After completing the questionnaire, subjects 
were asked to maintain secrecy about the experi- 
ment, thanked for their help, and dismissed. 


Results 


Allocation data. The subjects’ allocation of 
bonus pay was analyzed with a three-factor 
analysis of variance for experimental designs 
with repeated measures on one factor. The 
analysis is summarized in Table 4. The effect 
of information about worker motivation is 
assessed in the within-subjects portion of the 
table because both levels of that variable were 
administered to each subject. 

Table 5 shows the mean bonus pay sub- 
jects gave to the work group in each cell of 
the design. As expected, there was a signifi- — 
cant main effect of allocator’s goal. Subjects — 
of both sexes gave higher reward when they — 
were told to motivate the workers. Planned — 
comparisons revealed that this was true 
whether the workers’ poor performance was 
attributed to feelings of worry and discour- — 
agement [for males, F(1, 196) = 5.00, p< 
05: for females, F(1, 196) = 19.32); Dia 
.01] or to laziness and lack of concern [for _ 
males, F(1, 196) = 75.94, p< 0l; for fe- 
males, F(1, 196) = 27.27, p < 01]. ę 
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TABLE 4 


SUMMARY OF ANALYSIS OF VARIANCE FOR 
ALLOCATION RESPONSES 


Source of variance df MS F 


Between subjects 


Goal (A) 1 54,639.06 68.37* 
Sex (B) 1 1,105.56 <2 
AXB 1 217.56 <1 
Subjects within groups 196 799.16 
Within subjects 

Worker Motivation (C) 1 9,168.06 38.01* 
AXC 1 6,847.56 28.30% 
Bc 1 5,076.56 21.04* 
AXBXC 1 4,064.06 16.85* 
C X Subjects within 

groups 196 241,23 
*b <.01. SNS hers 


Subjects of both Sexes used overreward to 
motivate poor performers, However, the sig- 
nificant three-way interaction term in Table 
4 suggests that males were more strongly 
inclined to Overreward lazy workers, whereas 
females were more strongly inclined to over- 
reward worried workers, Further tests re- 
vealed that males and females differed sig- 
nificantly in the extent to which they raised 
the pay of lazy workers. The net increase was 
$39.50 for males and $23.80 for females, F(i, 
- In the case of wor- 


The significant three-way interaction in 
Table 4 also reflects the fact that in the 
males and females re- 
sponded similarly to information about moti- 
vational factors responsible for group failure 


group than the lazy 
196) = 30.30, p < 01; 
= 35.85, 


tion, females also gave higher reward to the 
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equity condition. Males in the mot ate con- 


dition gave higher reward to lazy workers 
than to worried workers, F(1, 196) = 15.43, 
É < .01. Thus, male allocators se ! willing 
to violate equity for the sake of n imizing 
productivity. Frequency counts rey ed that, 
in the motivate condition, 76% of males 
gave higher reward to lazy worker an to 
worried workers. In Contrast, 76% of the 
females in that condition gave high “ward 
to the worried workers. The mark; liffer- 
ence between males’ and females’ re onses 
in the motivate condition is further ected 
in the fact that males in this condit gave 
significantly more than females to | lazy 
workers, F(1, 196) = 16.09, p< .01 ad- 
dition, the difference between male and 
females’ responses to the worried kers 
reached a borderline level of signii nce, 
F(1,196) = 3.64, p < .10. Thus, mal ind 
females both used the tactic of overy. ud, 
but they differed in their response to i for- 
mation about the motivational factor re- 
sponsible for failing performance. 
Questionnaire data. One questionnaire om 
concerned the necessity of using mon ry 
incentives to improye worker perform 
The item asked, “In each case, to wha: ox- 
tent do the workers need an incentive ch 
as bonus pay to help them improve their \ irk 
and meet the deadline?” Subjects respo: ed 
on two separate scales, one for lazy wor ers 


and one for worried workers, The respcse 
scales ranged from “not at all” (1) through 
“to a great extent” (25), 


TABLE 5 


MEAN Bonus Pay (ty Dotrars) GIVEN TO TH 
ROUP AS A FUNCTION OF ALLOCATOR’s 
Goat, SEX or ALLOcATOR, AND 
Worker Motivation 


Motivational state 
causing poor 
performance 


Allocator's 
goal and sex Lazy Worried 
Equity 
Male 23.00 40.10 
Female 20.40 39.00 
Motivate 
Male 62.50 50.30 
Female 44.20 59.00 


Note. Bonus Day allocat 


ti 7 
$5 steps. (In each cell, n ao range from $0 to $100 in 


n responses to this item are presented 
: 6, Responses were analyzed with a 


three-factor analysis of variance for experi- 
mental designs with repeated measures on one 
fector. Only the effect of worker motivation 


y significant, F(1, 196) = 31.38, p < 01. 

Ail subjects believed it was more necessary to 

use monetary incentives for lazy workers than 

for worried workers. Thus, male and female 
subjects both believed it was necessary to use 

a lucrative extrinsic reward to motivate lazy, 

‘concerned workers. However, only male 

subjects actually gave higher reward to lazy 

workers than to worried workers in the moti- 

vate condition (see Table 5). 

Four questions asked subjects to estimate 
the probable effect on worker performance of 
giving high or low bonus pay. The questions 
formed two pairs of items with one item in 
each pair asking subjects to estimate the 
effect of giving high bonus pay and the other 
item asking subjects to estimate the effect 
of giving low bonus pay. The primary pur- 
pose of these questions was to measure sub- 

» jects’ beliefs about the relative superiority of 
the overreward tactic. 

In one pair of questions the high-pay 
item asked, “In each case, what would be the 
probable effect of giving the workers very 
high bonus pay?” Subjects responded on two 
separate scales, one for lazy workers and one 
for worried workers. The response scales 
ranged from “very high bonus pay would 
decrease their performance greatly” (1) to 
“very high bonus pay would increase their 
performance greatly” (25). The low-pay item 
was similar in all respects except that the 


TABLE 6 


PERFORMANCE 


Mot ional state 
causing poor 
performance 


Allocator's 

1 sex Lazy Worried 
Equity 

Aale 18.26 15.00 
Female 17.36 14.26 

Motivate 
Male 20.18 15.10 
Female 18.50 13.98 


„ Note, Higher scores indicate greater necessity. (In each cell, 
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TABLE 7 
Estimatep Errect or Hicn anp Low Bonus Pay 


Stated level of pay and motivational 
state causing poor performance 


Allocator's High pay Low pay 
goal and 3 
sex Lazy Worried Lazy Worried 

Equity 

Male 16.32 17.28 9.78 11.98 

Female 16.24 18.34 7.58 12.84 
Motivate 

Male 17.80 17.16 9.30 13.24 

Female 16.98 16.36 7.712 11.84 


a le values greater than 13 indicate improved per- 
formance; those below 13 indicate poorer performance, (In 
each cell, n = 50.) 


term low was substituted for Aigh in the ques- 
tion and in the response scales. For purposes 
of statistical analysis, the high-pay and low- 
pay items were treated as two levels of a 
single independent variable with repeated 
measurement. The results were analyzed with 
a four-way analysis of variance for experi- 
mental designs with repeated measures on two 
factors, namely, stated level of pay (high-pay 
item versus low-pay item) and worker moti- 
vation (lazy versus worried). 

Mean responses to this pair of items are 
shown in Table 7. The main effect for the 
high-pay versus low-pay item was significant, 
F(1, 196) = 375.47, p < .01. All subjects be- 
lieved that giving high pay would elicit better 
performance. Mean responses to the high-pay 
item were in the vicinity of 17, a scale value 
which corresponded to the verbal label stat- 
ing that “very high bonus pay would increase 
performance somewhat.” 

Comparable results were obtained on the 
second pair of questions in which the high- 
pay item asked: “In each case, to what 
extent would the workers be likely to im- 
prove their performance if they were given 
very high bonus pay?” Response scales for 
the lazy workers and for the worried workers 
ranged from “not at all” (1) through “to a 
great extent” (25). The low-pay item was 
similar in all respects except that the term 
low was substituted for high in the question 
and in the response Seales: Mean responses 
to this pair of items are presented in Table 8. 
AJl subjects believed that giving high pay 

mprove workers’ perform- 


likely to i ; 
e ats Jow pay [for the main effect 
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TABLE 8 


PERCEIVED LIKELIHOOD oF IMPROVEMENT As A 
Function E 


Stated level of pay and motivational 
state causing poor performance 


Allocator’s High pay Low pay 
goal and 
sex Lazy Worried Lazy Worried 

Equity 

Male 11.86 3.86 8.26 

Female 12.46 6.46 9.88 
Motivate 

Male 15.80 16.40 36 10.74 

Female 14.66 16.14 5.20 10,32 


Nole, Highe dres indicate greater likelihood of improve- 
ment, (In each cell, n = 50.) 


of high-pay versus low-pay, F (1,196) = 
296.66, p < 01]. 

The analyses of 
7 and 8 yielded 
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bonus pay constituted a more effective , 
method for eliciting improved performance 
than giving low bonus pay. 

In the equity condition, subicsts gave 
higher reward to worried worl nan lazy 
workers. Apparently, the worried workers? 
greater concern about doing well mage them 
seem more deserving. This result consis- 
tent with other research Which indic ies that 
supervisors give higher reward to wor} rs who 
hold favorable attitudes toward + jobs 
(Goodstadt & Kipnis, 1970; Kipnis ‘osen- 
tino, 1969). Tt is also consistent wit} udies 
which show that allocators’ use of p rd is 
often based on their attribution of vers? 
motivation (Lanzetta & Hannah 969; 
Weiner & Kukla, 1970). 

In the motivate condition, the ects’ 
responses were more complex, Both m and 
females believed an incentive such a mus 
Pay was more necessary to motivay lazy 
workers than worried workers, Presu ably, 
all subjects felt one must increase lazy ork- 
ers’ extrinsic motivation in order to ffset 
their low level of intrinsic motivation, ow- 
ever, in the motivate condition, only < ales 


actually gaye higher reward to lazy w 


gave higher reward to the worried wo: srs, 
One factor which may underlie this diffe ice 
between male and female allocators ir ihe 
motivate condition is g sex difference in 
achievement orientation, An extensive |). ya- 


often place hi 


1953). In the present 


fore more willing to deviate from equity in 
order to achieve success, Males? Overridi 


3 Two additional significant effects were present 


only in Table 8. There was a significant main effect 
of allocator's goal, F(1, 196) = 7.04, p< -01, and one 
Significant three-way interaction effect, Allocator’s 
Goal X Worker Motivation X Stated Level of Pay, 
EC, 196) = 5.67, ’<.05. i 


AN 
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ac g success took precedence over the 
r nents of fairness. Females, on the 
€ hand, being less concerned about 
ving success, were less willing to vio- 
the equity norm in order to elicit better 

k, For females, the requirements of fair- 

s took precedence over the requirements 

task success, and they were not willing to 
ve more reward to lazy workers than to 
orried workers. 

Another possibility is that the difference 
‘etween the males’ and females’ use of the 
verreward tactic was due to a more nurtu- 
vant orientation on the part of females. In 
the motivate condition, compared with males, 
females gave higher pay to worried workers. 
This trend may reflect an attempt by females 
to calm worried workers and alleviate their 
feelings of worry and discouragement. By giv- 
ing high reward, a female allocator could 
demonstrate her concern about the workers’ 
welfare and show that she was not going to 
punish them. A female allocator might assume 
that such reassurance would have a thera- 
peutic effect on worried workers and reduce 
the emotional responses that were disrupting 
their performance, This line of reasoning is 
consistent with the view that females use 
rewards to comfort individuals who feel badly 
about doing poor work (Leventhal, Popp, & 
Sawyer, 1973) and with studies that indicate 
that females tend to be more nurturant than 
males (e.g, Bardwick, 1971; Kagan, 1964; 
Terman & Tyler, 1954). 


l 
3 
l 


GENERAL DISCUSSION 


We have investigated the strategies that 
locators use to motivate better performance. 
he results indicate that college students 
prefer the strategy of overreward to a more 
punitive tactic of reducing reward. Question- 
naire data provided direct evidence that sub- 
jects believe overreward is a more effective 
method than reward reduction for motivating 
failing performers. In addition, subjects who 
were told to motivate failing performers gave 
them higher reward than subjects who were 
told to maintain equity. Under certain condi- 
tions, the tendency to overreward was s0 
strong that subjects in the motivate condition 
clearly violated the equity norm. Thus, in 
Study 1, subjects who attempted to motivate 
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the workers gave more to members of a fail- 
ing group than to members of a successful 
group. In Study 2, males who attempted to 
motivate failing performers gave more to 
lazy workers than to highly motivated work- 
ers, In each of these cases, the requirement 
of eliciting better performance clearly took 
precedence over considerations of fairness, 
and subjects gave higher reward to performers 
with lower task inputs. 

Our results indicate that allocators are 
selective in their use of overreward to moti- 
vate good performance. For example, subjects 
in the motivate condition of Study 1 concen- 
trated their use of overreward primarily on 
the below-average member of the failing 
group. In addition, they did not overreward 
members of a successful group. 

An allocator may hesitate to overreward 
receivers with adequate performance because 
he or she wants to sustain their interest in 
future rewards. Giving too much reward too 
soon could undermine the value of that re- 
ward (Blau, 1964; Gewirtz, 1967; Homans, 
1961). A principle of diminishing marginal 
utility may operate such that the more re- 
ward receivers obtain, the less value each 
additional unit of that reward has for them. 
To the extent that allocators recognize this 
possibility, they may hesitate to give too 
much reward to good performers because of 
the desire to preserve their power to control 
receivers through promises of additional re- 
ward, 

Our results suggest that many allocators 
prefer to treat a failing performer generously 
because they believe overreward can stimu- 
late better performance. However, it is likely 
that an allocator will use the overreward 
strategy only for relatively short periods and — 
switch to other tactics if the failing performer — 
shows no improvement. If a failing performer 
is unresponsive to overreward, the allocator 
would probably conclude that the overreward 
tactic was ineffective under existing condi- 
tions, The allocator might then become con- 
cerned that further overreward would only 
reinforce and strengthen an existing pattern 
of poor performance. Consequently, the allo- 
cator might then resort to other methods for 
controlling the performer’s behavior. In fact, 
research by Greenberg (1975) indicates it is 


i 
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then that the allocator would adopt the puni- 
tive tactic of reducing the performer’s re- 
wards. 


REFERENCE NOTE 
1. Leventhal, G. S. Reward allocation by males and 
females. In L. Messé (Chair), Individual difer- 
ences in equity behavior. Symposium presented 
at the meeting of the American Psychological 
Association, Montreal, August 1973. 
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A field experiment was conducted to assess the effects of enhanced personal 
responsibility and choice on a group of nursing home residents. It was ex- 
pected that the debilitated condition of many of the aged residing in institu- 
tional settings is, at least in part, a result of living in a virtually decision-free 
environment and consequently is potentially reversible. Residents who were 
in the experimental group were given a communication emphasizing their re- 
sponsibility for themselves, whereas the communication given to a second group 
stressed the stafi’s responsibility for them. In addition, to bolster the commu- 
nication, the former group was given the freedom to make choices and the 
responsibility of caring for a plant rather than having decisions made and the 
plant taken care of for them by the staff, as was the case for the latter group. 
Questionnaire ratings and behavioral measures showed a significant improve- 
ment for the experimental group over the comparison group on alertness, active 
participation, and a general sense of well-being. 


The transition from adulthood to old age 
is often perceived as a process of loss, physio- 


~ logically and psychologically (Birren, 1958; 


Gould, 1972). However, it is as yet unclear 
just how much of this change is biologically 
letermined and how much is a function of the 
environment, The ability to sustain a sense of 
personal control in old age may be greatly in- 
luenced by societal factors, and this in turn 
may affect one’s physical well-being. 
Typically the life situation does change in 
aye. There is some loss of roles, norms, 
and reference groups, events that negatively 
influence one’s perceived competence and 
«ling of responsibility (Bengston, 1973). 
‘ption of these changes in addition to 
| physical decrements may enhance a 
seuce of aging and lower self-esteem (Lehr & 
Pu 


old 


, Note 1), In response to internal 
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developmental changes, the aging individual 
may come to see himself in a position of les- 
sened mastery relative to the rest of the 
world, as a passive object manipulated by the 
environment (Neugarten & Gutman, 1958). 
Questioning whether these factors can be 
counteracted, some studies have suggested 
that more successful aging—measured by 
decreased mortality, morbidity, and psycho- 
logical disability—occurs when an individual 
feels a sense of usefulness and purpose 
(Bengston, 1973; Butler, 1967; Leaf, 1973; 
Lieberman, 1965). 

The notion of competence is indeed central — 
to much of human behavior. Adler (1930) — 
has described the need to control one’s per- 
sonal environment as “an intrinsic necessity 
of life itself” (p. 398). deCharms (1968) has 
stated that “man’s primary motivation pro- 
pensity is to be effective in producing changes 
in his environment, Man strives to be a 
causal agent, to be the primary locus of, 
causation for, or the origin of, his behavior; 
he strives for personal causation’ (p. 269). 

Several laboratory studies have demons 
strated that reduced control over ‘aversive 
outcomes increases physiological distress ace 

5 i & Gatchel, 1970; 
anxiety (Geer, Davison, sical percep 
Pervin, 1963) and even a nonveridicat p 


192 ELLEN J. LANGER AND JUDITH RODIN 


tion of control over an impending event Te- important Psychological factors hould pe 
duces the aversiveness of that event (Bowers, linked to health and survival. Th a series of "ee 
1968; Glass & Singer, 1972 ; Kanfer & Seid- retrospective studies, Schmale ang S associ. li 
ner, 1973), Langer, Janis, and Wolfer (1975) ates (Adamson & Schmale, 196: 

found that by inducing the Perception of con- 1958; Schmale & Iker, 1966) fou: 
trol over stress in hospital patients by means Cerative colitis, leukemia, cervical ç 
of a communication that emphasized poten- heart disease were linked with a 
tial cognitive control, subjects requested helplessness and loss of hope expe 
fewer pain relievers and sedatives and were the patient prior to the onset of tł 


seen by nurses as evidencing less anxiety, Seligman and his co-workers hay: 
Choice is also a crucial variable in enhanc- matically investigated the learning help. 
ing an induced sense of control, Stotland and lessness and related it to the clir syn- 
Blumenthal (1964) studied the effects of drome of depression (see Seligman, 75). 
choice on anxiety reduction, They told sub- Even death is apparently related to trol- 
jects that they were going to take a number relevant variables. McMahon and | dick 
of important ability tests. Half of the sub- (1964) found a relationship between res~ 
jects were allowed to choose the order in sion or hopelessness and death. The nost 
which they wanted to take the tests, and half graphic description of this association nes 
were told that the order was fixed. All sub- from Bettelheim (1943), who in his m ysis f 
jects were informed that the order of the of the “Muselmanner,” the walking « ses 
tests would have no bearing on their Scores. in the concentration camps, described . em 


They found that subjects not given the choice as: 


Sweating. In another study of the effects of Prisoners who came to believe the repeated ite- 
fa (1970) told their ments of the Suards—that there was no ho or 


Dee $ them, that they would never leave the cam { 
subjects that there Were two conditions in cept as a corpse—who came fo feel that > 


the experiment, each of which would be sig- environment was one over which they could , 
naled by a different light. In one condition cise no influence whatsoever, |.. Once his 


ently by increasing Rea EOL AI Danih Smitty followed and, according | 
though using a very different paradigm, Bettelheim, “a 


Portance of choice. Tn that study it was found [survival] depended on one’s ability to arrange to 

that the exercise of choice in a chance situa- Preserve some areas of independent action, t 

tion, where choice Was objectively inconse. ne Son ECOL Oe eio aportant aspects of one's l 
‘ i e despite an environment that seemed over . 

quential, nevertheless had Psychological con- whelming and total, 

Sequences manifested jn increased confidence 

and risk taking, Bettelheim’s description Teminds us of Rich- 


and a Positive role in Sustaining life” (P. the environment and by intrinsic changes that 
424). It is not Surprising, then, that these occur with increasing old age—may contri- 


sychological withdrawal, physical 
and death. In contrast, objective 
id feelings of mastery may very 


we! ‘jbute to physical health and per- 
SOF ncacy. 

ı study conceived to explore the effects 
of Ussonance, Ferrare (1962; cited in Selig- 


m 1975; Zimbardo & Ruch, 1975) pre- 
se ied data concerning the effects of the 
al iity of geriatric patients to control their 
place of residence, Of 17 subjects who an- 
s: cred that they did not have any other 
alternative but to move to a specific old 
e home, 8 died after 4 weeks of residence 
ind 16 after 10 weeks of residence. By com- 
parison, among the residents who died dur- 
ing the initial period, only one person had 
answered that she had the freedom to choose 
other alternatives. All of these deaths were 
classified as unexpected because “not even 
insignificant disturbances had actually given 
warning of the impending disaster.” 

As Zimbardo (Zimbardo & Ruch, 1975) 
suggested, the implications of Ferrare’s data 


_are striking and merit further study of old age 


home settings. There is already evidence that 
perceived personal control in one’s residential 
environment is important for younger and 
noninstitutional populations. Rodin (in 
press), using children as subjects, demon- 
strated that diminished feelings of control 
produced by chronic crowding at home led 
to fewer attempts to control self-reinforce- 
ment in the laboratory and to greater likeli- 
hood of giving up in the face of failure. 
The present study attempted to assess di- 
rectly the effects of enhanced personal respon- 
sibility and choice in a group of nursing home 
patients. In addition to examining previous 
results from the control-helplessness literature 
in a field setting, the present study extended 
the domain of this conception by considering 
new response variables. Specifically, if in- 
creased control has generalized beneficial 
effects, then physical and mental alertness, 
activity, general level of satisfaction, and 
sociability should all be affected. Also, the 
manipulation of the independent variables, 
assigning greater responsibility and decision 
freedom for relevant behavior, allowed sub- 
jects real choices that were not directed 
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toward a single behavior or stimulus condi- 
tion. This manipulation tested the ability of 
the subjects to generalize from specific choices 
enumerated for them to other aspects of their 
lives, and thus tested the generalizability of 
feelings of control over certain elements of 
the situation to more broadly based behavior 
and attitudes, 


METHOD 
Subjects 


The study was conducted in a nursing home, 
which was rated by the state of Connecticut as 
being among the finest care units and offering qual- 
ity medical, recreational, and residential facilities. 
The home was large and modern in design, appear- 
ing cheerful and comfortable as well as clean and 
efficient. Of the four floors in the home, two were 
selected for study because of similarity in the resi- 
dents’ physical and psychological health and prior 
socioeconomic status, as determined from evaluations 
made by the home’s director, head nurses, and social 
worker. Residents were assigned to a particular floor 
and room simply on the basis of availability, and on 
the average, residents on the two floors had been at 
the home about the same length of time. Rather 
than randomly assigning subjects to experimental 
treatment, a different floor was randomly selected 
for each treatment, Since there was not a great deal 
of communication between floors, this procedure was 
followed in order to decrease the likelihood that the 
treatment effects would be contaminated. There were 
8 males and 39 females in the responsibility-induced 
condition (all fourth-floor residents) and 9 males 
and 35 females in the comparison group (all second- 
floor residents). Residents who were either com- 
pletely bedridden or judged by the nursing home 
staff to be completely noncommunicative (11 on the 
experimental floor and 9 on the comparison floor) 
were omitted from the sample, Also omitted was 
one woman on each floor, one 40 years old and the 
other 26 years old, due to their age. Thus, 91 ambu- 
latory adults, ranging in age from 65 to 90, served 
as subjects. 


Procedure 


To introduce the experimental treatment, the 
nursing home administrator, an outgoing and 
friendly 33-year-old male who interacts with the 
residents daily, called a meeting in the lounge of 
each floor. He delivered one of the following two 
communications at that time: 


[Responsibility-induced group] I brought you 
together today to give you some information 
about Arden House. I was surprised to learn that 
many of you don’t know about the things that 
are available to you and more important, that 
many of you don’t realize the influence you have 
over your own lives here. Take a minute to think 
of the decisions you can and should be making. 


$ 
fs 
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For example, you have the responsibility of caring 
for yourselves, of deciding whether or not you 
want to make this a home you can be proud of 
and happy in. You should be deciding how you 
want your rooms to be arranged—whether you 
want it to be as it is or whether you want the 
staff to help you rearrange the furniture. You 
should be deciding how you want to spend your 
time, for example, whether you want to be visit- 
ing your friends who live on this floor or on other 
floors, whether you want to visit in your room 
or your friends’ room, in the lounge, the dining 
room, etc., or whether you want to be watching 
television, listening to the radio, writing, reading, 
or planning social events. In other words, it’s 
your life and you can make of it whatever you 
want. 

This brings me to another point. If you are 
unsatisfied with anything here, you have the in- 
fluence to change it. It’s your responsibility to 
make your complaints known, to tell us what you 
would like to change, to tell us what you would 
like. These are just a few of the things you could 
and should 


now. 

We're thinking of instituting some way for air- 
ing complaints, suggestions, etc. Let [nurse’s name] 
know if you think this is a good idea and how 
you think we should go about doing it, In any 
case let her know what your complaints or sug- 
gestions are, 


select a plant.] The plants are 
take care of as you'd like. 
One last thing, I wanted to tell 


you that we're 
showing a movie two nights next 


[Comparison 
today to give 
Arden House. 


For example, 
the other floors and to use the 
lounge on this floor for visiting as well as the 
dining 
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proud of and happy in, and we w 
can to help you. 

This brings me to another po; 
any complaints or suggestions 
[nurse’s name] know what they arc 
how we can best help you. You s 
you have free access to anyone o. 


we will do the best we can to proy individual. 
ized attention and time for you j 
Also, I wanted to take this opporion ty to giy, 


you each a present from the Arder use, 


nurse walked around with a box of and each. 
patient was handed one.] The plani yours to 
keep. The nurses will water and for them 
for you. 

One last thing, I wanted to tell y iat we're | 
showing a movie next week on Th lay and 
Friday. We'll let you know later which .’,y you're 


scheduled to see it, 


The major difference between the two « 


tions was that on one floor, the emphasis on the 
residents’ responsibility for themselves, \, eas on 
the other floor, the communication sty. sed the 
Staff’s responsibility for them, In additic several 
other differences bolstered this treatment: 'esidents 
in the responsibility-induced group were = sked to 
give their opinion of the means by wh com- 
plaints were handled rather than just b g told 
that any complaints would be handled staff 
members; they were given the Opportunity select 
their own plant and to care for it themsely rather 
than being given a plant to be taken Care of by 


someone else; and they were given their < e of 


ticu- 
lar night, as was typically the case in the 1 age 
home, However, there was no difference the 
amount of attention paid to the two groups 
Three days after these communications h en 
delivered, the director visited all of the resic in 
their rooms or in the corridor and reiterate art 
of the previous message. To those in the re i- 
bility-induced group he said, “Remember I 
said last Thursday, We want you to be happy at 
this like your own home and make all the dec 1s 


To the residents of the comparison floor, he d 


the same thing omitting the statement about 


: decis 
making, 


Dependent Variables 


Questionnaires, Two 
designed to assess the 
bility. Each was administered 1 Week prior to and 
3 weeks after the Communication, The first 
administered directly to the residents 
research assistant 


Was 
by a female 
who was unaware of the experi- 
experimental! 


munica- 
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TABLE 1 
MIKAN SCORES FOR SELE-REPORT, INTERVIEWER RATINGS, AND Nurses’ RATINGS FOR 


EXPERIMENTAL AND 


Comparison GROUPS 


Com- 
sa 
Responsibility induced (» = 24) Comparison (n = 28) zon of 
change 
Change: Change: cores 
e responses Pre Post Post-Pre Pre Post Post-Pre (p<) 
2 bei 
516 5.44 28 490 478  —.12 05 
RH 207 427 20 300 262 —1.28 ‘01 
} ceived Control 
326 342 16 3.02 403 Al on 
Want Sos 3 80 E 440 457 7 = 
Interviewer rating 
3 502 5.31 29 iy Ge SER 025 
Nt ratings 
General improvement aor 45:04. 3.97 42.69 4032  —2.39 005 
spent 
siting patients 13.03 19.81 6.78 e E .005 
EERS 1150 1375 244 1738 821 «= —4.16 ‘05 
Walking to staff 831 1643. EERI] O11 1071 1.61 ‘01 
Watching staff 678 46 2.14 696 1160 4.64 ‘05 


view, 
8-point 


aire was responded to by 


The second questior 
the nurses, who staffed the experimental and com- 
parison floors and who were unaware of the experi- 


mental treatments. Nurses on two different shifts 
completed the questionnaires in order to obtain two 
ratings for each subject. There were nine 10-point 
scales that asked for ratings of how happy, alert, 
dependent, sociable, and active the residents were 
as well as questions about their eating and sleeping 
habits. There were also questions evaluating the 
proportion of weekly time the patient spent engaged 
in a variety of activities. These included reading, 
watching television, visiting other patients, visiting 
outside guests, watching the staff, talking to the 
staff, sitting alone doing nothing, and others. 

Behavioral measures. Since perceived personal con- 
trol is enhanced by a sense of choice over relevant 
behaviors, the option to choose which night the 
experimental group wished to see the movie was 
expected to have measurable effects on active par- 
ticipation, Attendance records were kept by the 
occupational therapist, who was unaware that an 
experiment was being conducted, 

Another measure of involvement was obtained by 
holding a competition in which all participants had 
to guess the number of jelly beans in a large jar. 
Each patient wishing to enter the contest simply 
| wrote his or her name and estimate on a piece of 

paper and deposited it in a box that was next to 
the jar.t 

Finally, an unobtrusive measure of activity was 
taken. The tenth night after the experimental treat- 
ment, the right wheels of the wheelchairs belonging 


to a randomly selected subsample of each patient 
group were covered with 2 inches (.05 m) of white 
adhesive tape. The following night, the tape was 
removed from the chairs and placed on index cards 
for later evaluation of amount of activity, as indi- 
cated by the amount of discoloration. 


RESULTS 


Questionnaires. Before examining whether 
or not the experimental treatment was ef- 
fective, the pretest ratings made by the sub- 
jects, the nurses, and the interviewer were 
compared for both groups. None of the differ- 
ences approached significance, which indicates 
comparability between groups prior to the 
start of the investigation. 

The means for responses to the various 
questionnaires are summarized in Table 1. 
Statistical tests compared the posttest minus 
pretest scores of the experimental and com- 
parison groups. 

In response to direct questions about how 
happy they currently were, residents in the 


1We also intended to measure the number of 
complaints that patients voiced. Since one often does 
not complain after becoming psychologically help- 
less, complaints in this context were expected to be 
a positive indication of perceived personal control, 
This measure was discarded, however, since the 


nurses failed to keep @ systematic written record. 
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responsibility-induced groupexeported signifi- 
cantly greater increases in happiness after 
the experimental treatment than did the com- 
parison group, ¢(43) = 1.96, p< .052 Al- 
though the comparison group heard a com- 
munication that had specifically stressed the 
home’s commitment to making them happy, 
only 25% of them reported feeling happier 
by the time of the second interview, whereas 
48% of the experimental group did so. 

The responsibility-induced group reported 
themselves to be significantly more active on 
the second interview than the comparison 
group, (43) = 2.67, p < .01. The interview- 
er’s ratings of alertness also showed signifi- 
cantly greater increase for the experimental 
group, ¢(43) = 2.40, p < .025. However, the 
questions that were relevant to perceived 
control showed no significant changes for the 
experimental group. Since over 20% of the 
patients indicated that they were unable to 
understand what we meant by control, these 
questions were obviously not adequate to dis- 
criminate between groups. 

The second questionnaire measured nurses? 
ratings of each patient. The correlation be- 
tween the two nurses’ ratings of the same 
patient was .68 and .61 (ps < .005) on the 
comparison and responsibility-induced floors, 
respectively.’ For each patient, a score was 
calculated by averaging the two nurses’ rat- 
ings for each question, summing across ques- 
tions, and subtracting the total pretreatment 
score from the total posttreatment score 
This yielded a positive average total change 
score of 3.97 for the responsibility-induced 
group as compared with an average negative 
total change of —2.37 for the comparison 
group. The difference between these means is 
highly significant, #(50) = 5.18, p < .005. If 
one looks at the percentage of people who 
were judged improved rather than at the 
amount of judged improvement, the same 

pattern emerges: 93% of the experimental 
group (all but one subject) were considered 
improved, whereas only 21% (six subjects) 
f the comparison group showed this positive 
nge (x? = 19.23, p < ,005). 
e nurses’ evaluation of the proportion of 
ubjects spent engaged in various inter- 
ind noninteractive activities was ana- 


active 
lyzed by comparing the average change scores 


tim: 
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(post-precommunication) for all of the nurses 
for both groups of subjects on each activity. 
Several significant differences were found. The 
experimental group showed increases in the 
proportion of time spent visiting with other 
patients (for the experimental group, X = 
12.86 vs. —6.61 for the comparison group), 
t(50) = 3.83, p < .005; visiting people from 
outside of the nursing home (for the experi- 
mental group, X = 4.28 vs. —7.61 for the 
comparison group, ¢(50) = 2.30, p< .05; 
and talking to the staff (for the experimental 
group, X¥ = 8.21 vs. 1.61 for the comparison 
group), #(50) = 2.98, p < .05.° In addition, 
they spent less time passively watching the 
staff (for the experimental group, X = —4.28 
vs. 9.68 for the comparison group), ¢(50) = 
2.60, p < .05. Thus, it appears that the treat- 
ment increased active, interpersonal activity 
but not passive activity such as watching tele- 
vision or reading. 

Behavioral measures. As in the case of the 
questionnaires, the behavioral measures 
showed a pattern of differences between 
groups that was generally consistent with the 
predicted effects of increased responsibility. 
The movie attendance was significantly higher 
in the responsibility-induced group than in 
the control group after the experimental 
treatment (z = 1.71, p < .05, one-tailed), al- 
though a similar attendance check taken one 
month before the communications revealed 
no group differences.’ 


2 All of the statistics for the self-report data and 
the interviewers’ ratings are based on 45 subjects (25 
in the responsibility-induced group and 20 in the 
comparison group), since these were the only sub- 
jects available at the time of the interview, 

3 There was also significant agreement between the 
interviewer’s and nurses’ ratings of alertness (r= 
65). 

4 Since one nurse on the day shift and one nurse 
on the night shift gave the ratings, responses to the 
questions regarding sleeping and eating habits were 
not included in the total score. Also, in order to 
reduce rater bias, patients for whom there were 
ratings by a nurse on only one shift were excluded 
from this calculation, This left 24 residents f:om the 
experimental group and 28 from the comparison 
group. 

5 This statistic is based only on the responses of 
nurse on duty in the evening, 

8 Frequencies were transformed into arc sines and 
analyzed using the method that is essentially the 
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A FIELD EXPERIMENT IN AN INSTITUTIONAL SETTING 


In the jelly-bean-guessing contest, 10 sub- 
jects (21%) in the responsibility-induced 
group and only 1 subject (2%) from the 
comparison group participated (x° = 7.72, p 
< .01). Finally, very little dirt was found on 
the tape taken from any of the patients’ 
wheelchairs, and there was no significant dif- 
ference between the two groups. 


Discussion 


It appears that inducing a greater sense of 
personal responsibility in people who may 
have virtually relinquished decision making, 
either by choice or necessity, produces im- 
provement. In the present investigation, pa- 
tients in ‘the comparison group were given a 
communication stressing the staff’s desire to 
make them happy and were otherwise treated 
in the sympathetic manner characteristic of 
this high-quality nursing home. Despite the 
care, provided for these people, 71% were 
rated as having become more debilitated over 
a period of time as short as 3 weeks. In con- 
trast with this group, 93% of the people who 
were encouraged to make decisions for them- 
selves, given decisions to make, and given 
responsibility for something outside of them- 
selves, actually showed overall improvement. 
Based on their own judgments and by the 
judgments of the nurses with whom they 
interacted on a daily basis, they became more 
active and felt happier. Perhaps more im- 
portant was the judged improvement in their 
mental alertness and increased behavioral in- 
volyement in many different kinds of activi- 
ties. 

The behavioral measures showed greater 
active participation and involvement for the 
experimental group. Whether this directly 
resulted from an increase in perceived choice 
and decision-making responsibility or- from 
the increase in general activity and happiness 
occurring after the treatment cannot be as- 
sessed from the present results. It should also 
be clearly noted that although there were 
significant differences in active involvement, 
the overall level of participation in the ac- 
tivities that comprised the behavioral mea- 
sures was low. Perhaps a much more powerful 


same as that described by Langer and Abelson 
(1972). 
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treatment would be one that is individually 
administered and repeated on several occa- 
sions, That so weak a manipulation had any 
effect suggests how important increased con- 
trol is for these people, for whom decision 
making is virtually nonexistent. 

The practical implications of this experi- 
mental demonstration are straightforward. 
Mechanisms can and should be established 
for changing situational factors that reduce 
real or perceived responsibility in the elderly. 
Furthermore, this study adds to the body of 
literature (Bengston, 1973; Butler, 1967; 
Leaf, 1973; Lieberman, 1965) suggesting that 
senility and diminished alertness are not an 
almost inevitable result of aging. In fact, it 
suggests that some of the negative conse- 
quences of aging may be retarded, reversed, 
or possibly prevented by returning to the 
aged the right to make decisions and a feeling 
of competence. 


REFERENCE NOTE 


1. Lehr, K., & Puschner, I. Studies in the awareness 
of aging. Paper presented at the 6th International 
Congress on Gerontology, Copenhagen, 1963. 
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y, A study of the perceptions of typical social episodes was carried out in two 

m different subcultural groups, housewives and students. First, a sample of rep- 
4 y resentative episodes was obtained using an open-ended questionnaire. A second 
’ sample of subjects from each group was required to perform a similarity- 
sorting task of the most frequently mentioned social episodes. A measure of 
psychological relatedness, based on the sorting task, was used as input to a 
multidimensional scaling and a hierarchical clustering analysis of the episode 
space. In the scaling solution, perceived intimacy and subjective self-confidence 
were the two most important attributes differentiating episodes; the clustering 
solutions also highlighted the nature of the relationship to the interaction 
Á partner. The results are discussed in terms of the differences between the ob- 
jective social environments of the two subsamples and the relative adequacy 

of the dimensional and categorical solutions. The utility of analyzing subjective 

and consensual representations of social episodes as a basis for a taxonomic 


vy 
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system is also discussed. 


In recent years, dissatisfaction with atom- 
istic and often irrelevant experimentation in 
social psychology (Argyle, 1969; Armistead, 
1974; Harré & Secord, 1972) led to growing 
interest in “natural” units of social behavior, 
such as social episodes. Although the term so- 
cial episode has been widely invoked in differ- 
ent descriptive and analytic works concerned 
with social behavior, it has not yet found its 
way into empirical research, This is probably 
due to the considerable conceptual ambiguity 
surrounding the term and the absence of an 
acceptable taxonomy which would enable in- 
vestigators to classify interaction episodes on 
the basis of empirically quantifiable character- 
istics. Not surprisingly, calls for taxonomic 
studies of situational characteristics have in- 
tensified in recent years, both from personality 
theorists (Bowers, 1973; Mischel, 1973) and 
social psychologists (Argyle & Little, 1972; 
Frederiksen, 1972; Magnusson, 1971; Moos, 
1973). The aim of the present study is to pro- 
pose a definition of social episodes as cultural 
representations of recurring interaction se- 
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quences, and to explore an empirical method 
of classifying social episodes within a defined 
social and cultural milieu on the basis of in- 
dividuals’ perceptions of them. 

Social episodes are interaction sequences 
which constitute natural units in the stream 
of behavior (Dickman, 1963; Harré & Secord, 
1972) and are distinguishable on the basis of 
symbolic, temporal, and often physical bound- 
aries, More importantly, however, there is a 
shared, consensual representation in the given 
culture about what constitutes an episode and 
which norms, rules, and expectations apply. 
This definition of episodes as cultural objects 
is rooted in the symbolic interactionist tradi- 
tion (Stone & Farberman, 1970). It is also 
closely related to some other definitions of 
interaction units, such as Goffman’s (1963) 
“focused interaction”, Watson and Potter’s 
(1962) “episode”, Barker and Wright’s 
(1955) “behavior units”, Bjerg’s (1968) 
“agons”, and Scheflen’s (1964) idea of a 
“presentation”. 

The present definition of social episodes im- 
plies that individual members of a given 
cultural background should have an internal 
representation of the social episodes practiced 
in their environment. The term episode struc 
ture will be used to describe the perceived p; 
terned relationship between different kin 
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social encounters within the cultural milieu. 
Our task in deriving a meaningful taxonomy 
of episodes is thus to explicate subjectively 
perceived differences between episodes com- 
posing the episode structure, 

A perceptual taxonomy of episodes has im- 
portant advantages when contrasted with 
some more traditional approaches (Ekeham- 
mar, 1974), such as a priori category systems 
(Sells, 1963; Sherif & Sherif, 1969), classifica- 
tion systems developed in terms of need con- 
cepts (Rotter, 1954) and in terms of single 
response variables, such as anxiety (Ekeham- 
mar, Magnussion, & Ricklander, 1974), or the 
three-dimensional individual-situation-behay- 
ior data matrices proposed by Frederiksen 
(1972). All these approaches are either too 
atomistic in their reliance on observable be- 
havior as criteria or are unsuited to the ex- 
plication of the cultural content of episodes 
as defined here. 

A few studies in recent years have at- 
tempted the scaling of individuals’ perceptions 
of social situations. Magnusson and his co- 
workers (Ekehammar & Magnusson, 1973; 
Magnusson, 1971; Magnusson & Ekehammar, 
1973) used factor analysis to extract the di- 
mensionality of perceptions of a range of ad 
hoc situations, both solitary and social, Mag- 
nusson (1971) argued that “individuals dis- 
criminate among situations along cognitive 
dimensions, and . . , situations can be regarded 
as related to each other in cognitive space” 


(p. 853). The problem of stimulus sampling- 


is crucial in such studies, and Magnusson 
states that “no systematic attempt has been 
made to solve it in this study” (1971, p. 854). 
Further, the lack of external validity in these 
studies, due to the small number of subjects 
used and the intuitive interpretation of the 
factor analytic results, is a serious shortcom- 
ing if an empirical taxonomy is attempted. 

A different line was taken by Pervin (Note 
1), who scaled single individuals’ perceptions 
of their significant encounters, Although the 
idiographic orientation of this study makes it 
inappropriate for a cultural taxonomy, its 
findings are highly interesting. In a series of 
studies, Wish (Note 2) used multidimensional 
sealing to study communication episodes, but 
the representative sampling of episodes was 
not attempted. It is the aim of the present 
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study to show that the scaling of individual 
perceptions of social episodes can be accom- 
plished in such a way that the resulting 
taxonomy reflects a degree of consensual 
agreement in a given subculture. A nonmetric 
multidimensional scaling (MDS) procedure 
appears most appropriate for such a task be- 
cause it allows data collection methods that 
make the complex cognitive comparisons in- 
trinsically meaningful, without the imposition 
of external constructs by the experimenter: 
“Each subject determines which aspect of the 
stimulus . . . he will judge” (Jones & Young, 
1973, p. 109). The interpretation of the solu- 
tion can also be accomplished empirically, by 
fitting separately scaled hypothesis dimensions 
to the multidimensional space (Rosenberg & 
Sedlak, 1972). 

However, if subjects’ perceptions of the 
episodes are in terms of a finite number of 
discrete categories rather than in terms of 
gradual changes along different continua, the 
MDS solution will at best only approximate 
the actual cognitive representation of the 
episode space (Torgerson, 1963). By apply- 
ing an alternative procedure more sensitive to 
categorical structures, such as cluster analysis, 
to the same set of data (Jones & Ashmore, 
1973), the adequacy of the two representa- 
tions can be compared, 

Finally, the culture specificity of episodes 
as defined here offers the possibility of some 
potentially interesting comparisons between 
subcultures. Are the cognitive maps of epi- 
sodes structurally different in different sub- 
cultures? Do the same episodes occupy similar 
or different structural positions in different 
subgroups? If there are structural differences, 
could these be related to the objective cir- 
cumstances of the different groups? In order 
to provide at least some initial insight into 
problems such as these, it was decided to 
carry out the present investigation on two 
substantially different subcultural samples 
simultaneously. 

In summary, the present study had three 
objectives: (a) to explore the possibility of 
constructing an empirical taxonomy of social 
episodes based on individuals’ perception of 
them, (b) to compare the effectiveness of di- 
mensional and categorical representations of 
the same data using MDS and cluster analy- 


THE PERCEPTION OF SOCIAL EPISODES 


sis, and (c) to contrast differences between 
different subcultural groups in their repre- 
sentations of social episodes. 


METHOD 


Subjects 


The first population group was defined as married, 
middle-class housewives living in the Oxford area, 
who were 20-30 years of age and had completed 
secondary school, These subjects were drawn from 
the subject pool of the Department of Experimental 
Psychology, University of Oxford. In Stage 1, 25 
subjects drawn from this population were used, and 
26 subjects participated in Stage 2 of the study. 

The second population group was defined as under- 
graduate University of Oxford students of both sexes, 
who were 18-25 years of age. Subjects from this 
group were recruited either through the subject pool, 
as described above, or by way of notices displayed 
in public places in the department. Stages 1 and 2 
of the study employed 23 and 25 students, respec- 
tively. All subjects were paid for their participation. 


Stimulus Sampling 


The selection of a representative range of episodes 
is crucial for the adequate representation of the re- 
spondent’s cognitive structure (Pervin, Note 1). Past 
studies have tended to bypass this problem by using 
‘a selection of sundry ad hoc situations on an intuitive 
basis. Stage 1 of the present study was designed to 
overcome this difficulty. 

Subjects from both samples were given a question- 
naire that asked them to (a) give a detailed account 
of their interactions during the past 24 hours in the 
form of a diary, and (b) list their recurring activi- 
ties which did not occur within this period. Subjects 
were also required to give at least two descriptive 
adjectives for each of the interactions they nominated. 
These adjectives were collected as sources for the 
hypothesis dimensions to be used in the interpreta- 
tion of the MDS configurations. For the housewives 
subsample, a total of 75 social episodes were nomi- 
nated by at least two respondents. The 23 most fre- 
quently nominated episodes were retained for further 
use. For the student subsample, a similar procedure 
was followed, and again the top 23 episodes were 
retained, In each case an additional 2 episodes were 
added which would not have qualiñed in terms of 
the consensus criterion but which appeared to be 
interesting in their own right, and were nominated 
by at least five people in the respective subsamples 
(see Table 1, Items 7 and 23 for housewives; Items 
23 and 25 for students). These 25 stimulus episodes 
were used throughout the main study. 


Procedure 


A categorization task, giving a basic datum of ele- 
ment co-occurrence, was used. On arrival subjects were 
given an instruction sheet explaining the task as 
follows: 
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You are asked to compare different situations with 
each other, and after careful consideration place 
them into categories on the basis of their similarity. 
You will be given an envelope with 25 labels in it, 
each of them having the name of a typical every- 
day situation written on it. Look at each of the 
labels carefully, and try to imagine the episode it 
describes, Then sort the labels into distinct groups 
on the table, on the basis of their similarity. You 
may consider any aspect of the episodes in deciding 
whether they are similar or not, You may create 
as many groups as you wish, although 5 to 7 
groups would be preferable. If you have any labels 
left over which you feel you cannot assign to any 
of the existing groups, you can create a miscel- 
Janeous group, and put them there. When you have 
finished, go through all the groups again to make 
sure that they do not contain any episodes which 
in your opinion do not fit, and which you may 
want to re-assign. When you have finished, please 
record your choices on the record sheet provided 
by putting down the numbers of the episodes in 
each group in a separate column. 


Index of Psychological Relatedness 


The basic input datum required for both the MDS 
and the clustering analysis is an index of relatedness 
between each pair of stimuli. In the present study, 
the raw datum is the number of times a pair of 
episodes was put into the same category, a measure 
invplving the aggregation of data over subjects. This 
measure, as Rosenberg, Nelson, and Vivekananthan 
(1968) pointed out, does not take into account 
mutual cooccurrence between pairs and third ele- 
ments. An index of dissociation, taking both direct 
and indirect ties into account, was proposed by 
Rosenberg et al.: 


25 


8,4 = LD (Sie — Sja)’. 


k=l 


This index reflects the direct cooccurrence of the two 
elements, i and j, as well as a measure of indirect 
co-occurrence with third elements, k. This index, 
which contains useful new information and was 
shown to yield superior configurations with better 
interpretability (Rosenberg & Jones, 1972; Rosenberg 
et al, 1968; Rosenberg & Sedlak, 1972), was cal- 
culated for each possible pair of episodes for both 
subsamples and was used as the input datum for 
both MDS and clustering analyses. 


Unidimensional Scales 


The interpretation of MDS configuration can be 
accomplished by the fitting of unidimensional hy- 
pothesis scales: These scales, on which the elements 
were independently rated, represent a priori hy- 
potheses as to the identities of the dimensions of 
episode structure. These scales were derived from 
the analysis of free-response adjectives elicited in 
Stage 1 of the study in terms of three criteria: (a) 
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TABLE 1 


Tue 25 Most FREQUENTLY MENTIONED SOCIAL EPISODES USED AS STIMULI ror EACH SUBSAMPLE 


Housewives (n = 25) 


Students (n = 23) 


m 


. Having a short chat with the house delivery man 
(15) 

. Playing with your children (18) 

. Your husband rings up from work to discuss some- 
thing (17) 

. Having a short chat with the shop assistant while 
shopping (16) 

. Having dinner with your family (18) 

. Shopping on Saturday morning with your husband 
at the supermarket (16) 

. Attending a wedding ceremony (6) 

. Having a drink with some friends in the pub (14) 

. Washing up dishes after dinner with family help 
(16) 

. Chatting over morning coffee with some friends (14) 
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f Heading and talking in bed before going to sleep 
17) 

12. Chatting with an acquaintance who unexpectedly 
gave you a lift (13) 

13. Watching TV with your family after dinner (16) 

14. Having a short chat with an acquaintance whom 
you unexpectedly met on the street (15) 

15. Going to the pictures with some friends (16) 

16. Discussing the events of the day with your husband 
in the evening (18) 

17. Talking to other customers while queueing in a 
shop (17) 

18. Talking to a neighbor who called to borrow some 
household equipment (14) 

19. Having guests for dinner (18) 

20. Visiting a friend in hospital (11) 


21. Chatting with others while waiting for your washing 
in the coin laundry (14) 
22. ey to a neighbor through the backyard fence 
16) 
23. Playing chess (8) 
24. Going to the bank (18) 
25. Visiting your doctor (13) 


. Having morning coffee with people in the depart- 
ment (18) 

. Having a drink with some friends in a pub (15) 

. Discussing an essay during a tutorial (19) 


. Meeting an acquaintance while checking your 
pigeonhole for mail in college (10) 

. Going out for a walk with a friend (13) 

. Shopping on Saturday morning with a friend at the 
supermarket (14) 

. Acting as a subject in a psychology experiment (15) 

. Going to the pictures with some friends (17 

. Having a short chat with the shop assis 
shopping (15) 

. Getting acquainted with a new person during 
dinner in hall (11) 

11. Going to JCR meetings (13) 
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12. Chatting with an acquaintance before a lecture 
begins (14) 

13. Discussing psychology topics with friends (12) 

14. Meeting new people at a sherry party in college (16) 


15. Visiting your doctor (11) 

16. Chatting with an acquaintance who unexpectedly 
gave you a lift (11) 

17. Visiting a friend in his college room (17) 


18. Going to see a play at the theatre with friends (15) 


19. Going to the bank (18) 

20. Having an intimate conversation with your boy/ 

» girl friend (17) 

21. Having a short chat with an acquaintance whom 
you unexpectedly met on the street (19) 

22. Chatting with others while waiting for your washing 
in the coin laundry (14) 

23. Attending a wedding ceremony (6) 

24. Watching TV with some friends (17) 

25. Playing chess (9) 


Note, The numbers in parentheses after each episode indicate the frequency of nomination. 


salience, as indicated by overall frequency of usage 
across subjects and episodes, (b) productivity, or 
diversity of usage by individual subjects across epi- 
sodes, and (c) independence, or lack of co-occurrence 
between any two adjectives to describe an episode. 
From a pool of 146 adjectives, 10 were identified as 
best conforming to these criteria; 4 of them formed 
natural antonyms (friendly-unfriendly; pleasant—un- 
pleasant), while the remaining 6 were paired with 
appropriate antonyms to create the following dimen- 
sions: active-passive, involyed-uninvolved, intimate- 
nonintimate, occasional-regular, organized-disorga- 
nized, simple-complex. In addition, some of the scales 
proposed in speculative taxonomies in the literature 
were also included: formal-informal, cooperative- 
competitive (Harré & Secord, 1972), very much at 
ease-very ill at ease (Argyle & Little, 1972), and I 
know exactly how to behave in this situation-I don’t 
know how to behave in this situation (Bryant & 
_ Trower, 1974). These 12 unidimensional scales, repre- 


senting a priori assumptions about the identities of 
the dimensions underlying the episode structure, were 
used to independently rate each of the 25 episodes 
in both subsamples, The same subjects were used as 
for the episode categorization task, but ratings were 
taken at a separate session, 


Resutts 
Multidimensional Scaling 


This analysis was carried out using the 
TORSCA 9 nonmetric multidimensional scal- 
ing program, based on Kruskal’s (1964) 
method. 

One-, two-, three-, four-, and five-dimen- 
sional solutions were obtained for input data 
from both groups, using varimax rotation. In 
deciding the optimum number of dimensions, 
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TABLE 2 
MULTIPLE CORRELATION COEFFICIENTS BETWE! 


Hypornesis SCALES AND EACH or Five 


MULTIDIMENSIONAL CONFIGURATIONS FOR Bo! 
Housewives Students 
Scale 1 2 3 4 5 1 2 3 4 5 
Tnyolved-uninvolved 702* .790* .835* .834" .856* 338.395 607" .551  .789* 
Simple-complex "463 586% .609 .126* .757* .059  .488* ‘875.522. 708" 
Active-passive “600* .604* .624* .632 .718* 095 -252 549.393. 518 
Pleasant-unpleasant "272 336 607 .579 538,314 "682" .761% .764* .791* 
Intimate-nonintimate 1698% .709* .764* .780* .819* 349 623* .801*- .803* .829* 


Very much at ease-very ill at ease 449 486, 597, 635, 618 428 747" -.725* 867" .879* 


T know how to behave-don’t know "389 .682* .709* .743* .813* 304 570"  .616* .662* .080* 
Friendly-uniriendly '554* .621* .658* .738* .737* "540* .760* .864* .873* .880* 
(97 onal-regular “507* .604* .685* .700* .093* 094 432 446 .628 .O41 
Organized-disorganized "569* .629* .805* .825* .830* 238 A605 562.618 620 
Cooperative-competitive 315.371 388 614 .837* .293 .550 538 .596 569 
Formal-informal TS a179 aAA ROATA .789* 364 "850" .877* .925* .929* 
Stress 13% 4.5% 2.8% 1.8% 15% 23% 10.5% 6.2% 3.8% 2.6% 


*p < 001, 


two main criteria can be used: (a) the mini- 
mum number of dimensions with a satisfac- 
tory level of fit can be selected, and (b) the 
interpretability of the different solutions can 
be taken into account. The TORSCA 9 pro- 
gram automatically calculates a goodness-of- 
fit measure, or stress, for each solution. In 
terms of the baseline values for stress sug- 
gested by Spence and Ogilvie (1973), the 
stress values for all solutions were consider- 
ably better than could be expected by chance 
(see Table 2). The degree of decrease in stress 
with increasing dimensionality is also an im- 
portant indicator, and in terms of this cri- 
terion, the two-dimensional configuration for 
housewives and the three-dimensional con- 
figuration for students appear optimal. 

The second criterion for deciding the opti- 
mum dimensional solution is interpretability. 
To gain some insight into the meaningfulness 
of the different solutions, a series of multiple 
regression analyses was carried out, using 
median ratings on the 12 hypothesis scales as 
dependent variables and the coordinates of the 
25 episodes in different dimensional configura- 
tions as independent variables (Rosenberg & 
Jones, 1972). The number of significant multi- 
ple correlation coefficients is indicative of the 
potential interpretability of the different solu- 
tions. From Table 2 it seems that for house- 
wives, optimal interpretability is achieved at 
the two-dimensional level, whereas for stu- 
dents, a noticeable further improvement at the 


three-dimensional level is apparent. In con- 
clusion, the interpretation of the two-dimen- 
sional configuration for housewives and the 
three-dimensional configuration for students 
will be attempted before higher dimensionali- 
ties are considered. 


Interpretation of the Two-Dimensional 
Configuration—Housewives 


This was accomplished by the fitting of the 
independently scaled univariate property di- 
mensions to the multidimensional space, with 
significant and approximately orthogonal axes 
used as interpretative frameworks. 

From Table 2 it can be seen that 8 of the 
12 unidimensional scales are significantly re- 
lated to the two-dimensional configuration and 
are thus available for the interpretation of 
this space. These eight hypothesis dimensions 
were fitted to the two-dimensional space using 
the ratio of the regression weights as the slope 
of a line through the origin (Rosenberg & 
Jones, 1972). The two-dimensional configura- 
tion and six of the eight fitted unidimensional 
scales are shown in Figure 1. (Two scales, 
friendly—unfriendly and organized-disorga- 
nized, located between the involvement and 
intimacy scales, could not be shown in the 
space available). This episode space appears 
to be interpretable in terms of two general at- 
tribute clusters: the perceived intimacy, in- 
volvement and friendliness of episodes, and 
the subjective self-confidence or competence 
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The two dimensional confi 


of the actors related to the regularity of the 
episodes. The exact relationship between prop- 
erty dimensions is shown in Figure 1, 


Interpretation of the Three-Dimensional 
Configuration—Students 


Seven hypothesis scales were found to be 
significantly related to 
Solution 


the three-dimensional 
at the .001 level (see Table 2). A 


TABLE 3 


FITTED AXES FOR UNDIMENSIONAL 

IGNIFICANT AT THE 001 LEVEL 

(THREE-DIMENSIONAL SOLUTION, 
STUDENTS SAMPLE) 


Scale 1 2 3 4 5 6 


1. Involved = — 

2. Pleasant 69 — 

3. Intimate 54 4 
4. At ease 
5. Know how 94 77 72 44 
6. Friendly 65 42 44 4g 67 — 
7. Informal 81 4 39 36 53 


guration of 25 episodes (see Table 
hypothesis scales fitted (housewife sample) 
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occasional 
664 


BS 


complex 
586 
involved 


active intimate .790 


-604 .709 


1), with 6 significant 


multiple regression analysis of the three MDs 
dimensions as predictor variables and the 
Seven univariate scales as dependent variable: 


scales as interpretati 
mately orthogonal framework is 


three property dimensions: 
pleasantness, and know how to 


defined by 
involvement, 
behave. The 


constructs. 
Cluster Anal ysis 


This analysis was carried out using An- 
drew’s (Note 3) CLUSTER program for the 


THE PERCEPTION OF SOCIAL EPISODES 
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Ficurr 2. The three-dimensional configuration of 25 episodes (see Table 1), 


with 3 hypothesis 


scales fitted (student sample). 


hierarchical clustering and a sequel program, 
DENDRO, which produces a dendrogram 
graph on the computer plotter. This program 
uses the distance between input elements as 
the criterion for partitioning the elements into 
increasingly heterogeneous, nonoverlapping 
clusters. The 8 index of dissociation was used 
as input both for the housewife and the stu- 
dent samples (Jones & Ashmore, 1973), The 
dendrograms for both subsamples are shown 
in Figure 3. 

The interpretation of hierarchical clustering 
configurations can be accomplished either by 
an intuitive analysis of the shared character- 
istics of elements within the same cluster or 


by a more accurate empirical procedure re- 
cently proposed by Rosenberg and Jones 
(1972), which is based on the fitting of un- 
idimensional hypothesis scales to hierarchical 
clustering solutions using an analysis of vari- 
ance technique. In principle, however, the 
intuitive interpretation is “acceptable when 
the underlying bases for partitioning are clear- 
cut and obvious” (Rosenberg & Sedlak, 1972, 
p. 270). As is apparent from Figure 3, the 
present solutions form clear and homogeneous 
clusters for both subsamples, which are easily 
interpretable and will be further discussed 
later, Accordingly, it was decided not to use 
Rosenberg and Jones’ (1972) more sophisti- 
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Ficure 3, Dendrogram of 25 episodes (see 


cated labeling technique for the present solu- 
tion. 


Discussion 


main attributes of social episodes emerging 
from this study, involvement or intimacy and 
are promising as 


1972, p. 118). Similar attributes were found 
to be relevant not only in person Perception 
(Rosenberg et al., 1968) but also in some 
other studies of social situations (Pervin, Note 
1; Wish, Note 2), 
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STUDENTS 


136370 151923922101425 4 22116218 4 8 


Table 1) for each subsample, 


tions of the stimulus Space, and to com; 
the difference between the cognitive re; 
Sentations of the two subsamples, 


The Dimensional Solutions 


The structure of the 25 common episodes a 


Provides few encounters of this type. 


THE PERCEPTION OF SOCIAL EPISODES 


The dimensional representation of the epi- 
sode structure of students was best accom- 
plished in three-dimensional space (see Figure 
2), with the addition of an evaluative dimen- 
sion to the basic involvement-self-confidence 
framework. It seems an intriguing possibility 
to account for this either in terms of the 
greater cognitive complexity of students or in 
terms of their wider and more varied stimulus 
field. 

The involvement dimension mainly differ- 
entiates encounters with friends and acquaint- 
ances on the basis of the duration and 
intimacy of the episode. The evaluative 
(pleasant—unpleasant) dimension seems to re- 
flect the degree of situational constraint im- 
plied by different episodes (Price & Bouffard, 
1974), with open and uncontrolled interac- 
tions seen as pleasant (e.g., Items 19, 16, and 
5), and controlled and restrictive activities, 
such as tutorials, acting as a subject in an ex- 
periment, and attending formal meetings, be- 
ing negatively evaluated. 

The function of the self-confidence (“know 
how to behave”) dimension in differentiating 
episodes is interesting because it relates to 
previous work on anxiety responses to differ- 
ent situations (Bryant & Trower, 1974; 
Ekehammer & Magnusson, 1973). Episodes 
involving prolonged interaction with strangers 
(ie, the getting acquainted process) elicits 
the highest level of apprehension (Items 10, 
14, and 25), thus supporting Bryant & 
Trower’s (1974) finding with a similar sample 
and showing that substantial numbers of stu- 
dents find episodes such as parties rather dif- 
ficult to cope with. 

Students seem to regard episodes involving 
entertainments and socializing with friends 
with great self-confidence (Items 2, 8, 18, and 
24), in strong contrast to housewives, who 
viewed nearly identical episodes with a lack 
of self-confidence. The very different subjec- 
tive definition of these interactions, involving 
nearly indistinguishable activities and objec- 
tive characteristics, suggests that a classifica- 
tion of episodes in terms of objective factors 
may not tap the psychologically meaningful 
differences. While “socializing with friends” 
for students is a natural, self-selected enter- 
tainment, for housewives it may be a more 
formal, organized affair, with an element of 
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self-presentation and potential loss of face 
(Goffman, 1955). 

Thus, both dimensional representations pro- 
vide a picture of the episode space which is 
(a) readily interpretable and meaningful, (b) 
appears to be sensitive to the cultural differ- 
ences between the samples, and (c) tends to 
differentiate episodes primarily in terms of 
their perceived subjective, connotative rather 
than denotative, characteristics, It was sur- 
prising to note that the activities which make 
up an episode, the physical aspects of the 
environment, and other objective factors were 
of little consequence in determining this 
structure. 


Categorical Representations 


The two hierarchical clustering configura- 
tions shown in Figure 3 provide some im- 
portant additional cues to the interpretation 
of the episode space, 

For housewives, the three large primary 
categories, which could be labeled as “casual,” 
“family,” and “social,” divide into subclusters 
in terms of the relationship to the interaction 
partner. Thus, family episodes can be differ- 
entiated by whether they involve the husband 
(Items 3, 11, and 16) or the whole family 
(Items 2, 5, 6, 9, and 13). Casual encounters 
can involve strangers (Items 1, 4, 17, and 21) 
and friends and neighbors (Items 12, 14, 18, 
and 22), It appears that the underlying dimen- 
sions of involvement~self-confidence closely 
match the relationship to the interaction 
partner, 

For students, the primary division of epi- 
sodes is in terms of the categories of people 
they involve: friends, acquaintances, strang- 
ers, people in their official capacity. When 
compared with the categorical structure of 
episodes for housewives, a reversal of the im- 
portance of differentiating criteria is apparent. 
Housewives differentiate their social episodes 
primarily in terms of which area of their lives 
they relate to (family, social, or casual), and 
the interaction partner is only a secondary 
criterion, whereas for students their relation- 
ship to people they encounter is of primary 
importance, and further subdivision is ap- 
parently in terms of intimacy, self-confidence, 
area of life, and so on. 
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In summary, the categorical representations 
are closely related to the MDS configurations 
and can be regarded as alternative ways of 
interpreting the same episode structure. Most 
of the characteristics mentioned in the inter- 
pretation of the cluster analysis were ob- 
viously overlapping and complementary to 
the basic involvement-self-confidence framé- 
work derived from MDS. It would be futile 
to regard either the categorical or the dimen- 
sional representations as “correct.” In light 
of the complex nature of social episodes, it 
seems that the question of dimensional versus 
categorical representations is- ill-conceived. 
Elements of both would have to be considered 
if crucial attributes of episodes are to be 
identified and understood. 

The use of the present methods to ex- 
plicate episode structure has potentially im- 
portant implications for further interaction- 
oriented research. They can be used to test 
the relevance of the numerous speculative 
taxonomies in given cultural milieus. From 
the present data, it is clear that the often pro- 
posed distinctions between formal-informal 
and cooperative-competitive episodes (Harré 
& Secord, 1972) are of little relevance to the 
present sample in their construal of their so- 
cial world, The present method is eminently 
applicable to the study of cross-cultural and 
subcultural differences in social perception. 
In our case, the structure of episodes strongly 
reflects the socialization experiences, lifestyles, 
and values of the two subsamples. It also pro- 
vides a meaningful and quantified “map” of 
a sample of episodes, which can thus be in- 
corporated into models concerned with the 
situational determination of behavior. 
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Attitude change in group discussion was analyzed in terms of information inte- 


gration theory. Subjects in groups of tw: 
about past presidents of the United S 


o or three received different paragraphs 
tates that were transmitted via group 


discussion. According to the molar unitization hypothesis, each paragraph and 


each group member’s discussion can be 


treated as molar units of information 


that are averaged to determine attitudes, This averaging model received strong 
support using subject’s self-estimated parameters in exact tests of goodness of 
fit. Recent work in risky shift and group discussion was examined from the 
integration—theoretical view, Overall, the results indicated the usefulness of in- 


formation integration theory in solving 


This article applies information integration 
theory to the study of attitude change in 
group discussion. Theoretically, this problem 
is viewed much like any other kind of atti- 
tude change (Anderson, 1959, 1971, 1974b). 
Each group member is exposed to various in- 
formational influences in the discussion, and 
his (her) attitude develops through the 
integration of this information. 

An informational view of group discussion 
has also been employed by many investi- 
gators of “risky shift,” and that view now 
seems to be dominant in the area (Myers & 
Lamm, 1976). A number of investigators 
have contributed to this view, but two lines 
of work, represented by the articles of 
Vinokur and Burnstein (1974) and Bishop 
and Myers (1974) have primary relevance 
here. Not only has this work provided cogent 
evidence for the importance of informational 
factors, but it has progressed toward a deeper 
theoretical analysis of the informational ap- 
proach to group discussion. The present paper 
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many problems of group dynamics, 


is critical of certain details of thei tk, 
but it is very much in the same con ual 
spirit. 

Two fundamental problems face it- 
tempts to develop an informational a is 
of group discussion, The first concer he 
identification of the informational unit he 
second concerns the rule that governs the 


integration of this information into the i- 
tude structure of the individual group mer: ver, 


Definition of Informational Units 


The first fundamental problem, specifying 
the informational units, presents great di^i- 
culties because the informational flow in 
group is complex and largely uncontrolled, 
The impact of any group member on the 
others is spread out in time, conditione 
by his (her) motivations to influence and t 
conform, interlinked with the others’ com- 
ments and silences, dependent on prio: 
knowledge and on expressive factors from 
clarity of thought to eye contact and personal 
attractiveness. Thus, the attitude of each 
group member reflect: 


S an ongoing, time- 
dependent process in which the informational 


components seem close to unknowable. 
Nevertheless, any informational analysis 
must come to grips with the problem of spec- 
ifying the effective information. The way in 
m is handled determines how 
far the theoretical analysis can be taken, Two 


general approaches, the molecular and the 
molar, can be employed. 


ee Soe 


5 


GROUP ATTITUDES 


Molecular argument listing. Vinokur and 
Burnstein (1974) had individual subjects list 
pro and con arguments for standard choice 
dilemmas, and these arguments were screened 
on redundancy and rated on persuasiveness. 
The net balance of pro and con arguments 
correlated with attitude change produced by 
group discussion in previous work, Also, other 
individuals exposed to these same arguments 
changed in the same way. These results, to- 
gether with related work by other investi- 
gators (e.g., Bishop & Myers, 1974; Ebbesen 
& Bowers, 1974; Kaplan, Note 1), clearly 
demonstrate that the informational content 
of group discussion is one important determi- 
nant of member’s attitudes. 

Nevertheless, there are severe limitations 
on the usefulness of argument listing. Given 
that these arguments change attitudes, the 
manner of their transmission must also be 
important. One person may phrase the same 
thought more persuasively than another. Clar- 
ity and forcefulness of expression, tone of 
voice, and social warmth must all play a role. 
Yet verbal content analysis necessarily omits 
such source effects. This is a critical short- 
coming for certain purposes, in particular, for 
testing integration models, 

Molar unitization. The present approach is 
based on the theoretical concept of molar unit- 
ization. This allows a valid, simple represen- 
tation of complex informational units. Within 
averaging theory, an aggregate of elementary 
pieces of information can theoretically be 
treated as a unit whose weight is the sum of 
the weights and whose scale value is the mean 
value of the elementary units (Anderson, 
1971, pp. 197-198). In previous work, ex- 
tended prose messages have been treated in 
this way. In the present work, an attempt 
is made to treat each group member as a 
molar unit. 

The theoretical significance of molar uniti- 
zation deserves emphasis. Molecular analysis 
will almost necessarily omit important compo- 
nents of a complex informational field. Molar 
analysis, on the other hand, provides a sum- 
mary that includes all of the molecular ef- 
fects. Where this approach is valid, a complex 
message, or the aggregate of influence from 
another member, can be represented by two 
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parameters, a weight and a scale value. This 
molar representation gives a simple bypass of 
the molecular detail but still allows for valid 
and exact analyses of the observed attitudes. 

Selj-estimated parameters. In molar-level 
analysis, one is still faced with the problem 
of estimating parameters and testing goodness 
of fit. For this purpose, a procedure of self- 
estimated parameters was employed. In one 
condition, for example, each group member 
was asked to judge the attitude of each other 
group member, together with the impact of 
that member on his (her) own thinking. 
Each group member is treated as a molar 
unit, therefore, and the two judgments are 
used to obtain estimates of the weight and 
scale value of that unit. 

This method of parameter estimation dif- 
fers from the usual practice in integration 
theory in which parameters are derived 
through the use of factorial design. With self- 
estimated parameters, however, an analysis 
can in principle be done for a single experi- 
mental condition. Certain interactions, such 
as discounting of a deviant member’s atti- 
tude, could also be handled in this manner. 
If this method works, even under restricted 
conditions, its flexibility will make it most 
useful. 


Integration Models 


The second fundamental problem concerns 
the way in which information is integrated 
into each member’s attitudes. The present 
analyses are based on the averaging model de- 
veloped in previous work on integration theory. 

Algebraic models are also considered by 
Bishop and Myers (1974) and by Vinokur 
and Burnstein (1974). Although a complete 
discussion of these models cannot be given 
here, they face certain difficulties that deserve 
brief mention. One limitation of both models 
is that they predict for the group oF mean 
group, but not for the individual member, 
Also, both models take an all-or-none view in 
which new arguments may be learned, but 
old, already known arguments may not 
be strengthened. Moreover, as Vinokur and 
Burnstein (1974, P- 314) explicitly noted, 
their approach disallows “simple persuasion,” 
or source effects and bare assertions (Ander- 


son, 1971, P- 198). 
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Furthermore, neither model provides a test 
of goodness of fit. The correlations that are 
presented suffice for the authors’ principle 
conclusion that informational processes are 
important in group discussion. As model tests, 
however, correlations are usually meaningless, 
a point that is illustrated in the Results 
section. 

This article presents two experiments de- 
signed to test the integration-theoretical model 
for attitudes using the technique of self-esti- 
mated parameters for molar units. To provide 
needed control over the informational stimuli, 
a method of information priming was used in 
which each group member received a differ- 
ent piece of information to contribute to the 
discussion. 


METHOD 
Procedure 


Subjects in groups of two or three were seated 
around a table and given a booklet that contained 
the stimulus paragraphs as well as pages for the 
various responses. They were told that each of them 
had received different information about the same 
past president, that they were to exchange and dis- 
cuss this information and form opinions of how 
well the president had done his job. This pro- 
cedure was motivated by referring to similar pro- 
cedures that actually have been used in teaching 
history at certain universities. Subjects then read a 
one-page historical survey intended to place the 
slavery issue in historical perspective, and to bring 
out the importance of the tariff problems that have 
faced nearly every president (Anderson, Sawyers, & 
Farkas, 1972, p. 178). Subjects seemed to find the 
task quite interesting, in part because of the then 
current concern with the Watergate affair. 

Attitude ratings were made at four successive 
stages in both experiments. The first rating, based 
on preexperimental knowledge, was not analyzed, 
since such preexperimental knowledge was typically 
negligible. The remaining three ratings were as follows. 

Preexchange attitudes were given after each sub- 
ject had read the one or two paragraphs in his 
(her) own booklet, prior to any group interaction. 

Postexchange attitudes were given after the in- 
formation exchange. In making their exchange, sub- 
jects were instructed to give a verbal paraphrase 
of the specific information in their own paragraphs, 
xcluding any personal opinion or prior information, 

le one member was paraphrasing his (her) para- 

s, the other members took notes. 
liscussion attitudes were given after a group 
Each member first spoke in turn, saying 
at his (her) previous rated attitude had 
»w he (she) had formed that attitude, 
pertinent preexperimental knowledge. 
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Then there was a general discussion, followed by the 
final rating. 

The experimenter paced the subjects through the 
procedure for a practice president. Each group, 
lightly monitored by the experimenter, then worked 
at its own speed for the rest of the presidents. Re- 
sponse sheets at appropriate places in the booklets 
contained 10-cm graphic rating scales on which 
subjects recorded their attitudes and made self- 
estimates of parameters. The digits 0 to 10 were 
placed at 1-cm intervals below the scale in order to 
facilitate expressions of own attitude during the 
discussion. However, the scale was treated as a con- 
tinuous response and read to the nearest mm. 


Self-Estimates of Parameters 


Immediately after giving their postexchange atti- 
tudes, subjects in both experiments estimated the 
weight and scale value for each paragraph, For 
weight estimates, the subject rated how important 
each paragraph was in forming his (her) post- 
exchange attitude. For scale value estimates, each 
subject indicated what his (her) attitude would 
have been if he (she) had heard only that paragraph 
alone. In making these estimates, subjects were 
instructed to refer to their own paragraphs and to 
their notes on the others’ paragraphs, 

Additional self-estimates were made in Experi- 
ment 2, at the postdiscussion stage. After each group 
member had expressed his (her) attitude in turn, 
all group members estimated what their own atti- 
tude would have been if all they knew about the 
president was what the speaking member had just 
said. These ratings were used as estimates of the 
scale values of each member’s discussion. After giv- 
ing his (her) postdiscussion attitude, each subject 
rated the importance of the paragraphs and the 
importance of each separate member’s discussion in 
the formation of his (her) own final attitude. These 
estimates were used as weights in predicting the 
postdiscussion attitude. 


President Paragraphs 


The stimuli were taken from a standardized set 
of paragraphs about past presidents of the United 
States (Anderson et al., 1972). Each Paragraph con- 
tains information about a president that is either 
very favorable, mildly favorable, mildly unfayor- 
able, or very unfavorable, denoted by H, M*, M- 
and L, respectively. One very unfavorable Gane 
graph about John Tyler illustrates the general length 
and quality of the paragraphs: s 


John Tyler 1841-1845 
Popular feeling was strong against John Tyler 


when his term as President drew to a close. His 
stubborn refusal to submit to the will of the 
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position was not changed by these demonstrations 
of displeasure. One of his last official acts as 
President reflected his strong advocacy of slavery 
by laying the groundwork for the addition of yet 
another slave state to the Union. 


Experiment 1 


Each subject in a group of two received two 
paragraphs about past presidents. The two para- 
graphs were either an HM* or an LM- pair. In- 
cluding the order in which subjects spoke in ex- 
changing information, there were four combinations 
of paragraphs: HM*tHM*, HM*LM, LM-HM*, 
LM-LM^. One subject always spoke first for the 
first four presidents, second for the second four 
presidents. This speaking order had no effect and is 
ignored. 

Each paragraph combination was used to describe 
two presidents, so that each dyad judged eight 
presidents. These presidents were as follows: Chester 
A. Arthur, Benjamin Harrison, Rutherford B. Hayes, 
James Madison, Franklin Pierce, James Knox Polk, 
John Tyler, and Martin Van Buren. The practice 
president was Grover Cleveland, described by HHLL 
paragraphs. 

Presidents, paragraph combinations, and serial 
position were balanced through the use of two 4 X 4 
Greco-Latin squares. 

Subjects were 18 female and 14 male undergradu- 
ates fulfilling a requirement in introductory psy- 
chology at the University of California, San Diego. 
Sixteen dyads composed of same-sex subjects were 
separately run in sessions lasting about 2.5 hours, 
including a 10-minute break. 


Experiment 2 


Each subject in a group of three received one 
paragraph about a president, so that the total in- 
formation provided was a set of three paragraphs, 
Four combinations of paragraphs were used in this 
experiment; HHH, M'M-M', M-M'‘M-, and LLL. 

To provide end anchors for the stimulus range, 
half of the triads received the HHH paragraphs with 
the first president presented, and the LLL para- 
graphs with the second; for the other half of the 
triads, this order was reversed. The M'M M+ and 
MM'M- paragraph combinations were presented 
with either the third or fourth president. Thus, 
there were four presentation sequences for the para- 
graph combinations. 

Four triads were run through each of the presen- 
tation sequences. All triads received paragraphs about 
the same four presidents, but the presidents were 
rotated through each serial position in each presen- 
tation sequence. Thus, each subject discussed four 
presidents described by one of four paragraph com- 
binations. The four presidents were Grover Cleve- 
land, Andrew Jackson, James Monroe, and William 
Howard Taft. The practice president was Thomas 
Jefferson, described by HM*L pargraphs. 

Subjects were 24 male and 24 female undergradu- 
ates fulfilling a course requirement in introductory 
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TABLE 1 
Mean ATTITUDES ror Each COMBINATION or 
PARAGRAPHS AT EACH STAGE or Discussion, 
EXPERIMENTS 1 AND 2 


Post- 


Pre-ex- Post- discus- 
Paragraphs change Change exchange Change sion 
Experiment 1 
HM*HM* 6.87 66 7.53 Al 7,64 
LM-HM+ 481 4.47 13 4.60 
HM*LM- 481 417 —.16 401 
LM-LM- 2.50 VIS m12- 163 
Experiment 2 
HHH 7.68 49 817 08- 3.250 
MIM M+ 5.67 42 6.09 I6 i6: 
M-M*M- 477 —.08 469 —.48 4 %Ua- 
LLL 2.74 —77 197 —.30 1.Q0wS 
ua- 


Note. H, M+, M5, and L = very favorable, mildly favora) 


mildly unfavorable, and very unfavorable, respectively, ONAL 


psychology at the University of California, 3 for 
Diego. Sixteen groups of three same-sex subjeelf. 
were separately run in sessions lasting about 2 hours, 


Resutrts 
Attitude Data 


Mean attitudes at each successive stage in 
the group interaction are in Table 1. These 
data show the changes in attitude produced 
by the information exchange, as well as by 
the subsequent group discussion. 

The information exchange between pre- 
and postexchange stages produced substantial 
changes in attitude. Preexchange attitudes 
were based only on the one or two para- 
graphs available to each group member in 
his (her) own booklet. During the information 
exchange, each group member described the 
contents of his (her) paragraphs to the other 
members. Thus, the postexchange attitudes 
were based on all the paragraphs available to 
the group. This added information produced 
the changes in attitude listed in the second 
data column, 

Statistical analysis of these change scores 
requires that the group mean attitude be 
treated as the unit score because the atti- 
tudes of the members are not independent. 
This yields a repeated-measurements analysis 
with four experimental conditions and 16 
“subjects” or groups. The change scores were 
significantly different across pargraph combi- 
nations, F(3,45) = 16.55 and 8.22, for Ex- 
periments 1 and 2, respectively. 
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POST- DISCUSSION 
EXP. 2 


IOP POST-EXCHANGE 
EXP. | 


POST-EXCHANGE 
EXP. 2 


ATTITUDE 


MEAN 


O EAA ae 
n SMM REEN MIN MEE B ss Me MPH 
ethod’ MEAN VALUE OF PARAGRAPH INFORMATION 


1. Observed and predicted attitudes as a 

f value of paragraph information. (Solid 
= esent theoretical predictions from integra- 
. Dashed curves represent alternative 
theme.) 


kquent discussion produced small 
pts “int additional changes in attitude. 
These changes, from the postexchange to 
the postdiscussion stage, are listed in the 
fourth data column of Table 1. The differ- 
ences among the four change scores were sta- 
tistically significant, F(3,45) = 2.84 and 
4.36, for the two respective experiments. The 
role of this statistical test deserves notice. 
The changes in attitude are small, and most 
of the single change scores would not be sig- 
nificantly different from zero by themselves. 
The F test has greater statistical power be- 
cause it is based on the full set of data. 

Both sets of change scores exhibit the 
“group polarization effect” (see Discussion 
section). Attitudes that are initially positive 
change to become more positive; attitudes 
that are initially negative change to become 
more negative. Only one exception appears in 
the 16 change scores of Table 1. 

The polarization produced by the informa- 
tion exchange is essentially the well-known 
“set-size effect” in attitudes. On the average 
across each group, the additional information 
has the same value as the initial informa- 
tion. Nevertheless, this additional information 
acts to make the attitudes more extreme, in 
agreement with previous work (e.g., Ander- 
son, 1959, 1967; Myers & Kaplan, 1976). 


Model Predictions 


The model prediction scheme can be most 
easily explained for the postexchange attitude 
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in Experiment 2. At that point, each subject 
has received three pieces of information, his 
own paragraph, and one paragraph communi- 
cated through each of the other two mem- 
bers. The first step is to compute the weighted 
average of the self-estimated parameters given 
by Member A to obtain a provisional pre- 
dicted attitude: 


Pa! = (@aRa + ÛsRs + WceRc)/ 
(ùa + p+ ùc). (1) 


Here Ry, Ry, and Ro denote A’s judgment of 
what his (her) attitude would be if based 
only on his (her) own paragraph, or only on 
the paragraphs communicated by B and C, 
respectively. The “hat” notation is used to 
denote A’s corresponding  self-estimated 
weights. Thus, Pa’ is just a weighted average 
based on A’s self-estimated parameters. 

Values of Py’ and Po’ were obtained for the 
other two members of the group in the same 
way, using each member’s own self-estimated 
parameters. These three P’ scores were then 
averaged to yield a mean P’ score for each of 
the four paragraph combinations for each 
group. 

The P’ scores are an interval scale of the 
proper theoretically predicted values, but they 
have a different zero and unit. Accordingly, 
a linear transformation of the P’ scores is 
required to obtain the final predicted values. 
These final values are discussed here, and 
the rationale for the linear transformation 
is taken up in the next section. 

The theoretical predictions are indicated 
by the solid line in the center panel of Figure 
1, the data being represented by filled circles. 
The fit to the data is quite good. The mean 
absolute discrepancy between observed and 
predicted is only .02. The analysis of variance 
test of the discrepancies did not approach 
significance, F(2,45) = .03. 

Similar analyses were applied to the other 
two sets of data. For the postexchange stage 
of Experiment 1, there were actually four 
terms to be averaged, corresponding to the 
four informational paragraphs, but the pro- 
cedure was otherwise the same. These predic- 
tions are compared with the data in the left 
panel of Figure 1. Again the fit is good. The 
mean absolute discrepancy is only .12, The 


eee 


GROUP ATTITUDES 


analysis of variance test of the discrepancies 
was not significant, F(2,45) = .58. 

For the postdiscussion data of Experiment 
2, six terms were used in the prediction. 
Three of these represented the three para- 
graphs, and the other three represented the 
discussions of the three members. Otherwise, 
the procedure was the same. These predic- 
tions are compared with the data in the right 
panel of Figure 1. Here again a good fit was 
obtained. The mean absolute discrepancy is 
only .06. The analysis of variance test of the 
discrepancies did not approach significance, 
F(2,45) = .25. 

Overall, therefore, it appears that the self- 
estimated parameters, together with the aver- 
aging model, yield a good account of the data. 
This provides joint support for the model and 
for the validity of the self-estimated param- 
eters. 


Details of Model Analysis 


The model analysis involves one problem 
which, though readily resolved,ødoes require 
detailed explanation. The problem arises be- 
cause, theoretically, the self-estimates are 
biased estimates of the true scale values. In 
making these judgments, the subject was 
asked to say what his (her) attitude would be 
if based on the given information alone. The- 
oretically, this judgment is an average of 
the true scale value of that information and 
the initial attitude: š 


Ra = (wolo + wzs:)/(wo+ wz), (2) 


where wy and J, refer to the initial attitude, 
w, and Sp to the given piece of information. 
Thus, the self-estimate R, does not equal the 
true scale value Ss. 

However, Equation 2 indicates that R, is 
a linear function of Ss, and vice versa. Under 
the equal-weighting assumption that all 
pieces of information have equal weight, it 
is possible to obtain the true scale value, Sz 
by applying a linear transformation to the 
observed self-estimate, Rz. 

Consider the theoretical expression for the 
attitude of Member A at the postexchange 
stage of Experiment 2. It is the weighted 
average of A’s initial impression, A’s own 
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paragraph, and the paragraph information 
transmitted by the other two members: 


Pa = (Wolo + wasa + wasp 
+ wese)/(wa + wy + we). (3) 


It is desired to show that this true theo- 
retical value is a linear function of the obsery- 
able P4’ value calculated from the self-esti- 
mates in Equation 1. That can be done by 
assuming that the self-estimated ®’s are un- 
biased estimates of the true weights, and by 
substituting the theoretical expressions for 
Rx, Rg, and Re from Equation 2 into Equa- 
tion 3. Straightforward algebra then shows 
that the theoretical value of Pa from Equa- 
tion 3 is a linear function of the provisional 
predicted value Ps’ of Equation 1. 

A similar linear relation also holds for 
Members B and C, based on their own self- 
estimates. Accordingly, a linear relation also 
holds for the mean attitude of the three 
group members: 


P= b+ bP’, (4) 


where the,overbars denote the group means, 
The cons{nts bọ and bı have complicated 
theoretical expressions, but their values are 
estimated directly from a regression analysis. 
This is the justification for the linear trans- 
formation used in the previous section. 

In practice, the transformation is obtained 
from a linear regression that maximizes the 
fit between the overall observed means for 
the four paragraph combinations and the 
mean P’ values calculated from Equation 1 
using the self-estimated parameters, An inci- 
dental advantage of this approach is that wo 
and Jo, the parameters of the initial attitude 
in Equation 3, are absorbed into the regres- 
sion constants of Equation 4 and do not re- 
quire separate estimation. 

The test of goodness of fit of the model is 
on the deviations between these predicted 
and observed values. The transformation was 
actually performed on the overall means, and 
the sum of squares for discrepancy was di- 
rectly calculated from the four overall mean 
discrepancies. This sum of squares has only 
two degrees of freedom, however, because 
one degree of freedom is used up in estimat- 
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TABLE 2 


Mean SELF-ESTMATES or WEIGHT AND VALUE FOR 
Eacu LEVEL or PARAGRAPH 


Weight Attitude value Experi- 


ment 2 

Para-  Experi- Experi- Experi- Experi- discus- 
graphs ment1 ment2 menti ment 2 sion 
H 6.6 7:2 6.9 7.6 7.6 
M+ 5.2 6.4 6.3 5.6 5.9 
M- 61 5.6 2.9 4.8 4.4 
L 7.0 7.0 2.5 2.6 2.9 

he right-most data column represents value estimate 


ion of other group members, H, M*, M~, and L = very 
favorable, mildly favorable, mildly unfavorable, or very unfa- 
yorable, respectively. 


ing bı in Equation 4. The degree of freedom 
used up in estimating bo corresponds to the 
grand mean discrepancy, which is set to zero 
by the transformation. 

The null hypothesis in this test is that all 
four mean deviations are equal to one an- 
other, and hence also equal to zero because 
the grand mean is zero. This test is superior 
to making separate tests of each of the four 
discrepancies. It makes proper allowance for 
the two degrees of freedom used up in the 
linear transformation and, at the same time, 
has greater power to detect discrepancies 
from the model. 

The error mean square for this test was 
taken directly from the analysis of the un- 
transformed scores of Equation 1. However, 
it must be multiplied by the square of the 
slope bı in Equation 4 in order to correct for 
the difference in range of predicted and ob- 
served. 

Two or three additional comments need 
mention. The validity of the linear transfor- 
mation depends on the assumption that Ws 
in Equation 2 is the same for all relevant in- 
formation for a given subject. This would not 
be the case with differential weighting, if, 
for example, one paragraph was very strong 
and another very weak. In the present experi- 
ments, this equal-weight assumption is rea- 
sonable in view of previous work using the 

president paragraphs, and in view of the ac- 
tual values of the self-estimated weights. Ac- 
tually, a moderate degree of differential 
weighting might not cause much difficulty, al- 
though this matter has not been checked in 
detail. 

Means of the self-estimated parameters are 
listed in Table 2. Value estimates follow the 


normative levels of the pargaraphs, both for 
the paragraphs themselves, and in Experi- 
ment 2 for the corresponding member discus- 
sions. Weight estimates are somewhat larger 
for the more extreme H and L paragraphs. 

In the predictions for the postdiscussion 
attitudes of Experiment 2, the paragraph in- 
formation of each member is highly corre- 
lated with his (her) discussion. The inclusion 
of both in the prediction introduces redun- 
dancy. Since this redundancy can be assumed 
to be homogeneous across group members, it 
is not a problem here. That would not al- 
ways be true, however, and differential re- 
dundancy is in general a serious problem for 
model analysis. In some cases, the problem 
might be resolved by considering the re- 
dundant components, in this case the para- 
graph and the discussion of a member, as a 
single molar unit. 

Finally, there is a question why the sub- 
jects were not asked to self-estimate the true 
scale values of the information instead of 
what their attitudes would be based on that 
information alone. It would certainly seem 
simpler to have an estimate of the true scale 
value than of the derivative attitude. Sev- 
eral considerations are involved. First, an 
attempt to use this alternative in an applica- 
tion of integration theory to McGuire's inoc- 
ulation paradigm met with mixed success 
(Farkas & Anderson, 1976). Second, the task 
as given seems more natural and understand- 
able. In analogous work on person percep- 
tion, it is difficult to explain to the subject 
the difference between the true likableness 
value of a trait adjective and the likableness 
of a person described by that adjective. 
Third, the present method avoids the need 
for self-estimates of the parameters of the 
initial attitude. 


Alternative Model 


An alternative theoretical approach would 
be to use the self-estimates as scale values 
without transformation. The above analysis 
rests on the averaging hypothesis and on the 
concept of the initial attitude. Alternative 
theories, based on linear or summation mod- 
els, for example, might well require a different 
analysis. 
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Some light on this matter comes from con- 
sidering the P’ values of Equation 1 calcu- 
lated directly from the self-estimated param- 
eters. These values are plotted as the dashed 
curves in Figure 1. The fit does not seem too 
bad, but there is a clear pattern of discrep- 
ancy, with the observed values being more 
extreme than the dashed curves. This pat- 
tern of discrepancy, incidentally, is predicted 
by averaging theory. The initial attitude was 
near neutral so that R, is less extreme than 
Sx in Equation 2, When R, is used in place of 
J», therefore, the predicted value is less ex- 
treme than it should be. 

Two general methods have been in use for 
comparing observed and predicted attitudes, 
one based on correlation coefficients, the 
other on analysis of variance. The correla- 
titns between the observed attitudes and 
thosé@predicted by the alternative model are 
high: .998, .999, and .999 for the three pan- 
els of Figure 1 from left to right, respec- 


. tively. Nevertheless, the analysis of variance 


demonstrates that the discrepancies are highly 
significant, (3,45) = 7.84, 5.46, and 19.47, 
respectively, This contrast provides another 
illustration of the grim fact that extremely 
high correlations can be obtained from seri- 
ously incorrect models (Anderson, 1969, 
al971, p. 193; Birnbaum, 1973). 

This methodological issue is important in 
general attitude theory. A prime example is 
Fishbein’s summation model (Fishbein & 
Ajzen, 1975), for it rests almost entirely on 
the evidence of correlation coefficients. 

Of course, any alternative theory that could 
justify a linear transformation of the P’ 
values would also be able to account for the 
data.* The methodological issue remains, how- 
ever, that theories that build on correlation- 
type statistics may well be building on sand 
(Anderson, Note 3). An adequate test of the 
model requires a test of the discrepancies 
from prediction, such as is given by the 
analysis of variance, 


Comments on Self-Estimated Parameters 


The success of the present theoretical pre- 
dictions confers some support on the method 
of self-estimated parameters. However, this 
support should be taken tentatively. The 
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success of integration models in other appli- 
cations implies that subjects’ responses to 


1 Even with a linear model, the common practice 
of correlating observed values with values predicted 
from self-estimates or other parameters estimated 
independently from the given data is valid only 
under severely restricted conditions. To be concrete, 
suppose that information is given about a presidential 
aspirant on two dimensions, personal (integrity, 
charisma, etc.) and political (liberal-conservative). 
The weight and value parameters are denoted by w 
and s for the personal dimension, w and ¢ for the 
political dimension. Suppose that the subject’s atti- 
tude A obeys a linear, additive, or summation rule 
so that, in terms of the true parameters, A = ws + ut. 

In terms of the independent estimates, de- 
noted with “hats,” the predicted attitude becomes: 
A = ûs + úl. Suppose that the estimates are on in- 
terval scales. This means that they are linear func- 
tions of the true values, where slope and intercept 
correspond to zero and unit of the interval scale, In 
general, therefore, 4 4A. However, because of the 
linear relation between the estimates and the true 
parameters, it might be thought that a linear ad- 
justment of the 4 values would solve this problem, 
That is the assumption implicit in the regression 
equation that underlies the correlation coefficient 
between A and A, 

However, it can be shown that this linear adjust- 
ment is valid only under three stringent conditions: 
(a) The weight estimates must be on a ratio scale— 
with the same unit for all variables; (b) the scale 
value estimates must be on interval scales—with the 
same unit for all variables; and (c) either these 
interval scales must be ratio scales, or else different 
values of a given variable must have equal weight. 
Unless the parameter estimates meet these three 
conditions, A and A will not be linearly related even 
though the model is true, 

The requirement of common scale unit seems espe- 
cially problematical. In the example, this would 
require that one measured unit on the personal 
dimension be equivalent to one measured unit on 
the political dimension, which seems probably false, 
and certainly difficult to test. Unless the stimulus 
variables are directly comparable, and estimates ob- 
tained in comparable terms, as in the present experi- 
ments, they can hardly be expected to have a com- 
mon unit. It may be added that extra parameters 
can be introduced into the regression equation to 
allow for arbitrary zeroes and units, In addition, the 
two assumptions of common unit can be avoided by 
using a two-variable regression, 

Apparently this problem has not been previously 
noted. At any rate, published studies of attitude 
models that rely on correlations between predicted 
and observed seem unaware of the problem. Such 
analyses may be inherently invalid. Unless the spec- 
ified assumptions about the parameter estimates are 
satisfied, the model will necessarily fail the test of 
goodness of fit even when it is correct. 
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single stimuli can often be treated as an in- 
terval measure of the true scale values, valid, 
that is, up to zero and unit. For importance 
estimates, however, the evidence is mixed 
(Anderson & Alexander, 1971; Farkas & An- 
derson, 1976). 

Two other approaches to  self-estimated 
parameters may also be noted. In decision 
theory, a number of studies have found that 
subjective estimates of importance do not 
agree with beta weights from regression anal- 
ysis (Slovic & Lichtenstein, 1971). However, 
these comparisons are not generally meaning- 
ful because each beta weight is confounded 
with the scale unit of the corresponding varia- 
ble, and the scale units are not ordinarily 
determinable. 

In attitude theory, attempts to use direct 
estimates of importance include the norma- 
tive ratings of persuasiveness obtained by 
Bishop and Myers (1974) and Vinokur and 
Burnstein (1974) and measures of belief in 
certain studies of Fishbein’s summation 
model (Fishbein & Ajzen, 1975). The valid- 
ity of these measures as true parameter val- 
ues rests completely on the associated model, 
but the cited work rarely if ever provides a 
test of goodness of fit. Other evidence, how- 
ever, suggests that these models are not cor- 
rect (Anderson, 1971, 1974b). 


Discussion 


Averaging Model for Group Discussion 


Both experiments provide good support for 
averaging theory of attitudes formed in group 
discussion. Since this is an initial applica- 
tion to groups, these results must be viewed 
with more than usual caution, and with at- 
tention to the three main theoretical assump- 
tions, 

Of these, the averaging model itself rests on 
a solid empirical foundation, of which atti- 
tude experiments are only a part. This pre- 
vious work with the averaging model provides 
important support for its present application. 

The molar unitization hypothesis has also 
received support, especially in work with at- 
titudes (Anderson, 1959, 1973; Anderson & 
Farkas, 1973; Farkas & Anderson, 1976; 
Sawyers & Anderson, 1971; see also Lane & 
Anderson, 1976). The paragraph-type ma- 


terials used in these experiments present a 
complex informational field, yet it appeared 
that they could be treated as molar units, 
representable by two model parameters, 
weight and scale value. 

The present extension to consider the other 
group members also as molar units seems 
reasonable. This approach has the significant 
advantage of providing a completely valid 
representation of a complex stimulus that 
could seldom be analyzed into its effective 
components. 

The status of self-estimated parameters, 
especially weights, is on less certain ground, 
Strong assumptions are required, as discussed 
above. The method seems reasonably prom- 
ising for scale values and could be quite use- 
ful in that respect. However, it would be 
exceptionally yaluable if it could be estab- 
lished for weights. This would allow simple 
analyses of interactive effects, such as dis- 
counting of deviant members, that would be 
difficult to analyze in any other way. 


Group Polarization 


Group discussion tends to polarize pre- 
existing opinion (Myers & Lamm, 1976), A 
mean group attitude that is positive before 
discussion will typically become more positive 
after discussion; similarly, a negative initial 
attitude will tend to become more negative, 

In the simplest informational view, group 
discussion essentially serves a transmission 
function, so that information initially held 
by one member will be made common to all 
members. The amount of information pos- 
sessed by each member will increase, and it 
seems plausible to expect each member’s atti- 
tude to become more positive or negative 
according to whether the initial mean atti- 
tude is positive or negative. That is the es- 
sential idea of the “partially shared argu- 
ments” hypothesis advanced by Vinokur and 
Burnstein (1974) and by Bishop and Myers 
(1974). At an empirical level, therefore, 
group polarization may be no more than a 
manifestation of the well-known set-size ef- 
fect, as has been emphasized by Kaplan 
(Note 1) and Myers and Kaplan (1976), 

Support for this informational yiew js 
easily obtained when the initial distribution 
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of information is experimentally manipu- 


lated, as was done here, and in the cited 


work by Kaplan. However, more definite 
evidence is needed when the initial distribu- 
tion of information is only a reflection of 
individual differences. The work of Vinokur 
and Burnstein and of Bishop and Myers pro- 
vides such evidence and so plays an impor- 
tant role in the informational interpretation 
of group polarization. 

Nevertheless, two important questions re- 
main, It may seem obvious that adding new 
information of the same polarity will increase 
the polarity of the attitude; in fact this is 
often not the case (eg., Anderson, 1973, 
1974a, Figure 2, 1974b, Section 2). Amount 
of information presents a fundamental theo- 
retical problem that does not yet have a com- 
pletely convincing explanation. The first ques- 
tion, therefore, relates to theoretical models 
for information integration. A second question 
concerns possible operation of processes in 
group discussion over and above mere trans- 
mission of information. 

Averaging model. The averaging model of 
information integration theory has given a 
good account of the set-size effect and related 
phenomena in individual attitudes. It can 
account for group polarization in a parallel 
manner. In addition, it implies that amount 
of polarization will increase with group size 
(Teger & Pruitt, 1967) and will be an in- 
verted-U function of magnitude of initial atti- 
tude (Myers & Lamm, 1976), 

However, the averaging model does present 
a theoretical puzzle. It predicts no change if 
the information transmitted by each member 
has the same scale value as his (her) atti- 
tude, at least if all members have equal 
weight. To account for group polarization, 
therefore, would require that the information 
that a member transmits be more extreme 
than his (her) attitude, But this will be the 
case, according to averaging theory, if each 
member simply transmits the information on 
which his (her) attitude is based, as in the 
present experiment. By Equation 2, his (her) 
attitude includes the neutral Jọ component 
and so is ordinarily less extreme than the 
scale value of the information, The trans- 
mitted information is self-redundant, of 
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course, and need not polarize the given mem- 
ber’s attitude. 

Other models. The difficulty faced by the 
averaging model in accounting for amount of 
information also faces the model considered 
by Vinokur and Burnstein (1974). They cal- 
culated a prorisk index as a weighted average 
persuasiveness of prorisk arguments, and an 
analogous procaution index. The difference 
between these two was used as an index of 
group shift. 

But suppose that the number of prorisk 
arguments was increased, keeping the argu- 
ments otherwise equivalent. Then the prorisk 
index of Vinokur and Burnstein’s Equation 4 
would be unchanged and no polarization 
would be predicted, This is contrary to the 
substantial set-size effect observed in the 
present data. Some change in the model might 
resolve this difficulty, but it faces other prob- 
lems noted in the introduction. 

The model used by Bishop and Myers 
(1974, p. 97; see also Myers & Lamm, 1976) 
has some similarities to the present formula- 
tion. They assumed that each argument pro- 
duces a shift in group opinion equal to its 
persuasiveness multiplied by the proportion of 
group members initially unfamiliar with that 
argument. Total shift in group opinion was 
assumed to be the sum of the shifts produced 
by the individual arguments. 

The Bishop and Myers model implies that 
group shift will be a linear function of amount 
of equivalent information, This is contrary 
to the present data. In Experiment 2, for 
example, the three high (or low) paragraphs 
that enter into the postexchange attitude 
produce much less than three times the effect 
of the one high (or low) paragraph that en- 
ters into the preexchange attitude. More gen- 
erally, some skepticism may be voiced over 
the use of a summation rule when the evi- 
dence from general attitude theory so strongly 
favors an averaging rule. 

A self-perception approach. Ebbesen and 
Bowers (1974) showed that the ratio of risky 
to conservative arguments in the group dis- 
cussion is an important determinant of mem- 
ber opinion. They interpreted this result in 
terms of social attribution and Bem’s (1972) 
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self-perception theory: 


The more often other people in the group argue 
that the high-risk alternative is valuable, the 
more likely it is that the subject sees these argu- 
ments as being caused by a property of the choice 
dilemma and, therefore, the more likely he should 
be to assign a high value to the risky alternative 
(p. 317). 


This self-perception analysis seems strained. 
It ignores the semantic content of the argu- 
ments, which is presumably the major deter- 
minant of their effect. And although it allows 
for source effects, in contrast with the models 
of Bishop and Myers (1974) and of Vinokur 
and Burnstein (1974), it seems to make no 
allowance for redundancy. In general, the 
self-perception approach seems inadequate 

because it is not aimed at the basic informa- 
tional determinants. 

Ebbesen and Bowers (1974) did not con- 
sider a formal model, but their Experiment 3 
provides a test of the present integration 
model. Individual subjects listened to realistic 
but acted group discussions deliberately con- 
structed to contain specific proportions of 
risky to conservative arguments. The shift in 
opinion was approximately a linear function 
of the argument proportion, a result that 
accords with integration theory (Anderson, 
1971, p. 191). The free-discussion data in 
their Figure 2 are perhaps interpretable in 
the same way. 

The technique of constructed discussions 
used by Ebbesen and Bowers has considerable 
potential. It provides control over informa- 
tional variables yet still retains some es- 
sence of group discussion. Many of the basic 
issues in person perception and attitude 
theory could be directly extended to group 
discussion in this manner, 

Active effects of group discussion. The 
formulations that have been considered above 
are passive in the sense that the only func- 
tion of the group discussion is to allow trans- 
mission of information. But other, more ac- 
tive processes could also be operative. For 

example, the discussion task may evoke a 
constructive process that contributes to the 
development of each member's own attitude 


(Anderson, 1974a, p. 89). Seeking for con- 
tributions to the discussion, each member may 


generate arguments that are effectively new ~ 
to himself (herself) and thereby polarize his 
(her) own attitude, independent of any in- 
formation received from the other members. 

Further, there is a very real possibility 
that the discussion task induces polarization 
within the flow of information itself so that 
weaker arguments, or those for a minority or 
unpopular position, are suppressed or lost. 
Thus, members who are motivated to per- 
suade others may concentrate on their 
stronger arguments. Members who have more 
extreme attitudes may tend to dominate the 
discussion. Members who lean toward a mi- 
nority position may be motivated to conform 
to the majority. : 

But although there is no shortage of possi- 
ble mechanisms for an informational inter- 
pretation of group polarization, there is a 
shortage of evidence, Indeed, there seems to 
be little or no evidence to distinguish among 
the several mechanisms, active and passive, 
that have just been mentioned. It would be 
hard to disagree with the assumption of My- 
ers and Lamm (1976) that discussion argu- 
ments are influenced by social context. Yet 
the only evidence they present, that discus- 
sion arguments are more polarized than writ- 
ten arguments, is not too convincing, as in- 
deed they carefully pointed out. 


Two Suggestions on Method 


An informational view of group interaction 
has been advocated by many writers, espe- 
cially in the area of risky shift (see also re- 
views by Kelley & Thibaut, 1969; Myers & 
Lamm, 1976). However, this view has yet to 
be effectively coupled to experimental analy- 
sis. The most rudimentary questions, concern- 
ing amount and repetition of information, for 
example, have barely begun to be studied. In 
part, that is because the nature of group dis- 
cussion makes it difficult to measure and 
monitor the flow of information, 

However, it also seems to be true that 
current experimental procedures are no 
suited to implementing an informational 
analysis. This point echoes Cartwright’s 
(1973) criticism that the tisky shift area has 
steered away from the “concrete Processes 
actually responsible for the shifts in observed 
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test scores” (p. 229). Accordingly, two sug- 
gestions for a change in experimental empha- 
sis will be briefly summarized here (Ander- 
son & Graesser, Note 2). 

The first suggestion is for major emphasis 
on attitudes and impressions of the individual 
group members. This calls for a shift away 
from the problem-solving orientation that has 
dominated much of the research on group 
theory. Equally, it calls for a shift away 
from the study of group product, toward the 
study of individual behavior within the group. 
A prime experimental corollary is a shift 
away from the consensus requirement that 
has been dominant in group research. 

To reach consensus, the individual group 
members will typically have to compromise 
their own attitudes or preferred positions. 
Thus, a consensus requirement introduces 
forces beyond the informational and per- 
suasive influences that form the basis of indi- 
vidual attitudes. Although such consensus 
forces have direct interest in problem solving 
and bargaining, they largely confound the 
analysis of the influences on individual atti- 
tudes. 

The distinction between these two sets of 
forces can be represented as a two-stage 
process. The first stage is one of attitude 
formation by the individual members. Theo- 
retically, the first stage should obey averaging 
theory, as illustrated in the present experi- 
ment. The second stage is one of compromise 
among the group members. Theoretically, the 
second stage should obey the social averag- 
ing theorem (Anderson & Graesser, Note 2). 

These two stages will not always be sepa- 
rable, of course, In particular, the consensus 
requirement may affect individual attitudes. 
Nevertheless, the two sets of forces seem to 
be qualitatively different, and they deserve 
study in their pure form. 

The second suggestion is for more extensive 
union of experimental control with natural- 
istic group interaction. This is an old idea, 
going back to the early studies of conformity 
and of communication networks. It may seem 
gratuitous to bring it up, since it will obvi- 
ously be impossible to unravel the tangled 
skein of multiple causation and correlation in 
group dynamics without some measure of ex- 
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perimental control. Nevertheless, the risky 
shift literature has largely neglected this 
point. 

One much-used method in conformity studies 
rests on confederates to present specified 
positions. That would allow, for example, a 
simple test of Vinokur and Burnstein’s (1974) 
hypothesis of no effect when one member 
presents an argument with which another 
member is already familiar. Various manipu- 
lations of source credibility could also be 
studied within the discussion context much 
as they have been studied in attitude theory. 
Individual differences in persuasiveness and 
persuasibility could be similarly studied. 

The method of information priming can 
also provide valuable experimental control, 
Its use in model analysis has been illustrated 
in the present experiments. Another use would 
be in combination with time-series analysis 
of discussion content. Different specific pieces 
of information would be given to different 
group members, and the path traced out by 
each piece of information as it circulates 
through the group could provide important 
clues about the interaction process. This kind 
of content analysis could be related in a di- 
rect, causal way to member attitudes, yield- 
ing results more relevant to substantive ques- 
tions of group theory than the typology of 
Bales (1950). Further, this suggested ap- 
proach would combine well with Bavelas’ 
(1950) communication networks, since they 
can provide experimental control over the 
flow of information. This shift of emphasis 
away from group product to member attitude 
could lead to new applications of the network 
paradigm. 
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A conditioned emotional response paradigm was applied to human subjects. It 
ri was hypothesized that the introduction of a person who had been previously 
l * paired with an unpleasant event would produce response decrements in the 
) 
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performance of subjects engaged in a simple repetitive task. The hypothesis 
l was confirmed. The results of a control group ruled out the interpretation that 
\ the decrement in performance might be due merely to the distracting effect of 


, Infrahuman studies of conditioned suppres- 
sion have consistently shown that a neutral 
stimulus (e.g., light or noise) that had been 
paired with shock will suppress the response 
rate when introduced by itself during a free- 
responding situation. The hypothetical mecha- 
nism thought to underlie the effect is a 
loosely defined “conditioned emotional re- 
sponse” (CER), which if sufficiently strong 
when elicited by the conditioned stimulus 
_ (CS), disrupts ongoing behavior. Attempts 
to establish conditioned suppression in hu- 
mans have used various sensory stimuli, also 
to signal shock, but have generally failed to 
obtain the same reliable response decrements 
found with lower organisms. For example, 
Kanfer (1958), using verbal rate as a base- 
line, found that CS onset facilitated rather 
than suppressed responding. Of two studies 
employing rate of lever pressing as the depen- 
dent measure, one (Sachs & May, 1967) 
found no change in response rate as a func- 
tion of CER procedures; the other (Mulder, 
Lyon, & Pott, Note 1) reported that out of 
25 subjects, 9 refused to continue (as a result 
of the fear of painful shocks), and of the 16 
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t ‘ the stimulus person, The implications of one person acting as a conditioned 
j suppressor for another person were discussed. 


that remained, 4 suppressed. Using a very 
complex matching to sample task, Sachs and 
May (1969) reported that out of 6 subjects, 
3 suppressed in the presence of the CS, 1 
showed facilitation, and 2 showed no change. 

The equivocal success of CER procedures 
with humans was always obtained in studies 
in which the CS was a simple sensory stimu- 
lus like a tone or light and the unconditioned 
stimulus (UCS) also a simple sensory event, 
electric shock. While the same class of stim- 
uli may effectively suppress behavior in lower 
organisms, it may be less effective in sup- 
pressing human behavior. In fact, the notion 
of matching the stimulus situation with the 
species under test dates back to Thorndike’s 
(1935) concept of “belongingness,” which is 
just now beginning to be reckoned with in 
the field of animal learning and, in particular, 
toxicosis research (Garcia, Kimeldorf, & 
Koelling, 1955). In a variety of experiments it 
has been shown that the nature of the stimuli 
that are most readily associated is highly 
dependent upon species (Garcia & Koelling, 
1966; Wilcoxin, Dragoin, & Koal, 1969). The 
application of this concept to the conditioned 
suppression paradigm would necessitate the 
selection of a stimulus situation that is an 
integral part of the subject’s biological niche 
and one that can be incorporated easily into 
a Paylovian conditioning procedure. It would 
intuitively seem that the stimulus situation 
most meaningful to a human subject would 
be a social situation involving other individ- 
uals. The social psychological literature, for 
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example, offers examples of overt human be- 
havior being markedly affected by social stim- 
uli in the form of critical comments or reports 
of failure (McClelland & Apicella, 1945; 
Schalon, 1968). The intention of the present 
investigators is to examine the feasibility of 
incorporating social stimuli into the condi- 
tioned suppression paradigm, 

The CS was a person whose presence was 
paired with an unpleasant social situation 
that was assumed to be functionally equiv- 
alent to a UCS. It was hypothesized that 
while an experimental subject was steadily 
performing a simple response task, the intro- 
duction of the CS person would produce a 
greater response rate decrement than would 
the introduction of the same person before a 
control subject whose previous experience with 
the CS person had been only in a “neutral” 
social setting. 


METHOD 
Subjects 


The subjects were 18 female students at Con- 
necticut College enrolled in freshman and sophomore 
psychology courses who had volunteered for two 
short, supposedly unrelated experiments. Nine of the 
subjects were randomly assigned to the control 
condition and nine to the experimental condition. 


Procedure 


Subjects were individually run. Phase 1 (Experi- 
ment 1, from the subject’s perspective) was designed 
to establish one experimenter (Experimenter 1, a 
male graduate student) as a Pavlovian, conditioned 
“aversive” stimulus for an experimental subject and 
as a “neutral” stimulus for a control subject. In this 
phase (described to subjects as an “interest inven- 
tory”), a person was asked by Experimenter 1 to 
complete two columns of the answer sheet of the 
Kudor Interest Inventory Scale. The pin ordinarily 
used to punch out the answer was available but 
partly concealed by its placement in the lower right 
corner of the test booklet cover, A pen was left in 
plain view on the desk, in the hopes that the sub- 
ject would mistakenly use it for punching the an- 
Swer sheet, and in so doing, provide a realistic 
Opportunity for presentation of the noxious event in 
association with the experimenter. Details of the 
procedure were as follows. 

Experimenter 1 told each subject that she had 
exactly 10 minutes to read the test instructions (the 
Standard instructions for the Kudor scale) and to 
Complete the first two columns of the test. She was 
further told that subjects’ data were being fed into 
a computer once every 15 minutes, that the experi- 
Ment was on a very tight schedule, and that if for 
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| 
some reason this schedule could not be mei, the 
whole day's work would “be shot.” The exper! enter 
then left the room. 
After 10 minutes, he returned, With the peri- 
mental subjects, he pretended to be shock 


vith | 


what he found and said: 


Oh no! You used the pen rather than 
that was provided right here. The machi 
rates these will never take ink. This me: 
the whole computer schedule has been 
Oh, now I'll have to cancel the rest of th 
subjects. Look, the second experiment thi 
signed up for is probably ready, so you'd bet 


Control subjects were given the same test 
instructions, but when the experimenter ro ned 
after 10 minutes he said, “O.K., thanks. Why Vt) 
you proceed to the second experiment that a} 


signed up for.” 
The second phase of the experiment (an unr d 
experiment as far as the subject was concerned x 
designed to provide a simple response task ago 
which the effects of Phase 1 could be meas 
The subject was brought into a different roon 
a second experimenter (Experimenter 2, a fen 
graduate student) and seated at a desk. In front 
the subject were 200 index cards and an answ 
sheet. On each card was a subtraction example con 
sisting of two three-digit numbers. Experimenter 2 ¢ 
informed a subject that the experiment was a test 
of speed and accuracy in doing subtraction examples * 
and that her speed would be recorded with a stop- Í 
watch. The subject was told not to stop working = 
when the stopwatch clicked or for any other reason{ 7 
The experimenter then recorded the time it took a ~ 
subject to complete each successive set of five sub 
traction examples. A response unit of five samples 
was chosen beforehand by considering the probable 
speed of an average subject and the experimenter’s 
dexterity. A subject was given four sets of Warm-up 
examples (20 examples) to allow for acclimation to 
the new situation and a wearing off of the residual 
effects of the previous phase. The time it took to 
complete each of the next three sets (Sets 5, 


to be looking for something in a book that had been, 
planted on another desk, Experimenter 2 continued 4 


and did the same for each of three more sets 
formed after Experimenter 1 had left, to provide $ 
a postreatment measure, ; 


The procedure for the second phase of the ex; 


ment was identical for experimental and control 
subjects. 


At the conclusion of the second 
was thoroughly debriefed. A subject in the experi- 
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t she had not disrupted any aspect of the 
nt by using the pen rather than the pin, 


RESULTS 


it took a subject to run through 
cessive set of five subtraction ex- 
was recorded. The mean time spent 
last three pretreatment sets (base- 
variable number of treatment sets, 
hree posttreatment sets were com- 
entered into a mixed-design analysis 

c. The data are shown in Figure 1. 
ost striking feature of the figure is 
¿d departure from baseline perform- 
experimental subjects relative to 
ıt the treatment condition. The 
eflects an increase in the amount 
juired by the experimental subjects 
te a set of subtraction examples in 
nce of Experimenter 1 (the condi- 
ersive stimulus). The reliability of 
‘yation is supported by the presence 
oups (experimental, control) X Ses- 
iseline, treatment, posttreatment) in- 


m, F(2,32) =3.72; p<.05, in the 


urthermore, a simple-effects analysis 
nce performed over sessions for each 
\dividually indicates that the observed 
in response time over sessions are 
nt only in the experimental group, 
i = 4.03, p < .05. Response time in- 
from baseline to treatment conditions 
out of nine experimental subjects. 
n to baseline following treatment was 
in seven out of nine experimental 


Discussion 


-esults confirm the hypothesis that a 
med suppression paradigm using a per- 
viously associated with an embarrass- 
produce 
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experimenter had previously been associated 
pie a neutral situation. 
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TIME PER SET (SEC) 


BASELINE TREATMENT POST TREATMENT 


Ficure 1. Mean subtraction example solution 
times before (baseline), during (treatment), and 
after (posttreatment) presentation of the conditioned 
stimulus person to experimental and control subjects. 


On the one hand, this capacity of a CER 
to interfere with human performance is in- 
consistent with Sachs and May’s (1969) 
conclusion that perhaps “neither suppression 
nor facilitation is a characteristic reaction of 
humans to CER procedure” (p. 1007). On 
the other hand, the results may support the 
“belongingness” concept: that conditioned 
suppression may be more apt to occur in hu- 
mans or in any other species if the condition- 
ing and testing procedures are adapted to the 
biological niche of the organism being tested. 
Whether a belongingness hypothesis is indeed 
appropriate for these results, however, de- 
pends upon its success in competing’ with 
other reasonable explanations. The most ap- 
parent of these is based on stimulus intensity. 
Since the speed of conditioning frequently 
corresponds to US intensity, perhaps the 
greater degree of behavioral suppression ob- 
tained here was the result, simply, of an in- 
sult of greater subjective intensity than that 
of the particular shock Jevels used in previous 
studies. To attribute the present results to 
stimulus intensity per se, it would be neces- 
sary to show (a) that on some scale of aver- 
siveness our insult occupied a higher ordinal 
position than did any of the shock levels 
previously employed and (b) that insults in 
general are not more aversive to people than 
shocks. If the initial intuition and raison 
d'être for this experiment were correct, how- 
ever, then Assumption b would be incorrect, 
and indeed negative social events would, as a 
rule, be shown to have greater behavioral ef- 
fect on people than would simple shocks. In 


226 


this case, belongingness would be the theory 
of choice. Because US intensities have not 
been compared in this manner, it is impos- 
sible to rule out explicitly an argument based 
purely on differential stimulus intensity. We 
Must point out, however, that what is meant 
by stimulus “intensity” or “potency” requires 
clarification, for the issue is more complex 
than is at first apparent. The shocks used in 
previous CER studies were clearly intense 
and aversive. Indeed 9 of 25 subjects in the 
Mulder et al. (Note 1) study were affected 
enough to drop out of the experiment mid- 
way. By this measure alone, the shocks in 
question could be shown to be more aversive 
than our insult, which Produced no such at- 
trition rate in our subject pool. On the other 
hand, this “intense” level of shock was in- 
capable of suppressing the ongoing response 
rate, whereas our “milder” insult was in this 
regard quite effective. Tt would seem to us 
that a simple comparison of qualitatively dif- 
ferent USs on a single quantitative scale is 
not a meaningful comparison for the kinds of 
behavioral effects that we have been trying 
to explain. Perhaps for the moment it is suf- 
ficient to Say that in humans, anticipation of 
intense physical stimuli (e.g., shocks) does 
not lead to the same kinds of behavioral con- 
sequences as does the anticipation of socially 
meaningful events, Specifically, the former may 
yield a tensing and flinching of the affected 
limb; the latter, brooding, self-evaluation, and 
preoccupation. Whereas tensing and flinching 
may not interfere with the rate of perform- 
ance in a task not explicitly dependent on 
that limb, brooding and the like will most 
assuredly interfere with any task requiring 
some degree of concentration. 

Perhaps the most noteworthy result of the 
present approach was that reliable response 
suppression was obtained after only one pair- 
ing of the CS person with the aversive event. 
Reports of one-trial Pavlovian learning are 
rare, and its occurrence in this study may 
be an index of the power of the present 
“matching” procedures. In contrast, there are 
two less attractive interpretations that would 
attribute the success of the present CER pro- 
cedure either to the difficulty of the particular 
task employed or to the possible sensitizing 
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effects of the treatment. While the firs; 
Pothesis is readily testable, it is never tli¢ 
doubtful in view of the far more com)! x 
tasks that have remained unsuppressed jr 1e 
face of direct Pavlovian procedures (Sac i 
May, 1969). The second hypothesis is «4. 
siderably weakened by the adaptation periad 
data. Tf sensitization and not stimulus 

trol were responsible for suppression, then 
considerable suppression should have h n 
generated by 
Experimenter 2 in Phase 2 of the experiment, 


the prebaseline warm-up period at the outset kL.” 
of Phase 2, Response rate, however, was. no 

control 
subjects during these Seven warm-up sets 
(£< 1.0). A more complete demonstration of 
control, however, 
if experimental 
to a number of 
neutral stimulus Persons who varied in degree 
of similarity to the experimenter. The steep- 
ness of the generalization gradient so obtained 
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be understood from the verbalized reports of 
the subjects. This by no means implies that 
the basis of the human CER would be the 
same as for the rat but rather that with the 
judicious use of postexperimental question- 
naires or subject selection based on various 
scales of emotionality, personality, etc., a 
closer view of the psychological dynamics of 
the CER (i.e., as limited to humans within 
human social situations) might be possible. 
In the present experiment, an informal post- 
experimental interview with each subject re- 
vealed that experimental subjects tended to 
feel upset and often embarrassed in the 
presence of Experimenter 1 during the test 
phase, whereas control subjects generally 
claimed to be unaffected. This would seem 
to support those theories identifying the CER 
‘in humans with a conditioned form of anxiety 
- (Watson & Raynor, 1920). 


CONCLUSION 


The implications of this study may be of 
theoretical as well as practical import. Theo- 
retically, the study offers experimental evi- 
dence in addition to that provided by taste- 
aversion research (i.e., toxicosis) suggesting 
that “belongingness” may be an important 
independent variable in the study of learning 
and performance processes. Moreover, a po- 
tentially powerful experimental technique is 
offered for the study of the processes involved 
in conditioned suppression, as well as the 
effects of various personality variables and 
social psychological conditions on behavioral 
suppression. From a more utilitarian perspec- 
tive, the relative ease with which conditioned 
suppression was demonstrated in a human 
social situation impresses one with the likeli- 
hood of the phenomenon occurring frequently 
in normal social interaction. If a subject sup- 
presses in the presence of a CS person after 
a single conditioning trial, how much more 
so does the CS teacher suppress test per- 
formance speed of the student; the CS fore- 
_ man, the efficiency of the worker, and so on 


where many conditioning trials are likely to 
have occurred? From this perspective, it 
would seem worthwhile to examine generali- 
zation gradients in the human CER, espe- 
cially in relation to personality type and age 
group. Furthermore, studies investigating pos- 
sible methods of CER extinction or reversal 
might have practical value in clinical research. 


REFERENCE NOTE 


1. Mulder, D. W., Lyon, D, O., & Pott, E. W. Con- 
ditioned suppression in humans. Paper presented 
at the meeting of the Midwestern Psychological 
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The role of nonverbal communication 


of Kansas 


in interpersonal learning was analyzed 


via an extension of the double agent research procedure. The new procedure 


permitted detection of the subtle use 
positive and negative 
information was used by learners. Fi 


dyads were subgrouped into four treatments v: 
verbal feedback and its necessity for learning. 


feedback to learners anı 
‘orty randomly constructed teacher—learner 


of nonverbal behaviors by teachers as 
d also the degree to which this 


arying in the legitimacy of non- 
Major results were that teachers 


provided substantial nonverbal information, apparently without awareness, re- 


gardless of its necessity for learning or 


its permissibility; nonverbal cues were 


more subtle in form and slower to appear when not legitimized; students failed 
to use nonlegitimized nonverbal information except when privately informed 
of its necessity for performance. Implications of the results for testing and 
training in nonverbal skill were discussed, 


This report describes studies employing a 
new procedure for the elucidation of subtle 
social learning processes that are assumed to 
operate with varying efficiency in a wide 
range of natural face-to-face encounters. A 
fundamental implication of social learning 
theory is that the behavior of each participant 
in a social interaction will come under the 
control of the reinforcing outcomes of the 
other participants. Thus, over time, the behav- 
iors of each should consist of an increasingly 
larger proportion of those responses that are 
more socially reinforcing. To verify this model, 
it is necessary to demonstrate that it accounts 
for behaviors that are actually employed in 
social interaction. 

Early procedures for testing the applica- 
bility of reinforcement principles to the analy- 
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sis of interpersonal behavior were more con- 
cerned with demonstrating the susceptibility 
of social behavior to operant conditioning 
than with accounting for the details of natural 
interpersonal influence processes (Kanfer, 
1968). The prototype was the verbal condi- 
tioning method of Greenspoon (1955) in 
which the social behavior of the subject was 
restricted to emitting single nouns, one arbi- 
trary form of which was differentially at- 
tended to by a preprogrammed experimenter. 
A more interactional approach (Sidowski, 
1957) permitted each participant to push one 
of a pair of buttons that happened to de- 
liver rewarding or punishing outcomes to the 
other person. 

While both procedures confirmed that so- 
cial behavior has operant properties, neither 
permitted determination of the degree, man- 
ner, or conditions of usage of social reinforce- 
ment in situations where participants are 
free to employ responses from their natural 
behavioral repertoire, In addition, many such 
studies were criticized because their apparent 
conditioning effects could alternatively be in- 
terpreted as mere conformity by subjects to 
obvious reinforcement contingencies that they 
assumed the experimenter wished to be ef- 
fective (Spielberger & DeNike, 1966). 

To overcome these limitations, Rosenfeld 
and Baer (1969, 1970) devised the double 
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agent procedure in which the experimenter 
posed as a subject and the real subject was 
provided with an ineffective method of de- 
livering points, This interpersonal reinforce- 
ment paradigm distracted subjects from 
awareness of the experimenter’s efforts to in- 
fluence their behavior and also gave them 
regular opportunities to respond verbally to 
the experimenter. Through the double agent 
procedure, the experimenter was able to subtly 
provide verbal reinforcers to selected cate- 
gories of the subject’s verbal behavior. 

An extension of the double agent paradigm 
was subsequently devised by Rosenfeld and 
Gunnell (Note 1) to assess objectively the 
forms of verbal response naturally used by 
a subject to reward or punish the confederate 
experimenter. Prior to the conditioning phase, 
the experimenter performed desirable and un- 
desirable verbal responses on a random sched- 
ule to determine whether any verbal re- 
sponses of the subject were reliably associated 
with one class of the experimenter’s behavior. 
If a form of response by the subject con- 
sistently followed only desirable responses of 
the experimenter, it was considered to be 
used as.a reinforcer; if it followed undesirable 
responses, it was assumed to be used as a 
punisher., > 

Although the raridom responses of the ex- 
perimenter were necessary to determine ob- 
jectively which forms of the subject’s re- 


sponses were naturally used as attempted re- 


wards and punishments, the procedure tended 
to have the additional effect of soon ex- 
tinguishing those very response forms. That 
is, the continued random behavior of the ex- 
perimenter failed to reinforce the categories 
of behavior used by the subject to influence 
the experimenter’s behavior, One apparent 
outcome of the latter effect was that when 
the subject’s preferred forms of verbal in- 
fluenee were ineffective, they tended to shift 
to Tess likely forms, But due to the built-in 


. extinction, sometimes a subjects response 
‘form would be abandoned before it occurred 


sufficiently often to be reliably identified as 
a reward or punishment. 

. In contrast, if another naive subject had 
been substituted for the randomly program- 
med confederate, then the original subject’s 
response forms would more likely be rein- 


forced, Such strengthening of the usage of 
normally functional response forms within a 
truly interactive relationship has been dem- 
onstrated (e.g., Rosenfeld, 1966, 1972), The 
occurrence of new forms of response in such 
contexts has been referred to as “social 
emergents” (Lindsley, 1966). 

To permit the emergence of subtle influence 
processes, the present study introduces an 
adaptation of the double agent procedure in 
which both members of dyads are permitted 
degrees of freedom in response selection, On 
the basis of substantial evidence that ele- 
mentary forms of positive and negative reac- 
tions are most ubiquitously and reliably com- 
municated by nonyerbal activities of the face 
and head (see Rosenfeld, in press), a social 
behavior setting was constructed in which 
only face and head behavior was available as 
an alternative to experimentally provided 
forms of influence. 

Dyads were given a learning task in which 
the goal of both the teacher and the learner 
was to increase the performance of the 
learner. The correct and incorrect perform- 
ances of the learner provided occasions on 
which the teacher could employ his nonverbal 
reactions as reinforcers or punishers. How- 
ever, in_all but one condition no instructions 
were given to the teacher concerning the usage 
of nonverbal behaviors; in fact, such usage 
was implicitly prohibited. Nor was the learner 
instructed to strengthen a particular non- 
verbal response of the teacher, as he would 
have been in the original double agent pro- 
cedure. The emphasis in this study was not 
operant conditioning of responses but rather 
identification of the self-generated forms of 
behavior that persons employ to influence 
others when ineffective means are provided. 
Both usage and consequences of nonverbal 
expression were free to vary. 

Due to the structure of the situation and 
task, nonverbal expressions used functionally 
in the task were expected to be primarily in 
the form of what Ekman and Friesen (1969) 
described as “emblems.” An emblem is a non- 


- verbal response that symbolizes a message 


that could be verbally stated in a word or 
phrase. Emblems are often used when verbal 
communication is inconvenient or impossible. 
The emblem has shared meaning for a 
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group of observers and is generally readily 
understood, In this study, a face-head em- 
blem, successfully employed, would act as a 
positive or negative social reinforcer of cor- 
rect and incorrect performances of the 
learner.* 

In the procedure to be described, the 
teacher was instructed to use a nonsocial 
(mechanical) form of feedback to influence 
the performance of the learner. In some con- 
ditions, the mechanical form of feedback was 
caused to be ineffective without the knowl- 
edge of the teacher. Several hypotheses con- 
cerning the role of nonverbal communication 
in the social learning process were testable 
within the research design. Generalizing from 
prior double agent research on verbal be- 
havior, it was predicted that when the me- 
chanical feedback failed to facilitate learn- 
ing, the teacher would employ nonverbal 
behavior as feedback on performance. The ef- 
fectiveness of the nonverbal emblems in pro- 
moting learning was expected to be greater to 
the degree that learners were made aware of 
their potential functional usage by teachers. 
These hypotheses about effects on learning 
were tested in Experiment 1. 

In addition, the provision of nonverbal 
feedback by teachers was expected to be less 
subtle to the degree that nonverbal signaling 
was legitimized as a means of influence. 
Evidence pertaining to this hypothesis and 
subsidiary questions concerning nonverbal 
communication were obtained by additional 
procedures in which videotaped records of be- 
havior in the experiment were coded by small 
groups of independent observers. The coding 
studies are reported as Experiments 2, 3, 
and 4. 


EXPERIMENT 1 


Unacquainted subjects were paired in 
sine of a “teacher” and a “learner.” The 
ruited from introductory courses 
ent and education at the Uni- 
he learners were recruited from 
-tions and psychology. Subjects 

y to three conditions. A fourth 

1) was conducted at the be- 

z semester to aid in the in- 

ilts of one of the original 

n 2), In all, 46 dyads took 

; of which were excluded 


part in the ¢ 
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among the four conditions. Thi 
tions 1-3 were run in random order. 
in Condition 4, although run consecutively, were re- 
cruited from populations that were apparently 
equivalent to those from which subjects were drawn 
in the first three conditions. 

Apparatus. The teachers and learners communi- 
cated between separate laboratory rooms over closed- 
circuit television (see Figure 1). The equipment per- 
mitted a semblance of a face-to-face interaction 
including eye contact, while maintaining the separa- 
tion necessary for the task. The setting was similar 
to that employed by Gunnell and Rosenfeld (Note 
2). The video image of each subject was reflected 
from a monitor in the other’s room onto a one-way 
window that acted as a mirror. If the teacher were 
looking at the learner’s image as reflected on the 
window, a camera in a darkened room behind the 
window that was aimed at the level of the image of 
the learner’s eyes would transmit the image of the 
teacher to the monitor in the Jearner’s room. The 
image of the teacher would be reflected from the 
monitor in the learners room onto the one-way 
window in front of the learner. When the learner 
looked at the image of the teacher, the teacher ap- 
peared to be looking directly at him, The same pro- 
cedure was followed to transmit the learner’s image 
to the teacher. Neither subject could see the cameras. 
The instantaneous transmission between subjects 
permitted a continuous interaction as well as main- 
taining eye contact. The separation of the subjects 
permitted a split-image videotape recording of the 
full face of each subject to be made for later analy- 
sis and transcription. It also allowed information 
from the experimenter to be delivered privately to 
either subject. The direction of scanning in each 
television monitor was reversed so that letters and 
numbers would appear normal when reflected off of 
the one-way windows. 


Procedure. The procedure was as follows: 


E Task: For each dyad, a word-discrimination 
task was given by the teacher. In each trial, the 


1 To comply with the present definition of emblem, 
a nonverbal response need only be reliably inter- 
pretable as meaning “right” or “wrong.” We do not 
assume that the responder intended to elicit that re- 
action. Thus positive or negative facial expressions 
might be included among emblems in the present con- 
text, even though they could also be considered “af- 
fect displays” in Ekman and Friesen’s classificatory 
system. Similarly, head nods or shakes could be em- 
blems even though they might also function as 
“regulators” of response flow. We consider an emblem 
to have been functionally employed in this study 
when the user reliably emitted it after a correct or 
incorrect response and effectively employed when the 
receiver used this information to enhance his per- 
formance. 


NONVERBAL REINFORCERS IN DYADIC LEARNING 


Teacher's Room 


Learner's Room 
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One-way windows 


S P TV cameras 
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Ficure 1. Schematic diagram (nonscale) of experi- 
mental setting and: major apparatus. (Teacher and 
learner communicated between separate rooms over a 
two-way closed-circuit television system (teacher's 
subsystem is shaded). Each participant’s video image 
was reflected from a television monitor in the other's 
room onto a one-way window that served as a mir- 
ror, Cameras at eye level in the darkened room of 
the experimenter permitted the impression of eye-to- 
eye contact between participants. A suspended set of 
two-choice verbal stimulus cards and foot pedals for 
giving positive and negative points were located be- 
fore the teacher's chair, Points delivered appeared on 
learner’s monitor. All transactions were recorded on 
videotape.) 


teacher was to present two words to the learner, 
who was to choose the correct word. The words were 
written, one above the other, on stimulus cards 
hanging on rings from a bar in front of the teacher 
(see Figure 1). The correct words were written on 
the backs of stimulus cards for the teachers. From 
a counter that was visible only to the learner, the 
teacher was to use foot pedals to add or subtract 
points for right and wrong responses. 

The points, a cumulative count of trials, and the 
number of correct responses per block could be dis- 
played in an upper corner of the television picture 
received by the learner, The numbers were seen only 
by the learner, enabling the investigators to prevent 
transmission of points given by the teacher without 
his knowledge. The procedure also helped to orient 
the learner’s attention to the televised image of the 
teacher’s face. The counters appeared above the 
image of the teacher’s head, and the stimulus cards 
for the word pairs appeared at the level of the 
teacher's chest. The learner would at least receive a 
“tachistoscopic” impression of the teacher as he 
looked from the cards to the counters. 

In three of the four conditions, unbeknownst to 
the teacher, the learner did not receive the points 
controlled by the teacher, In these three conditions 
learning the task could reflect only the effect of the 
teacher’s nonverbal reactions on the learner. To give 
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learners an apparent, although ineffective, means of 
learning, they were told that the task involved con- 
cept formation. 

To create a real but unusable concept, 50 word 
pairs were constructed from the word frequency list 
of Thorndike and Lorge (1944). One word in each 
pair occurred about 10 more times per 100,000 than 
the other. Zajonc (1970; Zajonc & Rejecki, 1969), 
using word pairs in which the frequency difference 
was well over 100, found that the more frequent 
member of a pair is likely to be preferred in a 
choice task, although the preference for the more 
frequent word decreased as the frequency difference 
decreased, To work against preference in the present 
task, the concept of “the less frequent word in each 
pair” designated the correct choice. The 50 word 
pairs were pretested on a group of 50 introductory 
sociology students. The students were asked to check 
1 word in each pair as preferable. From these data, 
10 pairs, in which each word was chosen by half of 
the subjects, were selected for use in the task, The 
entire procedure was aimed at creating a list of word 
pairs having a common underlying concept while 
minimizing the chances that the concept would ever 
be used to choose the correct words. 

The teacher presented the word pairs to the 
learner one at a time, repeating the same sequence 
of 10 word pairs for each repetition of the task, The 
sequence of pairs was the same for all dyads. In the 
analysis of data, each presentation of the set of 10 
word pairs constituted a block of trials, If a learner 
reached criterion (all choices correct within a block), 
the session was terminated one block following the 
criterion performance. Of the subjects who reached 
criterion, 82% also repeated it in the subsequent. 
block, indicating that the criterion scores were gen- 
erally valid. A minimum of 5 blocks was provided 
for all subjects, Time permitting, dyads who did not 
reach criterion within the 5 blocks were allowed to 
continue for additional blocks for heuristic purposes. 
Dyads in Conditions 1-3 ranged from 5 to 16 blocks, 
and all dyads in Condition 4 were allowed up to 16 
blocks. 

2. The teacher: Teachers were recruited to act as 
experimenters in a procedure for reinforcing dis- 
crimination learning, In written instructions, the 
teachers were informed that the investigator was 
exploring the way people used patterns of reward 
and punishment and which patterns would be most 
effective to control behavior. The discrimination task 
was explained, and the teachers in all conditions were 
instructed to teach their “subject” the correct words 
by awarding, withholding, or withdrawing points. 
Foot pedals (see Figure 1) were provided to add or 
subtract points on a counter only visible to the 
learner, The teachers were told that the investigator 
would control a second counter to show the learner 
the number of correct choices he made on each block 
of trials. Only in Condition 1 did the learner actually 
receive the points given him per trial by the teacher, 
In all other conditions a cumulative count of trials, 
controlled by the investigator, was substituted for 


the points. 
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TABLE 1 
INSTRUCTED METHODS FOR ACCOMPLISHING TASK 


Subject 
Condi- 
tion Teacher Learner 
k Gives points Receives points and 
concept formation 
2 Gives points Concept formation 
3 Gives points and Concept formation 
nonverbal cues 
4 Gives points Attends to nonverbal 


cues 


In Condition 3, teachers were given additional in- 
structions following the directions for giving points: 


In addition to the points, you have at your dis- 
posal some very effective means of reward and 
punishment, namely, your facial expression, your 
gestures, and attitude. The possibilities are end- 
less, ranging from more subtle changes in expres- 
sion, to more obvious ones. Use your face and 
gestures as you would in any situation to express 
approval or disapproval. 


Before beginning the experiment, a practice period 
was used to acquaint the dyads with the task and 
situation, A separate set of 10 word pairs was used 
for practice. Procedural difficulties and questions 
were handled following the practice period. In addi- 
tion, with the exception of Condition 3, the in- 
vestigator stressed that the teacher’s communicative 
behavior was to be limited to asking the learner for 
his next answer, 

In Condition 1, where the points given by the 
teachers were delivered to the learner, it was of in- 
terest to see whether facial emblems would be used 
by teachers in spite of the fact that they were not 
necessary for success in the task, Teachers in Con- 
ditions 2 and 4 received the same implicit prohibi- 
tion against the use of nonverbal and verbal feed- 
back as they did in Condition 1 but were expected 
to have less regard for the prohibition because non- 
verbal communication was their only available means 
to succeed in the task. Essentially, these three con- 
ditions were tests for what has been described as 
the experimenter effect (Rosenthal, 1966). Condition 
3, in which it was suggested that teachers use non- 
verbal feedback, was intended to serve as a standard 
for evaluating the usage of uninstructed nonverbal 
expressions in the other three conditions and their 
effect on the learning curves of the learners, It was 
expected that the nonverbal expressions in Condi- 
tion 3 would be less subtle than those displayed 
under implicit prohibition, The instructions for the 
teachers and the learners in the various conditions 
are summarized in Table 1. 

3. The learner: In the first three conditions, the 
learner’s written instructions described his task as 
subliminal concept formation. The learner was told 
that the teacher was an experimenter who would 
Present a series of paired words to him. Learners 
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were told that there was a concept common to every 
correct word but that it was subtle. Through sub- 
liminal concept formation, the learners were to 
choose the correct word for each pair presented to 
them. The learners were told that they might not be 
sure of the exact concept, but after repeated ex- 
posure to the pairs they should begin to use it sub- 
liminally in making their choices, Emphasis was 
placed on the number of correct responses rather 
than the discovery of the concept. If the learner did 
not understand what was meant by subliminal con- 
cept formation, the investigator elaborated, using as 
an example the way a child learns language without 
the knowledge of the rules of grammar, 

In Condition 4 the learners were told that their 
task involved social skills, The word pairs were de- 
scribed as a simple discrimination task in which the 
correct word had been arbitrarily chosen. The 
learners were informed that their only cues to the 
right and wrong answers would be through the non- 
verbal reactions of the teacher. Before the practice 
pericd the investigator stressed that the learners were 
not to ask the teacher to give more or clearer re- 
actions, 

In Condition 1 learners received points for each 
response from the teachers to help them to learn 
the words and to develop the “concept.” In the 
other three conditions, learners expected their counter 
to record only the ordinal number of each trial, 
Learners in all conditions also received the investi- 
gator’s summary count of the number of correct 
choices made in each block, following each block, 
This count was meant to reduce ambiguity where 
points were not received rather than to promote 
learning.2 

The learning curves obtained from the learners for 
the task served to measure the amount of useful in- 


tions, Learning was expected to be more rapid in 
Condition 3 than in Condition 2 due to the instruc- 


4. Postexperimental questionnaire: Following the 
experiment, the teachers and learners were given 
questionnaires to assess their awareness of the true 
nature of their particular condition and their at- 
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tributions about the study. The investigator re- 
viewed answers with the subjects to ensure that they 
were understandable and complete, No subject ex- 
pressed a hypothesis that explained his condition.® 
One learner did propose that subliminal concept 
formation was a “sham” to cover an interest in the 
effects of television on learning. This was the only 
hypothesis given that deviated from the situation 
and instructions as presented to the subjects. 

All subjects signed a release for the limited use of 
the data collected in the experiment and agreed not 
to discuss the conditions of the experiment until the 
study had been completed. Subjects subsequently 
were debriefed and informed of the results of the 
learning task by mail. 


Results 


Percentage scores for learning were cal- 
culated for each learner on the first 5 sequen- 
tial blocks of 10 trials. The analysis was 
limited to the first 5 blocks in order to have 
complete data for all conditions. The scores 
were subjected to a 4 X 5 analysis of variance 
to assess the effect of the four experimental 
conditions on learning over the 5 blocks (see 
Table 2. The Fmax test for heterogeneity of 
variance was nonsignificant. 

The results of the analysis of variance 
showed that there were significant effects of 
conditions (p < .001) and blocks (p < .001) 
and that there was a significant (p < .01) 
Blocks X Conditions interaction. Analysis of 
the interaction, shown in Figure 2, revealed 
that the learners in Conditions 1 and 3 sig- 
nificantly improved in performance, while 
those in Conditions 2 and 4 did not, The fact 
that the learners in Condition 3, who needed 
to use the facial expression of the teacher in 
order to learn, did not significantly differ from 
learners in Condition 1, who had the points 
available, indicated that virtually all teachers 


TABLE 2 


5 OF VARIANCE OF LEARNING Sco 3 
OF CONDITIONS AND TRIAL BLOCKS 


Source MS df F 
Between subjects 
Conditions (A) 3,540.50 3 6.88** 
„Error 514.83 36 
Within subjects 
Blocks (B) 2,330.75 4 14.87** 
AXB 12 2.20* 
Error 156.78 144 
*p < 0l. 


** p < .001. 
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Ficure 2. Learning curves for discrimination task. 
(The ordinate scale is condensed to emphasize rela- 
tive differences in magnitude between conditions.) 


were capable of employing nonverbal em- 
blems as a means of teaching the discrimina- 
tion and that the learners were capable of 
effectively using those cues. 

However, closer comparison of the results 
of Conditions 1 and 3 did indicate that points 
were more effective than instructed nonverbal 
feedback, Of the 10 learners in Condition 1, 
9 reached criterion performance (Mdn = sixth 
block; range = third to eighth blocks), 
whereas only 5 learners achieved criterion in 
Condition 3 (Mdn = sixth block; range = 
fourth to tenth blocks). 

In Condition 2, where learners were given 
from 7 to 16 blocks of trials, with a median 
of 11 blocks, none reached criterion, In Con- 
dition 4, where all learners were allowed up 
to 16 blocks, two reached criterion by the 
seventh block and three more did so by the 
fifteenth block. Thus there was evidence of 
learning through nonverbal cues in Condi- 
tion 4, although learning was achieved more 
slowly than in Condition 3. 


EXPERIMENT 2 


In order to answer some of the questions 
concerning the presence or absence of usable 
emblematic information in the nonverbal be- 


8 While “awareness” was not a central concern of 
the current study, it is of interest to note that the 
present adaptation of the double agent procedure ap- 
pears to be as effective as its predecessors in distract- 
ing subjects from attention to subtle influence pro- 
cesses. 
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havior of the teachers, judges drawn from 
populations similar to those of the learners in 
Experiment 1 were asked to try to determine 
from the videotaped behavior of the teacher 
whether the learner made a right or wrong 
response. This consensus method for identify- 
ing the emblematic meaning of a facial ex- 
pression was in accord with the definition of 
an emblem given by Ekman and Friesen 
(1969): An emblem is a symbolic nonverbal 
behavior that has shared meaning among a 
group of observers. In this study, then, it was 
possible to use concordance among judges to 
estimate the potential meaning that the teach- 
er’s facial expression had for the learner. 
Meaningful expressions may or may not have 
been intended or employed appropriately by 
teachers or used by learners. 

A judge was said to agree with criterion 
when he judged correctly, from the teacher’s 
behavior, that the learner’s answer was right 
or wrong on the preceding trial. The greater 
the number of trials in which judges concur- 
rently agreed with criterion, the more the 
teacher’s nonverbal behavior was considered 
functional within the context of the task. 
Consensual disagreement with criterion in- 
dicated disfunctional behavior by teachers. 


Method 


Subjects. Four subjects served as judges in the 
study. They were recruited from introductory psy- 
chology courses, so as to be comparable with the 
subjects who served as learners in the original dyads. 
The judges were recruited for two sessions. 

Procedure. At the beginning of the first session, 
the judges were given a brief outline of the original 
experiment. They were told that there were different 
conditions but not how these conditions differed. 
They were merely told to watch the teacher’s non- 
verbal response and to try to judge whether the 
answer was right or wrong. 

The judges viewed two edited videotapes, one at 
each session, Each of the dyads from the learning 
task was represented on each tape by a block of 10 
trials. The first of the 2 blocks for each dyad had been 
campled from the first half of the blocks 

d before learning, and the second block, 


random! 
that occu 


from the second half of the blocks that occurred 
before lea g. The temporal sampling provided a 
means f ng changes in the teacher’s behavior 


mples were selected to provide an 


over ti 

equal num í teacher reactions to correct and in- 
correct c students, If the block chosen. in 
the random was not within the limits of from 


ct choices, the block that came 
his criterion was chosen. If the 
he block sampled was of poor 


three to sevo 
closest to mect 
videotape pictur 
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technical quality, the same procedure was followed 
to sample a new block, The two sample blocks were 
randomly assigned, without regard to temporal order, 
to the two edited videotapes viewed by the judges. 

It should be noted that because the rate of learn- 
ing differed for the four conditions, the blocks sam- 
pled from each condition differed in their absolute 
temporal position within the teaching process. The 
average ordinal positions of the blocks sampled for 
viewing by judges were as follows: Condition 1, first 
and second blocks; Condition 3, second and fourth; 
Conditions 2 and 4, fourth and seventh, 

The judges sat in individual viewing carrels, each 
containing a monitor connected to a common video- 
tape playback unit. The tape was played at real 
time, and the judges were not permitted to review 
segments when they were unsure of their decisions. 
To minimize possible extraneous cues, the audio por- 
tion of the videotapes was not presented to the 
judges, and the portions of the judges’ monitors on 
which the images of the learner and the counters ap 
peared were covered. An investigator announced thc 
number of the trial as the learner answered. The 
judges were not to make a judgment until the in- 
vestigator had signaled that the learner’s responsc 
had been made. 

Each judge marked the letters R or W on answer 
sheets to indicate judgments of right or wrong for 
each trial. The typical block of 10 trials lasted nearl) 
a minute, with 6 sec allotted for making a judgment 
between trials. Judgments were collected on 80 
blocks of trials, 40 at each session. Rest periods were 
given midway through each session. 


Results 


A consensual and accurate set of judgments 
of a trial indicated that the nonverbal be- 
havior of the teacher was functional, The de- 
gree to which functional emblems could serve 
as effective social reinforcers in the task de- 
pended on the preponderance of functional 
emblems over nonfunctional or inappropriate 
emblems occurring in the same period. 

The frequency with which judges agreed 
with criterion for each sample block in each 
of the four conditions is shown in Figure 3. 
The graphs describe the degree of functional 
usage of nonverbal emblems, ranging from 
functional (all four judges correctly indicat- 
ing the student’s performance on a trial) to 
dysfunctional (consensual inaccuracy), with 
intermediate levels of consensus indicating 
ambiguous or nonemblematic usage. A 

The ability of each judge to detect the 
nonverbal emblems of the teachers was com- 
puted for the earlier and later samples taken 
from each condition. Assuming that 50% cor- 
rect judgment reflected random information 
the significance of the obtained percentage of 


y 


—— a 


NONVERBAL REINFORCERS 


Trials Judges 


Judges Agreeing with Criterion 


Condition 3 


Condition 2 


Frourr 3. Consensus on judgment of functional 


(shaded area) and “wrong” (nonshaded) by teacl 


each condition. ( 
that is, correctly judged whether the learner had 


indicates the number of trials on which each level of consensus was 0} 
nctional usage of nonverbal emblems, the left side indicates 


each five-column histogram reflects fu 
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Condition 3 


Condition 4 


1 employment of nonverbal emblems of “right” 
hers in the first and second blocks sampled of 
The abscissa shows the number of judges, out of four, who agreed with criterion, 


made a right or wrong response, The ordinate 
btained. The right side of 


dysfunctional usage, and the central area shows nonemblematic usage.) 


correct judgments by each judge among the 
100 trials in each sample was calculated by 
the critical ratio test. The early sample in 
Condition 3 was judged significantly correctly 
(p < .01) by each of the four judges, whereas 
the early samples in the other conditions were 
judged significantly correctly in only 1 of 12 
tests (four judges over three conditions). The 
mean percentage of early trials judged cor- 
rectly in Condition 3 was 68, while in other 
conditions the corresponding means varied 
from 53% to 55% over judges. In the second 
block of trials sampled, all four judges in 
Condition 3 were again significantly accurate 
(p < 01, M = 75%); in each of the remain- 
ing three conditions at least three of the four 
judges were also significantly accurate (p< 
.05, Ms ranging from 62% to 63%). 

In an analysis of variance of the judgments, 
the second block sampled was judged more 
accurately than the first block ($ < 01), in- 
dicating that teachers increased their func- 
tional usage of nonverbal behavior over time. 
However, no Condition X Sample Block inter- 


action was obtained, in contrast with the im- 
plications of the above analysis of the per- 
formance of individual judges, The analysis 
of variance also revealed significant individual 
differences between teachers in both overall 
accuracy of conveying nonverbal information 
(p < 001) and in relative accuracy of con- 
veying “right” versus “wrong” through their 
nonverbal behavior (p < .001). 


EXPERIMENT 3 


The learner’s amount of gazing at the 
teacher was measured as an indication of his 
opportunity to be influenced by any non- 
verbal information provided by the teacher. 
To learn, the responders in Condition 1 did 
not need to look at the teacher, only at the 
points. In the other three conditions learning 
could only take place if the learner attended 
to the teacher, assuming that the teacher was 
using his face informatively. Gaze was de- 
fined in terms of the degree to which the 
learner looked at the teacher. 


TABLE 3 


MEAN MEASURES or GAZE DIRECTION or LEARNERS 
FOR EXPERIMENTAL CONDITIONS 


o Length Frequency 
Condition occurrence (in sec.) (per block) 
1 14.15 75 10.98 
2 22.13 1.12 11.29 
3 19.19 95 12.64 
4 40.96 2.00 12.17 


Nolte. In the analysis of variance, the main effect of conditions 
on both percentage of occurrence and length was significant at 
P <.01. In both cases Tukey's b test (Winer, 1962) indicated 
that Condition 4 was significantly different from each other 
condition at p < 01. 


Method 


Subjects. Two undergraduate students were re- 
cruited to serve as coders, 

Procedure, The two observers independently coded 
the occurrence of gaze direction from the videotape 
recordings of the learners by depressing a key on an 
Esterline-Angus event recorder whenever the learner 
looked at the teacher. The videotape recordings of 
the learner had been made full face, with simulated 
eye contact, which made the learner appear to be 
looking directly ahead when looking at the teacher, 
The observers were instructed to preview each tape 
before they coded it until they could define three 
eye positions: up, toward the points; down, toward 
the cards; and ahead, toward the teacher, 

Each observer was randomly assigned tapes of 20 
of the 40 learners, five from each condition. They 
viewed the tapes at real time so that the eye con- 
tact they perceived would be comparable to that 
seen by the teachers, Generally, there were about 15 
minutes of tape to code for each learner. Three 
measures of gaze, discussed by Kendon and Cook 
(1969), were obtained from the records of each ob- 
server in each block: mean frequency of gaze, mean 
length of gaze, and the percentage of trial time spent 
gazing. 

Reliability. To sample reliability, eight learners, 
two from each condition, were randomly chosen to 
be coded independently by both observers, Relia- 
bility was measured by aligning the event recorder 
Paper records of the two observers and determining 
the percentage of agreements Over occurrences of 
gaze behaviors, The number of agreements between 
observers was determined by counting the number 
of 4-sec units in which both observers recorded the 


Sampled, 
Results 


The gaze direction of the learner was coded to 
estimate the extent to which the learners in each 
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condition were exposed to, and thus 
tunity to use, any information that 


the oppor- 
have been 


available through the nonverbal er is of the 
teachers, Each measure of gaze was | within a 
4X2 analysis of variance for effects to condi- 
tions and observers. The fact that i were no 
significant differences between obser, n any of 
the measures is a further indication o cliability 
of observations. The similarity o; »nditions 
among the block means for frequency = 10.98 
to 12.64) shown in Table 3 indicates learners 
averaged about one gaze per trial no which 
condition they were in. Significant cfí condi- 
tions were found on the measures th and 
percent of gaze (p < .01), 

Tukey’s b (Winer, 1962) was used mpare 
condition means for both length and | age of 
gaze, On both measures of gaze, the me `ondi- 


tion 4 were significantly greater than : Í any 


other condition (p < .01), Learners in ion 4 
had been instructed to watch the tea facial 
expression, 
EXPERIMENT 4 

To determine what differences in itent 
of nonverbal emblems were generate: the 
different treatments in Experiment | om- 
prehensive content analysis of teac re- 
sponses was required, The scope of ı an 
analysis was judged by the investiga! to 
be too broad to be included efficiently within 
the present report, Instead, the data reu ired 
for present purposes were obtained within a 


more complex additional Study concerned 
with deriving representative samples of 
verbal emblems to serve as testing and ! 
ing devices, Details of the procedure ar: 
scribed in Rosenfeld, Shea, and Greenbaum 
(Note 3). They are only briefly reviewed 
here. The results to be presented here are 
also limited to the presentation of nonverbal 
emblems of teachers that most clearly distin- 
guished between conditions of Experiment 1. 


Method 


n- 


| 
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sed lips (92%), fallen face or slackened 
1%), upturned corners of mouth (92%), 
r ] face (92%), which referred to a lack of 
ment and a closed mouth, 
her’s data were retained for analysis only 
) at least 80% of her sampled behavior had 
ı consensually and correctly judged per block as 
icating “right” or “wrong” in Experiment 2 and 
each block contained expressions of both right 
l wrong. Individual trials from these teachers 
cre omitted if they lacked consensual validation in 
ixperiment 2 or reliable coding in the current ex- 
periment. On these bases, coded data from 432 epi- 
odes were retained, which represented 28 of the 40 
teachers. The number of episodes coded and retained 
per condition in Experiment 1 ranged from 90 to 131. 


) 


Results 


The major comparison of interest was be- 
tween Condition 3, where teachers were en- 
couraged to use nonverbal expressions, and 
the remaining three conditions, where such ex- 
pressions were implicitly discouraged. Table 
4 shows that three behavioral categories were 
more significantly prevalent in Condition 3 
than in the other conditions combined (p < 
.05); these were head nods, head shakes, and 
pressed lips, The fallen face, although rare, 
was almost exclusively limited to Condition 2, 
where the greatest subtlety of expression was 
expected. (In both Conditions 2 and 4, teach- 
ers had no legitimized reinforcers available, 
and their nonverbal behavior was implicitly 
prohibited, but those in Condition 2 had the 
further disadvantage that their students were 
less likely to elicit or reinforce nonverbal 
feedback through attentional responses.) In 
Condition 3, where nonverbal expression was 
emphasized in instructions, there were sig- 
nificantly fewer neutrals than in the other 
conditions (p < .05). 


GENERAL DISCUSSION 


The set of studies described in this report 
enabled us to answer a set of coordinated 
questions about the functions of nonverbal in- 
formation in the social learning process. First, 
was the task learnable on the basis of positive 
and negative reactions? The answer is clearly 
affirmative: The teachers promoted learning 
in virtually every subject in Condition 1, al- 
though their usage of points was confounded 
with their emblematic accompaniments. 

Was the task learnable from conventional 
nonverbal behavioral indicators of positivity 
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TABLE 4 


MEAN OCCURRENCES OF SELECTED NONVERBAL 
COMPONENTS OF CONSENSUALLY CODED 
SUBJECTS PER CONDITION 


Nonverbal response class 


Condi- 
tion Nod Shake Press Smile Fall Neutral 
1 2.83 .67 2.00 83 33° 3.33 
2 3.17 33 217 2.00 1.00 3.00 
3 7.00* 3.88* 6.75* 5.00 -00 .63* 
4 2.00 163 3.13 4.75 00 1,88 


Note. In Conditions 1 and 2, 6 subjects were coded per condi- 
tion; for Conditions 3 and 4, 8 subjects each were coded. The 
number of responses per condition ranged from 90 to 131, 

* Condition 3 differed significantly from grouped ur ked 
conditions, p <.05, by two-tailed nonparametric (median) 
test. 


and negativity? Again, the results support an 
affirmative answer. Significant learning oc- 
curred in Condition 3, where teachers were 
encouraged to employ nonverbal expression 
along with their points, which were not trans- 
mitted to the learners. 

Was nonverbal information made available 
to subjects under the more subtle conditions 
in which teachers were instructed only to use 
points but in which the points were not de- 
livered? Consensual observations about the 
information available from teachers? behav- 
iors demonstrated that teachers provided sig- 
nificant nonverbal feedback in both Condi- 
tions 2 and 4, although not as much as in 
Condition 3. 

Was the nonverbal information that was 
available to learners in the more subtle condi- 
tions actually used by them? In general, the 
answer is negative. There was no evidence of 
learning in Condition 2, even in extended 
blocks of trials. Nor did learning occur in the 
first five blocks of Condition 4. However, the 
fact that several subjects in Condition 4 
learned to criterion during extended trials in- 
dicates that learning from subtle nonverbal 
information can occur. Requisites for utiliza- 
tion appear to be both a mental set in learners 
to seek nonverbal cues from teachers and 
relatively lengthy exposure to such cues. 

Were nonverbal cues provided by teachers 
in accord with the double agent hypothesis, as 
subtle substitutes for more legitimate but in- 
effective means of influence? The present 
studies failed to confirm this expectation. Al- 
though teachers increasingly provided poten- 
tial information through nonverbal emblems 
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in Condition 2, where points were not effec- 
tive, at least as much was made available by 
teachers in Condition 1, where points were 
effective. Nonverbal cues were apparently un- 
intended concomitants of the teachers’ reac- 
tions to successful and unsuccessful perform- 
ances of their learners, Thus, we cannot 
determine whether or not their usage in Condi- 
tion 2 represented efforts to compensate for 
ineffective teaching techniques. 

Finally, were nonverbal cues more subtle 
in content when not legitimized? The avail- 
able evidence in the present and a supplemen- 
tary study (Rosenfeld, Shea, and Greenbaum, 
Note 3) supports an affirmative answer. In 
Condition 3, where nonverbal responding was 
encouraged and presumably purposefully em- 
ployed, the characteristic forms were head 
nods, head shakes, and pressed lips. Nods and 
shakes almost universally referred to positive 
and negative performance, respectively. A 
parsimonious interpretation is that nods and 
shakes are conventional, stereotyped emblems 
of positivity and negativity in this society. 
Pressed lips may function as amplifiers of 
either reaction. More subtle cues were more 
characteristic of the other three conditions. 

The most subtle active cue—fallen face— 
never occurred in the behavior samples coded 
from Condition 3. It was most characteristic 
of the most subtle experimental treatment 
(Condition 2), where it consistently referred 
to negative performance. Perhaps the simplest 
verbal translation of the fallen face is “dis- 
appointment.” Its form may have derived 
from the universal affects of surprise and sad- 
ness, described by Ekman (1971). Neutral or 
nonexpressive reactions by teachers were most 
strongly associated with Condition 1. Neu- 
trality was employed as a probabilistic refer- 
ent to negative performance possibly because 
of its contrast to the more active nonverbal 
responses that tended to follow positive per- 
formance. 

Finally, it is noted that upturned mouth 
corners—a major component of smiling—was 
almost as common in Condition 4 as in Condi- 
tion 3. The extended gazing experimentally 
elicited in learners in Condition 4 may have 
been interpreted by the teachers as an affilia- 
tive cue (Exline, 1971) and also perhaps as 
a request for additional feedback, thereby 
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prompting the teachers to provide at least a 
minimal smile. 

A variety of practical applications have de- 
rived from the current studies, The studies 
were initiated primarily to extend the double 
agent research paradigm so as to be more 
sensitive to the detection of the communica- 
tive functions of natural variations in behav- 
ior. A major variation introduced was that the 
criterion variable by which functional usage 
of behavior (by teachers) was validated—the 
correct or incorrect performance of the learner 
in this case—was free to vary rather than 
preprogrammed as in the earlier double agent 
studies, While some control over the criterion 
was sacrificed in the present version, it has an 
important complementary advantage: It al- 
lows functional usages (by the teacher) to be 
reinforced (by the success of the learner), 
whereas the prior preprogrammed version 
tended to extinguish functional usage by per- 
mitting only random performance. 

Relatedly, the current version also permits 
the study of the effects of information avail- 
able from the behavior of one member of a 
social unit on the target behaviors of other 
members. In other words, it permits study of 
both the “referential” function of behavior 
(here, the relationship of the form of the 
teacher’s behavior to the prior success or 
failure of the student, as in the prior pro- 
cedures) and its “inferential function” (here, 
the inference of success or failure attributed 
to the teacher’s behavior by the learner or 
other observers of the teacher). When the two 
functions are in agreement, the behavior 
serves a “communicative” function (cf. the 
definition of communication promoted by 
Wiener, Devoe, Rubinow, & Geller, 1972). 
The results of the present studies clearly vali- 
date the usefulness of the new procedure for 
detecting these functions. It has virtually 
limitless possibilities for the study of various 
behavioral forms in different populations as 
well as the employment of criterion variables 
other than correctness of performance. 

A potential spin-off of the procedure is that 
the data it generates on functional behavior 
can be used for deriving representative forms 
of communication in a given population. From 
the present data, two equivalent sets of com- 
municative responses of the teachers have 
been derived from the videotaped records, 
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each set including representative examples of 
both subtle and more blatant forms of non- 
verbal emblems of “right” and “wrong” 
(Rosenfeld et al., Note 3), The derived ex- 
amples, in turn, can be used as standard, 
normative stimulus materials for testing com- 
munication skills of individuals or groups. For 
example, randomly edited forms of the video- 
tapes have been presented to various groups 
of children to determine their abilities to de- 
code the cues that adults were found to com- 
municate with high accuracy in the current 
studies. Informative results have been ob- 
tained from groups of normal middle-class 
children (Rosenfeld, Shea, & Greenbaum, 
Note 4), educable mentally retarded children 
(Shea & Rosenfeld, Note 5), and deaf chil- 
dren (Rosenfeld & Sussman, Note 6). The 
stimulus items can also be employed for train- 
ing subjects to improve their abilities to rec- 
ognize and use particular communicative 
behaviors. 
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ESP and ESB: Belief in Personal Success at Mental Telepathy 


Fred Ayeroff and Robert P. Abelson 
Yale University 


Subject pairs undertook 100 trials in each of which the sender tried to transmit 
mentally one of five possible symbols to the receiver in another room. After 
each trial both members indicated whether they beiieved a hit had been scored. 
These hunches or “extrasensible beliefs’ (ESB) were compared across two ex- 
perimental variations. For half of the pairs (choice and involvement condition) 
the members chose the symbol set to be used, and the sender shuffled and dealt 
the symbol deck. Orthogonally, for half of the pairs there was a warm-up 
period (communicative experience condition) during which members discussed 
their telepathic technique. It was predicted and strongly confirmed that both 
of these variations would independently increase ESB. It was also found that 
senders had higher ESB than receivers. These belief variations were entirely 
independent of actual telepathic success (ESP), which was no different from 
chance levels. The results are interpreted within the framework of Langer’s 
“illusion of control” theory that people behave in chance situations as though 
in skill situations, to the extent that skill-related cues are situationally present. 


There has been a recent resurgence of 
popular interest in a variety of occult phe- 
nomena. Psychologists have begun to feel it 
appropriate to study ESP, after several dec- 
ades of treating this topic as taboo. Almost 
all of this recent effort has gone into exploring 
the nature and extent of such ESP effects as 
might really exist (Child, 1973; Layton & 
Turnbull, 1975; McBain, Fox, Kimura, 
Nakanishi, & Tirado, 1970; Moss, Chang, & 
Levitt, 1970; Moss & Gengerelli, 1967). From 
a social psychological standpoint, however, 
what may be even more interesting is the 
question of how people come to believe in 
ESP, in general, and in their own experience. 
It is the aim of the present paper to illumi- 
nate a set of factors which may increase the 
tendency toward belief in ESP. It is possible, 
of course, to believe that one is exercising 
extrasensory powers whether or not objective 
evidence supports such a belief. We call the 
belief itself “extrasensible belief” (ESB), that 
is, belief beyond the grounds of reason and 
understanding (“I can’t defend it, but I feel 
that it’s true”). We report data on both ESP 
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and ESB within a common mental telepathy 
setting. Our analysis is intended to be general 
across different forms of ESP, but we particu- 
larize to mental telepathy experiments þe- 
cause they are experimentally flexible and 
social psychologically compelling. 

Individuals approach new situations with a 
repertoire of experience from past situations, 
even if the new situation is presented as 
highly unusual, exploratory, and uncertain. 
When subjects are brought into a laboratory 
to serve in an experiment on mental telepathy 
(by an experimenter who tells them he is 
“open-minded”), they are confronted by a 
great deal of ambiguity. How do they know 
how to carry out the attempt, and how can 
they judge how well they are doing? 

Our theoretical point of departure is 
Langer’s (1975) analysis of how people be- 
have when playing games of chance. People 
seem to have great difficulty appreciating ob- 
jectively the character of chance events (cf. 
Goffman, 1967; Kahneman & Tversky, 1973; 
Ward & Jenkins, 1965). They often appear 
to act superstitiously, as though chance events 
can be controlled. Strickland, Lewicki, and 
Katz (1966), in an indirect demonstration of 
superstitious control, showed that subjects 
placed more daring bets before rolling dice 
than after rolling them (with the outcome 
hidden). 
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In an effort to organize such phenomena in 
the psychology of chance, Langer hypothe- 
sized a carryover of skill-appropriate attitudes 
and behaviors into chance situations; the 
more similar a chance situation to a skill 
situation, the more carryover. In her experi- 
mental tests of this hypothesis, Langer (1975) 
manipulated a number of variables calculated 
to give chance situations such as lotteries 
some of the components typically found in 
skilled tasks. Two of these, for example, were 
choice and active involvement. Subjects given 
a choice of lottery ticket were much more 
optimistic about winning the lottery than 
subjects simply given a ticket (objectively 
comparable) without a choice. In another 
study, active involvement was manipulated 
in the play of a chance-determined electronic 
game, The subject was given the opportunity 
to make a rewarded motor response, rather 
than the experimenter making the equivalent 
response for the subject. Subjects making 
their own responses were more optimistic 
about the potential outcomes of further plays 
of this chance game. 

Langer’s analysis can be extended to vari- 
ous ESP situations, in particular, to mental 
telepathy experiments. On the face of it, it 
‘is difficult or impossible for a subject to tell 
whether or not he or she can exercise tele- 
pathic skill.* In laboratory studies there is 
typically a chance baseline for correct “hits,” 
and the statistical null hypothesis is that the 
subjects are scoring randomly rather than 
systematically. From the subjects’ point of 
view also, either chance or skill could explain 
actual or potential hits, because the mediating 
mechanism is not accessible to conscious 
appreciation. 

In the present study, we manipulated three 
independent variables calculated to make the 
telepathic situation seem more skill prone. 
The first two were choice and active involve- 
ment, which were manipulated concurrently. 
The third factor was not considered by 
Langer but arises here because the skills (if 
any) which telepathy requires seem to be 
social skills. Indeed, telepathy is phenomeno- 
logically not unlike social persuasion. One 
factor (of course not the only one) leading 
to “good vibes” in social interaction is com- 
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municative experience, the past opportunity 
for the pair of individuals to have conversed 
and worked together cooperatively. 

Pairs of individuals were brought to a lab- 
oratory setting involving a sender and a re- 
ceiver in separated soundproof rooms. Choice 
and involvement were manipulated by allow- 
ing the pair to select the card symbols to be 
used in the task and allowing the sender to 
deal the symbol deck directly. Communica- 
tive experience was manipulated by providing 
a warm-up session in which the pair discussed 
their technique over an intercom. The depen- 
dent variable was the relative frequency with 
which the pair members thought they had 
scored hits, The prediction was that belief 
in success (ESB) would be positively af- 
fected by both the choice-and-involvement and 
the communicative experience manipulations. 
These predictions imply the possibility (in 
the most skill-oriented conditions) of per- 
ceived success far in excess of real success—a 
kind of euphoria of overoptimism. 


METHOD 
Subjects 


Subjects were 32 undergraduates, unselected by 
sex, from a general course requirement pool who 
signed up in response to sheets listing “Mental Te- 
lepathy” as an available experiment. Views expressed 
casually by subjects to the experimenter prior to the 
experiment ranged from abject skepticism to a dedi- 
cated faith in the possibility of mental telepathy, 
with a fairly uniform distribution over this range. 
Subjects were run in pairs; the members were 
screened by the experimenter to avoid pairs with 
any prior speaking acquaintance with each other. 


Trials 


An experimental session consisted of 4 blocks of 
25 trials, each trial consisting of the transmission of 
one of five alternative symbols, a reception or “call” 
of the presumed symbol, and signals to the experi- 


1A curious feature of the parapsychological litera- 
ture is that many effects are claimed which are con- 
sistent with ESP as a willed exercise of skill. There 
is a monotony effect, in which performance declines 
over time; various mood effects, which may interact 
with personality; a confidence effect, in which people 
do better if they believe in the phenomenon; and 
reported correlates of ESP ability, such as artistic 
talent, stress tolerance, and extroversion (Child, 
1973, pp. 113-119). The skill analogy, in other 
words, is attractive to ESP experimenters as well 
as to subjects. 
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menter by both sender and receiver of whether or 
not they were confident that a hit had been scored. 
Symbol decks were reshuffled after each block of 
25 trials, and the sender and receiver roles were 
reversed after 50 trials. 


Experimental Layout 


Two small soundproof rooms separated by a nar- 
row hallway were equipped so that when the experi- 
menter (stationed in the hallway) flipped a switch, 
the sender and receiver could hear one another, Each 
room contained a chair and a desk. On the desk was 
a box with two toggle switches labeled “confident” 
and “not confident”; these switches lit correspond- 
ing lights on the experimenter’s desk in the hall. In 
addition, on the sender’s desk was a button which 
illuminated signal lights on the experimenter’s desk 
and in the receiver's room. The sender was instructed 
to press this button only during the 3 or 4 seconds 
of each symbol transmission. 

All subjects were instructed in the use of these 
signals before the start of the experiment. 


The Target Cards 


Target cards were arranged in 20 little piles on 
a table in the sender’s room. Each pile consisted of 
six 3 X 5-inch (7.62 X 12.70-cm) cards with a simple 
crayon sketch of a particular concrete object, such 
as a top hat, a knife, a fried egg, a ladder, a honey 
bee, or a Volkswagen. In addition to the sketch, each 
target card had the appropriate word printed beneath 
the picture to make its identity unambiguous. 

These heterogeneous objects were used as symbols 
rather than the geometric shapes of standard ESP 
decks to give subjects the opportunity to exercise 
choice among a large, interesting set. Some sender— 
receiver pairs were allowed to select their favorites 
from the array of symbols, whereas others were 
forced to select particular target cards (see below). 


Experimental Conditions 


Four pairs of subjects were run in each cell of a 
2% 2 design. Elaborate precautions were taken to 
ensure that the experimenter remained blind to both 
of the independent variables. These two variables were 
choice and involvement (CI) and communicative 
experience (EX). 

Choice and involvement. To produce the CI ma- 
nipulation, the sender and receiver found, upon con- 
sulting their instruction booklets (randomly pre- 
shuffled so that the experimenter would be blind), 
that they were jointly to select from the piles of 
target symbols the five that both agree would be 
“sendable.” Later, for the experimental trials, the 
sender shuffled a deck of five cards of each of the 
five symbols and then dealt and transmitted them 
to the receiver one by one. The shuffling and dealing 
procedure was repeated on the same deck for each 
trial block. 

To produce the no-choice-and-involvement ma- 
nipulation (NCI), the instruction booklets told the 
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sender and receiver which 5 of the 20 displayed 
symbols they had to use. To control for symbol 
popularity, the particular symbols in this forced 
choice were prearranged as the 5 symbols chosen by 
a corresponding pair in the choice condition of a 
pilot study. (It was not feasible to yoke the NCI 
pairs to the CI pairs of the present study as this 
would have necessitated systematically running CI 
pairs ahead of NCI pairs, with consequent loss of 
experimenter blindness.) When alone again, the 
sender shuffled a deck of cards made up of five each 
of the numbers 1, 2, 3, 4, 5. Each of the five digits 
had been listed alongside one of the forced-choice 
symbols. To know which symbol to send on each 
trial, the sender dealt a number card and consulted 
this list to locate the appropriate symbol. 

The purpose of this manipulation was to convey 
a sense of remoteness of involvement in the trial-by- 
trial determination of the symbol to be sent. The 
hand of the experimenter, as it were, was phenome- 
nologically present on each card of the deck. An 
alternative no-involvement manipulation (viz., having 
the experimenter deal the real symbol deck) was 
used in the pilot study but was abandoned here 
because the experimenter would not have been blind. 

Communicative experience. The EX manipulation 
was explained in the instruction booklets after the 
five target symbols had been designated (either in 
CI or NCI manner). It involved a warm-up of five 
trials with the intercom on between rooms. The 
sender was instructed to transmit each symbol once, 
first naming it, and then “narrating for your Re- 
ceiver the different visual and/or verbal impressions” 
used as it is transmitted mentally. Concurrently, the 
receiver was instructed to “respond to the Sender’s 
narration” on each trial. This procedure was pre- 
sented to both partners as an opportunity to “hook 
up” their impressions, establishing what visual or 
verbal imagery should be used. 

The no-communicative-experience (NEX) varia- 
tion also involved five warm-up trials with the 
intercom on. Both sender and receiver were in- 
structed to think about their visual and verbal im- 
pressions and to write these down carefully after 
each trial. However, the only oral communications 
suggested between partners were one or two word 
signals indĝating the readiness for and termination 
s. This minimal verbal signaling was 
used for the XIX condition for two reasons: From 
a pilot stage knew it to be ineffective as com- 
municative eXperience, and we wanted to have the 
intercom switch on so that the experimenter would 
be blind to the EX vs. NEX variation. 

It should be noted that the present EX variation 
involves not only the exchange of task-related im- 
pressions but also an accompanying pseudosuccess 
experience not available to NEX pairs. The sender 
told the receiver before each trial what symbol he 
or she would transmit. The receiver, therefore 
would be likely to have mental impressions consist- 
ent with the correct symbol and could perhaps par- 
tially attribute those impressions to telepathic signals 
accompanying the verbally transmitted information. 
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Because task success (either pseudosuccess or real) 
is an important general feature of good communica- 
tive experience, this difference between EX and 
NEX conditions was very much in the spirit of what 
we wished to manipulate. 


Procedure 


The two subjects entering the laboratory at the 
appointed hour were met informally by a student 
experimenter (the first author). He ascertained 
whether this happened to be speaking acquaintances 
or friends. If they were acquainted, he rescheduled 
them separately. If not, he showed them the experi- 
mental layout, explained that they would each have 
a turn as the sender, and described himself as “open- 
minded” about mental telepathy. The study, he ex- 
plained, was a research project to be conducted 
under scientific conditions, exploring several different 
procedures. 

Scientific objectivity required the detailed instruc- 
tions to be standardized and not to be discussed 
with the experimenter. Therefore, each subject, he 
explained, would receive mimeographed instructions 
to be read very carefully (and reread, if necessary) 
and retained for reference. Matched instruction 
booklets (with the experimenter unaware of which 
condition they represented) were then given out. At 
random, one subject received the sender instructions, 
and the other was given the receiver instructions. 
They sat reading these for several minutes before the 
“the experiment began. The entire procedure was 
reflected step by step in these instructions. 

Instructions consisted of four segments: (a) Target 

card selection, (b) practice, (c) first 50 experimental 
trials, (d) second 50 experimental trials (after re- 
versal of sender and receiver roles). Below are given 
these instructions for the (CI, EX) group, for sub- 
jects with the initial sender role. Instructions for 
other groups involved minimal critical alterations. 
(The NCI variation involved an altered target card 
selection segment only. The EX variation involved 
an altered practice segment only, Initial receiver role 
instructions differed from initial sender role instruc- 
tions via small alterations in the target card segment, 
an appropriately different practice segment, and a 
simple interchange of Segments 3 and 4). 


* For the first half of the experiment (Sections 1 
and 2) you will assume the role of SENDER. 
Target Cards 

*On the grey table against the wall, you will find 
twenty piles of “target cards.” Together with 
your RECEIVER, select from these piles a total 
of five (5) different symbols that you both agree 
are “sendable.” 

* There should be a total of six cards in each little 
pile, Give the RECEIVER one (1) target card from 
each selected pile: you will have a total of twenty- 
five (25) cards, while the RECEIVER will have 
one (1) target card of each kind, giving him a 
total of five (5) target cards, 

* At this point, the RECEIVER shall go directly 
‘across the hall to the RECEIVER’S ROOM. 


[R went across the hall] 

* After the RECEIVER has left, shuffle your deck 
of target cards until you are satisfied that they 
have been randomized. 

* Place the deck face down on the table that has 
been placed against the curtained wall, 

* Let me know when you have done this. 

[E then told both S and R to follow the instruc- 
tions for the practice periods. ] 


Practice 


I have arranged it so that during this practice 
period, you and your RECEIVER will be able to 
discuss each of the five target cards. You will be 
able to talk and listen to each other by way of 
microphone during this pre-test period. 

PEAT ine ra 
* Collect one of each of the five different target 
cards, and turn this little pile face down. You will be 
sending, therefore, a total of five (5) target cards. 
* One at a time, send each target card. 
* Just so that the Receiver becomes acquainted 
with its use, utilize the “sending light” during this 
practice period. 
*It will not be necessary for you to rate how 
confident or nonconfident you feel about each 
transmission during this practice period. 
* You may discuss, one at a time, each of the five 
different target cards. 
* As you transmit each target card, narrate for your 
Receiver the different visual and/or verbal impres- 


sions that you use in transmitting each target card. 


*Tell your receiver which target card you are 
about to send, then relate to him the visual and/or 
verbal images that you will use when you transmit 
that target card during the actual test, 

* This is an opportunity for you and your partner 
o “hook up” your impressions. Be sure to find 
out what kind of impressions or impulses your 
Receiver gets from you as you relate yours. 

* You and the Receiver might use this practice 
period to establish what visual or verbal imagery 
you should use. 

* Therefore be sure to find out how the Receiver 
feels about each target card. 

* Please do not test each other during this pre- 
test period: Be sure you tell the Receiver which 
card you are sending before you “narrate your 
method,” 

When you have taken a few moments to discuss 
each target card, Tet, me know. At this point, I will 
turn off the microphones and we will begin the 
experiment, 

[E then told both S and R to follow the next 
instructions. ] 

FIRST HALF OF EXPERIMENT 

*Take your deck of target cards that you had 
placed face down on the table. 

* At my signal, you will flip over a target card, 
*Immediately upon flipping a target card, press 
firmly down on the red “signal light” button until 
the light goes on. This lets the RECEIVER know 
that you are transmitting a target card. 
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* Immediately after turning on the “signal light,” 
record the particular card that you are sending on 
the appropriate line on your SCORE SHEET. 

* Begin at line 1 of Section 1, and complete each 
Section consecutively. g 

* Transmit each target card for about three or 
four seconds. 

* When 3 or 4 seconds have passed, turn off the 
“signal light” by once again pressing firmly down 
on the red button. This indicates to the RE- 
CEIVER that you no longer are transmitting the 
target card. 

* After turning off the signal light, I’d like you to 
Jet me know how confident or non-confident you 
feel, relatively, that you have transmitted the 
information on a particular target card, and that 
the RECEIVER has correctly recorded that infor- 
mation on his SCORE SHEET. You do this by 
flipping one of the two toggle switches on the box 
labeled “CONFIDENT/NOT CONFIDENT,” and 
placed in front of you. Keep this switch on for 
about three (3) seconds. Then, be sure to flip it 
off to avoid confusion. 

*Go on to the next target card and repeat the 
procedure for the remainder of the Section. 

* When you have completed sending the twenty- 
five (25) target cards of Section 1, reshuffle them 
and proceed to Section 2 (line 26). 

* Complete Section 2 (lines 26 through 50) in this 
fashion, 

* When you have completed Section 2, let me 
know. I will have you exchange rooms at this 
time. For the rest of the experiment—(Sections 
3 and 4)—you will have a chance to be the 
RECEIVER, 

[S and R exchanged rooms.] 

SECOND HALF OF EXPERIMENT 


*In the RECEIVER ROOM, you will see before 
you the five (5) representative target cards. There 
will be a several minute wait while the SENDER 
arranges the deck. Be sure to watch for the 
“Sending Light” to come on. 

* When it does, it indicates that the SENDER is 
attempting to transmit a target card. 

*Take about 3 or 4 seconds to scan the array of 
target cards and mentally select the one you feel 
is the target card being transmitted. 

* Beginning with Section 3 (lines 51 through 75) 
record the word associated with the target card 
you sense the SENDER is transmitting to you. 
* When the “Sending Light” has been turned off 
(indicating that the t t card is no longer being 
transmitted), let me } ’ how confident or non- 
confident you feel, ively, that the SENDER 
has transmitted th ation on the target card, 
and that you } ly recorded that infor- 
mation on your SCOR! TEET. 

* Do so by flipping one c! two toggle switches on 
the box labeled “CONFIDENT/NOT CONFI- 
DENT.” Keep the switch on for about three (3) 
seconds. Make su i flip it off again to avoid 
confusion. 
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* When the “Sending Light” comes on again, repeat 
this procedure. 

* Complete Section 3 (lines 51-75) in this fashion. 
*There will be a pause of several minutes upon 
completing Section 3 so that the Sender can re- 
shuffle the deck. 

* Be sure to watch for the “Sending Light.” 

* Complete Section 4 (lines 76 to 100) in the same 
fashion. 

* When you have completed Section 4, stop and 
let me know that you have finished. 


Throughout the 100 trials, the experimenter tran- 
scribed the confidence calls of both sender and re- 
ceiver. These were the ESB data. After the last trial, 
he collected the cumulative records by senders of 
what symbols had been transmitted and by receivers 
of what symbols had been received. These, when 
scored for correspondence, constituted the ESP data. 
The experimenter also recorded the appropriate ex- 
perimental conditions by noting which instructions 
had been used. 

Before subjects left the laboratory, it was ex- 
plained to them that some but not other pairs had 
been given a choice of target symbols and that some 
but not other pairs had been allowed to talk about 
the task during the practice period. Our interest, we 
said, was in exploring the effects of these variables 
on feelings of confidence and on telepathic perform- 
ance, as well as in the relation between confidence 
and performance. 


Resutts 


The experimental unit was taken to be the 
subject pair, rather than the individual sub- 
ject.* For each pair, an ESP score based on 
100 trials was available, as well as 100-trial 
ESB scores from the points of view of both 
sender and receiver. Each of the two individ- 
uals contributed 50 trials to the ESB scores 
of both sender and receiver roles, 


ESP Data 


For ESP data, the design was a simple 
2 X 2 factorial with n = 4 per cell. Table 1 
gives the proportions of correct hits for each 


2In any given experimental condition, subjects 
run concurrently are apt to give results more similar 
to each other than subjects run Separately. This is 
because of correlated irrelevancies such as mood on 
first meeting, experimenter tone, hour of the day, 
and so on. For the ESB data, using the 28 dj of 
subjects within groups as error term would therefore 
be nonconservative, and the only correct procedure 
is to use the 12 df for pairs within groups. For the 
ESP data, of course, the original scores attach 
directly to the pairs, 
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of the 16 pairs, arrayed according to CI 
versus NCI and EX versus NEX. 

The average percentage of correct hits over 
all 16 pairs (1,600 trials total) was 19.25%, 
which was insignificantly different from the 
chance baseline of 20%. (On this number of 
trials, the standard error of overall percent 
correct is exactly 1%.) Furthermore, there 
was no significant variation in hit proportion 
among the four groups (F < 1). Combining 
experimental groups, the variance between the 
16 pairs in hit percentage (17.27) was very 
close to the expected variance on a binomial 
model (16.00), and a chi-square test of this 
variance comparison was clearly nonsignifi- 
cant. In other words, the overall hit rates give 
no evidence supporting ESP. 

For the ESB data, the design may be re- 
garded as a 2 X 2 factorial with repeated 
measures because each subject pair generates 
two ESB scores: the proportion of trials the 
sender believed were hits and the proportion 
of trials the receiver believed were hits. 
Table 2 gives these data, averaged over pairs. 

The scores in sender and receiver roles both 
appeared sharply affected by the experimental 
conditions. Averaging over the two roles, the 
mean ESB in the (NCI, NEX) group was 
25.75%, a perceived success rate barely above 
the chance level of 20%, whereas the mean 
ESB in the (CI, EX) group was 56.12%, a 
rate almost three times the chance level. The 
(CI, NEX) and (NCI, EX) means fell be- 
tween these two extremes. The analysis of 
variance of these data is given in Table 3. 
The main effects for choice-and-involvement 


TABLE 1 
ESP Hits 1N 100 Triats ror Each Parr 


No com- 
Communi- mu: 


cative ex- tive 
Experimental variations perience perience 
Choice and involvement 27 19 
25 16 
10 22 
21 22 
M 20.75 19.75 
No choice and involvement 20 20 
22 18 
15 19 
15 17 
M 18,00 18.50 


_~ 
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TABLE 2 


CONFIDENCE CALLS PER 100 TRIALS FoR Eacn ROLE 
AND EXPERIMENTAL CONDITION 


Experimental variations Sender Receiver M 


Communicative experience 


55:25 
53.75 


57.00 
49.50, 


56,12 
51.62 


Choice-and-involvement 
No-choice-and-involvement 


No communicative experience 
$2.25 44.25 
30.00 21,50 


48.25 
25,75 


Choice-and-involvement 
No-choice-and-involvement 


and communicative experience were significant 
at the .02 level or better, supporting the main 
hypotheses of the study. An additional, un- 
predicted finding was the main effect of role 
(b < .02), with senders on the average pre- 
dicting close to 5% more successes than re- 
ceivers. This effect appears somewhat larger 
(over 8%) among pairs with no communica- 
tive experience. However, this interaction was 
not significant at the .05 level, nor were any 
other interactions, 

Several other analyses were run to check 
whether there might have been any “reality” 
basis to the subjects’ confidence calls. Com- 
parisons were made trial by trial of expres- 
sions of confidence (or nonconfidence) by 
sender and receiver, with the idea that tele- 
pathic coordination would tend to make both 
members simultaneously optimistic on suc- 
cessful trials and pessimistic on unsuccessful 
trials, For each pair a phi coefficient was 
calculated between confidence of sender and 
confidence of receiver over the 100 trials. 
These 16 phi coefficients averaged —.01, and 
none were individually significant at the .05 
level. Furthermore, there was no indication of 
any systematic relationship between trial-to- 
trial confidence and trial-to-trial accuracy. 


Discussion 


In this paper, we have demonstrated that 
with simple variations in the scenario of a 
mental telepathy experiment, the participants 
can have very different beliefs about their 
efficacy of performance, without any relation 
whatsoever to actual task success. In the least 
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TABLE 3 


ANALYSIS OF VARIANCE OF CONFIDENCE CALLS PER 100 
‘TRIALS FoR EACH ROLE AND EXPERIMENTAL CONDITION 


Source df MS F 


Choice-and-involvement vs. 
no-choice-and-involvement 


(C) 1 1,458 8.46* 
Communicative experience vs. 

no communicative experi- 

ence (E) 1 2,278 13.22* 
CXE 1 648 3.76 
Error between 12T 2,3 = 
Role (R) 1 180.5 7.87* 
CXR $ 21.1 1 
EXR 1 98.0 4.27 
CXEXR 1 15.1 1 
Error within 12 22:94- == 


* p <.02 (for all other Fs, p > .05). 


favorable variation, subjects barely believed 
they were doing any better than chance, 
whereas in the most favorable variation, sub- 
jects were willing to presume that they were 
getting fully as many as 14 of every 25 calls 
correct. This was a very high rate, especially 
in view of the fact that the true success rate 

did not exceed the chance level of 5 in 25. 

This illusion of telepathic efficacy was a con- 

sistent phenomenon across subject pairs: The 

results were highly significant despite a very 
small number of subjects. 

We do not know whether the very high 
confidence induced on our transient task 
would carry over into a general belief in a 
variety of parapsychological phenomena. It is 
not necessarily the case that a confirmed 
skeptic on ESP, brought into the laboratory 
and caught up in the illusion that he or she 
was systematically succeeding on an objec- 
tively “impossible” task, would thereby be 
converted to a general belief in ESP. The sub- 
jects, in fact, were not told how well they had 
actually done. A subject could always say 
afterwards, “Well, I just felt as though I were 
doing well. I don’t know why. Maybe I 

wasn’t really doing well after all. I guess I 
still don’t believe in ESP.” 

If subjects were given not only the ma- 
nipulations of the present study but were also 
(falsely) told on a majority of trials that 
they had scored hits, then they might genu- 
inely become strong believers on a permanent 
basis. In other words, reinforcement is an 


obviously skill-related situational variable, 
and according to the theoretical analysis in 
this paper (see also Langer & Roth, 1975), 
we would certainly expect it to promote ESB. 
We did not employ this variable for ethical 
reasons. So many of the student subjects were 
so earnestly involved in the task and took the 
possibility of ESP so seriously that to deceive 
them might have been cruel. It does, never- 
theless, seem very plausible that critical “con- 
firmation” experiences of occult nature, espe- 
cially where cues to skill exercise are present, 
leave people with a belief in ESP. Writers on 
the subject often report such autobiograph- 
ical incidents (e.g., Child, 1973), and the 
second author, in systematic interviews with 
a number of “true believers,” has found a key 
confirmatory incident to be often, if not 
always, present in their lives.* 

The present study, although not directly 
addressed to the permanent acquisition of 
ESB, demonstrated the great power of objec- 
tively irrelevant situational variations in pro- 
ducing strong subjective feelings of success at 
mental telepathy. The data were very sup- 
portive of Langer’s (1975) “illusion of con- 
trol” theory, which states that elements typ- 
ical of skill situations will, when introduced 
into a chance situation, produce the illusion 
that skill is being exercised. Success in the 
present task was from all evidence entirely 
chance determined; yet subjects given a 
choice of telepathic symbols, the opportunity 
to handle the deck of symbol cards, and the 
prior experience of cooperative communica- 
tion with the partner were wildly optimistic 
about how well they were doing. On almost 
60% of the trials in the (CI, EX) group, one 
or the other or both pair members felt suc- 
cessful in the presence of a failure. The fac- 
tors of choice and active involvement repli- 


3 Adult subjects recruited by a newspaper ad for 
a series of experiments were screened by question- 
naire, and those stating a very strong belief in ESP 
were interviewed. Besides the critical confirmatory 
incident, another very common theme among be- 
lievers is that telepathic skill is possessed by artisti- 
cally or socially sensitive people (usually including 
themselves). There are heterogeneous types of be- 
lievers, however, and no single generalization covers 
them all. 


a 
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cate Langer’s results in nonparapsychological 
chance tasks. The factor of communicative 
experience is a new factor, which is appropri- 
ate when the theory is applied to social per- 
formances. Presumably there are other impor- 
tant social factors. 

The manipulated factors in our study were 
much more powerful than individual differ- 
ence factors. It was not our intention to ex- 
plore individual variables such as internality 
of control in relation to ESB, in the tradition 
begun by Liverant and Scodel (1960), be- 
cause of inconsistencies and complexities in 
that literature. Such variables may eventually 
prove to have some place, but we find a 
situational analysis more compelling. 

The unanticipated result that the sender 
was more confident than the receiver might 
be explained by some such skill-related factor 
as the clarity of articulation of the required 
performance. The sender can develop a 
method for transmission that involves concen- 
trated mental focus on some kind of image 
of the appropriate symbol for a fixed time 
period under his or her own control. The 
receiver, on the other hand, does not quite 
know in what form the symbol will arrive, or 
at what instant, and how long it will last. By 
analogy with familiar skill situations, there- 
fore, the sender’s more actively articulated 
role is more readily prone to the illusion of 
effectance than the receiver’s diffuse and pas- 
sive role. Communicative experience might 
perhaps diminish the distinctiveness of the 
two roles, 
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Density Does Affect Task Performance 


Paul B. Paulus, Angela B. Annis, John J. Seta, 
Janette K. Schkade, and Robert W. Matthews 
University of Texas at Arlington 


Three experiments examined the independent effects of group size, room size, 
and interpersonal proximity on task performance. It was found that increasing 
group size, decreasing room size, and decreasing interpersonal distance all in- 
dependently led to decrements in task performance in a short-term situation. 
Whereas the effects of group size and interpersonal distance were similar for 
males and females, the effect of room size was obtained only with males. The 
results of the present study call into question the previous conclusions that 
task performance is not affected by density. Affective responses to crowding 
were also examined in two of the experiments. The implication of the present 
study for educational settings was discussed. 


The psychological effects of crowding are 
increasingly becoming the concern of scientific 
investigation. Crowding has been assumed to 
produce negative effects on humans similar to 
those found in animal studies (Calhoun, 
1962; Christian, 1970). In fact, surveys and 
Statistical studies have generally yielded sig- 
nificant correlations between high population 
density among humans and such factors as 
crime, illness rate, and juvenile delinquency 
(e.g., Galle, Gove, & McPherson, 1972). Sev- 
eral laboratory and field studies have also 
provided evidence that crowding can produce 
negative psychological effects on humans (e.g., 
Eoyang, 1974; Freedman, Levy, Buchanan, 
& Price, 1972; Griffitt & Veitch, 1971; Mc- 
Cain, Cox, & Paulus, in press; Paulus, Cox, 
McCain, & Chandler, 1975; Ross, Layton, 
Erikson, & Schopler, 1973; Valins & Baum, 
1973). 

One interest of several studies has been the 
effects of being crowded during task perform- 
ance (e.g., Freedman, Klevansky, & Ehrlich, 
1971; Kutner, 1973; Sherrod, 1974; Stokols, 
Rall, Pinner, & Schopler, 1973). Typically, 
subjects in groups ranging from five to nine 
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perform in large or small rooms. These ex- 
perimental situations have differed greatly in 
their variation of group size (social density), 
amount of space (spatial density), and inter- 
personal distance (proximity). However, none 
of the above studies found any effects of being 
crowded during task performance on the qual- 
ity of task performance. Studies by Valins 
and Baum (1973), Sherrod (1974), and Ep- 
stein and Karlin (1975) have found after- 
effects of crowding on task performance. Also, 
a study by Martens and Landers (1972) using 
groups of size two, three, and four did find a 
decrement in task performance due to in- 
creased group size but only under conditions 
designed explicitly so that subjects could di- 
rectly or indirectly monitor one another’s 
performance. 

It is not clear whether the failure of pre- 
vious studies to find an effect of density on 
task performance is due to the use of an 
insensitive task, to stress adaptation during 
the experimental session (cf. Sundstrom, 
1975), or to the fact that the levels of density 
employed were not sufficiently ayersive. The 
present series of experiments was designed to 
clarify these issues and to determine to what 
extent the three primary potential components 
of density (group size, room size, and inter- 
personal proximity) may independently affect 
the quality of task performance, Past studies 
(e.g., Freedman et al., 1971; Freedman et 
al., 1972; Marshall & Heslin, 1975; Ross et 
al., 1973; Stokols et al., 1973) have often 
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confounded two or more of these factors in 
their manipulation of density. In the first 
experiment of the present series of experi- 
ments, group size, room size, and proximity 
were varied independently in a 2 X 2 Xx 2 fac- 
torial design, whereas the other experiments 
only manipulated either room size or inter- 
personal proximity. Furthermore, a maze- 
learning task that has been found to be sensi- 
tive to variations in socially induced stress by 
Hunt and Hillery (1973) was employed, and 
task performance was measured during a rela- 
tively brief exposure to crowding (about 10- 
15 minutes) instead of sessions lasting 1 hour 
or longer employed in the previous experi- 
ments. A secondary interest was to examine 
differences in affect produced by variations in 
the three density variables, since several stud- 
ies have found that high levels of density in- 
crease negative affect (e.g., Griffitt & Veitch, 
1971; Ross et al., 1973). 


EXPERIMENT 1 
Method 


Subjects. Subjects were 127 male and female sub- 
jects from introductory psychology classes. Of these, 
7 subjects were excluded from the data analysis due 
to their failure to understand instructions or due to 
malfunctioning of the equipment, which left 15 sub- 
jects in each of the conditions. Upon their arrival, 
the subjects were randomly assigned to one of the 
eight experimental conditions. Sexual composition of 
the group was left uncontrolled.+ 

Task. The task used was a “multiple-U” maze 
machined in Plexiglas similar to that employed by 
Hunt and Hillery (1973).% On this task, the starting 
point is located near the bottom edge of the maze 
and the goal near the top. The maze comprised 12 
levels with two alternatives at each level and was 
geometrigally balanced, requiring as many left turns 
at the decision points as right turns in order to 
trace it correctly. Located at each cul-de-sac and at 
the goal is an electrical contact. When the subject 
touches a cul-de-sac contact with his stylus, the red 
light signals in front of him indicating that he has 
made an error, and a cumulative counter in the 
experimenter’s room is activated. When the contact 
at the goal is touched, white lights on the subject’s 
apparatus and on the experimenter’s desk in the 
adjacent room signal that the subject has completed 
a trial. After completing each trial, the subject 
immediately starts on the next trial. A raised block 
is located at the starting point, enabling the subject 
to begin each trial without looking at the maze and 
without receiving assistance simply by finding the 
block. 

The mazes were housed in individual plywood 
boxes with an opening at the bottom of the front 
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panel that allowed access of the subject’s hand and 
arm for tracing and prevented subjects from viewing 
the maze during their performance, The maze was 
affixed to the back panel of the box and per- 
pendicular to the top of the table. The signal lights 
were mounted on top of the plywood boxes, 2 inches 
(5.08 cm) from the front edge. The lights were cov- 
ered by a box with an open side facing the subject 
and were not visible to other subjects. 

Experimental design, The experimental design was 
a 2X22 between-subjects factorial design with 
the three variables being proximity, group size, and 
room size. The proximity factor was varied by 
manipulating the distance between the subjects while 
they were tracing the mazes. In the close condition, 
the mazes were 6 inches (15.24 cm) apart, while in 
the far condition they were 18 inches (45.72 cm) 
apart. The group size factor was varied by having 
the subjects trace the mazes either in groups of four 
or groups of eight. Room size was varied by using 
either a 94.5 square foot (9.68 m”) room or a 198.5 
square foot (18.05 m*) room. The dimensions of the 
large room were 13 feet 7 inches X 14 feet 5 inches 
(4.12 m 17.78 cm X 4.38 m 12.70 cm), while those of 
the small room were 11 feet X 9 feet 6 inches (3.35 
m X 2.71 m 15.24 cm). 

Procedure. Each group of subjects was taken en 
masse from the waiting room to the appropriate ex- 
perimental room, In the conditions with four sub- 
jects, they were seated on the same side of the 
table. In the conditions with eight subjects, there 
were four subjects on each side of the table, Once 
they had been seated, the experimenter read the 
task instructions. After all questions had been an- 
swered, the experimenter left the room to monitor 
the equipment in the adjacent room and to record 
the data with the aid of several assistants. The sub- 
jects were instructed to continue until told to stop. 
When the slowest person in the group finished 10 
trials, the maze performance was terminated with 


1 In some cases, the full complement of naive sub- 
jects did not report for the experiment. In these 
cases, a confederate was employed to produce a 
complete group. The experimenter purposely assigned 
these “incomplete” groups to the eight conditions in 
such a manner that only one group of this type was 
assigned to a condition. The sexual composition of 
the groups varied from all male to all female. The 
total number of males in various conditions varied 
from 4 to 12. However, the performance of the males 
and females as a function of the experimental ma- 
nipulations was found to be similar, so that discrep- 
ancies in the number of males and females in the 
various conditions does not present an interpretative 
problem. 

2Hunt and Hillery (1973) initially found en- 
hanced performance on this task in the presence of 
coactors. We have found the opposite result in all 
of our studies using this task. A subsequent study in 
the Akron laboratory (Byers, Note 1) also found 
poorer performance in the presence of coactors. 
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TABLE 1 


Mean Errors IN NINE TRIALS OF 
MAzE PERFORMANCE 


Large room Small room Com- 


R = bined 
Group size Close Far Close Far M 
Small 31.40 29.47 33.53 34.13 32.13 
Large 38.80 37.13 41.27 40.80 39.50 
35.10 33.30 37.40. 37.47 
Combined M 34.20 37.44 


the experimenter reentering the room. The time taken 
for cach subject to complete 10 trials was obtained 
by recording the time elapsed on an electrical clock. 
When the experimenter reentered the room, the short 
form of the Mood Adjective Check List (Nowlis, 
1965) and six 7-point semantic differential scales as 
used by Griffitt and Veitch (1971) were admin- 
istered to assess the subjects’ feclings about them- 
selves and the experimental room. 


Results and Discussion 


Since the performance of the first trial on 
the maze task is essentially a random process, 
this trial was omitted from the analysis. Be- 
cause the number of errors in nine-trials was 
characterized by skewness and heterogeneity 
of variance, a log transformation was applied 
to the data, A 2 xX 2 X 2 between-subjects 
analysis of variance of this data yielded a 
main effect for group size that was highly 
signfiicant F(1,112) = 15.75, p < .001. As 
shown in Table 1, subjects in the large-group 
conditions made more errors on the maze per- 
formance than those in the small-group condi- 
tions. The main effect of room size approached 
significance F (1,112) = 3.01, p < .10, with 
those subjects in the small-room conditions 
making more errors than those in the large- 
room conditions. Proximity did not produce a 
significant main effect, nor were there any 
significant interactions in the analysis of vari- 
ance. A similar analysis of variance of the 
latency scores yielded no significant effects, 
suggesting that the error findings cannot be 
attributed to differential speeds of task com- 
pletion. 

Analysis of variance of the results of the six 
room-rating scales revealed only two statisti- 

cally reliable effects. The pleasant—unpleasant 
ratings of the room yielded a significant room 
size effect, F(1,112) = 4.84, $ < 05, with 
the small room being rated as more pleasant 
than the large room. The other significant ef- 


fect was an interaction of group size and 
proximity in regard to the attractiveness of 
the room, F(1, 112) = 4.03, p < .05. In the 
small group, subjects seated close together 
rated the room more unattractive than those 
seated far apart, whereas in the large group, 
subjects who were seated far apart rated the 
room as more unattractive than those seated 
close together. The only significant effect ob- 
tained in an analysis of the 11 categories of 
the Mood Adjective Check List was a main 
effect of room size in the category of sadness, 
F(1, 112) = 5.24, p < .05. Subjects in the 
small room rated themselves as less sad than 
those in the large room. The affect results are 
of course inconsistent with the task perform- 
ance results. However, since effects were 
found with only 3 of the 17 affective mea- 
sures, these results are likely to be spurious, 
and hence no attempt will be made to inter- 
pret their significance until they are repli- 
cated. 

Since the effect of the room size variation 
on task performance was somewhat weak, a 
second experiment was designed to examine 
further this effect. This experiment employed 
a more extreme variation of room size, sexu- 
ally homogeneous groups, and a more detailed 
task instruction phase. The revision in task 
instructions was designed to reduce intersub- 
ject variability in task performance. 


EXPERIMENT 2 
Method 


Subjects. Subjects were 60 students from intro- 
ductory psychology classes who were recruited as all- 
male or all-female groups of four. However, when 
one of the subjects in a group failed to report, sev- 
eral groups of three were run, These were evenly 
distributed across the two room size conditions. 

Procedure. Subjects performed the same maze task 
employed in Experiment 1 either in a small room (6 
feet 6 inches X 5 feet 5 inches, or 1.8 m 15.24 cm X 
1.5 m 12.70 cm) or in a large room (11 feet X 9 feet 
6 inches, or 3.3 m X 2.7 m 15.24 cm). It should be 
noted that the relationship of the length to width in 
the two rooms was similar. The smaller room was 
actually produced by shifting several partitions in 
the large room to form a smaller confined area, 
About one foot (.3 m) separated the top of the par- 
tition and the ceiling to allow for unhampered venti- 
lation and lighting. A subject was seated on each 
side of a small rectangular table that held the four 
maze boxes. 
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The instructions were similar to those employed 
previously with the following exceptions. A chart 
showing the nature of the maze (with a different 
pattern of correct turns) was employed to help the 
subjects better understand the instructions. After 
the instructions, the subjects were allowed one prac- 
tice trial on the maze, and any remaining questions 
were answered, The experimenter and assistant ré- 
corded the data for 10 trials behind the partition in 
the same room. 
` The Multiple Affect Adjective Check List (Zucker- 
man & Lubin, 1965) was administered to the sub- 
jects prior to the task instructions to assess the sub- 
jects’ immediate affective reactions (anxiety, hostil 
ity, and depression) to their environment. 


Results and Discussion 


Since the maze data were again character- 
ized by skewness and heterogeneity of vari- 
ance, a’log transformation was applied to the 
data (10 trials), A 2 X 2 between-subjects 
‘analysis of variance with room size and sex of 
: the group as factors revealed a significant 

main effect of room size, F(1, 56) = 5.37, p < 
* 025, and a significant Room Size X Sex inter- 
. action, F(1, 56) = 4.59, p < .05. The males 
` tended to make more errors in the small room 
(43.56; m= 16) than in the large room 
(31.47; n = 15), whereas the females made 
slightly more errors in the large room (37.0; 
n= 14) than in the small room (35.5; n= 
15). Specific comparisons indicated that the 
room size effect was significant only for the 
males (p < .05). An analysis of the Adjective 
Check List data resulted in no statistically 
reliable effects. 

Although the previous two experiments 
have demonstrated that group size and room 
size do affect task performance, it remains to 
be determined whether proximity affects task 
performance. Although no effect of proximity 
was obtained in Experiment 1, it could be 
argued that the variation in interpersonal dis- 
tance employed in this experiment may not 
have been extreme enough (i.e., only 6 vs. 18 
inches, or 15.24 vs. 48.26 cm). Furthermore, 
proximity may be a more powerful factor if 
more potential eye contact is involved than 
was possible in the first experiment. The 
height of the boxes in Experiment 1 prevented 
eye contact for most of the subjects. A third 
sexperiment was designed to examine further 
the’ effect of proximity with more extreme 
variations in interpersonal distance and 
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greater opportunity for eye contact. This ex- 
periment was actually run prior to Experiment 


` 2 and hence did not employ the improvements 


in procedure introduced in Experiment 2. 


EXPERIMENT 3 
Method 


Subjects. Subjects were 56 students from intro- 
ductory psychology classes who served as subjects in 
the experiment to fulfill a course requirement. They 
were recruited in all male or all female groups of 
four. In cases where a complete complement of sub- 
jects did not report, confederates were employed to 
maintain the constant group size. Male and female 
groups were assigned to the two proximity conditions 
in an alternating fashion. 

Task, Subjects were given the same maze task as 
in the previous two experiments, The mazes were 
placed on individual TV tables for each subject to 
allow for the variation in proximity. Subjects were 
seated in the pattern of an imaginary square, with 
each subject facing the center of the square. The 
tables were low enough to allow the subjects com- 
plete eye contact with the person across from him 
(her) as well as those on the left and right sides. 
In the close conditions, groups of four males or 
four females were seated in a 2-foot (.6 m) square 
configuration. In the far condition, the four subjects 
were seated in a 5-foot (1.5 m) square. The dimen- 
sions of the room were 13 feet 1 inch X 11 feet 1 
inch (3.9 m 2.54 cm X 3.3 m 2.54 cm). A 7 foot 6 
inch X 2 foot 7 inch (2.1 m 15.24 cm X .6 m 17.78 
cm) one-way mirror was contained on one wall. 


Results 


The results of the last nine trials of maze 
performance were subjected to a log transfor- 
mation and analyzed in a 2 X 2 between-sub- 
jects analysis of variance with sex of the 
group and interpersonal distance as between 
factors. A significant main effect of proximity 
was obtained, 7(1,48) = 4.15, p < .05, with 
those in thi Wose condition performing more 
poorly than those in the far condition, In the 
close condition, males made 41.77 errors (n = 
13) and females made 41.31 errors (n = 15), 
whereas in the far condition, males made 
33.87 errors (z = 15) and females made 34.61 
errors (n = 13). There were no other signifi- 
cant effects. ó 


GENERAL DISCUSSION 


The results of this series of experiments 
have shown that previous conclusions (e.g., 
Freedman et al., 1971; Lawrence, 1974; Lin- 
der, 1974) that density does not have dele- 
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terious effects on task performance were in- 
correct, at least for the conditions of the 
present series of experiments. Increasing 
group size, decreasing room size, and decreas- 
ing interpersonal proximity all led to decre- 
ments in task performance. Furthermore, in 
contrast with the results of previous studies 
(eg., Freedman et al., 1972; Marshall & 
Heslin, 1975; Ross et al., 1973), the effects of 
two of these variables on task performance 
appear to be independent of the sexual compo- 
sition of the group. In Experiment 1, a strong 
group size effect was obtained using sexually 
heterogeneous groups. Freedman et al. (1972) 
found density effects only with sexually 
homogeneous groups. In Experiment 3, the 
effect of proximity was similar for both males 
and females in homogeneous groups. Only 
room size interacted with sex of the group (in 
Experiment 2) in a similar fashion as in pre- 
vious studies (Freedman et al., 1972; Ross et 
al., 1973). Although these previous studies 
found that males reacted more negatively to 
crowded conditions and females reacted more 
negatively to uncrowded conditions, it is not 
clear whether this effect was due to the room 
size or the proximity factor, since they were 
confounded. It is usually assumed that the 
proximity or personal space factor is the im- 
portant mediating variable (cf., Freedman et 
al., 1972; Ross et al., 1973). The results of 
Experiment 2 suggest that room size may in 
fact be the important factor or at least a con- 
tributing factor. 

The lack of a consistent effect of density 
on affect in the present study suggests that 
density may have adverse psychological ef- 
fects, as evidenced by the task performance 
measure, without involving subjective aware- 
ness of these effects. Possibly, a behavioral 
measure may be more sensitive to the effects 
of density in a short-term situation. In such 
situations, subjects may not have sufficient 
time to discern or even consider their feelings 
about their environmental and social setting. 
Alternatively, the conditions of density em- 

ployed in the present study may not have 
been aversive enough to produce strong emo- 
tional states. For example, Griffitt and Veitch 
(1971) did find negative affect results due to 
density, using more extreme variations in 


group size and a longer exposure period (more 
than 1 hour). 

The failure of previous studies to obtain 
effects of density on task performance cov 
have been due to the use of tasks insensiti 
to stress or to behavioral adaptation to crow 
ing stress in long-term crowding situations 
Another possibility is that these studies ust 
multiple tasks without providing evaluatis 
feedback to subjects or evidencing conce: 
with task evaluation. This procedure m: 
have lowered the level of evaluation app: 
hension necessary to obtain the effects of ti 
density variables on task performance (i 
Henchy & Glass, 1968; Paulus & Murdoc 
1971). Further research is required to dete: 
mine which of these factors is involved. 

The results of the present study may o 
course be limited to short-term crowded situ 
ations. However, since many crowded situa 
tions in everyday life are of the short-term 
variety (e.g., in elevators, buses, and testing 
situations), the effects of such short-term 
exposures to crowding are of interest in 
themselves. One implication of the present 
study is that the number of students in a 
classroom, classroom size, and chair arrange- 
ment may have bearing on the quality of 
performance in test situations. Thus, the per- 
formance of students in dense classroom situ- 
ations (high social density, high spatial den- 
sity, close proximity) may be poorer on 
complex tasks and better on simple tasks or 
well-learned tasks than students in low-den- 
sity classroom settings. Some evidence in 
support of this suggestion is provided in a 
recent study by Hillery and Fugita (1975). 


2 In contrast to this possibility, Paulus et al. (1975 
found that negative reactions increased with in- 
creased length of exposure to crowding in a long-term 
prison setting. 


REFERENCE NOTE 


1, Byers, P. J. Sex differences and evaluation poten- 
tial as determinants of social facilitation effects 
in @ cognitive task. Unpublished master’s thesis, 
Akron University, 1973. 
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Behavior Change Through Value Self-Confrontation: 
A Field Experiment 


Theodore Greenstein 
Department of Sociology, Washington State University 


A field experiment was conducted to determine the generalizability of Rokeach’s 
self-confrontation model. A total of 182 student teachers were randomly as- 
signed to either an experimental group, which received objective feedback con- 
cerning their own values and the values of previously surveyed “good” and 
“mediocre” teachers, or to a control group that did not receive such feedback, 
Posttests administered 13 weeks after the treatment showed that the experi- 
mental subjects ranked the values, mature love and loving, significantly higher 
and the value, self-respect, significantly lower than control subjects. More im- 
portant, experimental subjects were rated significantly higher on a measure of 
teaching ability by double-blind judges. Implications and suggestions concern- 
ing the applicability of self-confrontation theory in modifying other kinds of 


behayior are offered. 


Previous research (Rokeach, 1973) has 
claimed that long-term cognitive and behavior 
changes can be effected by a simple proce- 
dure called self-confrontation. This procedure 
consists of the objective feedback of one’s 
own and significant others’ values, attitudes, 
and behaviors. Available evidence suggests 
that this feedback made certain experimental 
subjects aware of previously existing contra- 
dictions within their own belief systems. Such 
awareness was hypothesized to have produced 
an affective state of self-dissatisfaction that 
culminated in behavior changes as long as 21 
months after the experimental treatment 
session, 

If self-confrontation does in fact offer a 
means to modify complex social behaviors en- 
duringly, it represents a major development 
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in social psychology. Any theory making such 
claims, however, demands rigorous experi- 
mental verification. The present research is 
designed to study the generalizability of self- 
confrontation theory and to test its effective- 
ness in a field setting. 

Rokeach (1973, Ch. 8) has posited a theory 
of cognitive and behavioral change that ad- 
dresses itself to the contradictions that may 
exist between an individual’s self-conceptions 
and his/her own behaviors, attitudes, and 
values. The theory is based on the assumption 
that human beings possess hierarchically 
ordered belief systems in which self-concep- 
tions (i.e., cognitions about one’s own compe- 
tence or morality in specific situations or 
roles) are the most central elements. Values 
defined as enduring beliefs [i.e., “a specific 
mode of conduct or end-state of existence is 
personally or socially preferable to an oppo- 
site or converse mode of conduct or end-state 
of existence” (Rokeach, 1973, p. 5)] are the 
next most central or important elements in 
the belief system. Less central than self- 
conceptions and values are attitudes or orga- 
nizations of “beliefs around a specific object 
or situation” (p. 18). 

According to Rokeach (1973), values pro- 
vide the individual with “standards employed 
to maintain and enhance self-conceptions” 
(p. 217). Rokeach theorizes that “a contra- 
diction between values and seli-conceptions 


can be most effortlessly resolved by changing 
the less-central values” (p. 217). If one 
becomes aware vf previously existing con- 
tradictions or inconsistencies between one’s 
values and self-conceptions, one should re- 
organize one’s value system to become more 
consistent with one’s self-conceptions. Since 
valués are postulated to be dynamically 
a related to behavior, value system reorganiza- 
tions are conceptualized to lead to some form 
of value-related behavioral change. 

Although superficially similar to other the- 
ories concerned with the perception of incon- 
sistency, Rokeach (1973, pp. 230-234) has 
pointed out several ways in which self- 
confrontation theory is fundamentally differ- 
ent from other consistency theories. First, 
the inconsistency irivolved stems from the 
recognition of contradictions between self- 
conceptions, on oné hand, and values, atti- 
tudes, or behaviors, on. the other, Second, the 
primary focus’ is on values rather than atti- 
tudés: “As long as the values underlying a 
changed attitude remain intact, there is no 
compelling theoretical reason why a short- 
termvattitude change should lead to behavior 
clfange” (p..232). Third, the theory is explic- 
itly about the conditions that lead to long- as 

opposed .to short-term changes in cogni- 
tions and behavior. Rokeach notes that 
“most cognitive theories in social psychology 
_ turn out . . . to be theories about conditions 
leading to short-term change” (p. 232). 

Most interesting, perhaps, is the technique 
employed to induce a long-term change. 
McGuire's (1969) review of attitude change 
research suggests that such research may 
generally be Placed in one of two types: (a) 
ï peřson may be exposed to information 
about thé cognitions or behavior of a signifi- 

chnt other that is discrepant with one’s own 
cognitions or behavior, or (b) a person may 
be induced to engage in behavior that is dis- 
crepant with one’s own cognitions. Self- 
confrontation, on the other hand, exposes a 
_ person to information about oneself in order 
to make one aware of previously existing 
. contradictions pvithin one’s own belief system 
or behavior. 
` Rokeach’s main experimental research using 
self-confrontation procedures has concentrated 
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on values, attitudes, and behaviors relating to 
racism. Certain experimental subjects discov- 
ered—through an experimental session that 
supplied objective feedback concerning the 
subjects’ own and other students’ values and 
behavior—that they held value rankings for 
equality that were inconsistent with their 
(assumed)  self-conceptions as nonracists, 
As long as 15 months after the treatment, 
experimental subjects showed significantly 
higher rankings for equality than comparable 
control subjects. More importantly, experi- 
mental subjects showed a significantly higher 
rate of joining the National Association for 
the Advancement of Colored People when 
solicited to do so 3 months after the experi- 
mental session, registered significantly more 
often for ethnic core courses, and changed 
their academic majors as long as 21 months 
after the experimental treatment. Penner 
(1971), Rokeach and Cochrane (1972), Rok- 
each and McLellan (1972), and McLellan 
(1974) have confirmed these findings in 
subsequent research, 

Other researchers have shown that self- 
confrontation may be used to induce long- 
term change in other contexts. Hollen (1972) 
successfully induced long-term value and atti- 
tude change concerning environmental issues. 
Conroy, Katkin, and Barnette (Note 1) 
focused their study on smoking behavior and 
values relating to self-control. Hamid and 
Flay (1974) induced value change and 
changes in locus of control among college 
students in New Zealand, while Rokeach 
(1975) has induced long-term change through 
feedback supplied by a computer terminal. 
Sherrid and Beech (in press) induced change 
in values among New York Police officers 
using self-confrontation procedures. 

The purpose of the present study is to 
provide a theoretical replication of Rokeach’s 
original (1973) experiments using self-con- 
frontation. To provide a totally different 
context in which to conduct the experiment, 
the replication was attempted using students 
enrolled in a teacher preparation program as 
subjects. While the field setting of the experi- 
ment is irrelevant to the theoretical replica- 
tion, it offers a number of methodological 
refinements over previous self-confrontation 
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research. First, the experiment was conducted 
outside the laboratory and designed so that 
the subjects were unaware that they were 
participating in an experiment. Second, the 
behavior under study—performance in the 
classroom during a 16-week student teaching 
experience—was of undeniable consequence 
to the subjects, possibly affecting their career 
opportunities. Third, the study employed an 
unobtrusive, nonreactive behavior measure 
obtained from judges who were also unaware 
that an experiment was in progress. All of 
these factors combined to provide the most 
rigorous test of the self-confrontation tech- 
nique to date. 

In the present experiment, student teachers 
were given the opportunity to compare their 
own values with the values of “good” and 
“mediocre” teachers, as determined from an 
earlier survey. More specifically, this previous 
research had shown that good teachers (those 
who had scored in the upper 10% on a be- 
havioral measure of teaching ability) had 
ranked the personal achievement value, a 
sense of accomplishment, significantly lower, 
and the socially oriented value, mature love, 
significantly higher, than did mediocre teach- 
ers (those who were judged to be in the 
bottom 10% in teaching ability). 

The self-confrontation model would predict 
that an affective state of self-dissatisfaction 
would result if an individual learned of 
contradictions between his values and self- 
conceptions. If student teachers discovered 
that they valued personal achievement over 
sensitivity to the problems of others, we 
would expect them to experience a state of 
self-dissatisfaction, since such values would 
contradict their self-conceptions as good or 
competent teachers. This self-dissatisfaction 
should lead to a reorganization of the student 
teachers’ value systems and ultimately to 
changes in behavior that are consistent with 
these changed values. The primary hypothesis 

of this research, therefore, is that values and 
behaviors related to teacher competency will 
be significantly modified by the objective 
feedback of information concerning one’s 
own values and those of others differing in 
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METHOD 


Subjects 


A total of 182 student teachers, enrolled in the 
teacher certification program at Central Michigan 
University, participated in the study. The teacher 
preparation program places the student teacher with 
an experienced supervising teacher in a cooperating 
public school in the appropriate subject area for 
approximately 16 weeks. During this time the student 
teacher is required to assume an increasingly greater 
responsibility for the supervisor’s class. After an ap- 
propriate interval, the student has accepted virtually 
all of the supervisor’s teaching responsibilities such 
as planning, classroom management, and pupil eval- 
uation. 

The students are initially assigned to teaching cen- 
ters composed of about 25 students and headed by 
a full-time university faculty coordinator. Assign- 
ment to the center is determined on the basis of 
availability of appropriate student teaching positions 
in the adjacent public school systems and the prox- 
imity of the student’s home to the center. Thus, 
while assignment to the center is decided on a non- 
random basis, it is nonsystematic insofar as the ex- 
perimentally relevant variables are concerned. 


Instrumentation 


Rokeach’s (1973) Value Survey, Form E, was 
used to assess the value systems of the subjects at 
pretest and posttest. Subjects were asked to rank 
the two lists of terminal and instrumental values “in 
order of their importance to you, as guiding prin- 
ciples in your life.” Test-retest reliabilities for the 
terminal scale are in the .70s, while instrumental 
reliabilities range from .65 to .70 (Rokeach, 1973, 
p. 32). 

Measurement of teaching competency was accom- 
plished through the use of the teaching program’s 
Evaluation of Student Teaching form. This form, 
which is compiled at the end of the teaching ex- 
perience by the student's supervising teacher, consists 
of 6-point ratings of performance on each of seven 
dimensions relating to teaching ability: personal 
qualities, human relations, communication skills, aca- 
demic preparation, classroom effectiveness, profes- 
sionalism, and overall teaching potential. Factor 
analyses of pilot study data revealed that the scale 
is unidimensional; using an oblique rotation, one 
factor accounts for over 75% of the variation in 
scores. Accordingly, the seven subscales were summed 
to form a single measure of teacher competence, 
Cronbach’s a for this scale is .973, 


Procedure 


The experiment is a pretest—posttest control group 
design with pencil-and-paper and behavioral post- 
tests given approximately 13 weeks after the experi- 
mental session. 

Pretest. During the first week of the teaching pro- 
gram, the experimenter administered the pretest ques- 
tionnaire, meeting with each of the seven teaching 
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centers separately. The experimenter was introduced 
by the center's coordinator with the statement that 
„he was conducting a research Program with the co- 
operation of the university, The students completed 
a questionnaire containing the Value Survey and 
various attitudinal and demographic items unrelated 
to the present research; the entire session lasted 
about an hour. 

Treatment. Experimental and control group ses- 
sions for each center took place on the same day, 
approximately 10 days after the pretest was admin- 
istered to each center, The same experimenter con- 
ducted both sessions, and the order in which he met 
with the experimental and control groups was de- 
termined randomly. Within each center, students 
were randomly assigned to experimental or control 
groups. These groups ranged in size from 10 to 15 
students, 

` Experimental session, Subjects received a “result 

summary” booklet with their name on the front 
page. Upon opening the booklet, the subject found 
his or her own rankings of the terminal value scale, 
as obtained from the pretest session 10 days before, 
The scale had been physically removed from the 
pretest questionnaire so that the subjects would iden- 
tify their own handwriting. 

The subjects were then given feedback that en- 
abled them to compare their own values with those 
of others. This feedback was presented in written 
form as well as orally by the experimenter: 


` Last spring, 308 student teachers from Central 
Michigan University filled out the same value sur- 
vey that you completed a short time ago. The re- 
sponses of these 308 student teachers were col- 
lected and averaged together. 


The subjects were then shown Table 1, which 
displayed these results, and the experimenter drew 


TABLE 1 


VALUE RANKINGS oF 308 CENTRAL Micnican 
Universiry STUDENT TEACHERS 


Rank Value 
14 a comfortable life 
12 an exciting life 

4 a sense of accomplishment 
10 a world at peace 
13 a world of beauty 
11 equality 

9 family security 

7 freedom 

2 happiness 

1 inner harmony 
8 mature love 
18 national security 
15 pleasure 
16 salvation 
3 self-respect 
17 social recognition 

6 true friendship 

5 wisdom 


TABLE 2 


AVERAGE VALUE RANKINGS FOR GOOD AND 
MEDIOCRE TEACHERS 


Value 
A sense of accomplishment 9 2 
Mature love 2 9 


the subjects’ attention to two target values in par- 
ticular, as follows: 


One of the most interesting findings shown in 
Table 1 is that the student teachers, on the aver- 
age, felt that A Sense of Accomplishment was 
very important—they ranked it 4; but that Ma- 
ture Love was considerably less important—they 
ranked it 8. Apparently, Central Michigan Univer- 
sity student teachers value A Sense of Accomplish- 
ment far more highly than they value Mature 
Love. 


The experimenter then invited the student teach- 
ers to compare their own value rankings with those 
of the previously sampled group of student teachers, 
After allowing them a few minutes to do so, the 
experimenter continued as follows: 


At the same time that we collected the value rank- 
ing data on these 308 student teachers, we were 
able to compare their value rankings with the 
evaluation report that each student teacher re- 
ceived from his or her supervising teacher at the 
end of the student teaching experience. We divided 
the student teachers into two groups: “good teach- 
ers,” or those who received favorable evaluations; 
and “mediocre teachers,” or those who had re- 
ceived relatively poor evaluations from their su- 
pervisors. Table 2 shows what we feel are some 
very interesting differences between these two 
groups of student teachers. 


The experimenter then went on to say: 


Notice that in Table 2 the rankings of A Sense 
of Accomplishment and Mature Love are exactly 
reversed for the two groups of student teachers, 
Good teachers value Mature Love rather highly, 
and place much less emphasis on A Sense of Ac- 
complishment. Mediocre teachers, on the other 
hand, place a great deal of emphasis on A Sense 
of Accomplishment while deemphasizing Mature 
Love. 

These data raise the question as to whether 
being concerned with the problems of others, and 
placing less emphasis on personal achievement, is 
essential to success as a public school teacher, 
These data could be interpreted to mean that good 
teachers value the problems of others above their 
own personal gain or advantage. 


The student teachers were again invited to com- 
pare their own value rankings with those of the 
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“sood” and “mediocre” teachers. The subjects were 
then invited to discuss the interpretation of the dif- 
ferences, and during the course of this discussion, 
some offered their own interpretations of the find- 
ings. After the discussion tapered off, the experi- 
menter thanked the students and dismissed them 
with the comment that he hoped that the students 
had found the comparisons of their own values with 
those of others interesting and meaningful. 

Control session. The control session was deliber- 
ately designed to create interest on the part of the 
control subjects. The experimenter realized that the 
student teachers were likely to discuss the study 
with each other, so the sessions were created to be 
nearly identical save for the crucial experimental 
variable of feedback of information designed to im- 
plicate the subjects’ self-conceptions. The control 
subjects received a “result summary” booklet with 
a cover page identical to that received by the ex- 
perimental subjects. Upon opening the booklet, how- 
ever, the control subjects were asked to rank the 
values of the Value Survey as they perceived pro- 
fessors of education might rank them. Upon com- 
pleting the rankings, the control subjects turned to 
the next page, where the actual composite rank 
orderings of the values of 347 professors of educa- 
tion were listed (Spear, 1973). The subjects were 
encouraged to discuss these findings, and after about 
15 minutes, the experimenter thanked the control 
subjects and dismissed them. 

It is important to note that there are no com- 
pelling reasons to expect that the control treatment 
might have affected the values or behavior of the 
control subjects, The control subjects were not ex- 
posed to information concerning their own values 
(remember that all subjects completed the Value 
Survey 10 days before at the pretest, but that only 
experimental subjects were shown their value rank- 
ings at the treatment session). Second, no interpre- 
tation concerning the professors’ value rankings were 
offered. Third, McLellan (1974) has demonstrated 
that long-term change is unlikely to result in the 
absence of such an interpretation of target value find- 
ings. For these reasons, it is apparent that the ses- 
sion is an appropriate control for the experiment. 

With the exception of the crucial experimental 
variable of the feedback of information concerning 
one’s own and others’ values, the experimental and 


control sessions were identical. They both involved 
the discussion of value data from some education- 
related group: Both asked subjects to respond to 
the findings presented, and all sessions lasted about 


an hour and were conducted by the same experi- 
menter. 

Posttest. Thirte: 
sion, the post 
(either by the 


cks after the treatment ses- 
tionnaire was administered 
oordinator or by another 
experimenter) to ts meeting in their respec- 
tive centers. This aire contained the Value 
Survey and seve linal and demographic 
t study. 
perimenter was given 
dent Teaching forms 


items unrelated to 
Behavioral measu 
access to the Evaluat 
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that are routinely completed by each student's su- 
pervisor some 12 to 13 weeks after the treatment 
session. The evaluation score on this form served 
as a behavioral measure of teaching ability. 

The judges rating the student teachers on teach- 
ing ability (i.e., the supervising teachers) were blind 
both to the group assignment of the student teacher 
they were evaluating and to the purposes and hy- 
potheses of the study. In fact, none of the super- 
vising teachers were aware that any kind of experi- 
ment was in progress. 


RESULTS 


Since subjects were randomly assigned to 
treatment conditions within centers, it will 
be useful to test the effects of the treatment 
within a two-way design, with treatment and 
center as the independent variables. A signifi- 
cant interaction between these two variables 
would suggest that the treatment had differ- 
ential effects across the seven teaching centers 
and that the study might best be conceptual- 
ized as seven separate experiments, one within 
each center. Accordingly, such interaction ef- 
fects will be considered when interpreting 
the differences between experimental and con- 
trol subjects across centers. No statistically 
significant value differences between experi- 
mental and control groups were observed at 
the pretest. 


Effects on Values 


The target values of the treatment were a 
sense of accomplishment and mature love. 
The self-confrontation model predicts that the 
experimental subjects who discovered that 
they held rankings for these values that were 
unlike those of the good teachers would sub- 
sequently change their rankings of these val- 
ues to become more consistent with that self- 
conception. The results are shown in Table 3. 
Also shown are the posttest median rankings 
for the remaining 34 values, since Rokeach 
(1973, p. 257) has suggested that values 
other than those directly implicated by the 
treatment may also undergo long-term change. 

Posttest differences in the rankings for ma- 
ture love indicate that the treatment had the 
predicted effect on this value. Thirteen weeks 
after the treatment, the experimental sub- 
jects had a median ranking of 4.35, while 
comparable control subjects had a ranking 
of 9.00. This difference is significant beyond 


BEHAVIOR CHANGE THROUGH SELF-CONFRONTATION 


259 


TABLE 3 
POSTTEST TERMINAL AND INSTRUMENTAL VALUE MEDIANS AND COMPOSITE 
RANK ORDERS FOR CONTROL AND EXPERIMENTAL SUBJECTS 
Subjects Median test 
Value Control* Experimental x p 
Terminal 
a comfortable life 11.00 (12) 12.36 (13) 1.60 203 
an exciting life 11.38 (13) 11.88 (12) 1.25 203 
a sense of accomplishment 6.70 ( 7) 7.25 ( 8) 02 898 
a world at peace 11.00 (11) 11.14 (11) 01 O10 
, a world of beauty 14.05 (16) 12.94 (15) 2.42 116 
equality 11.00 (10) 10.00 (10) 62 503 
family security 9.50 ( 9) 8.78 ( 9) 67 583 
freedom 6.63 ( 6) 6.78 ( 7) 00 987 
happiness 4.41 ( 3) 3.67 ( 2) 2.49 a111 
inner harmony 3.50 ( 1) 3.27 ( 1) .05 822 
mature love 9.00 ( 8) 4.35 ( 3) 16,99 <.001 
national security 16.42 (18) 16.63 (18) 02 891 
pleasure 13.00 (14) 12.81 (14) 00 985, 
salvation 15.30 (17) 16.44 (17) 84 -638 
self-respect 4.17 ( 2) 6.22 ( 5) 7.17 .006 
social recognition 13.50 (15) 15.00 (16) 2.77 092 
true friendship 6.21 ( 4) 6.63 ( 6) 42 526 
wisdom 6.30 ( 5) 6.00 ( 4) 12 732 
Instrumental 
ambitious 8.25 ( 8) 8.33 ( 8) 00. 984 
broadminded 6.00 ( 3) 7.33 ( 6) 50 512 
capable 7.78 ( 5) 8.35 ( 9) 50, 512 
‘ cheerful 8.86 (10) 10.56 (11) 70 593 
clean 14.00 (17) 14.44 (17) 27 610, 
courageous 13.00 (15) 13.11 (15) O01 901 
forgiving 8.06 ( 6) 6.94 ( 4) 1.62 200 
helpful 6.67 t 4) 7.56 ( 7) Bb 137 
honest 2.67 ( 1) 3.78 ( 1) Al 531 
imaginative 12.00 (14) 10.60 (12) 88 -650 
independent 8.13 ( 7) 7.06 ( 5) .62 563 
intellectual 10.88 (12) 12.42 (14) 1.05 306 
logical 11.71 (13) 11.00 (13) 10 «149 
loving 8.57 ( 9) 4.00 ( 2) 10.15 :002 
obedient 16.15 (18) 17.07 (18) 1.33 247 
polite 13.44 (16) 13,57 (16) 00 1.000 
responsible 5.00 ( 2) 5.80 ( 3) 1.34 246 
self-controlled 9.75 (11) 8.71 (10) 86 642 


Note. Numbers in parentheses are composite rank order: 
an = 76 and 73 for terminal and instrumental val 
> = 87 and 85 for terminal and instrumental values 


the .001 level using the median test. Addi- 
tional analyses indicated that nearly two 
thirds of the experimental subjects increased 
their ranking for mature love, while only 25% 
of the control subjects did so. 

However, there is no significant difference 
in the rankings of a sense of accomplishment 
between the two groups. Inspection of the 
pretest data (not presented) shows that both 
~ groups had -relatively low rankings for this 
value (experimental group, 8.08; control 
group, 7.13). Since the overall pretest rank- 
ings for this value were already consistent on 
the whole with the values of good teachers, 


feelings of dissatisfaction should not have 
been induced among most of the experimental 
subjects. Consequently, the theory does not 
predict that the overall effect for the experi- 
mental group as a whole would be significant. 
On the other hand, the pretest median rank- 
ing for mature love for experimental subjects 
was 6.88, which is clearly inconsistent with 
the values of good teachers. 

Two other values do show significant post- 
test differences, however, and both of these 
differences are consistent with theoretical ex- 
pectations. Experimental subjects ranked lov- 
ing significantly higher—nearly five units 


TABLE 4 


MEAN EVALUATION SCORES 
EXPERIMENTAL SUBJECT: 


R CONTROL AND 
CENTER 


ASSIGNMENT 
Group 
Center Control Total 
1 34.07 (15) 34.47 (15) 34.27 ( 30) 
2 37.90 (10) 39.08 (12) 38.55 ( 22) 
3 34.45 (11) 34.85 (13) 34.67 ( 24) 
4 34.90 (10) 35.50 (12) 35.23 ( 22) 
5 35.00 (11) 35.38 (13) 35.21 ( 24) 
6 31.13 i) 36.00 (10) 33.08 ( 25) 
7 31.25 (12) 36.00 (12) 33.63 ( 24) 
Total 33.87 (84) 35.83 (87) 34.87 (171) 
Note. Numbers in parentheses indicate the number of sub- 


jects in each cell. 


higher—while they ranked self-respect sig- 
nificantly lower than control subjects. 

Further analyses of the posttest data indi- 
cate that the Treatment X Center interaction 
is not significant for any of the values dis- 
cussed above, permitting us to consider the 
data from all seven centers in one analysis. 

It may be argued that a more appropriate 
analysis technique for these data would be 
the analysis of covariance, controlling for pre- 
test value ranks. Such a technique is recom- 
mended by Campbell and Stanley (1963, p. 
23) and, in fact, yields even more striking 
differences between experimental and control 
groups. Due to the nonparametric nature of 
the value rank data, however, the results 
from the more conservative median test are 
reported. 


Effects on Teacher Behavior 


Table 4 presents mean teaching evaluation 
scores for subjects grouped by treatment 
within centers. These scores range from 18 to 
42, with higher scores indicating better teach- 
ine ability. Inspection of the data shows that 


experimental subjects scored higher on the 
ev on measures (i.e., were judged, on 
th o. to be better teachers) in each of 
the ters. This finding is highly un- 
likel \bability of experimental sub- 
ject gher in all seven centers by 


chance 008 (by randomization test 


for mat s). 
Table ı two-way analysis of vari- 
ance for t The Treatment X Center 
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interaction is not significant, and we are con- 
sequently justified in conceptualizing the 
study as a single experiment. Overall, the 
experimental subjects show a mean evaluation 
score nearly two points higher than the con- 
trol subjects, a difference that is significant 
at the .008 level. 


Discussion 


In a substantive sense, the major finding 
of the research reported here is that a self- 
confrontation procedure showed significant 
effects on the values and behavior of experi- 
mental subjects 13 weeks after the treatment 
session. Student teachers who received objec- 
tive feedback concerning their own values and 
those of good and mediocre teachers (a) €x- 
hibited significantly higher value ranks for 
mature love and loving and lower ranks for 
self-respect, and (b) showed significantly 
higher scores on a behavioral measure of 
teaching ability than did student teachers not 
receiving such feedback. 

Why should people who value mature love 
be better teachers than those who value a 
sense of accomplishment? This is certainly 
not an obvious relationship, and we can only 
speculate at this time about a possible ex- 
planation. It might be conjectured that indi- 
viduals who are primarily concerned with the 
problems of others (as indicated by higher 
rankings of socially oriented values such as 
mature love) and less concerned with personal 
achievement (as indicated by lower rankings 
of values such as a sense of accomplishment) 
should be better able to empathize and iden- 
tify with their students. Whether or not this 
speculation is correct, our empirical finding 
that these values distinguish between good 
and mediocre teachers is supported by studies 


TABLE 5 


‘Two-Way ANALYSIS OF VARIANCE FOR MEAN 
EVALUATION SCORE BY GROUP AND CENTER 


Mean 


Source df square F > 
Treatment 1 163.93 7.08  .008 
Center 6 71.99 3.11 007 
Treatment X Center 6 21.02 .91 508 
Residual 157 12316 

Total 170 
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= of teacher trainees in Australia (Feather, 
1975) and Michigan (Oberle, 1974). 

Significant differences in posttest value 

rankings were found for mature love but not 

for the other target value of the self-confron- 

tation treatment, a sense of accomplishment. 

Inspection of the pretest data reveals that 

the reason that no differences were found for 

s a sense of accomplishment is relatively 
straightforward and consistent with the the- 
ory. Upon receiving the feedback data, the 
majority of the experimental subjects discov- 
ered that they already ranked this value as 
did the good teachers discussed in the treat- 
ment. Consequently, there was no motivation 
for change on a sense of accomplishment. 

On the other hand, experimental subjects 
generally discovered that their relatively low 
® value for mature love, which experimental 
subjects ranked eighth on the average at the 
pretest, conflicted with what they had found 
out about the values of competent, effective 
teachers. Thus, we would expect a significant 
upward shift in this value as a result of the 
feedback, and the posttest analysis confirms 
this expectation. 

w% Two other values not directly implicated 
by the treatment also showed sizable and sig- 
nificant differences between experimental and 
control subjects at the posttest. The instru- 

) mental value, loving, was ranked more than 
four units higher by experimental subjects at 
ef the posttest, while the terminal value, self- 
respect, was ranked two units lower, These 
findings provide additional data which suggest 
that a primary effect of the treatment was 
to decrease the emphasis that the experimen- 
tal subjects placed on personal achievement 
values while increasing the importance they 
placed on values relating to concern for the 
problems of others. 

All of these differences observed 13 weeks 
after the experimental treatment can be in- 
terpreted as changes occurring in the direction 
of increasing compatibility with self-concep- 
tions. According to Rokeach’s (1973) theory, 
cognitions and behaviors will undergo long- 
term change if they are demonstrated to be 
inconsistent with self-conceptions concerning 
competence or morality. The experimental 

treatment in the present study was explicitly 


———— ee — ltt (t‘ié‘a 


designed to implicate the subjects’ self-con- 
ceptions concerning their competence as 
teachers. It can be assumed that when sub- 
jects learned that they held values that were 
inconsistent with their self-conceptions, the 
result was a state of self-dissatisfaction, 
which then led to the observed cognitive and 
behavioral changes. 

We were not able to gather empirical data 
on self-dissatisfaction in the present study 
(so that the subjects would not suspect that 
they were participating in an experiment), 
but previous research strongly suggests that 
this is the most parsimonious account of the 
psychological process that explains the ob- 
served value and behavioral changes. Experi- 
mental studies by Rokeach (1973), McLellan 
(1974), Hamid and Flay (1974), and Sherrid 
and Beech (in press) have confirmed the hy- 
pothesis that self-dissatisfaction is a primary 
determinant of value change. These studies 
have uniformly shown that those experimental 
subjects who reported at the end of the ex- 
periment that they were dissatisfied with their 
ranking for a particular value, manifested 
significantly more long-term change than sub- 
jects reporting that they were satisfied. 

The fact that a single brief experimental 
treatment had such long-lasting effects on 
complex social behavior should lead the reader 
to regard the design of the study with the 
tightest scrutiny. However, various elements 
of the research design—including the field 
setting of the study, the length of time (13 
weeks) between the treatment and the post- 
test, and the use of judges who were unaware 
that an experiment was in progress—combine 
to provide a preponderance of evidence that 
the observed differences are, in fact, due to 
the experimental treatment. The design of 
the study tends to rule out the usual threats 
to validity, such as demand characteristics, 
experimenter effects, or selection problems. 

When considered in conjunction with the 
experimental studies discussed earlier, the 
findings of this study suggest that self-con- 
frontation techniques offer an effective and 
practical means of modifying complex social 
behaviors. Research using such techniques has 
now successfully modified values and/or be- 
havior relating to racism, smoking, ecology, 
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police-community relations, locus of control, 
and teaching. 

The experimental confirmation of a cogni- 
tively based theory of behavior change repre- 
sents a major development in social psychol- 
ogy and may provide an alternative to 
behavior modification for field researchers and 
practitioners. One apparent advantage of self- 
confrontation over behavior modification is 
that while the latter is generally effective in 
modifying simple or situation-specific behay- 
iors, self-confrontation has been shown to be 
applicable to a wide range of complex social 
behavior. It may well be that self-confronta- 
tion is especially useful in modifying those 
behaviors whose reinforcement contingencies 
are too complex or obscure for the applica- 
tion of behavior modification techniques. 


REFERENCE NOTE 
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Modification of smoking behavior by the use of a 
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Correlational Bias in Observer Ratings 


Jeffrey S. Berman and David A. Kenny 
Harvard University 


A variety of evidence suggests that implicit assumptions about traits and be- 
haviors may seriously bias the correlations derived from observer ratings. Up 
to now, however, there has been little experimental examination of the process. 
In the present study, pairs of traits that are commonly assumed to be either 
positively correlated, uncorrelated, or negatively correlated were assigned to 
fictitious stimulus persons in such a way that there actually was either a posi- 
tive, zero, or negative correlation between the two traits in each pair. Subjects 
made trait ratings of the stimulus persons, and correlations were derived from 


these ratings, The results indicated th 


at implicit assumptions concerning the 


co-occurrence of the traits had a systematic effect on immediate as well as de- 
layed ratings made by the subjects. Moreover, the degree of this correlational 
bias appeared to be constant not only across different levels of actual correlation 


but also across ratings made at different 


points in time. These findings have 


important implications for the use of human observers and raters in the social 


sciences. 


- 


Human observers and raters have long been 
an important source from which the psychol- 
ogist gathers data. For many situations, in 
fact, the recording of complex behaviors is 
simply beyond the capacity of any mechan- 
ical coding device and can only be accom- 
plished by the use of rating techniques. Even 
early researchers recognized, however, that 
there might be systematic biases in this type 
of data collection. 

Thorndike (1920), as well as others (Rugg, 
‘1922; Wells, 1907), for example, suggested 
the presence of a “halo effect” in ratings. 
These researchers argued that the recall of 
information on a rating task would be affected 
*by the general impression the rater formed 
of the ratee. Thus, if a person was seen as 
being highly friendly, raters would tend to 
overestimate that person’s standing on other 
positive traits such as sociable, kind, or honest. 
Sugh a halo effect would therefore result in 
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the appearance of spuriously high correlations 
between those traits or behaviors sharing 
similar connotations, 

The original evidence for the halo effect 
was simply the occurrence of improbably high 
levels of association between items on a rating 
scale. Newcomb (1931), however, later pro- 
vided a more direct demonstration of this type 
of bias. In his research, Newcomb found that 
the correlations between behaviors referring 
to the same trait were consistently higher 
when they were derived from retrospective 
ratings than when they were derived from 
immediate observations. He reasoned that the 
higher correlations in the retrospective ratings 
must have resulted from “logical presupposi- 
tions in the minds of raters rather than from 
actual behavior” (1931, p. 288). Psychologists 
since then have often designated this biasing 
problem as “logical error.’ And a recent 
analysis by D’Andrade (1974), using similar 
comparisons between immediate and retro- 
spective ratings, has further confirmed the 
possibilities of this rating bias, 

The existence of the logical error and the 
halo effect suggests that raters share certain 
assumptions of how traits and behaviors “go 
together”; in recent years the nature of these 
assumptions has itself become a major area 
of interest in psychology. Arising out of re- 
search on person perception, these assumed 


263 


264 


correlations have been labeled “implicit theo- 
ries of personality” (Bruner & Tagiuri, 1954; 
Cronbach, 1955). Significantly, the research 
that has emerged from this area (e.g., Koltuv, 
1962; Rosenberg, Nelson, & Vivekananthan, 
1968; Wishner, 1960) indicates that implicit 
personality theories are both clearly struc- 
tured and, to a large extent, commonly agreed 
upon. Furthermore, there are indications that 
implicit personality theories may rest on very 
basic semantic foundations (e.g., Kuusinen, 
1969; Rosenberg & Olshan, 1970). 

The biases that may be generated by im- 
plicit personality theory, whether they are 
labeled as logical error, halo effect, or given 
some other name, would all seem to highlight 
the same general difficulty in observer ratings. 
In the present paper, we shall simply refer 
to this problem as correlational bias. The 
basic process seems clear: Raters often ap- 
pear to possess assumptions concerning the 
co-occurrence of rated items, and these as- 
sumptions may introduce systematic distor- 
tions when correlations are derived from the 
ratings. The possibility of such correlational 
bias could therefore exist whenever a re- 
searcher wishes to use raters, or observers, as 
a means of assessing the association between 
behaviors or between traits; for, in such situ- 
ations, it may be unclear whether the inter- 
correlations obtained from the ratings are due 
to the assumptions and biases of the rater or 
to the actual behavior of the persons being 
rated. 

The potential dangers of correlational bias 
are perhaps nowhere more clearly illustrated 
than in the experience of a number of per- 
sonality researchers who relied heavily upon 
trait ratings in an attempt to outline basic 
structures of personality (e.g., Cattell, 1946, 
1947, 1957; Fiske, 1949; Lorr & McNair, 
1963, 1965; Norman, 1963). Whereas these 
researchers claimed to have discovered rather 
sta »ersonality dimensions, later investi- 
a g., D’Andrade, 1965; Mulaik, 1964; 
P: Norman, 1966) have quite strongly 
s the dimensions derived from 
- studies may have been more 


2 


ig 


these pei 
a product stereotypic and. semantic 
assumptions raters than the personal- 


ity of the p being rated. Studies by 
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Chapman and Chapman (1967, 1969), as well 
as others (e.g., Starr & Katkin, 1969), have 
indicated that a similar problem of correla- 
tional bias seems to occur in judgments and 
ratings made by clinicians. 

Psychologists who rely on observers and 
raters have not been totally unaware of this 
problem, and suggestions have periodically 
arisen concerning ways in which correlational 
bias may be minimized or eliminated. The 
most common of these suggested strategies 
have included the use of carefully defined 
behavioral terms on the rating scale, the 
rating of all persons on one trait at a time, 
and the computation of correlations from the 
mean ratings of a number of observers (e.g., 
see Guilford, 1954, Ch. 11; Vernon, 1938, 
Ch. 4). Unfortunately, these simple tech- 
niques may very well be ineffective. For ex- 
ample, the suspect personality studies men- 
tioned above generally employed carefully de- 
fined rating terms, and they often computed 
correlations from mean ratings. Similarly, an 
experiment by Johnson (1963) failed to find 
any reduction in correlational bias when all 
persons were rated one trait at a time. 

More complex statistical manipulations, in- 
cluding an early procedure by Chi (1937) as 
well as a later formulation by Stanley (1961), 
have also been proposed to deal with the 
problem of bias in ratings, However, these 
statistical methods reduce correlational bias 
by relying on differences between raters; 
thus, they are not effective in compensating 
for any biases which result from common 
stereotypes of certain ratees, Morever, these 
techniques require much added computation 
and have not enjoyed widespread usage in 
rating research, 

The majority of researchers who employ 
observer ratings, in fact, have generally 
seemed to ignore or dismiss the problem of 
correlational bias and have continued to re- 
port correlations derived from uncorrected 
ratings. Perhaps one reason for this apparent 
indifference may be that most of the evidence 
of a bias in ratings has been circumstantial, 
rather than experimental, in nature, Thus, 
correlational bias has been inferred from un- 
reasonably high correlations among rated 
items (e.g., Thorndike, 1920), by comparison, 
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between immediate and retrospective ratings 
(e.g., Newcomb, 1931), or from the similarity 
between semantic structures and the struc- 
tures of actual ratings (e.g., D’Andrade, 1965; 
MujJaik, 1964). In none of these cases, how- 
er, has the true correlation between the rated 
items been known; the skeptical psychologist 
might therefore devise alternate explanations 
for each of these findings and question 
whether correlational bias occurs in ratings 
at all. 

One obvious way to resolve this issue and 
highlight any bias that does occur in ratings 
is to experimentally manipulate the actual 
correlation between items that are being rated 
by observers and then examine the correla- 
tional bias which may appear under these 
conditions. Up to now, however, the few 
experimental studies in this area have em- 
ployed some variant of the Chapman and 
Chapman (1967, 1969) design, in which sub- 
jects are asked to directly judge the co-occur- 
rence of items they have observed. These 
studies are less than ideal in any controversy 
over ratings because in the normal observer 
rating situation observers are not required to 
directly judge the co-occurrence of rated 
items but must simply indicate the standing 
of persons on traits, Psychologists can right- 
fully argue that the bias that appears in 
direct co-occurrence judgments may be quite 
different from any bias that may appear when 
observers rate individual traits, and correla- 
tions are then derived from these individual 
ratings. 

The investigation to be reported here was 
therefore designed to experimentally demon- 
strate the process of correlational bias with a 
procedure that more closely parallels a typical 
rating situation. The study itself consisted of 
two parts and involved two separate groups 
of subjects. A questionnaire was given to the 
first group of subjects to assess the correla- 
tions that are assumed to exist among pairs 
of personality traits. From the responses to 
this questionnaire, trait pairs were chosen 


@ whose traits were thought to be (a) very 


likely to co-occur, (b) not at all likely to 
co-occur, or (c) unrelated to one another. 
Fictitious stimulus persons were then assigned 
standings on these traits in such a way that 
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there would actually be either a positive, 
negative, or zero correlation between the two 
traits in each trait pair. The second group of 
subjects was asked to make trait ratings for 
each stimulus person, and correlations be- 
tween the traits in each trait pair were de- 
rived from these ratings. In this way, trait 
pairs that are commonly assumed to have 
three different types of relationships were 
presented in three types of actual correla- 
tional patterns. From these data, the dynamics 
of any correlational bias could be clearly 
examined: One could assess the separate 
effects on the ratings of (a) the actual cor- 
relations among the traits, (b) the assumed 
correlations among the traits, and (c) any 
interaction of the actual and assumed cor- 
relations, 


METHOD 
Subjects 


Twenty subjects (11 males; 9 females) completed 
the initial questionnaire, A separate set of 30 sub- 
jects (20 males; 10 females) participated in the main 
experiment. All subjects were recruited through 
notices posted at Harvard University, and each 
subject was paid $3. 


Procedure 


Initial questionnaire. The purpose of the initial 
questionnaire was to assess the commonly shared 
notions of how personality traits go together. More 
specifically, the aim was to identify pairs of traits 
having one of three types of assumed correlation: 
positive, zero, or negative. As the label implies, the 
assumed correlation of a trait pair is (a) positive if 
two traits are generally thought to be likely to co- 
occur within the same person, (b) sero if two 
traits are considered equally likely as unlikely to 
co-occur within the same person, and (c) negative 
if two traits are generally thought to be unlikely 
to co-occur within the same person. Earlier studies 
(e.g., Warr & Knapper, 1968, Experiment 14; 
Wishner, 1960) have provided substantial evidence 
that such correlational assumptions among trait pairs 
do exist. 

The initial questionnaire involved 22 trait terms, 
many of which were taken from previous research 
(c.g, Rosenberg et al., 1968), On each page of the 
questionnaire the subject was informed that a per- 
son possessed one of the traits. He or she was then 
asked to indicate, on a 5-point scale, how likely it 
was that the person would also possess a series of 


Satine ne) alee 


other traits. The following is a typical item: 


‘A person is ctuMsy. How likely would he(she) be: 


equally some- gener- 
gener- some- likely what ally 
ally what asun- un- un- 
likely likely likely likely likely 
POLITE (m) o mj oO Oo 


Since the number of assessments needed to mea- 
sure every possible relationship between 22 traits 
was tediously large, two forms of the questionnaire 
Were produced. Half of the subjects (10) completed 
Form A and half completed Form B. For each form, 
only one implication direction between a pair of 
traits was assessed. For example, in Form A, the 
subject was told that a person was clumsy and was 
then asked to indicate the likelihood of that person 
also being polite. In Form B, on the other hand, 
the subject was told that a person was polite and 
was then asked to indicate how likely it was for 
that person to be clumsy. An asymmetrical relation- 
ship between a pair of traits therefore resulted in dif- 
ferent ratings on the two forms of the questionnaire. 

Analysis of the questionnaire. For each possible 
pair of traits, a mean rating was calculated sepa- 
rately for the implication direction of Form A and 
the opposite direction of Form B. If, according to 
a £ test, the two “directional” means were signifi- 
cantly different (p< .05), the pair was judged to 
contain asymmetrical inferences and was excluded 
from further consideration. 

Nine trait pairs were then selected from the re- 
maining combinations using an overall mean rating 
computed across both forms of the initial question- 


TABLE 1 


Tie Nine Trarr Pairs CHOSEN FROM THE 
INITIAL QUESTIONNAIRE 


t test of 
difference 
Assumed correlation and Mean from score 
trait pair scores of zerob 
Positive 
understanding-tolerant 1.7 13.31* 
persistent-industrious 1.5 11.05* 
friendly-sociable 1.8 22.58* 
Zero 
clumsy-polite 0 -00 
unreliable-aggri —.0 —.33 
irritable-cautious At 46 
Negative 
shy—domineering —1.8 
-frivolous — Ty 
-impulsive =S 


—2.0 to 2.0. Positive scores 
med to be positively cor- 
that the trait pair was as- 


a The maximum range 
indicate that the tr; 
related; negative scot 
sumed to be negatively co: 

v Based on 19 degrees of freedom. 

*p <.001, 


JEFFREY S. BERMAN AND DAVID A. KENNY 


naire. Three pairs were chosen to represent each type 
of assumed correlation described above. Thus, the 
trait pairs chosen to represent a positive assumed 
correlation were those with high positive ratings 
(closest to “generally likely” on the rating scale). 
Trait pairs chosen to represent a zero assumed cor- 
relation were those with ratings near the zero point 
on the scale (closest to “equally likely as unlikely”). 
Trait pairs chosen to represent a negative assumed 
correlation were those which received the largest 
negative ratings (closest to “generally unlikely” on 
the scale). The nine pairs, their mean scores on the 
questionnaire, and £ tests of their difference from 
a neutral, or zero, score are presented in Table 1. 
Stimulus material. The 18 chosen trait words were 
combined with eight short, male names in a series 
of printed sentences. In each of these sentences, one 
of the males is said to have been rated either high 
or low on one of the traits. Typical sentences 
appeared as follows: JOHN WAS RATED LOW ON 


THE TRAIT SOCIABLE. DAVE WAS RATED 


HIGH ON THE TRAIT IMPULSIVE. The various 
combinations of name, trait, and rating of high or 
low were typed out, and each sentence was put on a 
separate photographic slide. 

Presentation of the traits. Sessions of the main 
experiment were run in groups of five subjects. Upon 
arrival, subjects were seated at a large semicircular 
table facing a projection screen, Each subject was 
given a sheet of instructions, which began with the 
following cover story: 


Tn a recent experiment conducted at Harvard Uni- 
versity, trained psychologists extensively observed 
a group of eight male college students, These stu- 
dents were then rated on a number of personality 
traits. In the present study, we are interested in 
your ability to recall the personality characteristics 
of these eight students. 

During this experiment you will be shown a 
series of slides. Each slide will state how one of 
the eight students in the previous study had been 
rated on a personality trait. The student will be 
classified as having been rated either high or low 
on the trait. 


The subjects were told that, to help them recall 
the ratings, the slides would be shown in groups of 
16. They were further told that each group of 16 
Slides would be concerned with only two traits and 
would indicate how each of the eight students had 
been rated on these two traits. The two traits in each 
group of slides were, of course, one of the nine trait 
pairs selected from the initial questionnaire, 

The presentation of each group of slides was as 
follows: Each slide was shown on the projection 
screen for 5 seconds. After the 16 slides had been 
shown once, they were immediately shown in the 
same order a second time, again with 5-second ex- 
posures. 

Experimental manipulation. The trait ratings at- 
tributed to the eight “students” were constructed 
such that the two traits within each group of slides 


formed one of three actual correlations: positive, 
zero, or negative. When the actual correlation was 
positive, there was a perfect positive correlation 
between the two traits, Therefore, any student who 
was presented as rating high on one of the traits 
was presented as rating high on the second trait; 
and any student who had supposedly been rated low 
on one of the traits was presented as rating low on 
the other trait. When the actual correlation was 
zero, there was a zero correlation between the two 
traits. Therefore, a random relationship existed 
between a student’s supposed standing on the first 
trait and his standing on the second trait. Finally, 
when the actual correlation was negative, there was 
a perfect negative correlation between the two 
traits. Therefore, a student presented as ing low 
on one of the traits was presented as rating high on 
the other, 

While the actual correlation of the trait pairs 
varied, the frequency of high and low ratings within 
a group of slides was always equal; thus, on each 
trait, four students would be shown as rating high 
and the other four would be shown as rating low. 
To mask the actual correlation presented in each 
group of slides, the names and traits were shown in 
different random orders. 

Rating 1. Immediately following the presentation 
of a group of slides, the subjects were given a small 
booklet in which to recall the trait ratings of the 
eight students, Each page of the recall booklet had 
the name of one of the students and the two traits 
to be rated. A typical page of the recall booklet 
looked as follows: 


ON THIS PAGE RATE ALAN; 


O low 
0 low 


SHY * 
DOMINEERING 


O high 
O high 


The subjects were told to answer each page se- 
quentially. If, for any item, they were not sure of 
the correct answer, they were to make their best 
guess. Once a subject had gone on to another page, 
he or she was instructed not to return to earlier 
pages in the recall booklet. To avoid any effect for 
the order of names, the booklets for each of the 
trait pairs presented the students’ names in different 
random orders. 

Rating 2, After the 9 groups of 16 slides had been 
presented and the appropriate recall booklets filled 
out, the subjects were given a larger booklet with 
which to recall the ratings of all the traits that had 
been observed. At the top of each page of this 
booklet was the name of one of the eight students 
and below that were the 18 trait words. The trait 
pairs, however, were not grouped together but 
placed in a random order down the page. As in the 
first rating, subjects indicated ratings of high or 
low for each trait. In this way, ratings of all the 
traits were made for one stimulus person at a time. 
The subjects were once again instructed not to 
return to earlier pages of the booklet once they had 
gone on to a later page. 
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Experimental design. In the full experimental 
n, each of the nine trait pai is. presented 
equally often in each level of actual correlation. 
However, a single subject was presented with the 
nine trait pairs only once. Therefore, for any one 
subject, only one trait pair within each level of 
assumed correlation was presented in each level of 
actual correlation. To balance the design, the spe- 
cific pair assigned to a level of actual correlation 
was varied for different sets of subjects. In this 
way, the actual correlation of a particular trait pair 
was positive for 1/3, zero for 1/3, and negative 
for 1/3 of the subjects. 

The result of this assignment plan was that, for 
each subject, assumed correlation was crossed with 
actual correlation, Moreover, across all the subjects, 
trait pair was nested within assumed correlation. 

As a final feature of the design, half of the sub- 
jects saw the nine trait pairs in one order of pres- 
entation and the other half saw these trait pairs in 
the reverse order. 

Statistical analysis. Primary analysis of each of 
the two ratings made by the subjects involved a 
2X3X3X3 repeated-measures design. The two 
between-subject factors in this analysis of variance 
were the two orders in which the trait pairs were 
shown and the three different assignments of par- 
ticular trait pairs to an actual correlation level. 
Neither of these variables was of substantive in- 
terest in the present study, but each was introduced 
for the sake of improved experimental and statisti- 
cal control. The two within-subject factors were 
the three levels of actual correlation and the three 
levels of assumed correlation. Comparisons which 
were made across the two rating tasks required 
rating as an additional repeated-measures factor and 
thus involved a 2X 33X32 design, 

For all analyses involving the repeated-measures 
factors, orthogonal trends of main effects and inter- 
actions were tested by means of an appropriately 
partitioned error term (Winer, 1971, pp. 577-594), 


RESULTS 


Measures of association were calculated for 
each of the two rating tasks. The most basic 
of these measures was simply a count of the 
number of concordant responses a subject 
gave for a particular trait pair. A concordant 
response was recorded any time a subject 
rated one of the eight students either high on 
both traits or low on both traits. Thus, the 
number of concordant responses for a pair of 
traits could range from 8, indicating a perfect 
positive correlation between the two traits, 
to 0, indicating a perfect negative correlation, 

A measure such as concordant responses 
can be a misleading gauge of correlation if 
the frequencies of high and low ratings are 
highly skewed. However, inspection of the 
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Ficure 1, Mean number of concordant responses 
reported during the first rating task. 


marginal frequencies in the present ratings 
reveals few of these skewed distributions. 
Moreover, a parallel analysis with the phi mea- 
sure of association yields virtually identical 
results. 

Figure 1 presents the mean number of 
concordant responses reported by subjects 
during the first rating task. As might be 
expected if subjects were accurately reporting 
on the traits, the number of concordant re- 
sponses is lowest when the actual correlation 
of a trait pair is negative and highest when 
the actual correlation of a trait pair is posi- 
tive. However, it is also clear that the 
assumed correlation of a trait pair has a sys- 
tematic effect on the subjects’ rating. Thus, 
within each type of actual correlation, trait 
pairs assumed to be negatively correlated 
receive the lowest level of concordant re- 
sponses, trait pairs assumed to be uncorrelated 
receive a middle level, and trait pairs assumed 
to be positively correlated receive the highest 
level. Partitioning of the main effects for 
actual correlation and assumed correlation 
confirms that, in both cases, almost all the 
systematic variance between means is at- 
 tributable to the linear trend: for actual cor- 
relation, (1,24) = 103.09, p < .001; for as- 
~ sumed correlation, F(1, 24) = 46.68, p < .001. 
In addition, there is a significant Linear x 
Linear interaction between the actual corre- 
lation and the assumed correlation, F (1, 24) 
= 11.97, p < .005. 
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Figure 2 presents the mean number of 
concordant responses reported during the sec- 
ond rating task. As in Rating 1, the means 
for the three types of assumed correlation 
increase in an almost purely linear trend, F 
(1, 24) = 65.55, p <..001. Thus, the number 
of concordant responses is lowest for trait 
pairs assumed to be negatively correlated and 
highest for trait pairs assumed to be posi- 
tively correlated. However, in contrast to the 
first rating, the means for the three levels 
of actual correlation contain both a linear 
component, F(1,24) = 11.12, p < .005, and 
a nonlinear component, (1,24) = 5.72, p 
<.05. Moreover, in this second rating, the 
relative size of the two main effects is re- 
versed. That is, here, the assumed correlation 
of trait pairs has a greater influence on the 
rating than does the actual correlation of the 
trait pairs. Such results probably reflect the 
fact that the second rating occurred after 
some time lapse, thereby weakening the sub- 
jects’ recall of the actual correlation between 
traits. 

Because each trait pair was presented 
equally often in each level of actual correla- 
tion, the difference between the ratings of 
those trait pairs assumed to be positively cor- 
related and those trait pairs assumed to be 
negatively correlated provides one measure of 
correlational bias. In the present study, it is 
interesting to note that this measure of cor- 
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Ficure 2. Mean number of concordant responses 
reported during the second rating task. 
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relational bias, which corresponds to the 
linear effect of assumed correlation, is ap- 
proximately the same for Rating 1 (mean dif- 
ference = 1.6) and Rating 2 (mean difference 
= 1.9), Analysis, in fact, reveals that there 
is no significant change in this linear effect 
across the two rating tasks, F(1,24) = 1.39, 
p > .2, On the other hand, the linear effect 
@ of actual correlation is significantly lower for 
Rating 2 (mean difference = .8) than it is 
for Rating 1 (mean difference = 4.3), F 
(1, 24) = 62.80, p < .001. Thus, it appears 
that while the recall of the actual correlation 
of trait pairs may decrease over time, the 
degree of correlational bias remains relatively 
stable." 

In addition to the concordant response mea- 
sure, an accuracy score for each trait pair 
was calculated for each subject in both 
Rating 1 and Rating 2, This accuracy score 
indicates the percentage of responses in which 
the subject correctly identified the standing 
of a stimulus person on a trait. Because a 
pair of traits was rated for eight stimulus 
persons, the accuracy score for each trait pair 
is based on 16 responses. To avoid any con- 
fusion, it must be stressed that the accuracy 
score refers to accuracy in reporting the 
standing of each stimulus person on each of 
the two traits in a pair; the score does not 
netessarily indicate accuracy in reporting the 
level of association between the two traits. In 
fact, it is possible that a subject could be 
perfectly accurate in reporting association 
but perfectly inaccurate in reporting the 
individual trait scores. 

The mean level of the accuracy measure 
is 76.0% for Rating 1 and 57.0% for Rat- 
ing 2. Both of these percentages are signifi- 
cently higher than the chance level of 50% 
accuracy: for Rating 1, F(1, 24) = 158.99, 
p< .001; for Rating 2, F(1, 24) = 25.84, 
p < .001. Analysis across both rating tasks 
indicates that the mean for the second rating 
is significantly lower than that of the first rat- 
ing, F(1, 24) = 130.98, p < .001. These fnd- 
ings suggest that some degree of accurate re- 
call occurs during each of the ratings and, as 
might be predicted, that the level of accuracy 
decreases over time. 
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One question of interest is the relationship 
between a subject’s accuracy and the size of 
any correlational bias exhibited in his or her 
ratings. To test such a relationship, a corre- 
lational bias score and an accuracy score were 
calculated in each rating for each of the 30 
subjects. The correlational bias score for a 
subject was computed by taking the mean 
number of concordant responses the subject 
had given to trait pairs assumed to be nega- 
tively correlated and subtracting this from 
the mean number of concordant responses the 
subject had given to trait pairs assumed to be 
positively correlated. The accuracy score for 
a subject was simply the average of the nine 
accuracy scores the subject had obtained in 
rating the nine trait pairs. ; 

The correlations among the accuracy and 
correlational bias measures, both within and 
between ratings, are presented in Table 2. 
Surprisingly, the table reveals a significant 
positive relationship between accuracy and 
correlational bias, both for Rating 1 and for 
Rating 2. It appears that those subjects who 
recall the information most accurately for a 
given rating are also those most prone to ex- 
hibit strong correlational bias in the rating. 
This positive correlation, however, does not 
appear to be due to a stable factor within 
the rater. That is, although the degree of 
accuracy on the first rating task is positively 
correlated with accuracy on the second rating 
task, there is no significant correlation be- 
tween accuracy on Rating 1 and correlational 
bias on Rating 2 or, conversely, between 
accuracy on Rating 2 and correlational bias 
on Rating 1. Moreover, there is no relation- 
ship between the size of a subject’s correla- 


1 A common strategy advocated for eliminating cor- 
relational bias is to average the ratings of a number 
of observers and then use these mean ratings to 
compute correlations between the events, or behav- 
iors, on the rating scale. To test this technique em- 
pirically, mean scores for the present data were 
calculated by summing across groups of five sub- 
jects. Analysis of the Pearson correlations derived 
from these mean ratings reveals that a strong and 
significant linear effect for assumed correlation per- 
sists both in Rating 1 and in Rating 2. Moreover, 
as with the individual ratings, the size of this cor- 
relational bias remains constant across the two 
ratings tasks. 
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TABLE 2 


CORRELATIONS AMONG ACCURACY AND CORRELATIONAL 
Bias ON THE Two RATING TASKS 


Rating 1 Rating 2 
Correla- Correla- 
tional tional 
Accuracy bias Accuracy bias 
Rating 1 
Accuracy 1.00 
Correlational bias 1678 1,00 
Rating 2 
Accuracy AT .23 1,00 
Correlational bias Js 08 43" 1.00 


Note, All correlations are based on 30 observations. 
*p < 05. 
* p < 01. 

wD < 001. 


tional bias on Rating 1 and his or her cor- 
relational bias on Rating 2. 


Discussion 


In 1945 Johnson pointed out that while the 
existence of a halo effect in ratings was gen- 
erally accepted, there had been, up to that 
time, no experimental manipulation of the 
phenomenon. Belief in a correlational bias 
was only inferred from circumstantial evi- 
dence such as differences in the correlations 
of immediate and recalled ratings or correla- 
tions which were suspiciously high. After an 
unsuccessful attempt to provide such an ex- 
perimental demonstration, Johnson was once 
again forced to conclude in 1963 that no 
experimental validation of the phenomenon 
existed. Since then, the evidence of 
bias in ,ersonality ratings supplied by such 
ré s as Mulaik (1964), D’Andrade 

id Passini and Norman (1966) has 

} suggestive but still not experimen- 
ta ‘led. And although the studies by 
Ch: | Chapman (1967, 1969) provide 
> of experimental manipulation 
yal bias, these studies are pat- 
clinical assessment model and, 


yet 


ternec i 


thus, idely from a typical rating pro- 
cedure. In Us way, the results of the present 
study probably represent the clearest experi- 


mental manipulation of correlational bias in 
an observer rating situation. 

It is, in fact, clear in the present study that 
correlational bias had a strong and systematic 
effect on both the immediate recall of the 
first rating and the later recall of the second 
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rating. In both cases, the correlations derived 
from these trait ratings were affected by im- 
plicit trait assumptions. Thus, within each 
level of actual correlation, those traits that 
are assumed to co-occur produced the most 
positive correlations, those traits assumed to 
have no relationship with one another resulted 
in an intermediate degree of correlation, and 
those traits thought to be unlikely to co-occur 
exhibited the most negative correlations. 
Moreover, these biasing effects persisted even 
when correlations for each rating task were 
computed from the mean ratings of a number 
of observers. 

A particularly striking finding in the ex- 
periment was the constancy of the correla- 
tional bias, Although there was a small inter- 
action in the first rating task, the ratings 
generally appeared to be an additive function 
of the true correlation plus a constant influ- 
ence of the assumed relationship between the 
traits. Such a constant, additive effect casts 
serious doubts on Campbell and O’Connell’s 
(1967) argument that correlational bias oper- 
ates in a multiplicative fashion. It should be 
further noted that the degree of bias for a 
pair of traits was not only constant across 
the levels of actual correlation but also across 
the ratings made at different points in time. 
Of course, some care must be exercised in 
interpreting the constancy across ratings be- 
cause the two rating tasks were not identical 
and the time span between them was not 
extremely long. 

In addition to HS constancy, the sturdiness 
of the correlational \bias was also striking in 
the present design. For example, the effect of 
the implicit trait assumptions was readily 
apparent even during the first rating task. 
What makes this outcome quite dramatic is 
that this first rating was made almost im- 
mediately after the subjects had viewed the 
evidence for a pair of traits. There was there- 
fore little time for the “memory drift,” which 
some writers (e.g., D’Andrade, 1974) have 
suggested might be necessary to produce cor- 
relational bias. Also impressive is the fact 
that bias occurred in this study even though 
the subjects were presented with trait names 
and were not asked to infer traits from be- 


haviors. The persistence of a correlational 
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bias under these conditions undermines any 
contention that correlational bias arises only 
when raters are required to make inferences 
from specific behaviors to general trait, or 
behavior, categories, 

Probably the most surprising outcome in 
the experiment was the relationship found to 
exist between a subject’s accuracy and his 
or her correlational bias. That is, the results 
indicated that the amount of correlational 
bias exhibited by any one subject on a rating 
was positively correlated with that subject’s 
level of accuracy on the rating. This positive 
correlation between accuracy and correla- 
tional bias was significant both within Rating 1 
and within Rating 2. On the other hand, a 
subject’s accuracy on one of the ratings did 


4 not predict the amount of correlational bias 


which the same subject exhibited on the other 
rating. Moreover, the relative amount of cor- 
relational bias exhibited by any one subject 
owas not constant across ratings. Such findings 
suggest that the positive correlation between 
accuracy and bias does not result from a 
stable attribute within the rater; rather, it 
may be due to some transitory factor such as 

, concentration or attention, The psychologist 
could, therefore, be faced with the paradox- 
ical situation that when observers are trying 
their hardest and are most accurate, they are 
also exhibiting the greatest correlational bias. 
The implications of this study, as well as 
other research on correlational bias, prob- 
« ably extend beyond the situation of observer 
.tatings. For example, some researchers (e.g., 
*Lay & Jackson, 1969; Strickler, Jacobs, & 
Kogan, 1974) have shown close structural 
similarities between responses to personality 
inventories, such as the Minnesota Multi- 
ı phasic Personality Inventory, and implicit 
assumptions that people have concerning the 
co-occurrence of items on these inventories. 
The researchers in these studies have argued 
that because implicit assumptions correspond 
to self-ratings, the implicit assumptions pos- 
sess a high degree of validity. However, it 
seems just as logical to view these results in 
the opposite manner. That is, correlational 
bias may be operating on self-ratings, and 
this is why these ratings tend to be similar 
to assumed judgments. In fact, from the 
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viewpoint of Bem’s (1972) self-perception 
hypothesis, it makes perfect sense that biases 
operating when a person views others will 
also function when a person views his or her 
own actions. If similar types of correlational 
bias do occur in self-report ratings, then an 
enormous amount of correlational information 
which has been gathered from inventories and 
other self-rating methods will have to be 
reassessed, 

Confronted with the possibility of correla- 
tional bias in ratings, the potential researcher 
has a number of options. First, he or she 
may simply ignore the problem, hoping that 
the degree of distortion is relatively minor. 
Such a course has, in fact, been advocated by 
some psychologists. For example, Cattell and 
Digman (1964), as well as others (e.g., Lay 
& Jackson, 1969; Stricker et al., 1974), have 
argued that correlational bias does not pose 
any grave threat because it probably mirrors 
true correlations in the world, In support of 
their position, these researchers cite the close 
structural similarity between implicit assump- 
tions and actual ratings, Unfortunately, the 
fact that personality factor structures have 
been almost perfectly duplicated from ratings 
of implicit assumptions could very well indi- 
cate that observer ratings are almost com- 
pletely a product of bias. If the researcher is 
truly interested in the relationships between 
items that are being rated, then it would 
seem unwise to simply ignore this problem, 

A second strategy of the researcher would 
be to minimize the task confronting the rater 
or avoid the use of human observers alto- 
gether, In fact, much valuable information 
can be gathered by means of highly me- 
chanical coding (e.g., Matarazzo, Saslow, & 
Matarazzo, 1956) or through the use of 
various unobtrusive measures (e.g, Webb, 
Campbell, Schwartz, & Sechrest, 1966), By 
avoiding correlational bias in these ways, 
however, the researcher is also forfeiting the 
integrative and multicategory observational 
powers which have always made the human 
rater an attractive measuring instrument in 
the social sciences. 

For the researcher who wishes to retain the 
powers afforded by human observers, the re- 
sults of the present experiment do suggest a 
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third course of action. The researcher could, 
for instance, videotape actors performing the 
various behaviors contained in a rating scale. 
These videotapes could then be presented to 
observers in such a manner that there would 
be no correlation between the performances 
of one behavior and any of the other behav- 
iors. Any correlations that are found in 
ratings under these conditions would there- 
fore represent bias. The present findings sug- 
gest that this type of correlational bias 
is constant whatever the true correlation, so 
the bias values from this first study could be 
used to correct ratings from subsequent 
studies. Of course, this type of calibration 
technique would require a large amount of 
effort and might be specific not only for each 
type of rating scale but also for each situa- 
ation in which the rating scale is used. More- 
over, certain scaling issues would remain to 
be solved, However, this calibration procedure 
would provide one way of salvaging correla- 
tional information from observer ratings. 

Recognition of the pervasive nature of cor- 
relational bias, as well as the considerable 
effort necessary to correct for it, should not 
lead to undue pessimism. For example, one 
alternative always left open to the researcher 
is to reinterpret the meaning of the data that 
are obtained from observer ratings. Although 
a rating may produce distorted information 
about the actual behaviors being observed, it 
certainly provides a clear picture of how 
people view others. Such knowledge of sub- 
jective perceptions is of central concern to 
a large number of psychological issues 
ranging from person perception to human in- 
formation processing. McGuire (1969) has 
quite perceptively pointed out that “one 
man’s artifact may be another man’s main 
effect” (p. 13). Uncorrected ratings can 
clearly remain of special value as long as the 
researcher carefully delineates the meaning 
and limitations of the information generated 
by this method. Indeed, perhaps we psy- 
chologists are fortunate that in examining the 
problems of one of our most powerful meth- 
ods, we may also be exploring substantive 
topics. 
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i 
This study examines how adult warmth affects reflective and impulsive chil- 
dren’s imitative donation and rescue effort (willingness to call for help for a 
distressed peer). First-grade males’ reflectivity or impulsivity was measured on 
Kagan’s Matching Familiar Figures Test. Later, children interacted with either 
a noncontingently high- or low-warmth adult prior to helping opportunities. 
Half of the children participated in a 2X 2 design to assess the effect of 
warmth and conceptual tempo on imitative donation, and all children partici- 
pated in a 2X 2X2 design to assess the effect of these variables and prior 
donation on rescue effort. Results indicate that (a) high as compared to low 
warmth decreased imitative donation but increased rescue effort, (b) impulsive 
boys tended to be more genercus than reflective boys, and (c) donation and 
rescue effort were not related, 
Interest in children’s helping behavior has entially respond to affect inductions and sub- 
primarily focused on the elicitation of behav- sequent helping opportunities, 
iors in donation opportunity and emergency In the literature concerning the effect of 
situations. These helping opportunities, how- nurturance on sharing and donation, there 
ever, are different with regard to cost to the have been no studies that have shown nur- 
actor and urgency of the needed help. It turance to increase imitative donation behay- © 
would seem important to study the anteced- ior (Grusec, 1971; Grusec & Skubiski, 1970; 
A ; 
ents of donation and rescue behavior to Rosenhan & White, 1967). Nurturance in 
assess whether similar factors encourage both these studies has been most frequently opera- 
types of behavior or whether donation and tionalized as 5-10 minutes with an experi- 
rescue are encouraged by different antece- menter who is noncontingently either warm 
dent variables. Of interest in the present study friendly, and interactive (high warmth) a 
is the effect of noncontingent warmth on aloof, supervisory, and task oriented (low 
these different modes of helping behaviors. warmth). Rather, the findings suggest that 
Additionally, it is of interest to know whether high noncontingent warmth seems to encour- 
children of varying cognitive style will differ- age self-gratification. ; 
; These findings are in contrast to predictions 
5 eg or the effect of nurtur: imitati 
This study is based on a doctoral dissertation sub- havior made b tat ee imitative be- 
mitted to the Graduate School of Northwestern Uni- M y certain theories (Freud, 1933; 
versity. The author wishes to thank James H. “s0WFer, 1950). They are also in contrast to 
Bryan, Lee Sechrest, Richard Bootzin, Peter Herman, results of investigations that assess the ef- 
and Philip Brickman for their suggestions concern- fect of adult warmth on the imitation of 
ing this study and Jean Slater for her assistance as neutral non-self-denial behavi T a 
experimenter. The cooperative participation of ad- PA ation h fi ayore hese in- 
Ministrative personnel, faculty, and students in 8 > Hae ound that high warmth in- 
creases the imitation of neutral behaviors! 


wWanston School District #65 and Morton Grove 
dool District #70 is also appreciated. This study 
Was supported in part by National Institute of Child 
Health and Development Grant #03234 to James H. 
Bryan, 

vatussi for reprints should be sent to Carol S. 
a, Department of Psychology, The Ameri- 

niversity, Washington, D.C. 20016. 
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(Bandura & Huston, 1961; Bandura, Grusec 
& Menlove, 1967; Mischel & Grusee, 1966), 
To explain the varying effect of warmth ra 
different behaviors, Grusec (1971) suggests 
that although the secondary reinforcing as- 
pects of imitating a warm model’s behavior 
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may encourage imitation of neutral behaviors, 
for high-cost behaviors such as self-denial, 
the secondary reinforcement for imitation is 
not powerful enough to offset the cost of 
performing a self-denial act, Further, a 
model’s noncontingent warmth may decrease 
self-denial by enabling a child to feel at ease 
in the environment, thus disinhibiting self- 
gratification (Bandura et al., 1967). 

High warmth may, however, facilitate help- 
ing in the rescue situation. The feeling of ac- 
ceptance and well-being that a high-warmth 
interaction induces may enable the child to 
help in the usually unexpected and ambiguous 
rescue situation, Staub’s (1970, 1971a, 1971b, 
1973) work suggests that concern about in- 
curring criticism and lack of clear standards 
for behavior may operate to inhibit children’s 
helpfulness in an unexpected rescue situation, 
Children in these situations indicate that the 
reason they do not rescue is because they are 
afraid they might break rules (Staub, 1970). 
Moreover, children are more willing to help 
a distressed child when they are given ex- 
plicit permission to enter an adjoining room 
than when they are given no information or 
prohibitions about entering the room (Staub, 
1971b), Having a feeling of acceptance and 
self-confidence in the environment may be 
conducive to overcoming perceived environ- 
mental sanctions and self-distress in a sud- 
den emergency. 

Thus, an adult’s high noncontingent warmth 
may operate in a donation opportunity, in 
part, to give children feelings of acceptance 
and to decrease children’s inhibitions about 
self-gratification, thus decreasing charitabil- 
ity. An adult’s low warmth, in contrast, may 
focus the child on either a search for stan- 
dards or the need to avoid criticism. In the 
rescue opportunity, high warmth may enable 
the child to take action, thus overcoming 
distress and self-concern. Low warmth, in 
contrast, would not tend to disinhibit the 
child but rather increase his or her concern 
about appropriate behavior. 

Little is known about the cognitions in- 
volved in helping behavior. While investiga- 
tors typically use models to elicit imitation 
and experimenters to influence altruistic be- 
havior, little attention is given to the process 
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involved in the child’s consideration or eval- 
uation of these individuals. Although it is not 
possible to tap the content of ongoing cog- 
nitions, at least the thinking process seems 
an important variable for study. The primary 
subject variable under investigation in this 
study is conceptual tempo (Kagan, 1965). 
This dimension refers to children’s tendencies 
to be either reflective, pausing to consider re- 
sponse alternatives, or impulsive, not pausing 
to consider them, prior to responding. Kagan 
and Kogan (1970) also suggest that reflec- 
tives may be more concerned about making 
mistakes. Thus, reflectives may feel greater 
self-concern, which may impede the empa- 
thetic and helping process. In contrast, im- 
pulsive children hypothetically may not fear 
making mistakes, may or may not empathize, 
but do not spend time thinking about alter- 
native modes of action. From these hypothe- 
ses, it could be predicted that high warmth 
should thus affect the reflective’s response 
tendencies by making him or her less con- 
cerned about external criticism, whereas high 
warmth will have little effect on impulsives’ 
response tendencies. In contrast, low warmth 
should influence the reflective child to be more 
concerned about external sanctions, while 
probably having little effect on the impulsive 
child. 

Two hypotheses were thus made with re- 
gard to the present study. First, experimenter 
warmth would differentially influence altruis- 
tic behavior of different modality, specifically 
that warmth would decrease donation behav- 
ior and increase aiding behavior (Bryan, 
1972). Second, individuals of differing re- 
sponse style would respond differently to the 
warmth induction and to helping opportu- 
nities. Specifically, it was hypothesized that 
in the high-warmth condition, reflective chil- 
dren would feel more free to do what they 
felt like doing. They would, thus, donate less 
and call for aid more than reflectives in the 
low-warmth conditions. In contrast, with re- 
gard to both donation and rescue behavior, 
it was hypothesized that because impulsive 
children do not presumably pause to think 
about response consequences, the warmth in- 
duction would not as greatly affect their help- 
ing responses, Thus, their helping behavior 
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would be similar in high- and low-warmth 
conditions. It was also of interest to deter- 
mine whether helping in a donation situation 
would be related to making a rescue effort. 


METHOD 


Participants and Experimenters 


Participants were 72 white first-grade males from 
two Chicago suburban school districts. Two females 
participated in this study as experimenters. Experi- 
menter 1 administered the Matching Familiar Fig- 
ures Test (MFFT) (Phase 1), escorted children to 
and from the trailer for the experimental session, 
and administered the final postrescue questionnaire. 
Experimenter 2, who was blind to the conceptual- 
tempo condition of the subjects and to the experi- 
mental hypotheses, administered the warmth induc- 
tion (Phase 2), the donation sequence (Phase 3), 
and the rescue rationale (Phase 4). 


Procedure 


Design. The present study was a 2X2 factorial 
design to study the effect of warmth and conceptual 
tempo on donation. Thirty-six males participated 
in this aspect of the study. These subjects and an 
additional 36 subjects who had not participated in 
the donation study took part in the 2X 2X2 fac- 
torial design to assess the effect of warmth, con- 
ceptual tempo, and prior donation on rescue effort. 

Apparatus. The experimental aspects of this study 
(Phases 2, 3, and 4) were conducted in a research 
trailer parked on school grounds. The experimental 
room contained a bowling game similar to that em- 
ployed by Rosenhan and White (1967) and the fol- 
lowing toys: a Peanuts book, two marble games, a 
puzzle, dominoes, a toy car, and a toy airplane. 
Also in the room were a March of Dimes cannister 
containing six pennies and a “my money” cannister 
for subjects’ retained winnings. Directly above the 
cannisters was a poster depicting a crippled child 
with the caption, “Fight Birth Defects—Join the 
March of Dimes.” The experimental room contained 
a Panasonic videotape recorder and a Concord VTR 
monitor with a 10 X 8-inch screen to be used in 
the donation sequence; an adjacent room contained a 
Wollensak tape recorder. Subjects’ calling for aid 
‘sponse latencies were recorded by 
rning on an Esterline timer with 


and/or rescue 


simultaneously 
the onset of the ‘-ped simulation of distress sounds 
in the rgency si uation. 

Ph ‘. Phase 1 was administered to each sub- 
ject in a room in the child’s school 
building onceptual- tempo task, the MFFT, and 
questions ning fariily size, teacher, and school 
were administered. Chil'ren were divided by me- 
dian split of total response latencies to 12 MFFT 
items into reflective and impulsive groups. The total 
sample median w 1 seconds; the median for the 


134 seconds, and the median 


reflective group 
vas 63 seconds. A total of 


for the impulsive oup 
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115 first-grade males were given the MFFT. How- 
ever, because of the proportional assignment of sub- 
jects by school to experimental conditions, 72 males 
were used. 

Phase 2. Phase 2 involved the warmth manipula- 
tion. Entering the trailer, children encountered a 
high-warmth or a low-warmth adult. The high- 
warmth adult was interested, friendly, verbal, and 
interactive with the child. In contrast, the low- 
warmth adult remained aloof, uninterested, and 
supervisory toward the child. Tone of voice differed 
in the two conditions: The high-warmth adult’s 
tone of voice was pleasant and warm, whereas the 
low-warmth adult’s voice was matter-of-fact and 
cold. In addition, in the high-warmth condition, the 
adult was careful not to confound sharing or help- 
ing behavior with the high-warmth manipulation. 
As the child entered the trailer, he was told that 
he was there to test toys for a toy company and 
that although Experimenter 2 had some work to 
do, (a) she “would rather play with him instead” 
(high warmth) or (b) she “would sit here and read” 
and that the child should “go ahead and play, but 
play quietly” so as not to disturb her (low warmth). 

Phase 3. Phase 3, which began after 9-10 minutes 
of Phase 2, involved the donation sequence. A tele- 
vision monitor was placed on a stand in front of the 
child, who was told the following: 


I would like you to test out a new bowling game. 
Watch the television show of me playing the game 
awhile back to see how the game is played. You 
will have a chance to win money when you play 
just like I did. There is a campaign going on now 
to collect money for crippled children who really 
need help—you'll see it going on when I played 
the game too. You will also see, just like these 
cans [points to each can] where I could keep some 
of my winnings or give some of my money or 
all of my money to crippled children, Now watch 
the film so you will know how to play. 


A film was then shown on a videotape monitor; 
the Same film was shown in all conditions, The film 
depicted the adult playing six trials on the bowling 
game, making charitable exhortations before each of 
three winning trials, and donating all of her money 
to the crippled children’s can each time she scored 
20 on the bowling game. This film, with very salient 
reminders of oR good it is to give to charity, was 
so constructed to elici i 
Kae t some charity by most 

After the film, Experimenter 2 asked the children 
to repeat the game instructions to ensure that they 
knew how to throw the ball and that a score of 20 
signified a win. The bowling game was prepro- 
gramed so that children bowled 10 trials, winnin 
on 5 randomly predetermined trials. Eapen ie 
2 told the children that she was leaving so that 
she could do some work and (a) that she had “reall; 
enjoyed [their] time together and hoped that ithe 
on sre A too” (high warmth), or (b) sim- 
ply that she had “more work insi ildi. 
PRERE E inside the building™ 
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She then repeated the instructions concerning chil- 
dren’s options when they won money: “You can 
put the pennies in the ‘my money’ can or can put 
some or all of them in the crippled children’s can. 
When you are all finished with the game, you can 
take the money out of the ‘my money’ can and put 
it in your pocket.” She then indicated that she 
would be back in a while. 

Phase 4. Phase 4 involved the toy questionnaire 
and the rescue sequence. Phase 4 began when Ex- 
perimenter 2 returned to the room 20 seconds after 
“the bowling game was over. To continue the ra- 
tionale for being in the trailer, in all conditions she 
asked children three “dummy” questions about the 
toys they had played with. At the same time, she 
also gave children a rationale for the rescue se- 
quence. The instructions given to children were es- 
sentially the same in both high-warmth and low- 
warmth conditions. The adult’s tone of voice differ- 
entiated the low-warmth and high-warmth condi- 
tions, The instructions were as follows; 


Now I'd like to ask you three questions. After 
each question give your answer to me and I will 
write it down. There is another child through that 
door who is playing with the same toys that you 
did. I hope he listened to me and doesn’t fool 
around in there because he could tip a cabinet 
over, Anyway, we should talk quietly, because 
he'll have to give his own opinions after the game. 
[The experimenter asks the following three ques- 
tions: “Which toy did you like best?” “Would you 
like someone to buy it for you?” “Do you think 
other kids would like it?”] I have to go back 
into the school building to do some work again. 
But you will have a chance, after I leave, to 
play with a surprise toy in that bag. When you're 
done, the child in the next room will have a 
chance to answer the same questions as you did 
and look at what’s inside the box. My assistant 
will be back to take you to class in a little while, 
but there’s a buzzer over there that will call my 
assistant, the lady who brought you in [Experi- 
menter 1], back to the trailer. Why don’t you 
go over there now and show me that you know 
how to work it. . Here is the toy [takes it 
out of the bag, shows it to the child, indicating 
how the ball moves along the rods, and replaces 
it in the bag.] My assistant will be back for you 
in a little while. 


After these instructions, she left the room, exiting 
from the trailer. Fifteen seconds after she left, the 
sound-of-distress tape was turned on. The tape was 
a 60-second simulation of a crashing cabinet and the 
subsequent crying and pleas for help of a little boy. 
A timer was turned on simultaneously with the tape 
recording and was automatically terminated when 
children pressed on the buzzer for the assistant to 
return. If a child tried the door to the adjoining 
room, rather than pressing the buzzer to get help, 
the timer was turned off manually when the child 
turned the door handle. 
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TABLE 1 
Means, STANDARD Deviations, AND MEDIANS OF 
PROPORTION or Money DONATED BY SUBJECTS IN 
WARMTH AND CONCEPTUAL TEMPO 
CONDITIONS 


Experimenter behavior 


High warmth 


M 348 522 

SD 252 360 

Màn 40 50 
Low warmth 

M 539 -696 

SD 300 193 

Mdn 40 80 


If the child pressed the buzzer, Experimenter 1 
waited 20 seconds before entering the experimental 
room, and upon entrance to the room she said, 
“You called?” and recorded the child’s response. If 
the child failed to press the buzzer, Experimenter 
1 entered the experimental room 120 seconds fol- 
lowing the termination of the tape. Subsequently, 
those children who called for aid and those who 
did not were verbally reassured as to the other 
child's well-being and were given a postexperiment 
interview. The postexperiment interview was de- 
signed to assess the validity of the experimental 
manipulations (eg, “How happy were you while 
you were in the trailer?”), reactions to the rescue 
sequence (eg, “How afraid were you when you 
heard the sounds?”), concern about the victim (e.g., 
“How sorry were you for the other child?”), and 
knowledge of the procedure for calling Experimenter 
1 to the experimental room, 


RESULTS 


The Effect of Warmth and Conceptual 
Tempo on Imitative Donation 


Two measures were used to assess the ef- 
fects of warmth and conceptual tempo on 
donation behavior: amount of money donated 
by each child and number of trials in which 
each child donated. Occasionally, children did 
not see a winning score on the bowling game 
and did not take a stack of pennies on a trial 
or took more than one stack of pennies on a 
winning trial. Thus, raw data for trials and 
amount donated were converted to propor- 
tions and subsequently transformed to their 
arc-sine equivalents. Since trials and amount 
donated were highly correlated (rw = .91), 
the focus of the donation analysis was on the 
amount of money children donated. The 
means, standard deviations, and medians of 
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TABLE 2 


Means, STANDARD DEVIATIONS, AND MEDIANS 
FOR Supyects’ RESCUE LATENCIES IN 
WARMTH, CONCEPTUAL TEMPO, AND 
EXPERIMENT CONDITIONS 


Condition 
Donation and rescue Rescue only 
Subject 
type High Low High Low 
Reflective 
M 79.440 111.000 80.778 148.788 
SD 76.410 82.009 75.120 62.269 
Mdn 43 180 50 180 
Impulsive 
M 100.670 105.556 113.556 142.220 
SD 75.850 75.120 79.144 64.009 
Màn 54 109 48 180 


Note. High and low refer to degree of warmth shown by ex- 
perimenter. 


the proportion of money donated by children 
in each condition are reported in Table 1. 

A two-way analysis of variance assessing 
the effect of warmth and conceptual tempo 
on the arc sines of the proportion of money 
that children donated revealed marginal ef- 
fects for warmth, F(1, 32) =3.44, P< 07, 
and conceptual tempo, /(1, 32) = 2st 
11. While warmth decreased donation as pre- 
dicted, the expected interaction of the effects 
of warmth and conceptual tempo on donation 
was not found. 

Because the data were skewed and the dis- 
tribution truncated, a nonparametric anal- 
ysis, a median test, was performed on the 
amount of money children donated. The me- 
dian for donating was higher in low-warmth 
conditions than in high-warmth conditions, 

4.1, p<.05, thus supporting the 

hat low warmth increases donations. 
test also indicated that the median 
yns of impulsive children was higher 
nedian for donations of reflective 
(1) = 4.76, $ < .03. 


oct oj Warmth, Conceptual Tempo, 
viment on Rescue Behavior 


chavioral measure used to assess the 


effects of warmth, c aceptual tempo, and 

prior on on rescue behavior was latency 
of r 80 secot 

I data w skewed and the dis- 

in ition to 


tri ited, ins, H 
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means and standard deviations, are reported 
in Table 2. An analysis of variance of the ef- 
fect of warmth, conceptual tempo, and prior 
donation on latency to rescue yielded one 
main effect, that of high warmth affecting 
shorter rescue effort latency, F(1, 64) = 3.81, 
p < .056. Although the means ordered as pre- 
dicted in warmth-conceptual tempo groups, 
there was no significant interaction of 
Warmth Xx Conceptual Tempo. 

Because the data were skewed and reflected 
a truncated distribution, median tests were 
performed on the rescue effort data. The me- 
dian for rescue latency is higher in low- 
warmth groups than in high-warmth groups, 
y2(1) = 4.558, p < 05. The medians for re- 
flective and impulsive groups, (1) = 111, 
p> 1, and the medians for donation plus 
rescue versus rescue alone groups, xX) = 
1.715, p > .2, did not differ. 

The frequency of rescue effort by children 
in each condition found in Table 3 corrobo- 
rate the above analyses; children in high- 
warmth conditions called for help more fre- 
quently than children in low-warmth condi- 
tions, x?(1) = 3.92, p< 05. 

Of interest in the present investigation was 
the relationship between children’s donation 
behavior and rescue effort behavior. No sig- 
nificant relationship was found between these 
two modes of helping (r = .—194, p > .05). 


Attitudes and Reactions to Experimental 
Manipulations 


Children’s reactions to the experimental 
situations and the effectiveness of the experi- 


TABLE 3 


PROPORTION OF SUBJECTS IN WARMTH, CONCEPTUAL 
‘Tempo, AND EXPERIMENT CONDITIONS 
WnHo RESCUED 


: Experim iti 

Huperimenter Bee periment condition 

havior and subject 
type 


Donation and 


rescue Rescue only 


High warmth 
Reflective 


-666 
Impulsive {6) stad a 


+555 (5) 

Low warmth 
Reflective 
Impulsive 


Aad (4) 
555 (5) 


Note. Numbers within parentheses indicate frequencies. 
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mental manipulations were assessed in a post- 
experimental questionnaire. Items were scored 
on a 4-point Likert-type continuum. Scores 
on five items dealing with subjects’ liking 
of the experimenter (e.g., “How much did 
you like the other lady?”) and their affect 
during the experiment (e.g, “How happy 
were you?”’) were highly intercorrelated (e.g., 
r = 333, p < .01) and were summed to form 
a total affect score. Similarly, scores on three 
items dealing with children’s concern for the 
victim (e.g., “How sorry were you for the 
other child?”) were highly intercorrelated and 
summed to form a total concern score. 

On the total affect variable, the analysis 
of variance yielded a main effect of warmth, 
F(1, 64) = 13.67, p < .001, with means indi- 
cating that children in high-warmth condi- 
tions were happier (M = 16.33) during their 
stay in the trailer and enjoyed Experimenter 
2 more than children in low-warmth condi- 
tions (M = 13.95). It was also found that 
impulsive children (M = 14.30) tended to be 
unhappier than reflective children (M = 
15.69), F(1, 64) = 3.46, p < .07. Of interest 
was the relationship between children’s re- 
sponse to “how much they would like to see 
the experimenter again” and donation behav- 
ior (r = —.439, p < .01) and the total affect 
score with donation behavior (r = —.288, .05 < 
p < 10). Thus, while the warmth manipula- 
tion was effective, the results also indicate 
that generosity was negatively related to chil- 
dren’s happiness in the experiment and good 
feelings about Experimenter 2. 

In contrast to the behavioral finding of high 
warmth increasing rescue effort behavior, the 
analysis of variance of children’s scores on 
the total concern items indicated that low- 
warmth subjects (M = 8.47) report greater 
concern for the distressed victim than high- 
warmth subjects (M = 7.12), F(1,64) = 
6.989, p < .01. This finding cannot be the re- 
sult of different information concerning the 
welfare of the other child because all chil- 
dren were reassured about the victim’s well- 
being at the beginning of the posttest. No 
significant relationship was found between the 
total concern sum score and rescue effort 
latency. There was, however, a significant re- 
lationship between how fearful children re- 
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ported they were (e.g., “How afraid were you 
when you heard the sounds?”) and how 
quickly they rescued (r = —.480, p <.001). 
Thus, fearfulness was related to short rescue 
latency. 

One interesting individual difference 
emerged from the questionnaire data. While 
children from large families tended to be 
more generous (r= .348, p<.05), they 
tended to rescue with longer latency (r= 
.248, p < .05). 


Discussion 


The present study gives some support to 
the first hypothesis, that high noncontingent 
warmth decreases imitative donation but 
tends to increase children’s willingness to call 
for help. The data did not support the sec- 
ond hypothesis, which predicted a Warmth Xx 
Conceptual Tempo interaction for both dona- 
tion and aiding behavior. Instead it was found 
that (a) impulsive children were more gen- 
erous than reflective children and (b) al- 
though the means for conceptual tempo/ 
warmth groups ordered as predicted, the 
interactions were not significant. 

Children in this and other studies (Grusec, 
1971; Grusec & Skubiski, 1970; Rosenhan & 
White, 1967) who were exposed to a high- 
as compared with a low-warmth model did 
not donate more money. Children in the high- 
warmth condition, however, reported greater 
happiness and liking of the experimenter than 
children who had been exposed to a low- 
warmth interaction. Perhaps the experimen- 
ter/model’s noncontingent high warmth sug- 
gests to the child that any of the child’s 
behaviors are acceptable to her and that the 
child does not have to be concerned with con- 
forming to standards. The low-warmth inter- 
action, in contrast, may have signaled to the 
child that few of his behaviors are acceptable, 
thus making him less happy and perhaps more 
concerned about performing according to 
standards. High warmth, then, may have 
given the child permission to nonimitatively 
self-gratify in a way that the low-warmth 
interaction could not possibly have given. 

Thus, the effect of a model’s noncontingent 
warmth to the child seems to influence self- 
denial behavior quite differently than when 
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the behavior to be performed is a neutral 
nondenial behavior or when the model dis- 
plays positive high affect which is contin- 
gent on performing an altruistic act (Aron- 
freed, 1970; Bryan, 1972; Midlarsky & 
Bryan, 1967). In these types of situations, 
the child imitates altruistic behavior pre- 
sumably because of the strong association 
between the contingency of the model’s affect 
and altruistic behavior. In the present study, 
the child, because of the noncontingent na- 
ture of the model’s warmth, probably inter- 
preted his own behavior in terms of accepta- 
bility or nonacceptability. The finding that 
children who donated more did not want to 
see Experimenter 2 again (r = .44) suggests 
that behaving like Experimenter 2 or seeking 
her approval to “feel good” was not a power- 
ful motivator. Children who imitatively do- 
nated more liked the model less and perhaps 
even feared her criticism if they did not use 


= her own behavior as a standard. These in- 


ferences, although suggested by the data, 
must, however, be independently tested. 

The aiding in an emergency situation dif- 
fers from the donation opportunity because 
of its unexpected nature and the lack of 
available conformity standards for a child’s 
behavior. While a high-warmth interaction 
may allow a child to infer no prohibition 
about self-gratification, it may, for a number 
of reasons, also enable the child to aid— 
call for help in a rescue situation. One ex- 
planation for the differential impact of high- 
versus low-warmth on rescue effort behavior 


is that because another person in distress 
causes anxiety in the observer (Staub, 1970), 
7 en- 


high- as compared with low-warmth 
able the child to act quickly not only to help 
but also to decrease his own distress. The 
strong correlation found between strength of 
fear when the distressed sounds were heard 
and children’s speed of aiding supports the 
existence of a self-distress component to the 
rescue effort situation. Perhaps those children 
in the low-warmth condition were either too 
concerned about the sudden environmental 
uncertainty to focus on either the distressed 
child or on their own affective response to the 
emergency and, thus, called for help less fre- 
quently and with longer latency. A second ex- 
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planation for the effect of high warmth on 
aiding behavior would be that the warmth 
would enable the child to empathize and, 
thus, act more helpfully toward the other 
child (Yarrow, Scott, & Waxler, 1973). This 
explanation for the results is less tenable, par- 
ticularly in view of the lack of relationship 
between reported concern for the child and 
actual behavior. 

The impact of warmth on children’s imita- 
tive donation and aiding behaviors was not, as 
hypothesized, moderated by the child’s con- 
ceptual style. Instead, it was found that im- 
pulsive children imitatively donated more 
than reflective children. Recently, Block, 
Block, and Harrington (1974) have empha- 
sized the notion that children who are anx- 
ious may be impulsive rather than reflective, 
responding quickly to avoid criticism. Indeed, 
in this study, impulsive children, as reported 
on the total affect measure, tended to be less 
happy and more nervous than reflective chil- 
dren. Campbell (Note 1) reports that mothers 
of reflective and impulsive children do not 
differ on the amount of positive encourage- 
ment that they give their children but that 
they give impulsive children more negative 
reinforcement. These findings are suggestive, 
although not demonstrative, of the possibility 
that impulsive children may be more con- 
cerned about incurring negative reinforce- 
ments. Impulsive children, possibly, do not 
give themselves the opportunity to evaluate 
their own needs in the same way reflective 
children do; that reflective children may be 
more egocentric than impulsive children re- 
quires further exploration. 

The nonsignificant difference in the aiding 
latency between conceptual tempo groups is 
somewhat surprising. Perhaps, as Kagan and 
Kogan (1970) suggest, the disposition to re- 
spond quickly or slowly on tasks of uncer- 
tainty may vary across different types of 
tasks. Kopfstein (1973) and Mann (1973), 
for example, report in their respective studies 
of risk taking and delay of gratification that 
reflective and impulsive children’s response 
latencies did not differ on these dependent 
measures. In the present study, perhaps the 
unfamiliar environment, the warmth interac- 
tions, and the distressing nature of the unex- 
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pected aiding opportunity had effects which 
overrode the expected moderating effect of 
conceptual tempo. 

Having the opportunity to donate prior to 
the emergency did not significantly increase 
children’s rescue effort latency. And, although 
the correlation between amount of imitative 
donation and aiding latency was confounded 
in this study by the fact that donation pre- 
ceded aiding opportunity, the finding of a 
nonsignificant relationship between these help- 
ing behaviors is consistent with the results 
of other investigators (e.g., Staub, Note 2). 
Indeed, it may be that the differing situational 
contexts of and behavioral standards for an 
imitative donation versus an unexpected op- 
portunity to aid another child require behav- 
iors perhaps encouraged by different ante- 
cedents. Whether noncontingent high or low 
warmth influences children’s helping behav- 
iors by differentially making them seek con- 
formity standards, be concerned about en- 
vironmental sanctions, and/or have greater 
empathy must be further explored. 
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The concept of reasonable doubt was examined as both an individual and group 
decision criterion. Previous research indicates that neither criterion has an 
effect on verdicts. A reexamination of this research suggested that such effects 
might occur for cases producing maximum disagreement. An experiment was 
performed in which mock jurors reached individual and group verdicts for 
such a case. The decision criteria for individuals (judge’s definition of reason- 
able doubt) and groups (assigned decision rule) were varied in a factorial de- 
sign. As predicted, mock juries assigned a unanimity decision rule were sig- 
nificantly less likely to reach a verdict than juries assigned a majority rule. 
Minority members of juries assigned a majority decision rule were particularly 
dissatisfied with group deliberation. Definitional variations in reasonable doubt 
significantly affected both individual and group verdicts. The effects of the in- 
dependent variables for the group decision-making process were also examined 
using a model-fitting approach to Davis’ social decision scheme model, 


criminal trial, the criterion is that of reason- 
able doubt, that is, the juror should not vote 
for conviction unless he or she is satisfied of 
the defendant’s guilt beyond any reasonable 


The task facing a juror in a criminal trial 
may be likened to that of an investigator 
testing a statistical hypothesis (Feinberg, 
1971). Both assume a null hypothesis: For 


example, the juror must presume that the 
defendant is innocent. Both evaluate the 
likelihood of the available evidence under the 
null hypothesis; if the evidence is sufficiently 
unlikely, the null hypothesis will be rejected 
in favor of an alternative hypothesis (e.g., 
the guilt of the defendant). Both are faced 
with the difficult criterion problem of specify- 
ing just how unlikely the evidence need 
be to abandon the null hypothesis. The an- 
swer for both has been set by convention. For 
most statistical tests, a = probability of find- 
ing as unlikely an event under the null hy- 
pothesis =.05. For the juror in an American 
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doubt. But while the statistician may specify 
his or her criterion explicitly and precisely 
so that the outcome of a test for any 
particular set of data is unambiguous, the 
reasonable-doubt criterion, representing as it 
does a subjective probability, is liable to 
varied interpretations. 

The ambiguity of the concept is com- 
pounded by the fact that verdicts are group 
decisions. Reasonable doubt may be inter- 
preted as both an individual and a group 
decision criterion. As an individual criterion, 
it relates to the degree of certainty about the 
defendant’s guilt which a juror must feel 
before assenting to a guilty verdict. As a 
group criterion, it may be related to the 
degree of group consensus required for a guilty 
verdict. Both criteria were examined in this 
study. 

A description of the reasonable-doubt cri- 
terion to the jury is typically provided by the 
judge during the final instructions to the jury. 
The wording varies a good deal from judge 


282 


eo 


REASONABLE DOUBT IN MOCK JUROR JUDGMENTS 


to judge and from state to state (cf. 
McBaine, 1944; Reid, 1962). The relative 
disutilities our society places on making a 
Type I error (falsely convicting an innocent 
man) versus a Type II error (freeing the 
guilty one) have tended to identify nearly 
any doubt about the defendant’s guilt as a 
“reasonable” doubt (e.g., common instruc- 
tions indicate that in order to convict, jurors 
must be sure and certain of guilt, or they 
must believe in guilt to a moral certainty). 
Yet the recognition that all decisions are sub- 
ject to some doubt and the unwillingness to 
suffer a high Type II error rate have had a 
moderating influence, tending to qualify the 
interpretation of what constitutes a reason- 
able doubt. For example, a reasonable doubt 
is sometimes described as one (a) for which 
reasons can be given; (b) which would cause 
a reasonable man to hesitate in making an 
important decision; (c) which is substantial; 
(d) which is serious and well founded on the 


. evidence; and (e) which is not trivial or 


imaginary. 

The obvious difficulty of conveying the 
concept of reasonable doubt in unambiguous 
and uniform terms has prompted several sug- 
gestions: (a) defining the concept in subjec- 
tive probability terms (Simon & Mahan, 
1971); (b) adopting standardized or “pat- 
tern” instructions to jurors; or (c) refraining 
from any definition, thereby leaving it en- 
tirely to the jury (e.g., Committee on Pattern 
Jury Instructions in Criminal Cases, 1968). 
Although most agree that the concept is 
vague, the courts consider it to be a crucial 


‘one. Verdicts are occasionally reversed by 


appellate courts because of the particular 
wording chosen by a judge (e.g., Orfield, 
1963), Many observers of actual jury behav- 
ior confirm the law’s behavioral assumption; 
both lawyers (Bailey & Rothblatt, 1974; 
Owen, 1973) and jurors (e.g., “ ‘Reasonable 
Doubt’ Tips Scale Against SLA Pair,” 1975) 
have attributed a central role to the jury’s 
interpretation of this concept. However, it 
has also been suggested to the contrary that 
a judge’s instructions may have very little 
effect on juries’ verdicts (e.g., Frank, 1973, 
pp. 112-114). The instructions are said to 
be confusing to the layman and are given 


283 


after the juror has heard all the evidence and 
may have already made up his or her mind. 
Several psychological issues bearing on the 
effects of the definition of reasonable doubt 
were considered in this study. First, we 
sought to determine whether the behavior of 
mock jurors is sensitive to definitional varia- 
tions in the concept as put forth in the 
judge’s charge to the jury. Sealy and Cornish 
(1973) gave their mock jurors one of three 
descriptions of the proof required for convic- 
tion, ranging from “you must feel sure and 
certain on the evidence you have heard that 
the accused is guilty” (p. 213) to “you must 
feel satisfied that it is more likely than not 
that the accused is guilty” (p. 214). The 
authors concluded that their subjects’ verdicts 
were sometimes influenced opposite to expec- 
tations and that the effect of such instructions 
is very weak, However, a more detailed inter- 
nal analysis of their data suggest to us that 
the effects were usually orderly, frequently 
significant, and particularly strong when juror 
opinion was sharply divided. This may be 
readily understood in terms of the hypothesis- 
testing analogy. A constant shift in the crit- 
ical value of some test statistic will affect 
test results more when the critical values are 
near the median of the sampling distribution 
than near the tails. The case used in the 
present study was designed to produce such 
strong disagreement. Although the “more 
likely guilty” instruction used by Sealy and 
Cornish probably contributed to a strong 
manipulation of the required burden of proof, 
such a “preponderance of evidence” criterion 
is proscribed in criminal trials as requiring 
too low a level of proof of guilt for convic- 
tion. We restricted our attention to the ex- 
tremes of instructions in actual use. Under 
such a constraint, it is of interest to examine 
the range of interpretations mock jurors 
place on the reasonable doubt concept. A final 
goal was to see whether variations in reason- 
able doubt instructions would affect the 
decision-making process of the jury itself. 
Interest in reasonable doubt as a group 
decision criterion has been stimulated by the 
cases of Johnson vs. Louisiana (1971) and 
Apodaca et al. vs. Oregon (1971), in which 
the Supreme Court focused on the relevance 
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of a jury’s assigned decision rule. The Court 
held by narrow majorities that nonunani- 
mous juries were not constitutionally forbid- 
den in state trials. The majority explicitly 
stated that the legal standard of reasonable 
doubt did not require unanimity, while those 
dissenting argued that it did. The major dis- 
agreements between the Justices hinged on 
psychological issues. The majority held that 
departure from the traditional unanimity rule 
would not result in any bias for or against a 
defendant and further would serve the purpose 
of expediting deliberation at no risk to full 
participation by all jurors. However, dissent- 
ing Justices noted the possibility that mem- 
bers of small minorities in juries assigned a 
nonunanimous decision rule could be alto- 
gether excluded from participation by the 
majority. The Court’s majority felt these fears 
were groundless. They argued that majority 
jurors will not automatically cease delibera- 
tion after attaining the requisite majority, 
but that the “majority will cease discussion 
and outvote a minority only after reasoned 
discussion has ceased to have a persuasive ef- 
fect or to serve ony other purposes” (Johnson 
vs. Louisiana, 1971, p. 361). 

These decisions have stimulated several at- 
tempts at experimentally checking the Jus- 
tices’ contentions about juror behavior. In 
three separate studies (Bray, 1974; Davis, 
Kerr, Atkin, Holt, & Meek, 1975; Saks, 1975) 
no significant relation has been obtained be- 
tween the decision rule assigned to mock 
juries and their overall distributions of ver- 
dicts. However, some results from these stud- 
ies do suggest that such effects might exist. 
All three studies showed a trend toward more 
juries when the jury was assigned a 
jimous rule than when assigned a nonunan- 
le. A similar, nonsignificant trend was 
Kalven and Zeisel (1966), who 
compared the frequency of hung juries in 
} without unanimity require- 
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These studies indicate that the assigned 
decision rule may affect verdicts and the 
deliberation process. However, the question of 
the conditions under which such effects obtain 
remains largely unanswered. One answer is 
suggested by a social decision scheme analy- 
sis (Davis, 1973). The social decision scheme 
model is a stochastic model of group decision 
making and problem solving. The formal 
model is described in full detail elsewhere 
(Davis, 1973). A brief summary appropriate 
to the present application is provided here. 

Assume that the preferences of individuals 
on a decision task (e.g., reaching a verdict) 
may be described by a probability distribu- 
tion, p= (Pı, Pz, - . + 5 Pn), where n = the 
number of available response alternatives 
(e.g., for jurors, n = 2: guilty and not guilty). 
If groups of size r are assigned the decision 
task, and group members are distinguishable 
by their opinions only, then there are m = 


(i i Ame 8 
r 
unique ways the r members could be distrib- 
uted across the m alternatives (these are 
termed the “distinguishable distributions’). 


For example, in a four-person jury making a 
guilty/not guilty judgment, there are 


2+4-1 
Cr) 

= 5 distinguishable distributions: (4, 0), 
(3, 1), (2, 2), (1, 3), and (0, 4). If the 
groups are formed by random sampling from 
the population described by p, then the prob- 
ability (m;) that the ith distinguishable dis- 
tribution will occur at the outset of group 
discussion ‘may be calculated using the multi- 
nomial distribution. The model conceptualizes 
the group’s task as constructing a single, 
group response from the initial, often dispar- 
ate, opinions of its members. The model does 
not require that members never change their 
opinions during deliberation; it requires only 
that the final group decision can be predicted 
from individual member preferences. 

To make its predictions, the social decision 
scheme model must specify d; = the prob- 
ability that the group will choose the jth 
decision alternative if it begins discussion with 


the ith distinguishable distribution, where 
j=1,2,..., 0 (n = the number of deci- 
sion alternatives available to the group) and 
i=1,2,...,m. The predicted distribution 
of decisions is then given by the matrix prod- 
uct P= 2-D, where z is a row vector con- 
taining the m,i=1,2,...,m, and D repre- 
sents the “social decision scheme matrix” with 
elements dy, as described above. The speci- 
fication of D is the key theoretical problem. 
Past theoretical work has relied primarily on 
familiar and plausible voting rules (e.g., sim- 
ple majority) as the basis for construction of 
hypothetical social decision schemes. 
Note that these voting rules must often be 
supplemented with a secondary subscheme to 
define all the entries of D (e.g, a simple 
majority rule may be supplemented with an 
“averaging” subscheme to handle all cases of 
initial nonmajority, to create a “majority- 
averaging” social decision scheme). Several 
studies (Bray, 1974; Davis et al., 1975; 
Davis, Kerr, Stasser, Meek, & Holt, Note 1) 
found that the distribution of mock jury ver- 
dicts was well predicted by a social decision 
scheme which assumes that the jury will 
decide for that alternative (guilty or not 
guilty) which at least two thirds of the group 
advocates at the outset of deliberations; the 
data have been equivocal on the theoretically 
best subscheme, given particular conditions. 
Even though no significant differences have 
been found between the overall verdict dis- 
tributions of mock juries assigned unanimous 
and nonunanimous rules, there have been in- 
dications that different social decision schemes 
are applicable to each (Bray, 1974). If we 
assume momentarily that a more stringent 
decision scheme (i.e., one requiring greater 
initial agreement for a decision) applies to 
unanimous juries, it can be shown (e.g., Bray, 
1974) that the differences between unanimous 
and nonunanimous juries would become most 
pronounced when initial disagreement was 
maximal (i.e., when there was nearly an even 
split favoring conviction and acquittal) and 
would be localized primarily in the frequency 
of hung juries. The existing data are con- 
sistent with this analysis. Davis et al. (1975) 
found only a weak trend, y?(1) = 1.86, $ 
< .20, for more hung juries among unanimous 
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juries using a case for which only 22% of the 
individual jurors initially favored conviction. 
However, when more nearly equal splits in 
opinion are obtained, the effect is much 
stronger [e.g x7(1) = 4.94, p< .05, when 
35% of the individuals favor conviction, 
Bray, 1974]. 

The present experiment examined the effect 
of assigned decision rule for a case producing 
strong initial disagreement. We also sought to 
extend previous research utilizing the social 
decision scheme model to such a case. In most 
previous work, a model-testing approach was 
used in which the predictive accuracies of 
several social decision schemes, which seemed 
plausible prior to the research, were compared 
(eg., Davis, Kerr, Sussman, & Rissman, 
1974; Kerr, Davis, Meek, & Rissman, 1975). 
A model-fitting approach was used in this 
study, that is, we estimated D directly. This 
may be done in several ways (¢.g., Laughlin, 
Kerr, Davis, Halff, & Marciniak, 1975), but 
the most straightforward technique is to clas- 
sify each group according to its initial dis- 
tribution of opinion. The relative frequency 
distribution of final decisions (verdicts) for 
the groups with the ith initial distinguishable 
distribution of initial opinion estimates the 
ith row of D. 


METHOD 
Subjects 


A total of 645 subjects (359 males and 286 fe- 
males) participated for 2 hours each, Their par- 
ticipation fulfilled class requirements for introductory 
psychology and social psychology courses. Of the 
total, 606 subjects participated in 6-person mock 
juries. The balance of subjects were those who re- 
mained after the maximum number of randomly 
composed, 6-person groups were formed at each 
session. The data for these 39 subjects were not 
analyzed. 


Design 
r 


The experimental design was a 2 3 factorial with 
independent variables of assigned decision rule 
(unanimous vs. majority) and definition of reason- 
able doubt. For the latter variable, subjects were 
either (a) told that essentially any doubt about the 
defendant’s guilt qualified as a reasonable one 
(stringent criterion of guilt), (b) told that a reason- 
able doubt must be a substantial one (lax criterion), 
or (c) given no definition of the concept (undefined). 
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Procedure 


The procedure closely followed that used in previ- 
ous research (e.g., Davis et al., 1975; Davis et al., 
Note 1). Therefore, only the essential features will be 
outlined here. The experiment had two parts. During 
the first part, which was conducted in a large class- 
room, all subjects participated individually. The 
experiment was described to the subjects as an in- 
vestigation of decision making in a trial setting. They 
then observed a simulation of an actual rape trial. 
The trial was one used in a previous study (Davis 
et al., Note 1). This particular version was chosen 
because of the strong, predeliberation disagreement it 
had produced. The simulation included summaries 
of news articles that purportedly appeared in a local 
newspaper prior to the beginning of the actual trial 
and a 50-minute, audio- and videotape of testimony, 
attorneys’ summaries, and judge’s charge to the jury. 
The experimental manipulations were contained in 
the judge’s instructions to the jurors at the end of 
the trial. In the undefined condition, only the fol- 
lowing statement concerning reasonable doubt was 
included: 


Ladies and gentlemen of the jury, by the law of 
this state the burden is not upon the defendant to 
prove his innocence, but, on the contrary, the bur- 
den is on the prosecution to convince you beyond 
any reasonable doubt that the defendant com- 
mitted the crime of rape. 


The other two levels of the reasonable-doubt factor 
contained the above statement but continued to 
further define reasonable doubt. The phrases used to 
compose these definitions were drawn from those in 
actual use (McBaine, 1944; Reid, 1962). In the 
Jax-criterion condition, the judge’s statement con- 
tinued as follows: 


You have heard the term reasonable doubt several 
times here today. A reasonable doubt is not just a 
possible doubt, not a capricious or trivial doubt, 
but a serious and well-founded misgiving about the 
defendant’s guilt of this crime. Such a doubt must 
be based only on the evidence of this case and 
not upon anything outside of or not included in 
the evidence. The prosecution does not have to 
convince you that the defendant is guilty beyond 
ven the possibility of a doubt, nor need they 
we that their facts are absolutely true. Of 
r -erything that has to be proven with 
is subject to some doubt, but just any 
not the same thing as a reasonable doubt. 
ible doubt about the defendant’s guilt 
ı substantial one, a fair one, one based on 
{| one for which reasons can be given. 
you need not be absolutely sure that 

it is guilty to find him guilty. 


criterion condition, the above defini- 
with the following: 


; assumed innocent under the law. 
presumption of innocence the 
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prosecution must have convinced you to a moral 
certainty, with absolute and positive proof, that 
the defendant is guilty. That is, before you can 
return a verdict of guilty you must be sure and 
certain that the defendant is guilty. If you are not 
sure and certain of his guilt, you must find him 
not guilty. In reviewing the evidence of the case, 
you may be able to form many theories about 
what might have occurred. One of those theories 
of course is that the defendant is guilty, as 
charged, of the crime of rape. But you may be 
able to come up with other explanations of the 
evidence. If you feel that the facts of this case 
are compatible with any other theory besides the 
one in which the defendant is guilty, then you 
have a reasonable doubt about his guilt and 
must find him not guilty. In order for such a 
doubt to qualify as a reasonable doubt it is not 
essential for everyone to agree with you. If you 
feel any such doubt, you must find the defendant 
not guilty. If you are certain that you have no 
such doubt, you should find him guilty. In sum- 
mary, to convince the defendant, you must believe 
beyond any reasonable doubt that he is guilty of 
the crime of rape as I have defined it to you. 


The assigned decision rule factor was also manipu- 
lated at the end of the judge’s instructions to the 
jurors. In the unanimous condition jurors were in- 
structed that all six jurors must agree to convict or 
acquit. In the majority conditions a simple major- 
ity of four out of six was described as sufficient. 
After the presentation of the case, all subjects re- 
sponded individually to a personal opinion question- 
naire that contained several items concerning their 
opinions on the guilt or innocence of the defendant: 
a trichotomous (guilty, not guilty, cannot form an 
opinion [no opinion]) verdict scale; a rating of the 
certainty of this verdict; a sentence recommendation 
for the defendant if he were convicted; a rating on 
an 11-point scale of the likelihood that the defen- 
dant committed the crime; and a similar scale for 
rating the necessary likelihood of guilt for conviction 
beyond a reasonable doubt. Subjects were then 
randomly assigned to six-person juries. The sex ratio 
of the groups was not controlled. 

The second part of the study took place in smaller 
rooms. Instructions to all groups were read over an 
intercom. The jury was told to deliberate for up to 
30 minutes and reach a final verdict concerning the 
guilt or innocence of the defendant. An arbitrarily 
appointed foreman was instructed to take periodic 
polls of the jury and to record the time and each 
juror’s vote. Juries then deliberated until a final 
verdict was reached. If the jury could not reach a 
final verdict after approximately 30 minutes, the 
foreman was instructed to declare the jury “hung.” 
The jury then reached a group decision on the 
11-alternative probability-of-guilt scale. The assigned 
decision rule also applied to this decision task. After 
these tasks were completed, the jurors individually 
filled out a questionnaire that included the same 
items on the guilt or innocence of the defendant 
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a 
TABLE 1 
FREQUENCY AND RELATIVE FREQUENCY DISTRIBUTIONS OF PREDELIBERATION, INDIVIDUAL 
VERDICTS, AND JURY VERDICTS 
Assigned decision rule 
Unanimous Majority Row total 
Criterion of reason- Not No Not No Not No 
able doubt Guilty guilty opinion Guilty guilty opinion Guilty guilty opinion 
K 3 Predeliberation, individual 
verdicts 
Stringent 49 52 13 55 48 11 104 100 24 
A430 A56 114 A82 421 096 A450 439 105 
Lax 69 34 11 67 35 12 136 69 23 
-605 .298 «096 588 -307 -105 596 303 101 
j Undefined 32 24 10 44 24 16 76 48 26 
485 364 152 524 286 -190 507 320 A73 
Column total 150 110 34 166 107 39 316 217 73 
510 374 116 +532 343 125 521 358 120 
a Jury verdicts Hung Hung Hung 
4 Stringent 2 10 zi 7 11 1 9 21 8 
i 105 526 368 368 579 053 237 -553 211 
f 
j Lax 8 3 8 11 8 0 19 11 8 
421 158 421 .579 AT 000 500 289 211 
Undefined 3 4 4 6 4 4 9 8 8 
273 364 364 429 286 286 360 320 320 
Column total 13 17 19 24 23 5 37 40 24 
we 205 347 -388 61 422 096 366 396 238 


that were included in the predeliberation question- 
naire. The questionnaire also included manipulation 
checks, yes-no judgments as to the sufficiency of 
deliberation time and whether the juror was able to 
state all of his/her arguments, and ratings of juror 
satisfaction with the jury’s verdict and deliberation. 
A debriefing and request for confidentiality com- 
pleted the experimental session. 


RESULTS 


Predeliberation Personal Opinions 

Following the presentation of the trial, 
jurors gave their first opinions concerning the 
case. The subjects could make one of three 
judgments: guilty, not guilty, or no opinion. 
The jurors’ data are shown in Table 1. The 
data were treated as a 2 (decision rule) 
X 3 (reasonable-doubt criterion) X 3 (guilty, 
not guilty, no opinion) contingency table, and 
chi-square tests of association were carried 
out. The only significant effect was for rea- 
sonable doubt, y?(4) = 15.73, p < .005. As 
expected, the largest proportion of guilty ver- 
dicts was obtained in the lax-criterion con- 
dition (.596), with smaller proportions in 


the undefined (.507) and stringent-criterion 
(.456) conditions. There were also dispropor- 
tionately more no-opinion responses in the 
undefined condition (.173) than in the strin- 
gent (.105) plus lax (.101) conditions, y?(1) 
= 5.26, p < .05. The interaction of Assigned 
Rule X Reasonable Doubt was nonsignificant, 
and sex was significantly associated with pre- 
deliberation opinion, x*(2) = 8.68, p < .02. 
Females favored conviction more often than 
males (.58 vs. .47) and less frequently had 
no opinion (.093 vs. .143). The certainty of 
verdict, sentence recommendations, perceived 
probability of committing the crime, and cri- 
terion of reasonable doubt questions were 
asked both before and after group delibera- 
tion and analyzed using repeated-measures 
analyses of variance. Results of these analy- 
ses are postponed to the description of the 
postdeliberation data. 


Jury Verdicts 


Juries could reach a verdict of guilty, not 
guilty, or be hung. The verdict distributions 
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for juries are presented in Table 1. These 
data were treated as a three-dimensional 
contingency table on which a chi-square 
analysis of association was performed. For 
juries, decision rule was highly significant, 
x°(2) = 12.26, p < .005. It can be seen that 
there are more guilty verdicts in the majority 
condition, but the greatest difference lies in 
the frequency of hung juries. This conclusion 
was reinforced by another three-way chi- 
square analysis in which hung juries were 
excluded. Here the decision rule did not have 
a significant effect on verdicts, x°(1) = .44. 
The results for the reasonable-doubt manipu- 
lation were somewhat more complex. When 
hung juries were included in the analysis, 
reasonable doubt only approached signifi- 
cance, x?(4) = 8.35, p<.08, in spite of 
rather large differences between conditions in 
the proportion of guilty verdicts. When only 
guilty and not-guilty verdicts were considered, 
reasonable doubt was significantly associated 
with verdict, x7(2) = 6.89, p < .05. While the 
overall interaction term (i.e., the Reasonable 
Doubt x Assigned Rule X Verdict test of 
association) was nonsignificant, x*(4) = 3.65, 
there appeared to be a trend for more hung 
juries in the undefined majority condition 
than in the defined majority conditions. A 
similar trend was not apparent for the unani- 
mous juries. This apparent interaction was 
statistically supported when the data were 
pooled to form a 2 (reasonable doubt: de- 
fined, undefined) > ? (assigned rule) X 2 
(not hung, hung itingency table, y°(1) 
= 4.835 p< .05. 
An unweighte 
of the juries’ p 
produced no s 
nificance of tl 
particularly in 
variable on vercic 
tions in the perc 
against the de 


analysis of variance 
- of guilt judgments 
affects. The nonsig- 
le-doubt effect was 
he effects of this 
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tion to final decision transition frequencies 
appear in Table 2. For all conditions, a 5/6 
majority rule was generally valid. For only 
2 out of 31 cases did a jury fail to settle 
upon the verdict favored by such an initial 
majority (and 1 of those cases involyed a 
unanimous jury with 2 no-opinion jurors). 
The interesting differences between experi- 
mental conditions were localized in three dis- 
tinguishable distributions: (4, 2), (3,3), and 
(2, 4). According to the assigned rule, a 4-2 
initial split was sufficient to determine a ver- 
dict in the majority juries, and in most of the 
cases (19 out of 22) it did. Previous research 
(Davis et al., 1975; Davis et al., Note 1) 
has found that social decision schemes with 
a 2/3-majority primary social decision scheme 
could adequately account for verdict distribu- 
tions of juries assigned a unanimous rule. 
However, the data in Table 2 suggest that a 
4-2 initial majority was frequently (17 out 
of 26 juries) not sufficient to determine the 
verdict of the unanimous juries in this study. 
The data also suggested an asymmetry in the 
decision scheme; although four unanimous 
juries with a 4-2 split for conviction reversed 
themselves completely (unanimously voting 
for acquittal), no such reversal ever occurred 
when an initial 4-2 split favored acquittal. 


1 This was done for several reasons. First, prior 
use of the same case with a very similar subject 
population (Davis et al., Note 1) resulted in a 55% 
conviction rate in predeliberation individual verdicts 
on a forced-choice, guilty/not guilty item. The over- 
all 52% conviction rate in the present study sug- 
gested that most no-opinion voters would have voted 
not guilty had a forced choice been made. Also, by 
assuming only two individual response alternatives, 
the size of the D matrix was reduced dramatically, 
The number of juries, while too small for reliably 
estimating D when individuals are allowed three 
response alternatives, was sufficient for a first esti- 
mate of the simpler model. When »=2, m (the 
number of distinguishable distributions) = 7, whereas 
when x=3, m=28. Thus, the new categorization 
reduced the number of model parameters to be esti- 
mated (i.e., the entries in D) from 84 to 21. How- 
ever, it should be noted that when no-opinion jurors 
were identified as not-guilty jurors, some loss of in- 
formation occasionally resulted (e.g., one majority- 
undefined jury that convicted the defendant began 
deliberation with a 2 guilty, 4 not guilty split under 
the simplified categorization; the actual split was 2 
guilty, 2 not guilty, and 2 no opinion). 


z 
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TABLE 2 


PREDELIBERATION DisTRIBUTION TO JUR 


Y VERDICT TRANSITION FREQUENCIES 


Jury verdict 


Predeliberation 
individual dis- 


med rule 


— Reasonable-doubt 


tribution Unanimous Majority total 
Not Not Not Not 
Reasonable doubt Guilty guilty Guilty guilty Hung Guilty guilty Hung Guilty guilty Hung 
Stringent criterion 6 0 0 0 0 1 0 0 1 0 0 
5 1 0 0 0 2 0 0 2 0 0 
4 2 2 2 0 4 1 0 6 3 0 
3 3 0 2 4 0 1 1 0 3 5 
2 4 0 4 3 0 5 0 0 9 3 
1 5 0 1 0 0 3 0 0 4 0 
0 6 0 1 0 0 1 0 0 2 0 
Lax criterion 6 0 2 0 0 2 0 0 4 0 0 
5 1 3 0 0 3 0 0 6 0 0 
4 2 1 1 4 5 1 0 6 2 4 
3 3 2 0 1 1 2 0 3 2 1 
2 4 0 2 3 0 2 0 0 4 3 
1 5 0 0 0 0 3 0 0 3 0 
0 6 0 0 0 0 0 0 0 0 0 
Undefined 6 0 0 0 0 0 0 0 0 0 0 
5 1 2 0 0 2 0 1 4 0 1 
4 2 0 1 1 2 0 0 2 1 1 
3 3 1 2 0 1 1 3 2 3 3 
2 4 0 0 2 1 1 0 it 1 2 
1 5 0 0 1 0 2 0 0 2 1 
0 6 0 1 0 0 0 0 0 1 0 
Assigned rule total 6 0 2 0 0 3 0 0 5 0 0 
5 1 5 0 0 7 0 1 12 1 
4 2 3 4 5 11 2 0 14 6 5 
3 3 3 4 5 2 4 4 5 8 9 
2 4 0 6 8 1 8 0 1 14 8 
1 5 0 1 1 0 8 0 0 9 1 
0 6 0 2 0 0 1 0 0 3 0 


This difference approached significance (p < 
.08, Fisher’s exact test, two-tailed). Although 
the number of cases was very small, compari- 
son of the evenly split (3-3) juries with 
stringent and lax definitions of reasonable 
doubt did suggest that this variable may have 
influenced the juries’ decision-making process. 
In the stringent-criterion condition, no such 
jury conyicted; in the lax-criterion condition, 
half convicted (p < .12, Fisher’s exact test, 
two-tailed). 


Polls and Deliberation Time 


For both the mean number of polls to reach 
a verdict and the mean time consumed in de- 
liberation there was a significant effect of 
decision rule: polls, F(1, 95) = 24.09, p< 
001; time, F(1, 94) = 9.44, p < .01, with 
majority-rule juries reaching verdicts more 


quickly (25.1 vs. 21.1 minutes). No other ef- 
fects were significant. 


Juror’s Postdeliberation Personal Decisions 


After jury deliberation, jurors again ex- 
pressed their personal opinion on the guilty/ 
not-guilty/no-opinion item, These data were 
analyzed via a three-way (assigned rule X 
reasonable doubt X verdict) chi-square. Once 
again, reasonable doubt was significantly asso- 
ciated with jurors’ opinion, x°(4) = 37.14, 
p<.001. Again, the largest proportion of 
guilty verdicts was in the lax-criterion condi- 
tion (.615) followed by the undefined (.573) 
and stringent (.352) conditions. Also, there 
were again more no-opinion responses in the 
undefined condition (.10) than in the strin- 
gent (.062) plus lax (.053) conditions, al- 
though the effect is only marginally significant, 
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x7(1) = 3.24, p < .10. Finally, females again 
favored conviction more than males, x?(2) = 
6.60, p < .05. 

Subjects made individual ratings on a series 
of four items during both the pre- and post- 
deliberation periods. These responses were 
analyzed in 2 (decision rule) X 3 (reasonable 
doubt) X 2 (sex) X 2 (testing) repeated 
measures, unweighted-means analyses of vari- 
ance. The most important of these scales was 
a manipulation check on the reasonable-doubt 
manipulation; subjects were asked to indicate 
on an 11-point scale the probability of guilt 
necessary for conviction beyond a reasonable 
doubt. As expected, the reasonable doubt 
main effect was significant, F(2, 590) = 
19.92, p < .005. Stringent-criterion subjects 
required a higher probability (unweighted 
mean = .87) than either the undefined or lax- 
criterion subjects (unweighted mean = .82 
for both). The subjective probability levels 
found here are similar to those found by 
Simon and Mahan (1971) who obtained a 
mean of .89 for college students judging a 
murder case. No other effects were significant 
for this variable. One possible result of leav- 
ing the reasonable-doubt criterion undefined 
could have been uncertainty by individual 
jurors and disagreement by jury members on 
the meaning of this concept. In the undefined 
condition, the relatively large numbers of no- 
opinion individual verdicts and hung majority- 
rule juries were consistent with this reason- 
> data on the “necessary probability 

item also were corroborative: For 

redeliberation ratings in the un- 

‘ition had a significantly larger 

vat ; , 453) = 3.83, p < .01, than 

the pooled variance of the lax- and stringent- 
riterion conditions 

atings cf the probability that the 

ommitte ' were significantly af- 

¿z (femai:s judging guilt more 

sting (si bjects thought guilt 

y y after d liberation than be- 

fo effects wee significant. The 

involving the remaining two 

varia t (a) reasonable doubt sig- 

) affected the jurors’ cer- 

sonal verdicts (with lax- 

st certain and stringent- 


criterion subjects least certain), and (b) 
jurors recommended a significantly (p< 
.002) shorter sentence after deliberation than 
before (6.5 years vs. 5.7 years). 


Personal/Social Responses 


The results described thus far have sup- 
ported the Supreme Court’s view that use of 
a majority instead of a unanimity decision 
rule may not affect the conviction—acquittal 
ratio but may increase the jury’s likelihood 
of reaching a verdict and speed of delibera- 
tion. However, other data support the conten- 
tion that minority opinion often may not be 
heard in a jury using a majority rule. Sub- 
jects were asked if they had made all the 
arguments they wished to make. Majority- or 
minority-juror status was determined by 
whether or not the jurors’ initial opinion 
agreed with the jury’s final verdict. Members 
of hung juries and jurors indicating no opin- 
ion prior to deliberation were excluded from 
this analysis. Chi-square analyses of associa- 
tion resulted in a significant, x (1) = 10.0, 
$ < .005, three-way interaction, indicating 
that minority members of majority-rule juries 
were by far the most likely to indicate that 
they were unable to make all their arguments 
(30% of these subjects said so vs. 4%, 13%, 
and 8% for the majority/unanimous-rule, 
minority/unanimous-rule and majority/ma- 
jority-rule groups, respectively). Although the 
minority-majority member status and as- 
signed-rule variables were also significantly 
associated with the “ability to make argu- 
ments” judgment, these effects were largely 
attributable to the interaction effect. Similar 
patterns emerged from subjects’ ratings of 
their “satisfaction with the way decisions 
were made in the jury” and “satisfaction 
with the final verdict.” Both variables pro- 
duced significant assigned-rule and minority— 
majority member status effects, such that mi- 
nority jurors and members of majority-rule 
juries were less satisfied, and both displayed 
the same assigned-rule x minority—majority 
interaction. These results demonstrated the 
dissatisfaction of minority members of non- 
unanimous juries and suggested that this re- 
sult may be attributable in Part to the inabil- 


ity of such subjects to express their views 
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fully. This interpretation is supported by the 
juries’ balloting. Of the 43 majority-rule 
juries without initial unanimity that eventu- 
ally reached a verdict, 24 (56%) ceased de- 
liberation upon reaching the necessary, simple 
majority. The average, postcriterion discus- 
sion time for the 19 juries that continued to 
discuss the case was 9.6 minutes, and the 
variability was quite high (§ = 9.04). How- 
ever, none of these juries achieved complete 
unanimity; in every case the majority even- 
tually outvoted an unconvinced minority. The 
tendency to continue discussion after the 
simple majority was obtained was related to 
the length of time required to reach that con- 
sensus. For example, 15 of the 27 (56%) 
juries that had a majority on the first ballot 
continued to deliberate (on average, for 11.13 
minutes), while only 4 of the 16 (25%) 
juries that took more than one ballot did so 
(for an average of only 3.75 minutes). 


Discussion 


Before discussing the results of this study, 
the issue of external validity should be briefly 
considered. While all experimental results 
may be generalized to uninvestigated condi- 
tions only with peril, the potential relevance 
of mock jury research for actual jury behay- 
ior requires special sensitivity to the dangers 
of premature generalization, At best, this 
study represents an imperfect simulation of 
the task confronting an actual juror, The use 
of such laboratory simulations was not only 
justified on the traditional grounds of experi- 
mental control, but was required because of 
ethical and legal constraints on the in situ 
study of the jury. This study, like nearly all 
mock jury experiments (cf. Davis, Bray, & 
Holt, in press), differed from the real court- 
room setting in many ways: The mock juror 
population was not representative of real 
jurors; the case was considerably abbreviated 
and was presented in a highly artificial man- 
ner; the mock jury deliberations were neces- 
sarily more abbreviated than is the case in 
most real juries; and, perhaps most impor- 
tant, mock jurors’ verdicts did not have ser- 
ious consequences, The relevance of these 
and other differences are empirical questions 
for which preliminary answers have begun 
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to emerge (e.g., Simon & Mahan, 1972; Kerr, 
Nerenz, & Herrick, Note 2; Diamond & 
Zeisel, Note 3), But until further research in- 
dicates the scope and importance of such 
differences, the results of research using mock 
juries in simulated trials (including the pres- 
ent study) should be viewed as suggestive 
rather than definitive. 

Most judges are required to define the rea- 
sonable doubt criterion to jurors; carelessness 
or missing definitions may result in a reversal 
of verdict. It is clear from the results of this 
study that mock jurors’ verdicts may indeed 
be affected by a judge’s instructions concern- 
ing reasonable doubt. Using variations in 
definition which were extreme, yet not un- 
representative of those in actual use, we found 
the expected pattern: Those who were told 
that nearly any doubt about the defendant's 
guilt should qualify as a reasonable doubt 
were much more willing to acquit than those 
who were told that a reasonable doubt must 
be substantial, nontrivial, defensible, and so 
on. This small variation at the end of a rather 
long and complex case produced a difference 
of over 26% in the group conviction rate, 
These clear-cut results contrast with the rather 
equivocal pattern of Sealy and Cornish’s 
(1973) data. Our internal analysis of those 
data suggested that the effects of this varia- 
ble were most pronounced when the case 
produced strong disagreement. The reliable 
effects found here for a case with a 52% 
overall conviction rate are consistent with 
this conclusion. 

Furthermore, our data suggest that the defi- 
nition of reasonable doubt does not affect 
the mock juror’s perception of the weight of 
evidence against the defendant; judgments of 
the probability that the defendant committed 
the crime were unaffected by this manipula- 
tion. Rather, the reasonable-doubt instruc- 
tions have the intended effect of setting the 
juror’s criterion for conviction, The subjects 
were able to discriminate between the per- 
sonal conviction and legal standard of guilt. 
There were also indications that the impact 
of the reasonable doubt instruction might ex- 
tend beyond the individual juror’s decision, 
Among those juries whose members were 
evenly divided at the outset of deliberations, 
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those who required a high level of proof (i.e., 
stringent-criterion juries) were somewhat less 
likely to convict than those who required less 
proof (lax-criterion juries). This may have 
reflected a greater vulnerability of the guilty- 
voting members of the stringent-criterion 
juries to any novel arguments advanced by 
the opposing faction. The significantly lower 
certainty stringent-criterion jurors expressed 
in their individual verdicts is consistent with 
this conclusion. However, larger samples of 
juries will be required to determine with con- 
fidence whether variation in the definition of 
reasonable doubt is accompanied by changes 
in the jury’s social decision scheme. 

The data also demonstrate that the mean- 
ing of the concept of reasonable doubt is not 
obvious to our college-student subjects. There 
was considerably greater individual uncer- 
tainty and group disagreement when reason- 
able doubt was undefined, and more hung 
juries resulted. We strongly suspect that no 
less confusion would characterize samples 
more representative of actual jurors. Although 
the practice of letting the jury define reason- 
able doubt is certainly expedient, its con- 
tinued use should be carefully reevaluated. 
Further research should be devoted to the 
development of a definition which (a) sum- 
marizes an acceptable level of proof, (b) is 
simple enough to be easily understood by the 
heterogeneous population of jurors, and (c) 
is unambiguous enough to produce reasonably 
close agreement among jurors on the appro- 
priate level of proof. 

In many respects, this experiment has sup- 
ported the psychological reasoning underlying 
the majority opinion in Johnson ys. Louisiana 
(1971). As in earlier research, we found no 

pro- or antidefendant bias to result from as- 
signing juries a nonunar’ us decision rule. 
However, we obtained strongest experi- 
mental evidence to dz hung juries are 
nous juries, par- 
is some have in- 
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decision rules. Of course, it is possible that 
many of the hung, unanimous-rule mock 
juries would have reached a verdict if given 
sufficient time; this possibility ought to be 
studied further. In either case, the nonunani- 
mous juries reached their decisions sooner. 
However, it should be noted that the delibera- 
tion time difference obtained in this study 
was fairly small. Further, even if the differ- 
ence were sizable, the effect of more wide- 
spread use on nonunanimous juries on the 
efficiency of judicial administration is likely 
to be small, since jury trials comprise a rather 
small proportion of all cases and jury de- 
liberation tends to represent a small propor- 
tion of the total trial duration (Kalven & 
Zeisel, 1966). 

Our data also support a central objection 
to nonunanimous juries voiced by the dis- 
senters to the Johnson and Apodaca opinions, 
namely, minority members may be denied an 
opportunity to participate fully. Although 
many of our majority-rule groups continued 
deliberation after the required majority had 
been obtained, most did not. Half of those 
that stopped had been deliberating for some 
time before they achieved a majority; per- 
haps every juror’s views had been fully ex- 
pressed in these groups. But the other half 
ceased deliberation altogether after the first, 
deciding ballot. Further, the minority mem- 
bers of the majority-rule juries were signifi- 
cantly more dissatisfied with their jury’s 
verdicts and method of reaching that verdict, 
This dissatisfaction was not wholly attribu- 
table to their minority status; the effects were 
much weaker for unanimous-rule minority 
jurors. One of the most popular rationales 
for the continued use of the jury (e.g., Frank, 
1973, p. 135) is the assumption that jury 


2A recent study of over 900 hung jury cases in 
10 California counties (Planning and Management 
Consulting Corporation, Note 4), partially supports 
this conclusion. A hung jury defendant was just as 
likely to eventually receive some kind of criminal 
sanction as to receive none (51.1% vs, 48.9%, re- 
spectively). However, many of the defendants from 
the former category pleaded guilty to a lesser charge 
than the one that resulted in a hung jury. Thus, 
74.4 of the hung jury defendants either had charges 
dismissed, were acquitted upon retrial, or pleaded 
suilty to a lesser charge (Standish, Note 5). 
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duty engenders respect for the fairness of the 
judicial system. Our data suggest that this 
assumption may be less valid in nonunani- 
mous than unanimous juries, at least under 
the conditions of this study. 

The social decision scheme analysis in- 
dicated that somewhat different decision 
schemes applied to the unanimous-rule and 
majority-rule juries. The model-fitting ap- 
proach used in this study provided clearer 
insight into the locus of these differences than 
the model-testing approach utilized in pre- 
vious studies. The data indicate that both 
conditions usually decided for that alternative 
for which a 5/6 majority exists at the outset. 
However, the simple majority (Kalven & 
Zeisel, 1966) or 2/3 majority (Davis et al., 
1975) decision schemes were not strictly ap- 
plicable to unanimous-rule juries; they were 
fairly applicable for the majority-rule juries 
(note that for 6-person juries, these two pri- 
mary decision schemes make identical predic- 
tions). In addition, an asymmetry appeared 
in the estimated D for unanimous-rule juries, 
Unanimous juries with 4-2 splits for convic- 
tion frequently acquitted, while no unani- 
mous-rule jury with a 4-2 split for acquittal 
convicted. One explanation of these results 
is that it was easier to change the vote of a 
juror voting for conyiction than one voting 
for acquittal. This could conceivably reflect 
a general bias against convictions among our 
subjects; however, previous research using a 
similar subject population (e.g., Bray, 1974; 
Davis et al., 1975) produced no evidence for 
such an asymmetry. Another explanation lies 
in the relative distances of the advocates of 
conviction and acquittal from the reasonable 
doubt criterion, Although nearly equal num- 
bers of individual jurors advocated conviction 
and acquittal, the distribution of the “proba- 
bility that the defendant committed the 
crime” data was highly skewed; on the aver- 
age, jurors who thought it was very probable 
were much closer to the mean reasonable- 
doubt cutoff (.84) than those who thought it 
much less probable. This explanation would 
predict that the asymmetry noted here would 
disappear and even reverse itself as the rea- 
sonable-doubt criterion became less stringent 
or the prosecution case was strengthened. 
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The small number of groups and the rela- 
tively small effect (.87 vs. .82) of the reason- 
able-doubt manipulation on jurors’ subjective 
criteria preclude a powerful test of the former 
hypothesis in the present data set. But for 
the latter hypothesis, it is interesting to note 
that symmetric decision schemes were opti- 
mal in Davis et al. (1975) and Bray (1974), 
both of which obtained high conviction rates, 
This explanation would also predict a group 
“leniency shift” when the prosecution case 
was weak or the reasonable-doubt criterion 
lax, and a “harshness shift” when the prose- 
cution case was strong or the reasonable- 
doubt criterion stringent; some supporting 
evidence exists for this prediction (Rumsey, 
Laughlin, & Castore, Note 6; Myers & Kap- 
lan, Note 7). 

There seems to be little question from this 
perspective about the verdict to be expected 
from mock juries with initial unanimities and 
strong majorities. Our uncertainty is restricted 
to weak majorities and even splits between 
conviction and acquittal. These “maximum 
conflict” opportunities, with their small dif- 
ferences in power between majorities and 
minorities, are probably less stable in any 
event and are governed by norms or other 
social processes that change with case, popu- 
lation, juror individual differences, and other 
social nuances. Future research should utilize 
methodologies which concentrate experimental 
replications in such cases (e.g., choose cases 
which produce strong disagreement and assign 
subjects to groups such that maximum con- 
flict obtains), 
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Three major hypotheses were tested. Hypothesis 1 was derived from Weiner’s 
theory and proposes that success feedback leads to differential effort attribu- 
tions resulting in increased performance quality for individuals low in achieve- 
ment motivation and in decreased performance quality for individuals high in 
achievement motivation. Hypothesis 2 was derived from Kukla’s theory and 
Proposes that success feedback leads to an initial increase in performance 
quality for all individuals but results in differential asymptotic levels of per- 
formance quality for individuals high and low in achievement motivation, with 
those low in achievement motivation having a higher asymptote. Hypothesis 3 
was also derived from Kukla’s theory and suggests that significant relations 
should obtain among measures of speed of performance, performance quality, 
intended effort, and the individual’s level of achievement motivation, Experi- 
4 ment 1 found that success feedback enhanced performance quality for every- 
one and did not result in differential effort attributions that mediate perform- 
ance quality. Experiment 2 replicated Experiment 1 concerning the effects of 
success feedback on quality of performance but also indicated an early 
asymptote in performance quality for subjects high in achievement motiva- 
tion. Experiment 3 found that intended effort qualified as a more adequate 
explanation of differences in speed of performance and performance quality 


W 


than the environmental factor experimenter-provided feedback. 


Weiner (1972) proposed a cognitive theory 
of performance that incorporates perceptions, 
expectancies, and intentions as explanations 
of performance differences. Following Heider 
(1958), Weiner has suggested effort, ability 
(power), luck, and task difficulty as the per- 
ceived causes of performance at achievement- 
oriented tasks. The model involves three 
stages that resemble an information-processing 
system: (a) task evaluation, (b) goal-directed 
behavior, and (c) outcomes and subsequent 
reevaluation. During the task evaluation 
stage, perceiving the task as achievement 
oriented evokes attributions to the four 
causal factors as potential causes of good or 
bad performance. The nature of these expect- 
ancits determines goal-directed behavior in 
Stage 2 that results in a performance quality 
constituting either success or failure. In 


Experiment 1 and Experiments 2 and 3 were 
presented as two papers at the annual Midwestern 
Psychological Association Meetings, Chicago, May 2, 
1974. 

Requests for reprints should be sent to R. Michael 
Latta, Department of Psychology, Drake University, 
Des Moines, Iowa 50311, 


Stage 3 the causal attributions to effort, abil- 
ity, luck, and task difficulty are reevaluated 
in light of the adequacy of the performance 
level attained in Stage 2. The quality of 
performance attained on subsequent attempts 
of the task is determined by these new 
attributions in a recursive process, 

Research by Weiner and his colleagues has 
shown that attributions about the causes of 
success and failure systematically vary with 
achievement motivation. Specifically, individ- 
uals high in achievement motivation have 
been found to attribute successful perform- 
ance of others to their effort (Frieze & 
Weiner, 1971), whereas individuals low in 
achievement motivation tend to attribute such 
performance to luck or to task difficulty 
(Kukla, 1970). Weiner (1972, p. 395) con- 
tends that these findings may explain other 
findings demonstrating individuals low in 
achievement motivation to exhibit greater 
increments in the quality of performance fol- 
lowing experimenter-provided success feed- 
back relative to individuals high in achieve- 
ment motivation (Weiner, 1966; Weiner & 
Schneider, 1971). Believing that quality of 
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performance varies with effort, individuals 
high in achievement motivation are said to 
“relax” following successful performance and 
reduce their effort. This reduction in effort 
reduces their quality of performance on subse- 
quent attempts at the task. Believing their 
quality of performance does not vary with 
effort, individuals low in achievement motiva- 
tion are said to exhibit renewed goal striving 
following successful performance, and their 
quality of performance increases on subse- 
quent attempts at the task (Weiner, 1972, 
p. 320). Within this analysis, attributions to 
effort arising from the experience of success- 
ful performance are considered an adequate 
explanation of differences in quality of per- 
formance on subsequent attempts at a task 
by individuals high and low in achievement 
motivation. 

Kukla (1972b) offered an alternative cogni- 
tive theory of performance based upon the 
properties of behavioral intentions outlined by 
Dulany (1968) and Fishbein (1967). Unlike 
Weiner, Kukla specified two dimensions of 
performance; speed and quality. Here speed 
of performance (the actual effort exerted at 
a task) as well as quality of performance are 
assumed to be an increasing function of the 
amount of effort a person intends to exert 
at a task (intended effort). A postulate of 
the theory specifies that individuals who per- 
ceive themselves as more capable of doing 
well at the task will generally have lower 
intended effort than will individuals who per- 
ceive themselves as relatively incapable of 
doing well at the task. Since individuals high 
in achievement motivation report a higher 
estimate of their ability relative to individ- 
low in achievement motivation after 


uals 

having worked on an identical task (Kukla, 
1972a, Experiment 1), the theory makes the 
further assumption that individuals high in 
achievement motivation will generally have 
lower intended rt than will individuals low 
in achieveme notivation because of greater 


Thus, achievement motiva- 
bility) influences speed of 
rformance quality through 


perceived abil 


tion (perceived 


performance an 
the behavioral intention intended effort. 
In Kukla’s theory of performance, per- 


formance speed quality are also influ- 
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enced by perceptions of task difficulty. Here, 
perceived task difficulty is assumed to be re- 
lated to intended effort by an inverted-U 
function (Weiner, Heckhausen, Meyer, & 
Cook, 1972). The same relation holds for speed 
and quality of performance and perceived 
task difficulty (Atkinson, 1958), presumably 
because intended effort directly determines all 
aspects of performance. Hence, maximal in- 
tended effort, speed of performance and qual- 
ity of performance occur for tasks perceived 
to be of moderate difficulty. Analysis of 
perceived task difficulty also points out the 
difference between speed of performance and 
performance quality. Although a given level 
of intended effort and speed of performance 
are considered adequate for good performance 
from the individual’s point of view, actual 
performance quality may be low due to a mis- 
perception of task difficulty. In general, it is 
assumed that speed of performance and qual- 
ity of performance will be highly correlated. 
Although these two cognitive theories are 
similar in many respects, they permit differ- 
ential predictions about the quality of per- 
formance exhibited by individuals high 
and low in achievement motivation following 
experimenter-provided success feedback on a 
task perceived as moderately difficult. Weiner 
(1972) predicted increments and decrements 
in the quality of performance for individuals 
low and high in achievement motivation, 
respectively. Here, success is assumed to affect 
attributions to effort that produce differences 
in performance quality. Kukla (1974b), on 
the other hand, predicted an initial increment 
in the quality of performance for everyone 
who receives success feedback followed by a 
lower asymptote in the quality of perform- 
ance for individuals high in achievement mo- 
tivation compared with individuals low in 
achievement motivation, Here, the experience 
of success is generally assumed to alter per- 
ceived task difficulty from difficult to easy, 
at least for laboratory tasks such as anagrams 
and digit-symbol substitution. Thus, repeated 
successful performance results in a general de- 
crease in perceived difficulty, leading first to 
maximal intended effort, then to minimal 
intended effort. A lower asymptote in quality 
of performance should occur for individuals 
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high in achievement motivation because per- 
ception of the task as easy should usually 
occur earlier for them due to greater per- 
ceived ability, and perceiving the task as easy 
results in lowered intended effort. 

Three experiments were conducted to test 
the ability of these two theories to predict 
differences in performance quality (number of 
correct solutions) on a digit-symbol substitu- 
tion task under uniform laboratory condi- 
tions. No-feedback control groups were em- 
ployed in each study to determine whether 
increments or decrements in the quality of 
performance of individuals high and low 
in achievement motivation following experi- 
menter-provided success feedback are greater 
than those attributable to practice or loss 
of interest. 


EXPERIMENT 1 


Experiment 1 was designed to test the 
assumptions of Weiner’s theory of perform- 
ance regarding the relationships of effort 
cognitions to achievement motivation and 
performance quality. Specifically, individuals 
high in achievement motivation should be 
more likely to attribute their own successful 
performance to their own effort than indi- 
viduals low in achievement motivation. This 
was done because previous research either did 
not ask the participants to make judgments 
about their own performance (Frieze & 
Weiner, 1971), used a task that was ambigu- 
ous as to the causes of success and failure 
(Kukla, 1970), or used failure feedback 
rather than success (Meyer, 1970). 

Since Weiner (1972) considered effort cogni- 
tions to be causally related to performance 
quality, attributions to effort made by par- 
ticipants about their own behavior should 
predict performance quality. Because effort is 
decreased following success by individuals 
high in achievement motivation, the subse- 
quent performance quality of these individu- 
als should immediately start to decline below 
a quality attributable to loss of interest. 
Because renewed effort follows success by 
individuals low in achievement motivation, 
the subsequent performance quality of these 
individuals should immediately begin to rise 
above a quality attributable to practice. 
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Method 


A 2X22 factorial design with two levels of 
achievement motivation (high-low), two levels of 
feedback about performance (success-no feedback), 
and two levels of causal attribution (requested-not 
requested) was employed. 

Subjects. Scores from the short form of the 
Mehrabian (1968) scale of resultant achievement 
motivation were used to block subjects.! This scale 
contains 26 items to be answered on a —4 (disagree) 
to 4 (agree) scale. Subjects were 160 males selected 
from a group of 240 introductory psychology stu- 
dents tested 2 weeks prior to the experiment. The 
80 high-achievement-motivation subjects had a mean 
score of 16.6 and constituted the upper third of the 
original distribution of 240 scores, while the 80 low- 
achievement-motivation subjects had a mean score of 
—14.2 and constituted the lower third of the original 
distribution. 

Procedure. Subjects were randomly assigned to 
conditions and were administered one practice trial 
followed by six 30-sec test trials of a digit-symbol 
substitution task. This task involved substituting the 
highly similar symbols 
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for the digits 1 through 6. No memory was involved, 
since an example of the proper matches was always 
visible. Each trial consisted of the subject attempting 
to fill a single worksheet of 50 possible substitutions. 
The performance measures taken on this task were 
speed of performance (the total number of at- 
tempted substitutions per trial) and performance 
quality (the total number of correct substitutions per 
trial). The digit-symbol substitution task and the 
performance measures were chosen for their sensi- 
tivity to the effects of motivation and their use in 
previous research (Weiner, 1972, p. 240). 

Following Weiner (1966), subjects in the four 
success feedback conditions were given success feed- 
back on even-numbered trials by false norms indi- 
cating that the subject was doing twice as well as 
most college students attempting the task. Following 
each trial, subjects in the attribution conditions were 
requested to check on each of six paired-comparison 
scales the extent to which effort, ability, luck, and 
task difficulty influenced performance quality. These 
causal attributions were made in response to the 
question, “Which of these factors influenced your 
performance on the last trial to a greater extent?” 
The paired-comparisons format also included a cer- 
tainty rating. Thus, if the subject was uncertain as 
to which of the two factors in a comparison was 
most important, he checked at the midpoint of the 
scale that was anchored with the word uncertain, 


1For the male version, Mebrabian (1968) has 
reported low correlations with the social desirability 
scale of Crowne and Marlowe (1960), a 10-week 
test-retest reliability coefficient of .78, and reasonable 
validity coefficients. See also Mchrabian (1969), 
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TABLE 1 
MEAN ATTRIBUTION SCORES ror INDIVIDUALS HICH 
AND LOW IN ACHIEVEMENT MOTIVATION 
IN EXPERIMENT 1 AS A FUNCTION 
OF FEEDBACK 


Causal factor> 


Task 

diffi- 

Groups Effort Ability Luck culty 

High-feedback 94.5 95.8 1.5 71.8 

High-no feedback 96.4 92.0 1.0 78.0 

Low-feedback 87.2 91.8 13.5 63.0 

Low-no feedback 90.5 90.5 11.6 59.5 

aN = 20. 

b The larger the score, the greater is the importance assigned 

to the causal factor as a determinant of performance, 


The scales were 10 inches (3 m) long and were 
numbered from 1 to 99 in multiples of 10. 


Results and Discussion 


Altributions. Using the paired-comparisons 
technique, it was possible to derive a separate 
attribution score for each of the four causal 
factors effort, ability, luck, and task di ficulty. 
This resulted in four scores for attribution 
(one for each causal factor) for each of the 
80 subjects who were requested to attribute 
to causal factors. The first hypothesis pre- 
dicted that individuals high in achievement 
motivation generally attribute their success to 
their own efforts more so than individuals low 
in achievement motivation. Table 1 presents 
the mean attribution scores for effort, abil- 
ity, luck, and task difficulty after averaging 
over trials, 

A multivariate analysis of variance was per- 
formed on the average attribution scores 
using achievement motivation and feedback 
as classification variables and attribution to 
the four causal factors as correlated depen- 
dent measures. An overall test (Hotelling- 

Lawley’s trace) indicated that differential at- 
tributions were associated only with achieve- 
ment motivation, F (4,73) =2.91, p< .03. 
Univariate te revealed subjects low in 
achievement motivation +v consider luck more 
igh in achievement 
54, p< .003. The 
1 for attributions 
= 3.76, p < .05, 
ment motivation 
be less of an 


important than subje 
motivation, F (1, 7¢ 
opposite pattern 5 
to task difficulty, 
with subjects low 
considering task d 


influence on their performance than subjects 
high in achievement motivation. Contrary to 
the first hypothesis, attributions to effort 
were not found to be significantly related to 
level of achievement motivation. Attributions 
to ability were also not significantly related 
to level of achievement motivation. Neither 
the overall test nor the univariate tests indi- 
cated attributions to vary with feedback or 
the interaction of achievement motivation 
with feedback. 

The means in Table 1 suggest that most 
people view the internal factors effort and 
ability to determine success much more so 
than the external factors duck and task difi- 
culty. This similarity of attribution to effort 
for the high- and low-achievement-motivation 
subjects is surprising, given the large differ- 
ences in average level of achievement motiva- 
tion for the subjects categorized as high 
and low in achievement motivation in this 
experiment. 

Relation between attribution to causal fac- 
tors and perjormance quality. Total scores for 
attributions to the four causal factors, speed 


of performance, and performance quality were — 


found by summing across the six trials. The 
second hypothesis derived from Weéiner’s 
theory stated that attributions to effort should 
generally predict performance quality. To test 
this hypothesis, a multiple regression analysis 
was conducted to find the best predictor 
among the causal factors with performance 
quality and speed as criterion variables, 
Achievement motivation was employed as a 


continuous variable, while feedback was : 


scored dichotomously and the interaction term 
was generated by multiplication (Kerlinger & 
Pedhazur, 1973). Only attribution to effort 
was found to be significantly related to speed 
of performance, F(1,78) = 3.60, p< .05. 
Contrary to the second hypothesis, none of 
the causal factors were significantly related 
to performance quality. The same solutions 
resulted using a forward, backward, and step- 
wise procedure (cf. Draper & Smith, 1961), 
which indicated that order of entry was not 
important. 

Effects of achievement motivation, feed- 
back, and attribution on performance quality. 
The third hypothesis tested in this experiment 
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predicted the performance quality of individ- 
uals high in achievement motivation to decline 
below a level attributable to a loss of interest 
following success, whereas the performance 
quality of individuals low in achievement mo- 
tivation was predicted to rise above a level 
attributable to practice following success. An 
analysis of variance conducted on the per- 
formance quality measure with achievement 
motivation, feedback, and attribution as 
between-subjects factors and trials as a within- 
subjects factor indicated a Feedback X Attri- 
bution interaction, F(1, 152) = 4.0, p < .01, 
and a four-way Achievement Motivation x 
Feedback x Attribution x Trials interaction, 
F(5,760) = 2.23, p < .05. Table 2 presents 
the performance quality of subjects in the 
eight groups as it changes over trials. The 
general increase in performance quality over 
trials is probably due to a practice effect but 
does not obscure group differences. 

In Table 2, Lines 2 and 4, and 6 and 8, 
indicate no support for Weiner’s hypothesis 
concerning the differential effects of success 
feedback on the subsequent performance qual- 
ity of subjects high and low in achievement 
motivation. Not only did success feedback 
alone produce an increment in performance 
quality for subjects low in achievement mo- 
tivation, but it also produced an increment in 
performance quality for subjects high in 
achievement motivation. These increments are 
both greater than any increases in perform- 
ance quality attributable to practice, since 
both groups’ performance quality was greater 
than that of the no feedback—no attribution 
controls. Comparing Lines 1 and 4 of Table 2 
shows that attributing to causal factors in the 
presence of success feedback increases the per- 
formance quality of subjects high in achieve- 
ment motivation. If, as Weiner suggested, 
success leads to differential effort attributions 
resulting in differential performance qualities 
for subjects high and low in achievement mo- 
tivation, the attribution process should not 
have facilitated the performance quality of 
these subjects. 


EXPERIMENT 2 


The results of Experiment 1 were consistent 
with Kukla’s theory in that success feedback 
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TABLE 2 


QuAtiry ror INpivipvALs HIGH AND 
tT MOTIVATION AS IT CHANGES 
XPERIMENT | AS A FUNCTION 

2K AND ATTRIBUTION 


PERFORMAN 


Triano. 

Group® 1 2 3 4 5 6 
High-feedback- 

attribution 23.4 23.9 25.6 25.7 27.5 27.5 
High-feedback— 

noattribution 23.9 24.6 25.6 26.9 26.6 26.8 
High-no feed- 

back-attribu- 

tion 23.8 25.4 261 265 27.4 27.3 
High-no feed- 

back-no at- 

tribution 22.6 23.0 24.3 25.7 25.6 26.3 
Low-feedback— 

attribution 23.8 24.5 25.9 26.1 26.2 26.3 
Low-feedback— 

noattribution 24.7 25.4 28.9 27.9 28.8 29.7 
Low-no feed- 

back~attri- 

bution 23.7 23.7 25.4 26.7 26.1 26.9 
Low-no feed- 

back=no at- 

tribution 23.2 23.7 25.0 26.0 26.6 26.9 
“N = 20, 


produced an increase in subsequent perform- 
ance quality for both categories of achieve- 
ment motivation. Experiment 2 was designed 
to test the joint effects of feedback and 
achievement motivation on performance qual- 
ity predicted by Kukla’s theory. Specifically, 
Kukla’s theory predicts that success feedback 
leads to an initial increase in performance 
quality for all individuals but differential 
asymptotes in performance quality for indi- 
viduals high and low in achievement motiva- 
tion. Here, the inverted-U function relating 
perceived task difficulty to performance qual- 
ity suggests that individuals high in achieve- 
ment motivation (high perceived ability) have 
a lower performance quality asymptote than 
individuals low in achievement motivation 
(low perceived ability) when given success 
feedback. This effect of success feedback is 
due to individuals high in achievement moti- 
vation perceiving the initially difficult task as 
easy sooner than individuals low in achieve- 
ment motivation under conditions of continu- 
ous success. Thus Kukla’s theory predicts an 
overall positive influence of success feed- 
back on performance quality and differential 
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asymptotes for individuals high and low in 
achievement motivation. 


Method 


Subjects. Subjects were 80 males categorized as 
high or low in achievement motivation, selected for 
Experiment 2 from an initial sample of 130 intro- 
ductory psychology students following the procedure 
used in Experiment 1. The mean achievement moti- 
vation scores for the high and low groups were 15.3 
and —14.8, respectively. 

Procedure. From each achievement motivation 
category, 20 subjects were randomly assigned to the 
success feedback or no-feedback control groups and 
were administered the digit-symbol substitution task 
following the procedure of Experiment 1. As in Ex- 
periment 1, performance quality was measured by 
taking the number of correct substitutions per trial. 


Results and Discussion 


The results of a 2X26 repeated- 
measures analysis of variance performed on 
the performance quality scores indicated a 
main effect for feedback, F(1, 76) = 4.71, 
p < 03, and a main effect for trials, F 
(5,380) = 31.43, p < .0001, as the only sig- 
nificant sources of variation in the perform- 
ance quality scores. Figure 1 presents the 
mean quality of performance for subjects high 
and low in achievement motivation as it de- 
velops over trials as a function of feedback. 

Inspection of Figure 1 indicates that sub- 

jects high in achievement motivation receiving 
success feedback reached their performance 
quality asymptote on Trial 3, whereas sub- 
jects low in achievement motivation receiving 
success feedback continued to improve in per- 
formance quality from Trials 3-6. Simple- 
effects tests using Sheffés method indicated 
that subjects low in achievement motivation 
receiving success feedback had a significantly 
higher performance quality than their cor- 
responding no-feedback control group from 
Trial 3 to Trial 6 (p< 05). The other 
three groups did not differ significantly in 
performance quality at any of the six trials, 
Thus the effects of the feedback variable on 
performance quality predicted by Kukla was 
found in Experiment 2. 


EXPERIMENT 3 


Since Experiments 1 and 2 provided evi- 
dence consistent with Kukla’s interpretation 
of the effects of successful performance on 
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subsequent performance quality, Experiment 3 
was designed to test Kukla’s assumptions 
concerning achievement motivation, intended 
effort, speed of performance, and quality of 
performance. Recall that Kukla’s cognitive 
theory of performance makes the following 
assumptions; (a) An individual who is low in 
achievement motivation should have greater 
intended effort than an individual who is high 
in achievement motivation because the former 
has a tendency to attribute relatively little 
ability to himself compared with the latter. 
Thus, the individual low in achievement mo- 
tivation thinks that trying hard will compen- 
sate for his lack of ability. (b) Intended 
effort is positively related to speed of per- 
formance. This assumption reflects the per- 
former’s belief that performance has two 
aspects, quantity and quality. Hence, as an 
individual’s perception of his own ability de- 
creases, he attempts to do well at a task by 
means of being highly productive, regardless 
of the quality of any single attempt. (c) 
Speed of performance is positively related to 
performance quality. Trying hard often leads 
to an overall quality of performance that 
cannot be reached any other way. This 
assumption reflects the individual’s belief that 
indolence rarely pays off. (d) Finally, an indi- 
vidual’s intended effort should predict both 
speed of performance and performance qual- 
ity. This assumption is similar to that out- 
lined in (c) and reflects the fact that inten- 
tions are many times realized.? 


Method 


Subjects. Subjects were 40 males classified as high 
or low in achievement motivation, selected from the 
top and bottom thirds of an original distribution 
of 125 introductory psychology students following 
the previously described procedure. The average 
achievement motivation scores for the high and low 
categories were 29 and —19, respectively, 


*It should be pointed out that these theoretical 
relations among intended effort, performance speed, 
and performance quality probably hold only for 
many simple laboratory tasks. For example, maximal* 
intended effort probably leads to impaired perform- 
ance quality at complex tasks, and fast performance 
may tend to impair performance quality at these 
tasks. Hence, these predictions Probably do not apply 
to artists, architects, or for that matter, authors. 
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; Procedure. Following random assignment of sub- 
jects to success feedback or no-feedback control 
groups, the procedure was identical to that used in 
Experiments 1 and 2, except that all subjects were 
requested to estimate their intended effort following 
the usual practice trial. As in Experiments 1 and 2, 
the number of correct digit-symbol substitutions 
served as the measure of performance quality. For 
this study, the number of attempted digit-symbol 
substitutions was also recorded and constituted the 
measure of performance speed. 


Results and Discussion 


| Performance measures. The variables total 

speed of performance and total performance 

quality were formed by averaging over trials. 

The group means for these variables appeat 

in Table 3. A 2 X 2 multivariate analysis of 
x variance with total speed of performance and 
total performance quality as correlated depen- 
dent measures was conducted to test the hy- 
pothesis concerning the effects of achievement 
motivation and success feedback on perform- 
ance. This analysis indicated a significant 
overall effect of achievement motivation on 
performance, F(2,35) = 4.08, p < -03, that 
only applied to performance quality ($ < .02) 
and a marginally significant overall Achieve- 


e LOW ACHIEVEMENT MOTIVATION FEEDBACK 
© LOW ACHIEVEMENT MOTIVATION NO FEEDBACK 
m HIGH ACHIEVEMENT MOTIVATION FEEDBACK 
O HIGH ACHIEVEMENT MOTIVATION NO FEEDBACK 


MEAN PERFORMANCE QUALITY - 


22 | 
TRIALS 


FIGURE 1. Mean performance quality as a function 
of achievement motivation and feedback in Experi- 


ment 2. 


TABLE 3 


Meran Toral SPEED OF PERFORMA) 
TOTAL PERFOR! CE QUALITY 1I 
AS A FUNCTION OF ACHIEVEM 
MOTIVATION AND FEEDBACK 


Total per- 
formance 


Group™ formance quality 
Low achievement motivation- 
feedback 149.5 135.6 
Low achievement motivation- 
no feedback 160.6 135.3 
High achievement motivation= 
feedback 159.7 132.9 
High achievement motivation- 
no feedback 145.0 116.4 


aN = 10. 


ment Motivation X Feedback interaction, F 
(2,35) = 2.82, P < .07, that applied strongly 
to performance speed ($ < .02) and weakly 
to performance quality ($ < 08). 

The most striking aspect of the means re- 
ported in Table 3 is the deviance of the high- 
P hievement-motivation-no-feedback group. 
The large increase in performance quality 
produced by success feedback given to indi- 
viduals high in achievement motivation sug- 
gests that these individuals are dependent 
upon success feedback in the sense that the 
absence of such feedback impairs the quality 
of their performance compared with individu- 
als low in achievement motivation. Indi- 
viduals high in achievement motivation seem 
to require such feedback, probably because its 
absence lowers their effort. This explanation 
is supported by the significant Achievement 
Motivation X Feedback interaction for total 
speed of performance where individuals high 
in achievement motivation lower their speed 
of performance in the absence of success 
feedback. 3 

Accuracy of Kukla’s assumptions. Further 
analyses were conducted to check the accu- 
racy of the assumptions of Kukla’s theory. As 
suggested in Assumption (a), subjects low in 
achievement motivation exhibited greater in- 
tended effort (X= 48.2) compared with sub- 
jects high in achievement motivation (£ = 


42.8), but the difference was not statistically 
1(36) = 1.0, p < 33. Intended ef- 


to be positively related to 
r(38) = 36, P < 


significant, 
fort was found 
total speed of performance, 
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TABLE 4 


DIFFERENTIAL TESTS or Two THEORIES 


Weiner's theory 


Kukla’s theory 


Finding 


1. Attributions to effort should 
predict performance quality. 


2. Success enhances performance 
quality for individuals low in 
achievement motivation and 
impairs it for individuals high 
in achievement motivation, 


3. Success produces immediate 
increments and decrements in 
performance quality for indi- 
viduals low and high in achieve- 
ment motivation, respectively, 


- Ratings of intended effort 


should predict both perform- 
ance speed and quality. 


. Success enhances performance 


quality regardless of level of 
achievement motivation. 


- Success initially increases per- 


formance quality for everyone 
followed by a lower perform- 
ance quality asymptote for 
individuals high in achievement 


1. Attributions to effort predicted 
speed but not quality of per- 
formance (Experiment 1). 
Ratings of intended effort pre- 
dicted both performance speed 
and quality. 


te 


- Success enhanced performance 
quality regardless of level of 
achievement motivation 
(Experiments 1 and 2). 


3. Success initially increases per- 
formance quality regardless of 
level of achievement moti- 
vation, followed by a lower 
performance quality asymptote 


motivation. 


for individuals high in achieve- 
ment motivation (Experiment 


-02, and total speed of performance was found 
to be positively related to total performance 
quality, r(38) = -69, p < .0001, as stated in 
Assumptions (b) and (c). 

A multiple-regression analysis was con- 
ducted on total speed of performance and 
total performance quality scores using achieve- 
ment motivation, feedback, and intended ef- 
fort as possible predictors. This analysis was 
conducted following the procedure outlined in 
Experiment 1. In accordance with Assump- 
tion (d), intended effort was found to be the 
best predictor of both total speed of perform- 
ance, F(1,38) = 5.82, Ż < .05, and total per- 
formance quality, F(1,38) = 11.9, p < .002. 

Evidently cognitions about effort, whether 
they be attributions occurring after perform- 
ing or behavioral intentions occurring before 
performing, are more useful in predicting 
speed of performance and performance qual- 
ity than are environmental variables such as 
experimenter-provided feedback. 


CONCLUSIONS AND IMPLICATIONS 


The major hypotheses and related findings 
of this investigation are presented in Table 4. 
Generally, but with some exceptions, the find- 
ings outlined in Table 4 confirm Kukla’s 
theory and disconfirm Weiner’s. However, 


since the present investigation tested the two 
theories for a relatively simple task and under 
a relatively uniform set of experimental con- 
ditions, it would be Premature to draw con- 
clusions beyond the general conclusion that 
these initial studies provide more support for 
Kukla’s theory. The two theories are suffi- 
ciently precise to permit differential predic- 
tions, and this is encouraging for future 
research along these lines, 

The findings of Experiment 1 suggest 
that Weiner’s predictions about the relations 
among achievement motivation, effort attri- 
butions, and performance quality following 
experimenter-provided success feedback are 
not accurate. Experiment 2 results supported 
the prediction from Kukla’s theory of differ- 
ential performance quality asymptotes for 
individuals high and low in achievement mo- 
tivation following repeated success. Thus 
Kukla’s assumption of an inverted-U function 
relating perceived task difficulty to perform- 
ance quality was Partially confirmed. Finally, 
the results of Experiment 3 suggest that the 


8 These are pooled within correlations, The pooled 
within correlation for Speed of Performance and 


= .89, p < .0001, while that of Experiment 2 was 
r(78) =.97, p < 001. 
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cognitive variable intended effort is an ade- 
quate predictor of differences in both speed 
of performance and performance quality. 
These findings suggest that the class of inter- 
vening variables called behavioral intentions 
can explain performance differences, 

Since Kukla’s theory specifies that per- 
ceived task difficulty combines with perceived 
ibility in determining intended effort, which 
in turn determines speed of performance and 
performance quality, further research is re- 
quired. For one thing, a variety of task dif- 
ficulties and abilities need to be employed to 
determine whether different levels of intended 
effort, speed of performance, and performance 
quality result. Kukla (1974) has provided 
some initial evidence concerning this question 
by showing that individuals high in achieve- 
ment motivation exhibit higher performance 
quality than individuals low in achievement 
motivation when the participants were told 
that the task was difficult. The converse was 
found to hold when the participants were 
told that the task was easy. However, in that 
study as well as in the present study, percep- 
tions of task difficulty were not independently 
checked. These perceptions must be measured 
to certify that perceived difficulty corresponds 
to the difficulty level that the experimenter 
as stated and to be certain that perceived 
difficulty interacts with perceived ability in 
producing intended effort. Finally, more ade- 
quate attention should be given to partici- 
pants’ perceptions of their own performance 
in relation to experimenter-provided feedback. 
If repeated success feedback does not produce 
the perception of task ease, the inverted-U 
unction relating perceived task difficulty to 
performance speed and quality would not 
apply. What determines perceptions of ease 
or difficulty is probably much more than 
success; it includes the ups and downs of ex- 
pectancies of success and, probably, the level 
of aversive affect encountered in attempting 
the task. 

Turning to Weiner’s theory, little support 
is indicated in Table 4. One disconfirming 
finding came from Experiment 1 where attri- 
butions to effort did not vary with achieve- 


» ment motivation, regardless of the presence 


or absence of success feedback, A second 


disconfirming finding from that experiment 
was that attributions to effort did not predict 
performance quality as Weiner’s theory sug- 
gests. In order for success feedback to affect 
performance quality in the manner specified 
by Weiner’s theory, it is essential that attri- 
butions to effort vary with achievement mo- 
tivation and predict performance quality. It 
could be argued that the digit-symbol substi- 
tution task is too easy and thus an insuff- 
cient means for properly evaluating Weiner’s 
theory. Weiner has contended (Weiner et al., 
1972) that tasks of intermediate difficulty 
are most likely to elicit effort attributions 
that influence performance quality, Estimates 
of the perceived difficulty of the digit-symbol 
substitution task were obtained from an inde- 
pendent sample of 60 male students following 
a practice trial. The estimates were made on 
a 1 (extremely easy) to 99 (extremely diffi- 
cult) scale. The results indicated that the 
task was perceived as intermediate in diffi- 
culty (£ = 61), the most advantageous level 
of task difficulty for evaluating Weiner’s 
theory. In defense of Weiner’s position, it 
should be pointed out that effort attributions 
did predict speed of performance in Experi- 
ment 1 and that the effects of failure may 
be mediated by the attribution process as 
specified in his theory.’ Future research may 
confirm that portion of the theory. To test 
completely the portion of Weiner’s theory 
investigated in these experiments, a variety of 
tasks are required, since task difficulty prob- 
ably influences the first stage in the attribu- 
tion process as suggested by Kukla’s theory. 

Inconsistent with Weiner’s theory, a third 
finding in Experiments 1 and 2 showed that 
success feedback enhanced performance qual- 
ity regardless of level of achievement motiva- 
tion. Once again in defense of Weiner, the 
possibility exists that success feedback works 
in the manner that he has specified only in 
ambiguous situations where subjective success 
rather than experimenter-provided success is 
salient. As with any theory incorporating a 
personality variable, powerful situational fac- 
tors such as experimenter-provided feedback 


4 Weiner and Sierad (1975) have demonstrated the 
usefulness of Weiner’s theory in predicting the effects 
of failure feedback on speed of performance. 
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about performance may obscure the effects 
of individual differences in the tendency to 
respond to subjective success specified in 
Weiner’s theory. This is especially true, since 
Weiner’s theory deals with subjective success 
that may or may not be congruent with 
experimenter-provided feedback. 

Future research should clarify the limi- 
tations of these two cognitive theories of 
performance, 
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The Power Motive in the Dating Couple 
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Zick Rubin 
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The links between power motivation and satisfaction, anticipation of problems, 
and longitudinal stability of dating couples’ relationships were investigated. 
Hope of Power in men was found to be positively associated with dissatisfac- 
tion and with the anticipation of problems in the relationship, particularly 
problems relating to interpersonal conflict and impulsivity. Couples in which 
the men were high in Hope of Power were relatively likely to break up and 
unlikely to marry. Women’s Hope of Power was not related to dissatisfaction 


or to the couple’s stability. 


In the last decade or so, the problem of 
power in intimate relationships between men 
and women has become increasingly salient 
and explicit. In both the theoretical and em- 
pirical literature, this problem has been ap- 
proached from the viewpoint of real power, 
felt power, and social and legal power (see, 
e.g., Bullough, 1973; Kenkel, 1957; Olson & 
Rabunsky, 1972; Strodtbeck, 1951; Waller 
& Hill, 1951). Our research focused instead 
on links between individuals’ desire to have 
power (power motivation) and the nature of 
their relationships. Specifically, we consider 
the effects of both men’s and women’s desire 
to have power on satisfaction, anticipated 
problems, and long-term stability in opposite- 
sex relationships. 

In a recent monograph, Winter (1973) con- 
ceptualized an individual’s need for power 
(measured by a semiprojective test) as a 
stable tendency to seek impact on others, This 
impact may be sought through direct, force- 
ful actions to control others or through more 
indirect efforts to influence, persuade, charm, 
or even help others. As reported in Winter 
(and as summarized in Winter & Stewart, in 
press), the need for power (n Power) is re- 
lated to leadership in small groups, organiza- 
tions, and institutions; to athletic participa- 
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tion; and to efforts to manipulate and per- 
suade others. There is reason to believe that 
power motivation may also be related to cer- 
tain difficulties in interpersonal relationships. 
In friendly relations, Winter found that 
power-motivated men tend to seek alliances 
with weaker subordinates who will follow 
them and to avoid carefully any possibility 
of conflict. In opposite-sex relationships, they 
seem to prefer wives who are dependent and 
submissive. Other findings that seem directly 
relevant to the study of intimate relationships 
include the fact that men high in n Power 
tend to be interested in “vicarious” or por- 
nographic magazines and tend to have more 
sexual partners. Indeed, Winter chose the 
figure of Don Juan as the archetypal repre- 
sentation of the power-motivated male. 

Other researchers have found that men who 
are high in n Power, if they show some other 
evidence of developmental immaturity, are 
more likely to have been physically aggres- 
sive toward women, to express views indicat- 
ing fear of and hostility toward women 
(Slavin, Note 1), and to be separated or 
divorced from their wives (McClelland, 
1975). Thus there is some evidence that men 
high in power motivation tend to have dif- 
ficulty in their relationships with women and 
perhaps tend toward an exploitative or de- 
rogatory view of women. 

No research has been reported in this area 
using female subjects. Some early results sug- 
gest that many of the correlates of n Power 
in men also hold for women (Stewart, Note 
2; Winter, Note 3; Winter & Stewart, in 
press). However, there are no results to date 
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indicating that women high in power motiva- 
tion tend to have difficulties in opposite-sex 
relationships analogous to those found among 
men. 

The major aim of the present research was 
to consider systematically the relationship 
between power motivation and close opposite- 
sex relationships. This study is the first to 
consider the effect of both partners’ power 
motivation within the dyadic context. In ad- 
dition, the longitudinal nature of the research 
made it possible to test causal hypotheses 
more adequately than in previous research. 
We predicted that power motivation would 
be associated with relative dissatisfaction 
with the current relationship, anticipation of 
problems in the relationship in the near fu- 
ture, and long-term instability of the rela- 
tionship. We expected individuals high in 
power motivation to be especially likely to 
anticipate particular sorts of problems. Be- 
cause power motivation (in men) had been 
linked in previous research to avoidance of 
conflict, to physical aggression when conflict 
arises, and to a preference for submissive 
friends, we predicted that individuals high in 
power motivation would be especially likely 
to find differences in interests and attitudes 
difficult to tolerate (internal strains). We 
also anticipated that men high in n Power 
would find it harder than others to ignore 
or suppress their desires for independence 
from the relationship or for a relationship 
with another person (independence pres- 
sures). On the other hand, we did not expect 
power-motivated individuals to be more or 
less concerned than others with possible prob- 
lems that were “external” to the relationship, 
such as physical separation, parents’ objec- 
tions, or differences in social background 
xternal pressures). 
These hypotheses are derived from research 


(é 


conducted mainly with male subjects. In the 
absence of any clear findings from previous 
research, we formed no definite hypotheses 


about sex differences in these predictions. 


METHOD 


Sub je 

were 63 dating couples, all of whom were 
in a larger longitudinal study of op- 
lationships. At least one member of each 
ied one of four different colleges in the 


Subject 


posi 
couple at 
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Boston area. The modal couple consisted of a male 
junior and a female sophomore who had been dat- 
ing for about a year. Almost all of the couples were 
dating each other exclusively (or “going together”), 
but relatively few had any concrete plans about mar- 
riage. Further details of the composition of the 
sample may be found in Rubin, Peplau, and Hill 
(Note 4). 


Procedures 


The couples were tested in small groups, in class- 
room settings. Boyfriends and girlfriends attended 
the same session but were seated separately. They 
were given a five-cue “test of insight” (a Thematic 
Apperception Test, or TAT-type test, substituting 
verbal stems for the usual TAT pictures; see French, 
1958), with 20 sec for reading each cue and 4 min- 
utes for writing each story. The experimenter in- 
dicated the time intervals to the subjects. 

In addition, a long questionnaire was administered, 
including the following questions considered in this 
analysis: “All in all, how satisfied would you say 
you are with your relationship with ( )” and 
“All in all, how satisfied would you say ( ) is 
in his or her relationship with you” + Each of these 
questions was followed by a response continuum 
ranging from 1 (not satisfied at all) to 9 (extremely 
satisfied). A list of possible problems in opposite- 
sex relationships was also included, and subjects 
were asked: “For each of the phrases, circle the 


number which indicates the extent to which that > 


factor is likely to lead to difficulties in your rela- 
tionship with ( ) during the next year.” 

Subjects were then asked to rate the following 
areas on a scale from O (not a factor or not at all 
likely to lead to difficulties) to 2 (likely to lead 
to major difficulties). The list of problems, reordered 
to conform to the three theoretical categories de- 
scribed above (internal strains, independence pres- 
sures, and externa] pressures) is presented in Table 2. 
The questionnaire also included Rubin’s (1970) love 
and liking scales. 


Two years after the initial testing, the couples ¢ 


were recontacted by mail or by telephone to deter- 
mine which of the couples were still dating, had 
broken up, or had gotten married. We were able to 
contact at least one member of 59 of the 63 couples 
to establish the current status of the relationship. 


Scoring 


Stories written by the subjects were scored for 
n Power, according to the “revised m Power scoring 
system” described in Winter (1973, Appendix 1) 
This is an experimentally derived measure of the 
desire to influence, control, or have direct or indi- 
rect impact on others. The basic scoring criteria for 
n Power are that a character in a story must either: 
(a) take a strong, vigorous action that affects an- 
other, (b) express concern with his reputation a 


1 The symbol ( 


) refers to the subject’ 
friend or girlfriend. ject’s boy- 


prestige, or (c) arouse a strong emotional response 
in others. Further subcategories are scored only if 
one of these three criteria is met. The psychometric 
properties and validational research with the mea- 

Sure are discussed extensively in Winter (1973), and 
“Axcomplete scoring system is presented there. 
measure is partitioned into two component 
power motives: Hope of Power (a positive desire for 
ower, the approach motive) and Fear of Power 

negative salience of power, the avoidance mo- 
& e). Since Fear of Power predicts stress, discom- 

rt, and avoidance behavior in situations of insti- 
tutional power (see Winter & Stewart, in press) 
rather than the pursuit of direct, interpersonal 
power, we did not expect it to be related to be- 
havior in intimate relationships (as, indeed, it was 
not). Therefore, we limit ourselves to the approach 
motive, Hope of Power, and hereafter the terms 
Hope of Power and power motivation are used in- 
terchangeably. In fact, Hope of Power and overall 
n Power are highly correlated (usually, r = .8 or .9), 
$ whereas Fear of Power is not highly correlated with 
Hope of Power and is uncorrelated with # Power. 
Further details of the separate correlates of Hope 
x of Power and Fear of Power are presented in Winter 
(1973) and Winter and Stewart (in press). 

All stories were scored blind by the first author, 
who has a demonstrated reliability (rho) of .98 
in scoring » Power. A random subsample of 30 
stories was scored by another expert scorer, and 
the two scorers’ agreement (rho) was .96. 


E à ' 
` There were no differences in the average 
level of Hope of Power in the male and fe- 
male subjects, In addition, there was no asso- 


ciation between the motive scores of the two 
A members of a given couple (xê < 1). 


RESULTS 


EE, TABLE 1 


x Power MOTIVATION AND OVERALL SATISFACTION 
WITH RELATIONSHIP 


Male Hope 
of Power 


Female Hope 
of Power 


Variable (n = 63) (n = 628) 
faction —.3599* —.01 
ction, as 
E by partner —.26* —.07 
Partner's satisfaction — 28 05 
Partner’s satisfaction, as 
estimated by self —.20"* O1 
Average score of antici- 
pated problems oot 21* 
Average score of prob- 
lembs anticipated by 
partner —.04 —15 


Note, All tests of significance are two-tailed, 
-= One female sub) = been omitted because she did not 
write pcompiets Thematic Apperception Test protocol. 
<.10. 
*p < .05. 
e+p <0, 
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TABLE 2 


Correlation with: 


Male Hope Female Hope 
of Power of Power 


Variable (n = 63) (n = 624) 
Internal strains 
Differences in interests A Drina 03 
Becoming bored with 
the relationship Ore .28"* 
Conflicting sexual 
attitudes 2 fae 23" 
Conflicting ideas 
about marriage 21* 09 
Different intelligence AS 10 
Independence pressures 
(——)’s desire to be 
independent —.01 ye 
My desire to be in- 
dependent 20 14 
(——)’s interest in 
someone else 19 20 
My interest in some- 
one else 22* 17 
External pressures 
Living too far apart Hey a — 18 
Pressure from (———)’s 
parent’s 07 Ad 
Pressure from my 
parents —.15 26" 
Differences in back- 
grounds 08 —.17 
Note, (—) refers to boyfriend or girlfriend. 


as been omitted because she did not 
Apperception Test protocol, 


a One female sul 

write a complete Them 
*p <10. 
p< .05. 
"+p < 01, 


Satisfaction with the Relationship 


The correlations between power motivation 
and satisfaction with the relationship are sum- 
marized in Table 1. The average level of satis- 
faction reported by the subjects was quite 
high. As predicted, however, Hope of Power 
among the men was inversely related to their 
own satisfaction with the current relationship 
(r = —.35, p < .01). Indeed, the satisfaction 
of the girlfriends of these men was also nega- 
tively associated with male Hope of Power 
(r= —.28, p < .05). In addition, men high 
in power motivation are especially likely to 
anticipate problems in the relationship in the 
next year (r= .39, p < .01), although their 
partners are not. Men’s Hope of Power was 
also inversely correlated with their scores on 
scales (Rubin, 1970) assessing their love (r = 
—.26, p < .05) and liking (7 = —.20, p ~ .10) 
for their partners. The women’s Hope of 
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TABLE 3 


Power MOTIVATION AND THE FUTURE COURSE OF 
THE RELATIONSHIP 


% Married 


% Broken up 
after 2 years 


Hope of Power after 2 years 


Men 
High (n = 32) 50 9 
Low (n = 27) 15 52 
Women* 
High (n = 25) 36 32 
Low (n = 33) 33 27 


a One female has been omitted because she did not write a 
complete Thematic Apperception Test protocol. 


Power, in contrast, was not related to their 
own or their partner’s reported satisfaction 
with the relationship, nor to their scores on 
love or liking for their partners. 


Specific Problem Areas 


The correlations between power motivation 
and specific anticipated problems are pre- 
sented in Table 2. As predicted, men’s Hope 
of Power was significantly correlated with 
their expectation that internal strains, includ- 
ing differences in interests and attitudes as 
well as “becoming bored,” might threaten the 
relationship. The only exception to this pat- 
tern was the nonsignificant correlation with 
“difference of intelligence.” In the area of in- 
dependence pressures, there was a near-signifi- 
cant tendency for power-motivated men to be 
especially likely to view their own interest in 
“someone else” as a potential problem. Fi- 
nally, there was an unpredicted correlation 
between men’s Hope of Power and the expec- 
tation that “living too far apart” (an external 
pressure) might threaten the relationship. 
‘There were fewer significant correlations be- 
tween women’s Hope of Power and their an- 
ticipation of problems in the relationship, and 
hey formed a less consistent pattern. Sig- 
icant or near-significant correlations were 
nd between women’s Hope of Power and 
expectation that becoming bored, con- 

s sexual attitudes, the partner’s desire 
lependence, and parental pressures 

sent difficulties for the relationship. 


f the Relationship 


» years, 34% of the couples had 
31% were married, and the re- 
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maining 35% were still dating. As Table 3 
indicates, the breakup rate was as high as 
50% in couples in which the man was high 
in Hope of Power (divided at the median) 
compared with 15% for couples in which the 
man’s Hope of Power was low (x? = 6.6, p < 
.01). Similarly, only 9% of the couples in 
which the man was high in Hope of Power 
got married compared with 52% of the 
couples in which the man’s Hope of Power 
was low (x? = 10.89, p < .001). These statis- 
tical relationships did not hold for female 
Hope of Power. 


Discussion 


The data strongly support the view that 
power motivation in men is a source of rela- 
tive dissatisfaction and stress in intimate re- 
lationships. In accord with previous findings, 
the data suggest that power-motivated men 
are especially likely to encounter difficulty in 
establishing a harmonious relationship with a 
woman. 

Supporting evidence for this view may be 
found in the fact that the men high in power 
motivation reported that during the 2 years 
before the relationship under study began, 
they “seriously” dated more different girls 
than did other men (7 = .37, p < .05). This 
suggests that these men have already estab- 
lished a pattern of relatively serious involve- 
ments in relationships that do not last. These 
findings do not hold for women high in power 
motivation, 

One possible interpretation of the pattern 
of unstable relationships among men high in 
Hope of Power is that these men tend to set 
higher standards for themselves either for 
their lives as a whole or in relationships in 
particular. However, there is no evidence that 
men high in Hope of Power actually have 
such high standards. Men’s Hope of Power 
was unrelated to their plans for further edu- 
cation, career aspirations, or degree of ad- 
herence to the “romantic ideal.” ? 

It was found that internal strains and dis- 
agreements were especially likely to be seen 
as potential problems by power-motivated 


2Precise details of these measures of aspiration 
may be found in Rubin, Peplau, and Hill (Note 4). 
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men. This may be due to their relative lack 
of ability to handle conflict or to their over- 
weening need for total agreement (see Winter, 
1973, pp. 115-116). Future research to locate 
the precise sources of these difficulties is 
needed. 

Most strikingly, strong support was found 
for the prediction that power motivation in 

ven is associated with long-term instability 
if opposite-sex relationships. Highly power- 
motivated men were especially unlikely to get 
married over the 2-year period of the study. 
Thus there may be tendencies among power- 
motivated men both toward destruction of the 
relationship and against long-term commit- 
ment, as the archetype of the Don Juan fig- 
ure and the previous findings of sexual pro- 
miscuity would suggest. Only future research 
can establish whether these two findings— 
breakup and lack of marriage—are opposite 
poles of the same dimension or two separate 
tendencies of men high in power motivation. 

The data for women were less striking or 
consistent. Power-motivated women were 
slightly more likely than others to expect dif- 
ficulties in their relationships, but their part- 
ners were not. There was no tendency for 
power-motivated women or their partners to 
find their current relationship “unsatisfying,” 
and their relationships did not in the long 
run tend toward dissolution or away from 
marriage. 

The lack of confirmation of the men’s re- 
sults in comparison with those of women may 
reflect a fundamental problem with the mea- 
sure of power motivation, which was derived 
from male subjects and has been used almost 
exclusively in research with men. Recent re- 
search suggests, however, that the power mo- 
tive is aroused in women under the same 
conditions as for men (Stewart & Winter, in 
press) and that it is related to many of the 
same behaviors for women as for men (Win- 
ter, Note 3). It seems appropriate, therefore, 
for other more substantive explanations for 
the differences between the sexes in these data 
to be considered. It may be, for example, 
that whereas men learn to seek power by 
means of short-term, serial conquests of 
women (the Don Juan syndrome), women 
are more inclined to seek lasting relationships 


with men as power or prestige “possessions.” 
As a result, women high in power motivation 
may be relatively unlikely to report difficul- 
ties or to want to dissolve an intimate rela- 
tionship. Another possible explanation is that 
power-motivated women may be better able 
to manage conflict than power-motivated men, 
reflecting a general tendency for women to be 
socialized to resolve conflicts and to reduce 
interpersonal tension. 
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Two experiments were conducted to examine the relationship between frustra- 
tion and coping strategies in two delay-of-gratification situations: self-imposed 
delay (optional waiting for a preferred reward) and externally imposed delay 
(required waiting for a preferred reward). It was hypothesized that the frus- 
tration generated by the two situations would evoke different coping strategies. 
Grade-three children were found to spend Jess time attending to reward-relevant 
cues in self-imposed delay than in externally imposed delay. It was also ob- 
served that the impact of physically present, deferred rewards on attentional 
behavior was dependent on the delay situation. Individuals engrossed them- 
selves more in a reward-irrelevant activity in self-imposed delay when the 
reward was physically present than when it was absent, whereas the opposite 
pattern emerged under externally imposed delay. A second study indicated that 
increasing the time individuals expected to wait resulted in decreased attention 
to reward-relevant stimuli in self-imposed delay but not in externally imposed 
delay. The importance of these findings for an understanding of the cognitive 


and motivational processes involved in delay behavior is discussed. 


While the development of the ability to de- 
lay gratification has been considered by many 
authorities to be critical to successful sociali- 
zation (e.g. Freud, 1911/1959; Jones & Ger- 
ard, 1967; Mowrer & Ullman, 1945), only 
recently have the cognitive and motivational 
variables relevant to this ability been sys- 
tematically investigated (Mischel, 1974). 
Mischel and his colleagues have conducted 
a series of studies with young children in 
which they have observed how long a child 
will wait for a preferred reward when he has 
the option of terminating the delay and re- 
ceiving a nonpreferred reward (e.g., Mischel 
% Ebbesen, 1970; Mischel, Ebbesen, & Zeiss, 

72; Mischel & Moore, 1973). One of the 

i intriguing results to emerge from the 
sf Mischel is the finding that the physi- 
ence or imaginal representation of a 
eward increases the frustration asso- 

w h delay and precipitates the volun- 
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tary termination of the delay period. This 
finding is particularly interesting because it 
appears to contradict the Freudian hypothesis 


that generating hallucinatory images of the ~ 


delayed goal object enables the child to “bind 
the time” till the object is delivered or be- 
comes accessible, 

A recent study by Miller and Karniol 
(1976) endeavored to illuminate the appar- 
ent discrepancies between the empirical find- 
ings of Mischel and his colleagues and the 
theorizing of Freud and others. Miller and 
Karniol observed that Mischel and his col- 
leagues have typically employed situations 
where the delay interval is under self-control, 
whereas Freud was typically concerned with 
externally imposed delay situations. Freud’s 
prototypical example of the delay process, for 
example, is that of the young child deprived 
of his mother’s presence, In order to tolerate 
her absence, an absence which the child can- 
not terminate, Freud contends that the child 
comes to generate images of her breast. 

Miller and Karniol (1976) empirically 
demonstrated the importance of the distinc- 
tion between self-imposed and externally im- 
posed delay in a study which manipulated 
the salience of contingent rewards, in addi- 
tion to the nature of the delay situation. Con- 
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sistent with the work of Mischel, Miller and 
Karniol found that children in a self-imposed 
delay situation experienced more frustration 
when a delayed reward was physically present 
during the delay interval than when it was 
absent. On the other hand, when the delay 
was externally imposed, children experienced 
ss frustration when the delayed reward was 
physically present than when it was absent. 
‘The purpose of the present investigation is to 
examine more closely the role of attentional 
mechanisms in the two types of delay situa- 
tions. 

It has been reported by Mischel and his 
colleagues that directing the child’s attention 
to reward-relevant and reward-irrelevant stim- 
uli can significantly affect the child’s volun- 
tary delay time. When the child is instructed 
to focus, either visually or imaginally, on 
stimuli relevant to the deferred reward, he or 
she tends to wait for a shorter duration than 
when instructed to focus on reward-irrelevant 
stimuli (e.g., Mischel & Baker, 1975). Mi- 
schel has summarized this research as follows: 


The successful maintenance of such [self-imposed] 
delay, depends on cognitive and overt activities to 
reduce the aversiveness of the self-imposed frustra- 
tion, One helpful strategy here is for the individual 
to “tune out” on the actual goal objects. . . . Self 
generated distractions from the rewards may serve 
is an extremely valuable cognitive strategy for de- 
lay. (Mischel, 1974, p. 287) 


Miller and Karniol, on the other hand, have 
argued that when the individual has no choice 
but to endure the delay interval, it is adap- 
tive to focus on rewards or cues that remind 
him or her of the reward. It is their conten- 
tion that delay frustration in an externally 
imposed delay situation is greatly abated if 
the individuals believe that their waiting is a 
means to an end and if they have a vivid 
image of what that end is. Evidence con- 
sistent with Miller and Karniol’s hypothesis 
can be found in a series of human learning 
experiments which indicates that an effective 
means of maintaining instrumental activity 
in the absence of immediate reward is to have 
the individual focus on stimuli which are as- 
sociated with the delayed outcome (e.g., 
Fagan & Witriyol, 1966; Strickland & Gaito, 
1967; Wright & Smothergill, 1967). 


EXPERIMENT 1 


One general hypothesis that can be drawn 
from the Miller and Karniol (1976) study is 
that individuals’ attention to reward-relevant 
and reward-irrelevant stimuli might be dis- 
tributed quite differently in the two types of 
delay situations. Specifically, individuals in 
self-imposed delay (SID) should focus more 
on reward-irrelevant stimuli and less on re- 
ward-relevant stimuli than individuals under 
externally imposed delay (EID). The suc- 
cessful tolerance of an SID requires that indi- 
viduals divert their attention from cues that 
remind them both of the deferred reward 
and the temptational dilemma which it pro- 
duces. When the delay period is externally 
imposed, with no opportunity for self-termi- 
nation, focusing on cues that enhance the 
salience of deferred rewards does not increase 
temptation but rather reduces frustration by 
reminding the individual that his suffering 
has an instrumental function. 

The physical presence or absence of con- 
tingent rewards is also expected to affect 
the degree of attention directed at reward- 
relevant and reward-irrelevant cues in the two 
types of delay situations. Since delay frustra- 
tion is thought to be greater in SID when the 
reward is physically present, it is hypothe- 
sized that as a means of coping with the 
increment in frustration, individuals will di- 
rect more attention to reward-irrelevant stim- 
uli when the reward is present than when it is 
absent. In EID, on the other hand, delay 
frustration is thought to be greater when the 
reward is absent than when it is present. Ae 
a means of coping with the greater frustra- 
tion, individuals in EID are expected to direct 
more of their attention to reward-relevant 
stimuli in the reward-absent condition than 
in the reward-present condition. 


Method 


Subjects. Subjects were 82 male and female chil- 
dren recruited from second- and third-grade classes 
in a middle-class area in London, Ontario. 4 

Apparatus. The study was conducted in a mobile 
trailer parked on the school grounds. In addition to 
a table and two chairs, the trailer contained a large 
portable clock placed directly behind the children 
and a block puzzle game which was placed on the 
table in front of the subjects. Since time helped 
define the delay contingency for the subjects, the 
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clock served as a reward-relevant stimulus in the 
study. The blocks served as the reward-irrelevant 
stimulus. In some conditions, an electric buzzer was 
also present in the trailer. 

Procedure. The experimenter brought the children 
individually to the trailer allegedly “to play games.” 
Once inside the trailer, the subject and the experi- 
menter sat down at a small table and the experi- 
menter produced a block puzzle game. The game 
required the child to put blocks of different shapes 
and sizes through matching holes in a plastic con- 
tainer. When the child had succeeded in placing all 
of the blocks in the container (approximately 3-5 
minutes), the experimenter pointed to a covered pie 
plate saying, “Well, let’s see what’s under here.” The 
pie plate contained two adjacent marshmallows and 
an eraser. The subject was asked to indicate 
which of the items he or she preferred. All subjects 
expressed a preference for the marshmallows. 

The experimenter next informed the subject that 
she had “some work to do in the back room.” 
Subjects in the SID condition were told, “If you 
wait till I come back, you can have the two marsh- 
mallows. But if you cannot wait till I come back, 
you can push this buzzer and I will come back.” 
The buzzer was demonstrated a number of times at 
this point to ensure that the child understood how 
to summon the experimenter. The child was then 
informed that if he or she did summon the experi- 
menter, he or she would receive the nonpreferred 
reward (the eraser) instead of the preferred reward 
(two marshmallows). In the EID condition, sub- 
jects were simply told that when the experimenter 
returned they would receive the two marshmallows. 
The buzzer was not present in this condition. 

In both conditions, the subjects were directed to 
a clock on a ledge behind them. The clock was 
always set at 12:00. The subjects were told by the 
experimenter that she would return in 5 minutes or 
when the “big hand gets to 1.” To ensure that the 
subject comprehended the instructions, she asked the 
child to repeat the temporal contingency. 

Depending on the reward condition, the experi- 
menter either took the pie plate with the rewards 
out of the room (reward absent) or left the plate 
on the table facing the child (reward present). In 
all conditions, as the experimenter was leaving the 
room, she spread the blocks out on the table and 
said to the subject, “You can play with these while 
I am gone if you would like to.” 

The experimenter then went into the other com- 
partment of the trailer and observed the child 
through a one-way mirror. The experimenter and 
another observer each held two stop watches on 
which they recorded the amount of time the sub- 
ject spent watching the clock and playing with 
the blocks.* 

Because the clock was located behind them, the 
subjects had to turn their heads in order to see the 
clock, which made it relatively easy to determine 
when they were actually focusing on the clock. 
Block playing was defined as time spent touching 
one or more of the blocks. 
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The experimenter returned to the experimental 
compartment after 4 minutes had elapsed. Pretesting 
had indicated that 4 minutes was the longest dura- 
tion that the preponderance of children in the self- 
imposed delay condition could wait before pushing 
the buzzer. The experimenter began by querying the 
subjects about the details of her absence. Specif- 
ically, the child was asked, “When did I say I was 
coming back?” and “Do you remember what you 
get now that I have come back?” Once the subject’s 
responses to these questions were recorded, the 
experimenter escorted the child back to the classroom. 


Results and Discussion 


One subject was dropped from the analy- 
sis because he terminated the delay interval 
in the choice condition before the experi- 
menter returned, A second subject was elimi- 
nated from the analysis because he could not 
accurately identify the time at which the 
experimenter was to return. The elimination 
of these 2 subjects left 20 subjects in each 
of the four conditions. A 2 x 2 analysis of 
variance was conducted on the data after a 
log transformation was performed to eliminate 
heterogeneity of variance (Myers, 1972). 

The first hypothesis was that individuals 
in SID would focus less on stimuli that 
evoked the deferred goal and the delay exi- 
gencies (the clock) than would be the case 
in EID situations. This hypothesis received 
strong support from the results of Experi- 
ment 1, F(1, 76) = 7.78, p < 001. 

A second hypothesis concerning clock- 
watching behavior focused on the impact of 
reward salience under externally imposed 
delay. It was hypothesized that subjects in 
the EID situation would look more at the 
clock (goal-relevant stimuli) when the reward 
was absent than when it was present. A 
planned comparison of the relevant conditions 
(see Table 1) failed to support this hypothe- 
sis (F <1). 

Block Playing. To correct for heterogeneity 
of variance, a log transformation was also 


1 Like the experimenter, the observer was blind 
to the experimental hypotheses, but unlike the ex- 
perimenter, she was also blind to the experimental 
conditions. After the attentional behavior of 10 sub- 
jects in each cell had been recorded, the records of 
the observers’ behavior were compared. The reliabil- 
ity of their judgments was extremely high (.91), and 
it was decided to continue the experiment employ- 
ing only the experimenter as the observer, 
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conducted on the time spent playing with the 
blocks. The transformed data were analyzed 
'] by a 2 X 2 analysis of variance. 
i Since distraction from the delay situation 
; thought to be an effective means of coping 
ith an SID but not an EID, it was hypothe- 
; ized that subjects in the SID condition 
) would spend more of the delay period engaged 
@ T reward-irrelevant activity (block playing) 
ian would individuals in the EID situation. 
the analysis of variance did not reveal a 
main effect of delay situation, but did reveal 
a Delay Situation X Reward Salience inter- 
A action, F(1, 76) = 15.29, p < .001. The pat- 
TA tern of means in this interaction indicated 
that subjects in the SID condition played 
more with the blocks than subjects in the 
$ SID condition when the reward was present, 
ut they played less with the blocks when 
the reward was absent. The means relevant 
to this interaction are shown in Table 2. 
A second hypothesis concerning block play- 
ing focused on the impact of reward salience 
in the SID situation. It was hypothesized that 
individuals in the SID situation would play 
a longer with the blocks when the reward was 

present than when it was absent. The data 
confirmed this hypothesis, F(1,76) = 6.00, 
$< .025. Surprisingly, reward salience also 
affected block playing behavior in the EID 
condition. Subjects in the EID condition 


y played with the blocks significantly longer 

} when the reward was absent than when it 
was present, F(1,76) = 5.72, p < .025. 

/ ‘The finding that individuals in the EID 


condition played more with the blocks when 
the rewards were absent than when they 
were present was unexpected but may help 
explain why individuals in this condition did 
not attend more to the clock, as was pre- 
dicted. It was believed that if subjects were 
more frustrated when the reward was absent, 


TABLE 1 


“MEAN TIME IN SECONDS SPENT IN CLOCK WATCHING AS 
A FUNCTION or DELAY SITUATION AND 
REWARD SALIENCE 


Reward Reward 

; Delay situation present absent 
Self-imposed 83.75 96.75 

Externally imposed 138.50 129.50 
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TABLE 2 


MEAN Time IN SECONDS SPENT IN BLOCK PLAYING AS A 
FUNCTION OF DELAY SITUATION AND 
REWARD SALIENCE 


Reward Reward 

Delay situation present absent 
Self-imposed 91.25 27.50 
Externally imposed 15.25 62.25 


they would attempt to reduce this frustration 
by focusing on a cue (in this case, the clock) 
that would remind them of the incipient 
goals. The clock-watching data did not re- 
flect this difference. Subjects may, however, 
have coped with their greater frustration in 
the reward-absent condtion by playing with 
the blocks. 


EXPERIMENT 2 


Experiment 1 assumed that the differences 
between the two delay situations on clock 
watching and block playing behaviors reflect 
different levels of frustration and different 
means of coping with the frustration. It may 
be desirable, however, to have further evi- 
dence that these behaviors are affected by 
frustration. If the patterns we have observed 
do reflect differential levels of frustration and 
coping strategies, then these differences should 
be magnified by experimentally increasing the 
frustration level. 

A problem with attempting to increase the 
frustration level in both types of situations is 
that what increases frustration in one situa- 
tion may not increase frustration in another 
situation. Indeed, some variables (e.g., reward 
salience) seem to have antithetical effects in 
the two situations. One variable which woukl 
appear to be positively related to experienced 
frustration in both situations is the expected 
length of the delay period. Whether the delay 
is self- or externally imposed, the longer the 
required delay, the more frustrated the 
individual should become and the more nec- 
essary are coping strategies. The longer the 
individual thinks he or she must endure a 
delay before a desired object can be obtained, 
the more likely he or she will be to engage in 
behaviors that reduce frustration. This should 
be the case in both SID and EID situations, 
although the particular coping strategies may 
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TABLE 3 


MEAN Time IN SECONDS SPENT IN CLOCK WATCHING AS 
A FUNCTION OF DELAY SITUATION AND 
EXPECTED DURATION 


Expected duration 


Delay situation 5 minutes 10 minutes 
Self-imposed 131.33 77.66 
Externally imposed 140.33 128.33 


be quite different in the two situations. Ex- 
periment 2 was conducted to test these 
hypotheses. 


Method 


Subjects. Subjects were 60 male and female third- 
grade children enrolled in public schools in London, 
Ontario. 

Procedure. The procedure and experimental design 
closely paralleled that of Experiment 1. The proce- 
dure was identical up to the point where the experi- 
menter instructed the subject as to how long she 
would be gone from the room. In one condition, the 
children were told that the waiting interval would 
be 5 minutes. In the other condition, the children 
were told that it would be 10 minutes. 

The manipulations thus involved the type of delay 
situation (SID or EID) and expected length of delay 
interval (5 or 10 minutes), Reward salience was 
not manipulated in this study as all rewards were 
taken from the room when the experimenter left. 
As with Experiment 1, the experimenter recorded 
the attentional behavior of the child. In all condi- 
tions, the experimenter returned after 4 minutes 
and asked the subjects the same questions as in 
Experiment 1. 


Results and Discussion 


To correct for heterogeneity of variance, a 
log transformation was conducted on both the 
clock-watching and block-playing data. Subse- 
quently, 2 X 2 analyses of variance were con- 
ducted on both dependent measures. 

Clock watching. It was hypothesized that 
attention to reward-relevant cues increases 
frustration in SID but not under EID. Con- 
sistent with this hypothesis, an interaction of 
Situation X Expected Duration of Delay was 
found on the clock-watching measure, F- 
(1, 56) = 4.38, $ < .05. As Table 3 shows, 
expected duration of delay did not affect clock 
watching behavior under EID, (F < 1) but 
significantly reduced clock watching under 
SID, F(1, 56) = 6.79, p < 025. 
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Block playing. It was hypothesized that the 
longer the expected delay, the greater would 
be the time spent engaged with the blocks (a 
reward-irrelevant activity). The results sup- 
ported this hypothesis, but an inspection of 
Table 4 and a marginally significant Delay 
Situation x Expected Duration interaction, F 
(1,56) = 3.2, p < .10, reveal that this effect 
was much more pronounced in the EID 
condition, F(1,56) = 14.20, p < .001, than in 
the SID condition (F <1). The interaction 
means are shown in Table 4. 

Overall, the results of Experiment 2 sup- 
port the assumption that in the present 
paradigm, the dependent measures of clock 
watching and block playing are highly sensi- 
tive to manipulations of delay frustration. 
Moreover, Experiment 2 substantiates the 
hypothesis that delay frustration induces dif- 
ferent types of coping strategies in the two 
delay situations and indicates that the greater 
the frustration experienced in the situations, 
the more prevalent the use of such coping 
strategies. 


GENERAL Discussion 


Employing time estimates of the delay 
period as the index of delay frustration, 
Miller and Karniol (1976) found that indi- 
viduals experienced (a) greater frustration 
(longer time estimates) the more physically 
prominent the delayed reward in SID, and (b) 
less frustration (shorter time estimates) the 
more prominent the delayed reward in EID. 
The present studies underscore further the 
importance of considering type of delay situa- 
tion in any attempt to understand the cog- 
nitive and motivational processes involved 
in delay behavior. It appears that not only 
do situational manipulations (e.g., reward 
salience) affect delay frustration differen- 


TABLE 4 


MEAN Tim 
or DELAY SITUATION AND EXPECTED 
DURATION or DELAY 


Expected duration . 


Delay situation 5 minutes 10 minutes 
Self-imposed 29.00 37.33 
Externally imposed 40.00 84.00 


; SPENT IN BLOCK PLAYING, AS A FUNCTION 
j 


a 


COPING STRATEGIES AND ATTENTIONAL MECHANISMS IN DELAY 


tially in the two contexts but the different 
situational dyanmics are reflected in coping 
strategies. 

Individuals in SID spent less time looking 
at reward-relevant stimuli (a clock) than did 
individuals in the EID condition. Presumably, 
attention to reward-relevant cues in choice 
situations is dysfunctional as it increases the 
desire for the delayed object as well as the 
frustration associated with the delay interval 
(Miller & Karniol, 1976; Mischel, 1974). 
However, the temptational element is absent 
from the externally imposed delay situation, 
and attention to reward-relevant cues may 
actually facilitate the endurance of the delay 
interval by providing the individual with in- 
formation as to the instrumental value of his 
waiting behavior. Further evidence that atten- 
tion to reward-relevant cues increases delay 
frustration under SID but not EID was found 
in Experiment 2. When individuals in a choice 
situation expected the delay to be 10 minutes, 
they looked at the clock less than when they 
expected it to be 5 minutes. The expected 
duration of the delay interval did not affect 
the clock-watching behavior of individuals in 
the EID context. 

Mischel (1974) has argued that an effective 
means of coping with the frustration gener- 
ated by SID is to distract oneself from all 
thoughts or stimuli that remind the individ- 
ual of the exigencies of the delay situation. 
This strategy is thought to be particularly 
necessary when the cues that evoke the temp- 
tational dilemma are plentiful. Accordingly, 
it was hypothesized that when a reward is 
present in SID, individuals will focus more 
on irrelevant stimuli than when the reward is 
absent. The results supported this hypothesis 
as the subjects spent significantly more time 
playing with the blocks (reward-irrelevant 
stimuli) when the rewards were physically 
present than when they were absent. 

It was hypothesized that individuals in 
EID would find it less necessary to engage in 
reward-irrelevant activity than subjects un- 
der SID. This only appeared to be the case 
in the reward-present condition. When the 
reward was absent, subjects in the EID con- 
dition engaged in more reward-irrelevant ac- 
tivity (block playing) than their SID coun- 
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terparts. The finding that EID subjects en- 
gaged more in a reward-irrelevant activity 
when the reward was absent than when it was 
present was not predicted. 

Experiment 2 supported the inference that 
when the reward was absent, subjects under 
EID coped with their frustration by directing 
their attention to reward-irrelevant stimuli. 
When individuals in the EID (reward-absent) 
condition anticipated a long as opposed to a 
short delay interval, they did not change their 
behavior with respect to clock watching, but 
they did increase the amount of time spent 
in block playing. 

The pattern of results, thus, provides evi- 
dence that individuals under EID are more 
frustrated when a deferred reward is physi- 
cally absent than when it is present, but there 
is no evidence to suggest that they cope with 
this additional frustration by attending to 
reward-relevant stimuli. 

A possible reason for the failure of sub- 
jects under EID to cope with their frustra- 
tion in the expected fashion may lie with the 
specific reward-relevant stimuli employed in 
the present experiment. The clock was con- 
ceptualized as a reward-relevant stimulus 
because it helped define the conditions neces- 
sary to attain the reward. It also provided 
cues as to the subjects’ temporal proximity 
to the goal object. It did not, however, con- 
stitute a representation, abstract or other- 
wise, of the delayed reward. In other words, 
individuals in the EID condition may indeed 
combat their delay frustration by generating 
images of the delayed objects, but attending 
to a clock does not seem to serve the same 
purpose. Attention to the clock did seem to 
reflect frustration level in the SID situation, 
however, so there is evidence that, in that 
context at least, it did fulfill its function as 
a reward-relevant stimulus. Future research 
should manipulate the type of goal-relevant 
stimuli employed in the two types of situa- 
tions. A fruitful beginning in this regard may 
be to explore the distinction made by Mischel 
(1974) between the informational or “cue” 
and “consummatory” value of stimuli. 

In conclusion, the present results corrobo- 
rate and extend Miller and Karniol’s finding 
(1976) that the nature of the delay context 
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greatly affects the dynamics of the delay proc- 
ess. In particular, the present findings sug- 
gest a possible means by which attentional 
and coping strategies may mediate the impact 
of motivational and contextual variables in 
delay situations, 


REFERENCES 


Fagan, J. F., & Witriyol, S. L. The effects of instruc- 
tional set and delay of reward on children’s learn- 
ing in a simultaneous discrimination task. Child 
Development, 1966, 37, 433-438. 

Freud, S. Formulations regarding the two principles 
of mental functioning. In Collected papers (Vol. 4). 
New York: Basic Books, 1959. (Originally pub- 
lished, 1911.) 

Jones, E. E., & Gerard, H. B. Foundations of social 
psychology. New York: Wiley, 1967. 

Miller, D. T., & Karniol, R. The role of rewards in 
externally and self-imposed delay of gratification. 
Journal of Personality and Social Psychology, 1976, 
33, XXX-XXX. 

Mischel, W. Processes in delay of gratification. In L. 
Berkowitz (Ed.), Advances in experimental social 
psychology (Vol. 7). New York: Academic Press, 
1974, 


DALE T. MILLER AND RACHEL KARNIOL 


Mischel, W., & Baker, N. Cognitive appraisals and 
transformations in delay behavior. Journal of Per- 
sonality and Social Psychology, 1975, 31, 254-260, 

Mischel, W., & Ebbesen, E. B. Attention in delay of 
gratification. Journal of Personality and Social 
Psychology, 1970, 16, 329-337, 

Mischel, W., Ebbesen, E. B., & Zeiss, A. R. Cognitive 
and attentional mechanisms in delay of gratifica- 
tion. Journal of Personality and Social Psychology, 
1972, 21, 204-218. 

Mischel, W., & Moore, B. Effects of attention to 
symbolically presented rewards on self-control, 
Journal of Personality and Social Psychology, 1973, 
28, 172-179, 

Mowrer, O. H., & Ullman, A. D. Time as a determi- 
nant of integrative learning. Psychological Review, 
1945, 52, 61-90. 

Myers, J. L. Fundamentals of experimental design. 
Boston: Allyn & Bacon, 1972. 

Strickland, L. H., & Gaito, F. W. Temporal presenta- 
tion of winning symbols and slot machine playing. 
Journal of Experimental Psychology, 1967, 74, 10- 
133 

Wright, J. C., & Smothergill, D. Observing behavicur 
and children’s discrimination learning under de- 
layed reinforcement. Journal of Experimental Child 
Psychology, 1967, 5, 430-440, 


(Received October 20, 1975) 


Manuscripts Accepted for Publication in the 
Journal of Personality and Social Psychology 


Physical Attractiveness and Marriage Adjustment in Middle-Aged Couples. Bernard Mur- 
stein (Box 1581, Department of Psychology, Connecticut College, New London, Con- 


necticut 06320) and Patricia Christy. 


Children’s Preferences for Individualistic, Cooperative, and Competitive Outcomes. Charles 
G. McClintock (Department of Psychology, University of California, Santa Barbara, 
California 93106) and Joel M. Moskowitz. 

Effects of Field Dependence on Affective Reactions and Compliance in Dyadic Interactions. 
Les R. Greene (Division of Mental Health, University of California at Davis, Sacra- 
mento Medical Center, 2315 Stockton Boulevard, Sacramento, California 95817). 

Communication Modality as a Determinant of Message Persuasiveness and Message Com- 
prehensibility. Shelly Chaiken and Alice H. Eagly (Department of Psychology, Univer- 
sity of Massachusetts, Amherst, Massachusetts 01002). 

Social Facilitation of Word Association: Further Questions. Thomas O. Blank, Ilene Staff, 
and Phillip Shaver (Department of Psychology, New York University, 4 Washington 


Place, New York City, New York 10003). 


Differential Social Perception and Attribution of Intergroup Violence: Testing the Lower 
Limits of Stereotyping of Blacks. Birt L. Duncan (University of California, Berkeley, 
Health and Medical Sciences Program, Room 106 T-7, Berkeley, California 94720), 

Compensatory Aspect of Dreams: An Empirical Test of Jung’s Theory. George Domino 
(Department of Psychology, University of Arizona, Tucson, Arizona 85721). 

Outcome Alternatives and Value as Criteria for Multistrategy Evaluations. Edward J. Lawler 
(Department of Sociology, University of Iowa, 204 Macbride Hall, Iowa City, Iowa 


52242) and Samuel B. Bacharach, 


Frequency Perception of Successes as a Function of Results Previously Obtained by Others 
and by Oneself. Robert Vreven and Joseph R. Nuttin (Laboratory of Experimental 
Psychology, University of Leuven, 102 Tiensestraat, B 3000, Louvain, Belgium). 

Order of Consensus, Distinctiveness, and Consistency Information, and Causal Attributions, 
Diane N. Ruble (Department of Psychology, Princeton University, Green Hall, Prince- 
ton, New Jersey 08540) and Nina S. Feldman. 


(Contents continued) 


Noncontingent Warmth Induction, Cognitive Style, and Children’s Imitative Donation 
and Rescue Effort Behaviors/Carol S. Weissbrod .........0200000c0cceesveuee 
Guilt Beyond a Reasonable Doubt: Effects of Concept Definition and Assigned Deci- 
sion Rule on the Judgments of Mock Jurors/Norbert L. Kerr, Robert S. Atkin, 
Garold Stasser, David Meek, Robert W. Holt, and James H. Davis ............ 
Differential Tests of Two Cognitive Theories of Performance: Weiner Versus Kukla/ 
R; Michael: Latta- oa i isc steteatone alee peed be ease tom ane each elie ee 


Coping Strategies and Attentional Mechanisms in Self-Imposed and Externally Im- 
posed Delay Situations/Dale T. Miller and Rachel Karniol ...... wie in ieee die kis 


WMaxuscripts x Accented Fon PuUDHCAUDN. or ae oau afm eaan endear ete eee sterner eterna reer 


274 


282 


... like a preliminary map of 
a recently explored territory 


The Psychology of 
ADULT DEVELOPMENT 
and AGING 


surveys what is known 
and outlines the unexplored regions 
of gerontological psychology 


A major product of the American Psychological Association's 
Task Force on Aging, this rich guide to gerontological research 
summarizes findings of outstanding leaders in. the field and 
provides a fruitful source of testable hypotheses for investiga- 
tors in almost every psychological specialty. Editors Carl 
Eisdorfer and M. Powell Lawton cochaired the APA task force, 
which was organized in response to a request for representation 
at the White House Conference on Aging (held November 1971). 


The Psychology of Adult Development and Aging amply re- 
affirms the realization that chronological age is a poor indicator 
of an individual's cognitive and emotional characteristics. 
Emphasis is on identification of the contribution of other vari- 
ables, demonstrating experimental, developmental, and clinical 
approaches. Topical coverage is unusually broad, ranging from 
animal and psychophysiological research to problems of 
education, housing, and transportation. (718 pages: $11) 


To obtain copies, write to the Order Department, American 
Psychological Association, 1200 17th St., N.W., Washington, 
D. C. 20036. Prepayment must accompany all orders amount- 
ing to less than $15. 


D 


Journal of Personality 


» and Soci eN 


JLUME 34, NU oh me G SEPTEMBER 1976 
SS As 
ye pE 


| ’ ind Social Class as Determinants of Future Orientation (Time Perspective) in 
Adolescents/Helmut Lamm, Rolf W. Schmidt, and Gisela Trommsdorf ........ 317 

> \ The Effects of Prior Reward and Punishment on Subsequent Reward and Punishment: 
N Guilt Versus Consistency/John P. Keating and Timothy C. Brock ............. 327 

1 Boys Meet Girls: Patterns of Interaction and Attribution in Heterosexual Attraction/ 
j John DCU ERGIE TT cares acs oe ioen npn onana inean see Ea A e 334 
The Generosity Shift/J. T. Muehleman, Charles Bruker, and Clara M. Ingram ...... 344 


Disinhibition Versus the Cathartic Effect: Artifact and Substance/Vladimir J. Konečni 
and Ebbe B. Ebbesen o. issss nosire ld att anA aa pate Nea eaa Sala T Aira AAR 352 


Í. Basking in Reflected Glory: Three (Football) Field Studies/Robert B. Cialdini, 
Richard J. Borden, Avril Thorne, Marcus Randall Walker, Stephen Freeman, and 
Lloyd Reynolds Sloan. ; s sostaner odd a ck 9 N NS ALAA E eS 366 


Awareness and Attitude Change in the Forced-Compliance Paradigm: The Importance 


of When/D. R. Wixon and James D. Laird 376 
\ 
1 Duration of the Effect of Good Mood on Helping: “Footprints on the Sands of Time”/ 
y Alice M. Isen, Margaret Clark, and Mark F. Schwartz .......+.+-00sevusesaees 385 
j Test of Attentional and Interpersonal Style/Robert M. Nideffer .........-+..+++++ “394 
Attributional Antecedents of Verbal and Vocal Assertiveness/Daphne Blunt Bugental, 
] Barbara Henker, and Carol K. Whalen is .a sasso sneesuetnea notae nt tian in ai 405 
Some Effects of Congruency of Member Characteristics and Group Structure upon 
Group Behavior/Marvin E. Shaw and Blaze Harkey .........0000+veveseeeees 412 
Comparative Effects of the Reward Stimulus and Its Cognitive Representation in Vol- 
‘untary Delay/Bert Moore, Walter Mischel, and Antonette Zeiss .....-.+++++++ 419 


(Continued on inside back cover) 


A, PUBLISHED MONTHLY BY THE AMERICAN PSYCHOLOGICAL ASSOCIATION 


PAPERBACKS 


SIMON AND SCHUSTER 
630 FIFTH AVENUE 
NEW YORK, N. Y. 10020 


ZR AR AR AR 
AN AN AR AN 
AN AR AR AN 
AR AR AN AN 
AR AR AR AN 


AR AR AN AR 


AR AR AN AN 


AN AR AN AN 


AR AN AN AN 


AR AR AR AR 
For complimentary examination copies 
and catalogs, write on your departmental 
letterhead to School and Library Ser- 
vices, address above, Include details on 
course in which you are considering using 


title requested, and advise us where you 
have seen announcement 


| 


AR AR AR AR 


aE a T 


Lifton, Robert Jay 
DEATH IN LIFE: Survivors of Hiroshima 
Winner of the National Book Award 


“The primary thrust of Lifton’s book comes from his study of 
hibakusha, the tainted survivors of Hiroshima. . .. Death in 
Life is a magnificent book on several grounds. . . Its implica- 
tions for future psychosocial research are also significant... . 
His literary skill and his psychological acumen blend harmoni- 
ously, as he formulates what is clearly beyond formulation.”— 
Avery Weisman, M.D., Psychiatry and Social Science Review. 
“This overwhelmingly compelling book is the work of a man 
with great capacity for introspection, for empathy and vi- 
cariousness, and for comparative analysis.”—David Riesman, 
Dissent, “This work is a major conttibution towards the meta- 
psychology of traumatology . . . the work is well thought out 
and provocative . . Psychoanalytic Quarterly. “. . . it is the 
depth, intensity, and a kind of awful and agonised confusion, 
of the experience which Dr. Lifton describes which give ‘his 
book its distinction . . . perhaps the greatest value of Death in 
Life is that it is not only the hibakusha’s experience which it 
helps us to understand; it is our own.”"—Encounter, “This is an 
impressive work of awful beauty written. with compassion and 
scholarly grace, It has great meaning for 20th-century man. 
Recommended reading for every college student.'"—Choice. 
(0 22276 $5.95 


Lifton, Robert Jay 
BOUNDARIES 
Psychological Man in Revolution 


The extremity of recent experience has shattered our tradi- 
tional boundaries, most especially “the boundary between the 
living and the dead that man has struggled so desperately to 
maintain,” The sense of immortality that we have maintained 
over the centuries — through our children, our belief in an 
afterlife, our lasting works, our ties to nature — is broken. 
“... Lifton has also raised some challenging questions about 
the nature of man’s life and how he responds to it in contem- 
porary society, . . . Lifton has very accurately described a 
number of the problems of modern society . . .”—Harvard 
Crimson. “Brilliant and concise, his book is also profoundly 
religious.”—The Nation, “Right on target.”—Christian Century. 
“... Boundaries seems most readable and provocative,’— 
Library Journal. 

022277 $2.95 


Lifton, Robert Jay 
THE LIFE OF THE SELF 
Toward a New Psychology 


“Contemplating his past work in psychohistory, he discerns 
a theme of death and the continuity of life, This work presents 
Lifton’s personal examination of that theme and his attempt to 
develop an understanding of history and individual suffering 
through the central image of the ‘survivor’. Referring to his 
previous books on the Hiroshima survivors and on Vietnam 
war veterans, and structuring his argument on comparisons 
with the psychological models of Freud and Erikson, Lifton 
outlines his concept of ‘psychic numbing’ as the nuclear age’s 
gift to our emotional lives. The author looks to Camus and 
Vonnegut for a sense of the disintegration the survivor must 
combat. He sees in their works and in the current search for 
alternative lifestyles an attempt to find an understandable 
form of existence, His thesis is persuasive and disturbing. . . . 
Recommended for philosophical and theoretical psychology 
collections.’—Library Journal. G} 
022425 $2.95 


Newman, Ruth G. 

GROUPS IN SCHOOLS 

A Book About Teachers, P. i 

A BookAnourTea i arents and Children 
Lifton, Robert Jay 

HOME FROM THE WAR: Transformations of 


Vietnam Veterans 
021727 $395 


( 


Volume 34 


Journal of eas. nee 


Editor: 
Associate Editors: 


Irwin Altman 

University of Utak 
Norman H. Anderson 

University of California, San Diego 
Albert Bandura 

Stanford University 
Daryl J. Bem 

Stanford University 
Seymour M. Berger 

University of Massachusetts 
John C, Brigham 

Florida State University 
Urie Bronfenbrenner 

Cornell University 
J. Merrill Carlsmith 

Stanford University 
Isidor Chein 

New York University 
Robert B. Cialdini 

Arizona State University 
Thomas Cook 

Northwestern University 
Walter H, Crockett 

University of Kansas 
James E. Dittes 

Yale University 
Norman T. Feather 

Flinders University oj South 

Australia, Bedford Park, Australia 

Kenneth Gergen 

Swarthmore College 
Alan Gross > x 

University of Missouri 
William Haythorn 

Florida State University 


Vernon Allen 
University of Wisconsin—Madison 


Jack Brehm 
Duke University 


Norman Cliff 
University of Southern California 


Ebbe B. Ebbesen 
University of Cop foreets San Diego 


William Griffitt 
Kansas State University 


Charles Gruder F 
University of Illinois at 
Chicago Circle 


Samuel Himmelfarb 
Tel Aviv University 


September 1976 


John T. Lanzetta, Dartmouth College see 
Alice H. Eagly, University of A E Annan 
Anthony G. Greenwald, Ohio State University 

Albert Pepitone, University of Pennsylvania 

William Scott, James Cook University, Townsville, Australia 


Consulting Editors 


Clyde Hendrick 
Kent State University 


Louise Kidder 

Temple University 
Robert E. Kleck 

Dartmouth College 
Milton E. Lipetz 

University of Colorado 
Clifford E. Lunneborg 

University of Washington 
R. Clark McCaule; 

Bryn Mawr College 
Albert Mehrabian 

University of California, Berkeley 
Gerald A. Mendelsohn 

University of California, Berkeley 
David Messick 

University of California, 

Santa Barbara 

Judson Mills 

University of Maryland 
Herbert L. Mirels 

Ohio State University 
William N. Mortis 

Dartmouth College 
Richard Nisbett 

University of Michigan, Ann Arbor 
Jane A. Piliavin 

University of Wisconsin—Madison 
Dean Pruitt 

State University of New York 

at Buffalo 


Issue Consultants 
B. 


nt Houston, 
University oj Kansas 
Paul D, Isaac 

Ohio State University 


George Levinger 
University of Massachusetts— 
Amherst 


Clara Mayo 
Boston University 
Clark McCauley 
Bryn Mawr College 


Robert McPeek 
Ohio State University 


Martin T. Orne 
University of Pennsylvania 


Milton E. Rosenbaum 
University of Iowa 
David J. Schneider 
Brandeis University 
Marvin Shaw 
University of Florida 
Lawrence Stricker 
Educational Testing Service, 
Princeton, New Jersey 
Loya H. Strickland 
‘arleton University, Ottawa, Canada 
Fred L. Strodtbeck 
University of Chicago 
Peter Suedfeld 
University of British Columbia, 
Vancouver, Canada 
James T, Tedeschi 
State University of New York, 
Albany 
Harry Upshaw 
University of Illinois at Chicago 
Circle 
Elaine Walster 
University of Wisconsin—Madison 
Ladd Wheeler 
University of Rochester 
Robert S. Wyer, Jr. 
University of Illinois at 
Urbana-Champaign 
Robert Ziller 
University of Florida 
Philip Zimbardo 
Stanford University 


Harold Raush 
University of Massachusetts— 
Amherst 


John W. Reich 
Arisona State University 


Stephen Reisman 
University of Massachusetts— 
Amherst 


Sidney Rosen 
University of Georgia 


Harold Sigall 
University oj Maryland 


James Uleman 
New York University 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, Inc. 


Copyright © 1976 by the American Psychological Association, Inc, 


and Social Psychology publishes original research reports in the arcas 
ality dynamics. The Journal includes articles on the following topi 
itude change; social neat pons ee ang aonyerpal comm 

a R, behavior; person perception; conformity and deviation; and perso: 
Pees sca ayvanics, Methodological articles, studies primarily concerned with the develo; 
of measuring instruments, replications, and reports of negative results or failures to replicate 
lished work are acceptable if they are judged to make a substantial contribution to knowledg: 
priority is given to papers concerned with personality assessment or the development and va! 
of assessment instruments. Manuscripts in these latter categories should be brief, not more th 
typewritten pages, The Journal does not include studies of abnormal processes or pop 
Manuscripts submitted after April 1, 1976, should conform to the policies of the Editor-: 
stated in his editorial in the January 1976 issue. 


Journal of Personality 
“social psychology and personality 
social motivation; attitudes and atti 


Manuscripts: Submit manuscripts in quadruplicate, prepared in accordance with the P» 
Manual of the American Psychological Association (2nd ed.) and the instructions st 
Editor-elect’s editorial in the January 1976 issue of the Journal (Vol. 33, pp. 1-7). 
be highly legible. Only the original will be returned to the author. Mail manuscripts to th: 

elect, Anthony G. Greenwald, Ohio State University, 404C West 17th Avenue, Colum! ) 
43210. Required supplementary information: A submitted manuscript will not be reviewed k 
lication without supplementary information in the following categories: (a) a cover letter 

dicates the relation (or lack of any relation) of the submitted manuscript to other currer ) 
mitted, in-press, or recently published papers of the same research program; (b) one copy h 
such currently submitted, in-press, or recently published paper; (c) separately from any 1 n 
the manuscript, one copy of a statement of the research procedures in sufficient detail to AS 
a guide to conducting an exact replication; and (d) also separately from any report in t! 1 
script, one copy of the summary tables (e.g., correlation tables, analysis of variance summa or 
the major analyses reported in the manuscript. This supplementary information is solely for se 
si the parton in determining comprehensiveness of the manuscript, and it will not be retu o 
the author, 


ipii l 


After a manuscript is accepted for publication, authors will be expected to provide assura ia 
their raw data are available for reanalyses by other investigators and that their research pr 5 
were consistent with the American Psychological Association’s Ethical Principles in the Ce af 
Research with Human Participants (1973), except insofar as may be detailed in the manuscri 


Copyright and Permissions: All rights reserved. Written permission must be obtained from the Americ O- 
logical Association for copying or reprinting text of more than 500 words, tables, or figures, Permissions « 
granted contingent upon like permission of the author, inclusion of the APA copyright notice on the first pax 
duced material, and payment of a fee of $10 per page of APA material. Permission fees are waived for author h 
to reproduce their own articles, Address requests for permission to the Permissions Office, American Ps; al 
Association, 1200 Seventeenth Street, N.W., Washington, D.C. 20036, 


Subscriptions: Subscriptions are available on a calendar year basis onl: y 
a k s y (January through December), N er 
rates for 1976: $60 domestic, $64 foreign, $6 single issue. APA member rate: $25, Write to Subscription Se 


Reprints: Authors may order reprints of their articles from the printer when they receive galley proofs. 


ie Baos RG and Back Volumes: For information regarding back issues or back volumes write to Orde t- 
, APA, 


Microform Editions: For information regarding microform editions write to any of the following: Johnson A 8, 


Inc., P.O, Box 1017, Greenwich, Connect O H H i i 
Mlctoflns, PHASEN NES TEAN hese 16830; University Microfilms, Ann Arbor, Michigan 48106; or P. on 


and a recent mailing label to the attention of ti 
the Subscription 

3 ne address. APA will not replace Faced eoii esulting fror 

hanges; journals will be forwarded only if subscribers notify the Tai Er i in Slag. in i oe 


Published monthly in two volumes per year, six į 
1200 Seventeenth Street, N.W., Wantleaton, DIC AoE oe a in 
ton, D.C., and at additional mailing offices, ae 


by the American Paychological Associati 
the U.S.A. Second-class Postage paid at Wa: 


Inc, 


Journal Staff 


Anita DeVivo, Executive Editor 
Kendall C. Patterson, Production Supervisor 


Rita Jaskowitz and -Dinah Sunday, 
Production Editors 


Robert J. Hayward, Advertising Representative 
Anne Redman, Subscription Manager 


Jå 


Journal of Personality and Soci 
1976, Vol. 34, No. A gt Seat pit as 


Sex and Social Class as Determinants of Future Orientation 


(Time Perspective) in Adolescents 


Helmut Lamm, Rolf W. Schmidt, and Gisela Trommsdorft 
Universität Mannheim, West Germany 


Hypotheses concerning the effects of sex and class on future orientation were 
tested on 100 boys and girls between the ages of 14 and 16, Future orientation 
was measured in various domains of life, constituting either private or public 
areas of concern (e.g, family, environment), on the dimensions of density 
(number of hopes and fears voiced by the subject), extension, and optimism- 
pessimism. As predicted, middle-class adolescents, as compared with lower- 
class adolescents, voiced more hopes and fears relating to public life and fewer 
in the private sphere, and they manifested a more extended future orientation 
in the private as well as the public spheres. They also judged the distant future 
more optimistically in two out of three public areas of concern. The lower-class 
girls were more optimistic than the middle-class girls in the occupational 
domain, As predicted, the lower-class boys voiced more hopes and fears in the 
occupational domain and manifested a more extended future orientation than 
the lower-class girls. On the other hand, the girls of both classes voiced more 
hopes and fears in the private sphere than the boys of either class. The results 


are interpreted through theories of socialization and role behavior, 


Time perspective may be defined as an indi- 
vidual’s cognitions concerning the content, 
placement, and unfolding of relevant events 
in his past, present, or future. Prior research 
has attempted to establish relationships be- 
tween time perspective (especially as directed 
toward the future) and societal type (tradi- 
tional vs. modern: Kluckhohn & Strodtbeck, 
1961; Ménks, 1967), social subgroups (¢.g., 
deviance: Barndt & Johnson, 1955; Jessor, 
Graves, Hanson, & Jessor, 1968) and person- 
ality characteristics (Platt, Eisenmann, De- 
lisser, & Darbes, 1971; cf. also Doob, 1971). 

In the body of literature just mentioned, a 
frequent assumption has been that social and 


This research was carried out at the Sonderfor- 
schungsbereich 24 (Sozial- und wirtschaftspsycho- 
logische Entscheidungsforschung, Universität Mann- 
heim) with funds from the Deutsche Forschungs- 
gemeinschaft. The data were collected and in part 
analyzed by Henning Eckel, who reported about 
them in his diplom (master’s) thesis at the Univer- 
sität Mannheim. The three authors have contributed 
equally to this article. We thank Henning Eckel for 
his collaboration and his discussion contributions in 
the earlier phases of this study. Further, we are 
grateful to W. A. Scott and to an anonymous re- 
viewer for their helpful comments. 

Requests for reprints should be sent to Helmut 
Lamm, SFB 24, Universität, 68 Mannheim, West 
Germany. 


cultural conditions affect a person’s time per- 
spective as well as his social behavior. In par- 
ticular, future time perspective has been seen 
as a cause or correlate of the tendency or 
capacity to delay rewards (cf. Mischel, 1973) 
and of achievement motivation (cf. Heck- 
hausen, 1967). The capacity to delay rewards 
in turn has been considered a precondition 
for social success and upward mobility, a 
view that has not remained unchallenged (cf. 
Kasakos, 1971; Nowotny, 1975). 

In the present authors’ view, an important 
though indirect contribution to the discussion 
about the “functional” meaning of future 
time perspective (e.g., its relevance for suc- 
cess in society) would be to investigate future 


time perspective empirically by taking into - 
account a number of relevant psychological — 


aspects or dimensions, and in particular, by 
including an affective component (ie, the 
evaluative-emotional meaning an individual’s 
future has for him). Thus this article focuses 
on some of the-dimensions of future orienta- 
tion and how these are influenced by sex and 
social class, that is, by special social roles 
that have become effective as a result of cer- 
tain socialization experiences. 7 
Future orientation here refers to an indi- 
vidual’s attitude—in pårticular, his cognitions 
and feelings—toward his future. The nar- 


317 


318 

rower term future time perspective refers to 
the cognitive component (e.g., density and ex- 
tension, described below) of future orienta- 
tion, which is in line with its use in the rele- 
vant research literature. For the most part, 
the present article uses the term future time 
perspective when referring to literature in 
which this term was used. 

This study includes the following aspects 
of future orientation (as dependent varia- 
bles): (a) content-structural (density), (b) 
temporal-structural (extension) » and (c) af- 
fective (optimism-pessimism). 

The term density refers to the number 
of events whose occurrence a person hopes or 
fears in various domains of life? By exten- 
sion we mean the length of the future time 
span being envisaged by a person. It is op- 
erationalized as that event which the subject 
places farthest in the future, This definition 
is identical with that of Wallace (1956, pp. 
240-241). 

Optimism is defined as the positive differ- 
ence between the evaluation of the future and 
the evaluation of the present, a negative dif- 
ference (pessimism) indicating a pessimistic 
view of the future. This definition is in line 
with that of Teahan (1958, p. 38). 

An additional dependent variable, termed 
internal—external control, is defined as a per- 
son’s belief as to whether events are con- 
trolled by himself or by external forces, 

A more detailed analysis of the concept of 
future orientation and its various dimensions 
can be found in Trommsdorff and Lamm 
(1975). 

One of the first investigations of future time 
perspective has already dealt with its depen- 
dence upon social class as a consequence of 
differential socialization. LeShan (1952) in- 
vestigated the effects of class-related parental 
reward and punishment methods, concluding 
that children from the lower class are more 
present oriented and children from the middle 
class are more future oriented, LeShan asked 

“middle- and working-class children to invent 
Stories and found that the stories of middle- 
Class children covered a longer period of time 
and extended more into the future than did 
those of working-class children, There have 
_ numerous challenges of this investiga- 


A 
É 
X 


LAMM, SCHMIDT, AND TROMMSDORFF 


tion, and aspects of both its content and its 
method have been criticized. 

Greene and Roberts (1961) faulted Le- 
Shan’s (1952) statistical procedure and came 
to the conclusion that his results were not 
clear. Kendall and Sibley’s (1970) analysis 
elicited the fact that the length of the sto 
that the children in LeShan’s study had : 
write down had an influence on the temporal 
extension of those stories: The lo ger the 
story, the greater was the extension the 
future time perspective. Those authors con- 
cluded that the relationship between social 
class and extension of future time perspective 
in LeShan’s study was artificial, Ellis, Ellis, 
Mandel, Schaeffer, Sommer, and Sommer 
(1955) and Judson and Tuttle (1966) were 
not able to replicate LeShan’s findings despite 
improved methods of statistical analysis, In 
contrast, Graves (cited in Lessing, 1968) re- 
Ported a difference between the two social 
classes with regard to the extension of futu 
time perspective, which is in line with X 
Shan’s (1952) findings. 

There are undoubtedly many re. 
the lack of clear-cut evidence on the 
social class. One of them may be the 
assumption that the structure of the 
al’s future orientation (and future time t 
spective) is the same in all domains of | 
However, Frank (1939) has already pointed 
out that a person can develop multiple ori 
tations toward the future that ar: 
each case toward various aspects of hi 
vate or public life, a point that has been 
widely disregarded in research up until now. 
One important advance that we hoped to ai 
complish through the present research was to 
obtain support for our assumption that future 
orientation consists of various dimensions, 
each of which has to be measured separately. 
These various dimensions—density, extension, 
and optimism—as well as the additional mea- 
Sure, perceived control, may be affected in 
different ways by the social attributes under 
investigation. 


1 See Kastenbaum (1961), who limited his defini- 
tion of density to the quantitative aspect. 

? Lessing (1971) recently tried to distinguish be- 
tween private and public future orientation, 
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Social Class and Future Orientation 


We assume that future orientation is de- 
termined by past and present experiences. Ac- 
cordingly, the structure and the content of 
future orientation must be dependent upon 
the frequency (quantity) and structure of 
past and present relationships of interaction 
between the individual and his environment, 
In the lower class—more so than in the mid- 
dle class—these relationships are character- 
ized by many factors that together could be 
described as a syndrome of family orienta- 
tion: nonmembership in secondary groups and 
organizations, helplessness and uncertainty in 
contact with local authorities and social insti- 
tutions, and apathy in connection with events 
and developments in the public domain. A 
person of the lower class looks for and finds 
protection in the particularized solidarity of 
primary relationships within the family 
against a universalistically organized outside 
world (Oevermann, 1969; Popitz, Bahrdt, 
Jiires, & Kesting, 1967). The fact of a fam- 
ily-centered value orientation within the lower 
class (Neidhardt, 1968; Rosen, 1956) can be 
seen as a correlate of this social withdrawal. 

On the basis of these observations, it can be 
assumed that the members of the lower class 
primarily focus on private concerns in their 
future orientation and that persons of the 
middle class turn comparatively more of their 
attention to public concerns. Thus, we pro- 
pose the following hypothesis: 


Hypothesis 1: In comparison with middle- 
class adolescents, more of the lower-class ado- 
lescents’ hopes and fears are concerned with 
private matters, 


In his well-known study of the “achieve- 
ment syndrome” Rosen (1956) showed that 
lower-class children have a passive, family- 
oriented, and present-oriented value system. 
In contrast, children of the upper classes were 
described as being active, individualistic, and 
future oriented, all of which are favorable pre- 
requisites for the development of greater 
achievement motivation. 


A large number of other findings concern- 


ing class-dependent socialization are in ac- 
cord with Rosen’s results. Pearlin and Kohn 
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(1966) showed that in comparison with the 
child-rearing practices of the lower class, 
those of the middle class are aimed more 
toward strengthening the self-confidence and 
self-responsibility of the child. In addition, 
with regard to the development of cognitive 
style (Witkin, Dyk, Faterson, Goodenough, & 
Karp, 1962) and cognitive abilities (Hess & 
Shipman, 1965) as well as language behavior 
(Bernstein, 1959; Bock, 1972; Oevermann, 
1972), the socialization milieu in middle- 
class families is relatively better suited for 
developing abilities and promoting goal striv- 
ing. These class-related behavioral differences 
are reinforced even further by secondary 
socialization in the school (Deutsch, 1965), 

The above observations suggest that sociali- 
zation practices (and learning experiences) in 
middle-class, more so than in lower-class, 
milieus are calibrated toward long-term goal 
setting (cf. Heckhausen, 1967), a stronger 
sense of control over events (cf. Lefcourt, 
1972, p. 26), and greater optimism as to the 
realization of one’s goals. 

Thus we propose the following hypotheses: 


Hypothesis 2: In general, middle-class ado- 
lescents have a more widely extended future 
orientation (i.e., in the private as well as the 
public domain) than do lower-class adoles- 
cents. (This represents an attempt, with im- 
proved procedures, to replicate the findings of 
LeShan, 1952, and others cited.) 

Hypothesis 3: Middle-class adolescents are 
more prone to believe that the realization of 
their hopes and fears depends on themselves 
(instead of external factors) than are lower- 
class adolescents. 

Hypothesis 4: Middle-class adolescents are 
more optimistic about their future than are 
lower-class adolescents. 


x wT 
Sex and Future Orientation 


The lower the social status of a family, the 
more the partner relationships are oriented 
toward the prevailing sex role stereotypes 
(Lehr, 1972; Mollenhauer, 1969). The man 
is regarded as the “instrumental leader” of the 
family (Parsons, 1964) in the occupational 
and public domains, whereas the woman is 
seen as competent in the role of housewife and 
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child rearer (Myrdal & Klein, 1956). This 
division of roles and abilities affects child 
rearing: The child is subjected to sex-specific 
role expectations and reinforcements (cf. 
Maccoby & Jacklin, 1974). (The latter do not 
necessarily remain stable over the entire life 
span, depending on learning experiences fol- 
lowing childhood, cf. Ahammer, 1973; the 
success of the Women’s Movement makes 
this abundantly clear.) 

Lehr (1968, 1969) reported that the career 
takes on a central position in (German) men’s 
autobiographies, whereas women—even those 
with an occupation—mainly emphasize the 
events and problems of their private lives in 
their life portrayals. This corresponds with 
the observation that more girls than boys 
renounce or give up an occupational education 

favor of an early marriage (Lessing, 1968; 
\ivrdal & Klein, 1956). Thus a further hy- 

esis of our study may be stated as fol- 


livpothesis 5: (a) Female adolescents 
nanifest fewer occupational but more family- 
related hopes and fears than do male ado- 
lescents. (b) Female adolescents have a more 
extended future orientation in the family do- 
main and a shorter one in the occupational 
domain, 


With hi social status of parents, the 


alization patterns referred to 


sex-specific sac 

above are less prevalent (see Lehr, 1972; 

Mollenhauer, 1969). Therefore, Hypothesis 5 

is confined | wer-class adolescents, 
{eTHOD 

Sample 


und 50 female adolescents 
between the ages id 16 who were selected 
from either of two s of German schools—a 
“Hauptschule” or a iere Schule’—in a West 
German industrial city ‘he former kind of school 
is the direct extension o elementary school: Pupils 
go through nine “classes”; after graduation the stu- 
dents obtain a full-time job or an apprenticeship. 
The latter kind of school corresponds to the United 
States high school, preparing pupils for a univer- 
sity.) Subjects participated in response to adver- 
tisements posted in the schools, in which § Deutsche 
Marks were promised per hour of participation, The 
lower-class sample (half of the subjects) consisted 
exclusively of Hauptschule students, where the head 
of family had solely a Hauptschule education. The 


Subjects were 


a 
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middle-class sample consisted of high-school students 
where the head of family had completed high school 
or university. 


Design 


The first two independent variables were sex and 
social class, with two levels each, yielding four cells 
of 25 subjects. The two-fold measure of optimism- 
pessimism (difference between present and future) 
provided a third (within-subjects) factor (near vs. 
distant future), with two levels (in 5 years, in 20 
years). The factors were all “fixed”; within-subjects 
measurement existed in the case of near versus dis- 
tant future, 


Dependent Variables 


The dependent variables were measured through 
questionnaires that the subjects had to complete in 
group sessions of about three persons each. 

Density. On the first page of the questionnaire the 
subjects listed their hopes, and on the second page, 
their fears. In each case the answering period was 
limited to 10 minutes. These raw data were cate- 
gorized with the help of Cantril’s (1965) scheme 
(agreement in 86% of the cases) by two independent 
judges. The original 23 categories of Cantril’s scheme 
were reduced to 3 major categories in the private 
sector (family, occupation, personal development) 
and 3 in the public sector (economy, politics, en- 
vironment). The number of listed hopes and fears in 
the 6 areas, divided by the total number of hopes 
and fears, served as the index of density, These 
proportions, since they were not normally distributed, 
were transformed with arc sin Vx to permit an 
analysis of variance (Winer, 1962, p. 221), 

Extension. As a further assignment, each subject 
had to indicate for each of his hopes and fears how 
old he thought he would be at the time of the 
materialization of the corresponding hope or fear, 
The maximal difference between the subject’s per- 
ceived future age and his current age for each given 
category (public and private domains and their sub- 
divisions) was then used as the index of extension, 
These scores were transformed with log x (to yield 
normal distributions) to justify analyses of variance. 

Internal versus external control. Next, each sub- 
ject indicated, for each of his hopes and fears, 
whether he thought its fulfillment depended more 
on himself or on external factors. For this, four 
answer categories were provided: (a) “depends on 
me only,” (b) “depends more on me than on chance 
and external circumstances,” (c) “depends more on 
chance and external circumstances than on me,” and 
(d) “depends on chance and external circumstances 
only”. The proportion of hopes and fears that a 
subject categorized as internally controlled (Cate- 
gories a or b), relative to the whole number of 
hopes and fears, was used as his score for data analy- 
sis. This score was calculated for both the private 
and the public areas as well as for the total number 
of hopes and fears. Because of a lack of normal dis- 
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tribution, these scores were transformed with arc sin 
Vx to permit analyses of variance. 
Optimism—pessimism, On the last two pages of the 
questionnaire the subject had to list, for each of 10 
private and 10 public areas of concern, his position 
“at the present,” “in 5 years” (near future), and in 
“20 years” (distant future). The positive extreme 


-of the 11-point scale was 10 (best conceivable situ- 


ation), its negative extreme 0 (worst conceivable 
situation). This was an adaptation of the aspiration 
ladder developed by Cantril (1965). 

For the purposes of data analysis, the 20 items 
were condensed into three categories of the private 
domain (labelled family, occupation, and personal 
development) and three public categories (econom- 
ics, politics, and environment), which in terms of 
content were designed to correspond roughly to the 
six previously mentioned areas of the density and 
extension variables. 

The 10 “private” items were as follows: “my per- 
sonal relations within my family,” “my sexual rela- 
tions,” “my interpersonal contacts” (subdomain fam- 
ily and interpersonal relations); “my health,” “my 
independence,” “my inner satisfaction,” “the realiza- 
tion of my real abilities” (subdomain personal de- 
velopment and self-actualization); “my financial 
situation,” “my attainments in school or occupation,” 
and “free time to practice interests not related to 
school or occupation” (subdomain occupation). 

The 10 “public” items were as follows (all relating 
to West Germany, unless otherwise noted): “eco- 
nomic capacity” (subdomain economics); environ- 
mental pollution (subdomain environment); “rela- 
tions with Eastern European countries (including 
East Germany),” “equal chances of education and 
advancement for all Germans,” “criminality,” “inte- 
gration of Western European countries and peoples,” 
“freedom of expression of opinion,” “peace in world,” 
“mutual helping and considerateness,” and “toler- 
ance and acceptance of minorities (e.g., foreign 
workers, homosexuals)” (subdomain political and 
social concerns). 

In each of the subdomains and in the two main 
groups, which is equivalent to the sum of all 10 
private or public items, the mean difference between 
the future (near or distant) and the present was 
calculated. Positive differences indicated an optimistic 
judgment, negative differences, a pessimistic one. 
The distribution of these data was normal, thus 
permitting 2X2X2 (Sex X Class X Near Versus 
Distant Future) analyses of variance. Near versus 
distant future constituted a within-subjects factor. 


RESULTS 


Tables 1 and 2 contain the means and 
standard deviations for the dependent varia- 
bles and the significant results of the analy- 
ses of variance. In the case of significant in- 
teraction effects, simple-effects F values were 
calculated (Winer, 1962, p. 174 f. and p. 
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340). The degrees of freedom for all listed F 
values are 1 and 96. 


Results For Class 


Density, Hypothesis 1 was confirmed: The 
middle-class adolescents listed more (i.e., 
greater proportions of) public (p< .005) 
and fewer private (p < .005) hopes and fears 
than did the lower-class adolescents. Further- 
more, lower-class subjects listed more con- 
cerns in the occupational sphere than did 
middle-class adolescents (p < .05). It should 
be remembered that this refers to relative 
measures of public and private concerns (i.e., 
the proportion of concerns, relative to total 
concerns, voiced in the respective categories). 

Extension. In addition, hypothesis 2 was 
confirmed: Middle-class subjects had a more 
extended future orientation than did the 
lower-class subjects in the private (p < .005) 
as well as in the public category (p < .005). 

Internal versus external control. Hypothe- 
sis 3 was confirmed only in the public cate- 
gory, with greater internal control indicated 
by middle-class ‘persons than by lower-class 
persons (p < .005). 

Optimism—pessimism. Hypothesis 4, as 
originally stated, was on the whole not con- 
firmed by the data. Instead, a more differen- 
tiated statement has to be made. Middle-class 
adolescents judged the distant future more 
optimistically in the public subdomains of 
politics (simple-effects F = 34.41; p < .005) 
and environment (simple-effects F = 12.32; 
p < .005) than did lower-class adolescents. 

With regard to the private sphere, there 
was a significant (p< .01) Sex X Social 
Class interaction in the occupational domain: 
For this category lower-class girls judged the 
future more optimistically than did middle 
class girls (simple-effects F = 8.79; p< 
.005); this was not the case for boys. 

An additional finding that may be noted 
here is that both class groups face the distant 
future with greater optimism than the near 
future (p < .005, in the private as well as 
public domains). 


3 The 10 private items and the 10 public items cor- 
respond to the future-related concerns that were 
most often voiced in surveys by Lamm (Note 1) and 
Trommsdorff (Note 2). 
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Results for Sex 


Density, Hypothesis 5a was confirmed in 
the occupational subdomain: Males voiced 
more hopes and fears than did females (p < 
005; see Table 1). The female adolescents 
of both classes voiced more hopes and fears 
in the private domain than did the male ado- 
lescents of both classes (p < .005). Surpris- 
ingly, the females—independently of class 
level—voiced more hopes and fears in the 
public areas of politics ( < .05) and en- 
vironment (p < .05) than did the males. 

Extension, Hypothesis 5b was confirmed in 
the occupational area: Lower-class boys had 
a more extended future orientation than did 
lower-class girls (simple-effects F = 9.87; p 
< .005). 

Internal versus external control. For the 
total of hopes and fears, males perceived more 
internal control than did females (p < .05). 
This difference was also significant in the pri- 
vate category (p < .05) but not in the public 


category 


ssimism, Sex-related differ- 
g optimism-pessimism were 


ences concernin 
not manifested in any of the life domains, 
except with respect to the effect of social class 
(see Discussion section). 


Discuss1on 
Effect Of Social Class On Future Orientation 


Our hypotheses concerning the effects of 
social class on the three dimensions of future 
orientation—density, extension, and opti- 
mism—pessimism—were confirmed by these 
data: Middle-class adolescents are more long- 
term directed in their future orientation than 
are lower-class adolescents, and this holds for 
hopes and fears related to private as well as 
public concerns.* In general, the middle-class 

is are more intensively concerned 
y ments and processes in the pub- 
lic are the lower-class adolescents 
ane to believe that the ma- 
terial hopes and fears is de- 
pendent oi 

In regai 
class adol 
sector, but 65 
tation is less cxi 
class counterpa 


ire orientation, lower- 
focus on the private 
a their future orien- 
hat of their higher- 
. It may well be that this 


shortness of time perspective is to a consid- 
erable extent responsible for the “restricted 
planning behavior” (Mollenhauer, 1969) of 
the lower class. 

Effects of social class on optimism were 
found in the environmental and political sub- 
domains: Middle-class persons are more op- 
timistic but only in regard to the distant 
future. This lack of confirmation of our 
“total” hypothesis may be explained by the 
relatively young age of the subjects. It is 
likely that the influence of class on optimism- 
pessimism becomes (more) evident at a later 
age. In the case of the age level tested here, 
the background of experience may still be 
relatively similar for the two classes. The 
amount of frustration in connection with goal 
striving is presumably still relatively low so 
that scarcely any negative changes would 
likely be anticipated in the (immediate) fu- 
ture. As they get older, lower-class persons 
will probably be more pervasively hampered 
in their goal-reaching efforts (e.g., because 
of lower incomes) and have more barriers to 
overcome than middle-class persons, with the 
possible consequence of greater pessimism 
and/or a lowering of aspirations. 

A class-related difference concerning opti- 
mism, which holds only for girls, will be con- 
sidered below. 


Influence Of Sex Role On Future Orientation 


With respect to the lower-class adolescents, 
the present findings confirm earlier evidence 
that the training for an occupation and 
further education play a more important role 
in the plans of boys than in the plans of 
girls. This is shown by the fact that more of 
the hopes and fears of boys concern their oc- 
cupation and that the future orientation of 
boys with respect to the occupational area is 
more extended. Considering the middle-class 
adolescents, the occupational future time per- 


4The rank-order or product-moment correlations 
were calculated between the dependent variables of 
density and extension as well as between extension 
and optimism-pessimism, For density versus exten- 
sion, they were significant only in some areas, and 
for extension versus optimism—pessimism, there was 
no correlation within the four groups of subjects. 
The correlation tables are available from the authors. 
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spective of boys is about as extended as that 
of girls; with respect to density, however, boys 
again concentrate more on the occupational 
domain than do girls (main effect of sex role). 

What has not been hypothesized is an in- 
teraction between sex and social class with 
regard to the occupational future: Lower-class 
girls (but not boys) are more optimistic than 
their (same-sex) middle-class counterparts (p 
< 005). It is possible that the latter are 
more realistic in their assessment of the bar- 
riers that stand in the way of a woman’s 
taking up and practicing an occupation, An 
alternative interpretation is that possibly mid- 
dle-class girls have higher aspirations than 
lower-class girls: They might consider occu- 
pations that are more difficult to prepare for, 
enter, and/or succeed in. Here again, the 
greater pessimism of middle-class girls would 
represent a realistic assessment of future pos- 
sibilities.’ 

As noted above, in the occupational domain 
girls voice fewer hopes and fears than boys, 
whereas in the family domain girls voice more 
hopes and fears than boys. Apparently, based 
on the traditional distribution of roles and 
abilities, the family is still the domain in 
which the woman takes (or is compelled to 
take) the leading role. This offers a second 
possible explanation for the comparatively 
pessimistic attitude of middle-class girls 
toward developments in the occupational area: 
On the one hand, they make an effort toward 
pursuing an occupation, while on the other 
hand, they realize that in the end, they will 
still be held responsible for household and 
children. This anticipation of a double load 
may have a negative influence on their as- 
sessment of their career chances. 


Concluding Comments 


The findings reported above are based on 
a (West) German sample. We have inter- 
preted them by means of reference to empiri- 
cal and theoretical analyses from North 
American and German research literature. 
However, caution should be exeréimdr regard- 
ing the crogs-cultural validity of the present 


5 We are indebted to an anonymous reviewer for 
suggesting this possibility. 


findings as well as of the theoretical analyses 
referred to above. The same socioeconomic 
group may be characterized by different typi- 
cal socialization patterns for Society (Coun- 
try) A in comparison with Society (Country) 
B; if these socialization patterns are causal 
to future orientation (or some dimension or 
rrelate of it), then different empirical rela- 
tionships may be expected for thie two socie- 
ties. Thus, even in two countries that are 
relatively close in their cultural outlook such 
as West Germany and the United States, 
there is a fair possibility that the sex-related 
findings of the present study would not hold 
for United States adolescents, given the earlier 
and, presumably, more powerful impact of the 
Women’s Movement in the United States. 
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The present study tested a consistency formulation of the transgression-com- 
pliance effect, In the first series of trials, subjects were directed to either punish 


or reward victims at a specified magni 


itude. In subsequent trials, subjects were 


free to choose the magnitude of reward or punishment they delivered, As was 
hypothesized, regardless of the direction (reward or punish) of prior behavior, 


subsequent behavior was consistent 


with the magnitude of prior behavior. 


Although not hypothesized, it was found that the consistency effect was in- 


tensified when the mode of behavior 


was negative. The findings are discussed 


in terms of guilt, equity, disinhibition of anti- and prosocial behavior, and 


consistency explanations. 


A number of studies have demonstrated 
increased compliance subsequent to transgres- 
sions. While transgression has been opera- 
tionalized in a variety of ways, for example, 
upsetting ordered index cards (Freedman, 
Wallington, & Bless, 1967) or administering 
shock (Carlsmith & Gross, 1969), a basic 
paradigm has been common to these studies: 


The individual is induced to commit behavior which 
ostensibly results in some kind of damage to an- 
other’s property or to another person (the victim), 
then the individual, the transgressor, is requested to 
comply in carrying out an additional task that will 
benefit the victim or someone associated with the 
victim or recommended by him (Brock, 1969, p. 
138). 


Most transgression-compliance studies in- 
terpret the compliance effect as an attempt to 
assuage guilt allegedly experienced by the 
transgressor (Carlsmith & Gross, 1969; Dar- 
lington & Macker, 1966; Freedman et al., 
1967; Wallace & Sadalla, 1966). Freedman 
et al. (1967) viewed the diversity of trans- 
gression manipulations that result in compli- 
ant behavior as buttressing a guilt interpre- 
tation. They concluded that “any alternative 
to a guilt interpretation has the difficult 


The authors wish to thank Thomas M. Ostrom, 
Alan E. Gross, and Debbie Davis for reading the 
first draft of this manuscript. 
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Keating, Department of Psychology NI-25, Univer- 
sity of Washington, Seattle, Washington 98195. 


task of providing a parsimonious explanation 
for compliant behavior resulting from such 
diverse manipulations” (p. 122). 

Walster, Berscheid, and Walster (1973) 
regarded the “guilt” that motivated compli- 
ant behavior in such studies as a symptom of 
distress that individuals experience after in- 
flicting injury. Such distress, symptomatic of 
inequitable relationships, motivates the harm- 
doer to restore equity between himself and 
the victim. The harm-doer can accomplish 
this restoration by either compensating the 
victim for the wrong done or distorting 
reality (e.g, derrogating the victim) to 
reestablish psychological equity in assessing 
the relationship. Increased compliance after 
transgression can be interpreted as an example 
of such restoration of equity. 

While compensation after transgression has 
frequently been reported (Berscheid & Wal- 
ster, 1967; Berscheid, Walster, & Barclay, 
1969; Brock & Becker, 1966; Carlsmith & 
Gross, 1969; Freedman, Wallington, & Bless, 
1967; Walster, Walster, Abrahams, & Brown, 
1966; Walster & Prestholdt, 1966), it ap- 
pears that harm-doers are not as vigilant in 
initially preventing the creation and escala- 
tion of inequitable situations. Studies in 
which subjects use a Buss (1961) “aggres- 
sion machine” to shock confederates have 
commonly reported a main effect of increased 
intensity of delivered shocks over trial blocks 
(e.g., Berkowitz, Lepinsky, & Angulo, 1969; 
Geen & Stone, 1973). Recently Goldstein, 
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Davis, and Herman (1975) tested theoret- 
ical explanations for this effect using a modi- 
fied verbal operant-condition situation. After 
ordering the intensity of both positive and 
negative verbal reinforcement statements on 
a 10-step scale, they found that subjects esca- 
lated the intensity of both types of reinforce- 
ment over four blocks of trials. This result 
was interpreted as demonstrating a disinhibi- 
tion of anti- and prosocial behavior over time. 
Considerations from equity theory suggest 
that such escalation of intensity would be 
avoided in order to maintain an equitable 
relationship and ward off distress that accom- 
panies inequitable situations (Walster et al., 
1973), unless subjects saw the victim as in- 
creasingly more deserving of the reinforcement 
administered. 

Brock (1969) offered an alternative to 
the guilt and equity interpretations of the 
transgression—compliance phenomenon. The 
dynamic construct of his assumption was de- 
rived from cognitive consistency principles. 
The major hypothesis of Brock’s “fate con- 
trol” interpretation is as follows: “An indi- 
vidual who has affected the fate control of 
another person in a certain magnitude will 
repeat that magnitude of control over the 
person (or a similar person in a similar role) 
if an opportunity to do so presents itself” 
(p. 143). Two basic assumptions underlie this 
interpretation: (a) The transgression ma- 
nipulation is sensed by individual subjects as 
their control over the fate of another in a 
clear-cut fashion, and (b) individuals strive 
to behave consistently toward others. Two 
dimensions of the control exercised by sub- 
jects should be identified: the direction of the 
control, which can be positive or negative, 
anti- or prosocial, and the magnitude of con- 
trol, that is, how much help or harm was 
administered. 

fhe {ate control interpretation applies to 
bo o itive and negative behavior. It pre- 
dic hat the magnitude of subsequent be- 
others will be similar to the 


havior towarc 

amount helping or harming, regard- 
less of t! ction of the prior behavior. 
For exam} an individual punishes an- 
other (e.g, ers shock) when given a 


subsequent opportunity to reward the other 
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(e.g., give money), he would do so with an 
intensity similar to the intensity of the initial 
punishment. This sequence of interactions 
would be similar to the transgression—compli- 
ance phenomenon and could be interpreted as 
an attempt to restore equity in the rela- 
tionship as well as an effort to control con- 
sistently the fate of the other. The fate con- 
trol interpretation, however, also predicts that 
if a person initially punishes another and is 
given a second opportunity to punish the 
other again, he would do so with an intensity 
similar to the first punishment. Provided that 
justification or other equity restoration proc- 
esses are curtailed or precluded, equity the- 
ory implies that a person will decrease subse- 
quent punishments in such a situation in 
order to reduce the distress that would be 
experienced from the inequity in it (Walster 
et al., 1973). In contrast with either the dis- 
inhibition or equity interpretations, the fate 
control explanation predicts a stable intensity 
of either anti- or prosocial behavior over time. 

McMillen (1971) tested the fate control 
hypothesis. Using a transgression manipula- 
tion similar to that used by Freedman et al. 
(1967, Experiment 1), he induced subjects to 
lie to the experimenter. Half of the subjects 
were able to use information about which 
they lied in subsequent tests, whereas for the 
remaining half of the subjects, the information 
they lied about contained nothing relevant to 
the subsequent tests. McMillen found that 
subjects in the lie-relevant condition volun- 
teered to help the experimenter in a phone 
campaign significantly more than those in the 
lie-irrelevant condition. Although all of the 
subjects should have felt guilty about the lie, 
only those who affected the “fate” of the 
experimenter by falsifying the experimental 
results through their lie displayed compliant 
behavior, 

The present experiment examined in greater 
detail the fate control or consistency of be- 
havior hypothesis. This experiment was not 
primarily aimed at yet another documenta- 
tion of the well-demonstrated transgression— 
compliance effect but was rather an attempt 
to test competing interpretations of this ef- 
fect. After rewarding a confederate of the 
experimenter with monetary rewards or pun- 
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ishing him with electrical shocks of specified 
magnitudes, subjects were subsequently al- 
lowed to choose the magnitude of reward of 
punishment for the same confederate. The 
general hypothesis of this experiment followed 
directly from the fate control interpretation 
of the transgression—compliance literature: 
Regardless of the direction of previous behay- 
ior, subsequent punishment or reward would 
correspond to the magnitude of the prior 
behavior, 


METHOD 


A 2X2X2 factorial design was used in this ex- 
periment, with initial behavior (reward-punishment), 
level of initial behavior (high-low), and subsequent 
behavior (reward-punishment) as factors. Subjects 
who role played teachers in a learning experiment 
were randomly assigned to administer specific levels 
of either high or low punishment or reward during 
the first series of teaching trials, During a second 
series of trials, subjects were instructed to administer 
either punishment or reward but were free to deter- 
mine the level of punishment and reward they gave. 
The mean amount of reward or punishment deliv- 
ered during the second series of trials constituted the 
dependent measure. 

A pretest was conducted to approximate an equiv- 
alent distance between low-high shock and low-high 
reward. Twenty-eight subjects were administered 10 
shocks in varying sequences from a Hunter Model 
350, Series D shock generator. They were asked to 
indicate a fair price to pay other volunteers for 
receiving each level of shock. From this pretest, in 
correspondence with low reward of $.25, low shock 
for the experimental manipulation was defined at 
Level 3 (65 V ac, .9 mA). High shock, defined as 
Level 8 (105 V ac, 2.2 mA), corresponded with a 
high reward of $.90. 


Procedure 


Subjects. Subjects were 65 male and 31 female 
students of introductory psychology at the Ohio 
State University; 24 subjects were randomly assigned 
to each level of initial behavior. Half of the sub- 
jects from each level of initial behavior were 
assigned during the second series of trials to pun- 
ishment, the remaining half being assigned to the 
subsequent reward condition. Subjects were individu- 
ally escorted into a room and seated in front of a 
wooden panel behind which was a confederate of 
the experimenter. 

Apparatus. The apparatus used in the experiment 
was a Buss (1961, pp. 47-49) aggression machine 
modified for purposes of the manipulation. Encased 
in a 3X 2.5 foot (.9 7.5 m) grey wooden panel 
was a control board on the subject’s side containing 
10 shock keys labeled from 1 to 10, a switch marked 
“power,” and two buttons labeled “fuse.” On top 


of the control box was a panel containing four 
lights: one green labeled “correct,” one red labeled 
“wrong,” and two white lights, one labeled “55 
seconds” and the other “1 minute.” Next to the 
panel was a money changer (Brandt Automatic 
Cashier, F-120) similar to the kind used in theater 
ticket offices. By depressing one key, the subject 
could reward the confederate from $.01 to $1.00. 
The money would slide down a delivery shute from 
the subject’s side to the confederate’s side, making 
an audible “clanking sound.” On the confederate’s 
side of the panel were switches that controlled the 
lights on the subject’s side, a well to receive the 
money rewarded, and an opaque screen on which 
flashed the number of the shock level delivered by 
the subject. 

Manipulation. Subjects were instructed to assume 
the role of teacher in an experiment investigating 
the effects of programmed amounts of reward and 
punishment on the facilitation of learning. Printed 
instructions informed subjects that the student on 
the other side of the panel was recruited from the 
school of education because of his/her interest in the 
nature of the experiment, To establish the confed- 
erate’s ability to perform the task quickly, he/she 
was described as well-practiced in discerning correct 
and incorrect mathematical sequences, examples of 
which were provided to the subject. A sequence 
would be flashed to the confederate at 6-sec inter- 
vals, and if he/she accurately judged the sequence 
as correct or incorrect, the green light (“right”) 
would flash on the subject’s panel; if he/she erred 
in judgment, the red light (“wrong”) would flash 
on the panel. This procedure would be repeated for 
8 minutes, with the confederate purportedly per- 
forming 10 recognition trials per minute, Subjects 
were instructed to try to time their punishments or 
rewards so that the student would be motivated to 
try harder to judge accurately the relationship of 
the numbers. 

The second page of instructions contained the 
experimental manipulation, All subjects were told 
that they must reward or punish the confederate 
once, and only once, any time during each of the 
8 minutes. If they had not previously done so, when 
the 55-sec light flashed, they were required to re- 
ward or punish the confederate within the next 
5 sec. The 1-minute light would flash to signal the 
beginning of a new reward-punish time period. 
Subjects in the reward conditions were instructed to 
give the confederate $.25 (low reward) or $.90 
(high reward) once each minute by pressing the 
appropriate key on the money changer. Subjects then 
practiced once by pressing the key and heard the 
money delivered to the confederate’s side of the 
panel, Subjects in the punish condition were in- 
structed to deliver a Level 3 shock (low punish) or 
a Level 8 shock (high punish) once each minute to 
the confederate. To enable the subjects to feel the 
intensity of the shock they would deliver, a con- 
ductor was attached to their little finger, and sub- 
jects shocked themselves once prior to the learning 


ye 


trials with either a L 
depending 


! 3 or Level 8 shock, 
condition. 

vcre used, one male and one 

ith approximately half of the 

`i confederate and subjects varied 


independe 'y. The confederates were trained to in- 
alicate 12 cor nd 8 wrong answers in prepared 
2-minute s throughout the entire experi- 
ment, To emp ty of the shock, subjects 
in the punish ¢ aw the experimenter attach 
e ieee or h previously had been attached 
Q ne OWT 


r during the practice trial, to the 


extended fir e confederate. Once the outlet 
Be beer the confederate unobtrusively 
shut off 

ae pen », After the first 8 minutes, the 
teaching t 


opped, and subjects read addi- 
informing them that during the 
the experiment, the effects of non- 
£ reward-punishment on learning 

tudied. The teaching trials would be 

© minutes, but this time all subjects 
to use whatever level of reward (from 
15) or punishment (from Level 1 to 
y chose. As in the previous trials, 

were obliged to reward or punish the con- 
© with some money or shock once each min- 
ing the second 8 minutes, being warned by 
1 lights when each minute had passed, 

The confederate collected and recorded the amount 
of money or recorded the level of shock delivered 
cach minute. The average amount of money or level 
of shock delivered per trial constituted the intensity 
of subsequent behavior exhibited by the subjects. 
To enable subjects in the subsequent punish condi- 
to feel the range of shock they could deliver, 
in outlet was attached to their little finger, and 

hjects shocked themselves once before the trials 
h both a Level 3 and a Level 8 shock. 
; th w the experimenter attach the shock 

the extended finger of the confederate. 
aft the second eight trials, subjects 
i “effectiveness of teacher” question- 
designed to test the effect of the 
i by the subjects if they felt that 
vention helped the student to improve his 
rning. Subjects were then debriefed and 
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story effectively concealed the 
periment. During debriefing, 
inaware that their own behav- 
n the “learning” of the con- 
ng studied. Even though the 
formance remained the same 
16 minutes of the experiment, 
that the rewards and pun- 
ated to the confederates’ 
g. This was true for all 


relationship was not affected by t 
of reward or punishment they had i 
administered. ee 
Analyses of variance were 
performed for subsequent reward 
ment. Additionally, standard score 
puted for each subject in the 
ward and punish conditions, and 
of variance was appa e alee 
using these scores, Special analyses indicated 
that subjects’ sex and sex of udek di 
not have a significant effect into 
any significant interactions, refor 
these variables were excluded 
the design. - 


Subsequent Reward 
As can be seen in Table 1, subjects 


were assigned to the reward conditions during 
the first half of the experiment rewarded the 
confederate during the second 8 minutes with 
averages of $.47 and $.58 per trial 
low reward ($.25) and high reward 
respectively. Subjects in the punish conditions — 
during the first 8 minutes of the experiment 
distributed averages of $.25 per trial for ini- 
tial low punish (Level 3 shock) and $.53 per 
trial for initial high punish (Level 8 shock) 
during the second half of the experiment, 
Analysis of variance yielded a significant — 
effect of magnitude of prior reward and pun- 
ishment on the amount of subsequent reward 
distributed, F(1, 44) = 20.40, p < .001, with 
those in the high conditions giving signifi- 
cantly more reward than those in the low 
conditions. Initial reward conditions gave 
significantly more money than initial punish- 
ment groups, F(1,44) = 8.44, #< 01, and 
the Reward-Punishment x High-Low Groups 
interaction was also significant, F(1,44) = _ 
6.41, < 025. Contrasts for various individ. 
ual cell differences yielded nonsignificant re- 
sults for the high-reward versus hi; ii 
and high-reward versus low-reward 
Significant contrasts were fo 
punish versus low-reward conditi 
= 14.80, ~<.01, and low- 
high-punish conditions, F(1, 44 
< .01, with the low-reward and high 
conditions distributing more reward. No dif- 


nditions, 
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ferences were found for the amount of reward 
subjects distributed during the first half com- 
pared with the second half of the trials. 


Subsequent Punishment 


As can be seen in Table 1, subjects who 
were assigned to the initial reward conditions 
gave average shock levels of 2.29 and 3.90 per 
trial for low and high initial reward, respec- 
tively. Subjects in the punishment conditions 
gave average shock levels of 2.49 and 5.04 per 
trial for low and high initial punishment, 
respectively. An analysis of variance yielded 
a significant effect of magnitude of prior re- 
ward and punishment on the amount of sub- 
sequent shock distributed, F(1,44) = 39.23, 
$ < .001, with those in the high conditions 
giving significantly higher subsequent shock. 
A significant difference was also found be- 
tween the reward and punishment groups and 
subsequent shock distributed, F (1,44) = 4.10, 
p<.05, with the punishment conditions 
giving more shock. The Prior Assignments X 
Subsequent Punishment interaction was non- 
significant, F (1,44) = 2.04, p < .05. 

Contrasts for the differential effects of the 
various experimental conditions on the subse- 
quent punishing behavior were performed. 
Subjects in the high-punish conditions deliv- 
ered significantly higher shock than subjects 
in the high-reward group, F(1,44) = 5.96, 
p< .02, while a nonsignificant, F(1,44) = 
18, p= .02, difference between low-reward 
and low-punish groups was found. As in the 
subsequent reward condition, no differences 
were found for the amount of shock distrib- 


uted during the first four trials compared with 


TABLE 1 


MEAN Amount OF SUBSEQUENT REWARD AND 
PUNISHMENT DISTRIBUTED PER TRIAL BY 
INITIAL CONDITION 


Subsequent condition 


Initial condition Reward® Punishment 


Low reward 47 2.29 
Low punishment 25 2.49 
High reward 58 3.90 
High punishment 53 5.04 


nm = 12 cell. t 
RA feward (in dollars) per trial. Possible range = $,01- 


$120 an punishment per trial. Possible range: 1-10. 


TABLE 2 


ANALYSIS OF VARIANCE FOR STANDARD SCORES OF 
Born PUNISHMENT AND REWARD 


Source df MS F 
Initial behavior (A) 1 AL Al 
Subsequent behavior (B) 1 01 .01 
Level (C) 1 DAT woo nom 
AXB 1 8.88 9.24" 
AXC 1 8.17 8.50" 
BXC 1 6.65 6.92* 
AXBXC 1 OL O01 
Error 88 96 
*p <.001. 


the amount of shock given during the second 
four trials. Subjects’ sex and the sex of the 
confederate did not affect the amount of 
shock delivered. 


Combined Analysis 


Standard scores were computed for each 
subject within each of the subsequent condi- 
tions; these scores were then analyzed by an 
analysis of variance. As can be seen from 
Table 2, significant interactions were found 
between prior rewarding or punishing and 
subsequent punishment or reward and be- 
tween previous and subsequent assignment to 
rewarding or punishing and level of initial 
assignment. However, as predicted by the 
fate control interpretation, a strong main ef- 
fect for level (low-high) of initial assignment 
was also found. 


Discussion 


The hypothesis that regardless of the direc- 
tion (anti- or prosocial) of previous behav- 
ior, subsequent punishment or reward would 
correspond to the magnitude of prior behav- 
ior was partially supported by the present 
study. The very strong effect found in the 
combined analysis for level of assignment in- 
dicated that overall, subjects did behave in 
a manner consistent with the magnitude of 
their prior “fate control.” As can be seen 
from the reported individual contrasts, low 
and high level of assignment yielded differ- 
ences in all conditions except the prior re- 
ward-subsequent reward groups in which the 
low-reward cell gave more reward than pre- 
dicted. However, the Initial Behavior X Sub- 


332 JOHN P. KEATING AND TIMOTHY C. BROCK 


sequent Behavior interaction indicated that 
modification of a consistency interpretation is 
needed, The interaction is mainly accounted 
for by the fact that subsequent behavior 
tended to be greater if that behavior was in 
the same mode as the prior behavior. Stimu- 
lus and/or response generalization would ac- 
count for this effect. Consequently, the rela- 
tionship of the modes of prior and subse- 
quent behavior will affect the magnitude of 
the subsequent behavior, intensifying it when 
the modes are similar. 

Intensified subsequent behavior in the same 
mode is an interesting finding, especially 
when the mode of behavior is antisocial. This 
effect was most obvious in the punishment 
conditions where high-prior-punishment sub- 
jects delivered significantly more shock than 
the high-prior-reward group. An equity or 
guilt interpretation suggests that lessening 
subsequent shock would be one way subjects 
could avoid escalation of an inequitable situa- 
tion (Walster et al., 1973). Although sub- 
jects did not use this mode to avoid inequity, 
it is possible that some other means like 
self-justification maintained perceived equity. 
Even though subjects did not derrogate the 
victim, they could have viewed the “improved” 
learning as due to their own effective admin- 
istration of reinforcements. Consequently, de- 
livery of subsequent shock of similar intensity 
to initial shock would be justified. However, 
although fate control does not predict the 
increased intensity of same modes of behay- 
ior, it does clearly account for the consistent 
levels of shock delivered by the punishment 
conditions, 

Although a consistency interpretation is 


supported by the difference obtained between 
levels of assignment, the heightening of this 
effect fur the same mode of behavior suggests 
the | by which such consistency is 
achie\ ice subjects universally reported 
that thi their teaching strategies were 


effective should not be expected to alter 
their su nt behavior when perceiving 
their pric avior as effective. This would 
be more readily perceived when the two be- 


haviors were ii: the same mode. Additionally, 
a significant change of subsequent behavior 
would threaten the rationale for performing 


the prior behavior, especially when such be- 
havior was harmful. This justification process 
does not conflict with a fate control inter- 
pretation but complements it by suggesting 
the process by which consistent behavior is 
maintained. 

The Initial Behavior X Level of Assignment 
interaction was largely due to prior punish- 
ment leading to greater high-low differences 
than prior rewarding behavior. Although this 
finding does not argue against either a fate 
control or guilt interpretation, the question 
of why punishment should be more sensitive 
to high-low differences remains unanswered. 
Perhaps the process of justification to main- 
tain consistency becomes more important to 
subjects when initial behavior was antisocial, 
resulting in relatively more consistent behav- 
ior under punishment conditions. It should be 
noted that this interaction could be partially 
due to the inexact correspondence of the re- 
warding and punishment measures. The final 
Subsequent Behavior X Level of Assignment 
interaction was mainly due to the greater 
differences between subsequent punishment 
by levels of assignment regardless of the 
direction of prior assignment. This finding 
highlights the fact that the consistency ex- 
planation seems most valid when negative or 
“guilt-producing” behavior is in question, 

No evidence was found to support a dis- 
inhibition of anti- or prosocial behavior over 
time. In all conditions but the low-prior- 
reward-subsequent reward cell, subjects de- 
livered less punishment and reward during 
the second half of the experiment, Addition- 
ally, no differences were found between the 
first 4 and second 4 minutes of the non- 
specified reward-punishment part of the study, 


Conclusion 


The compliant behavior obtained in trans- 
gression-compliance studies is usually ex- 
plained as an attempt to restore equity in a 
relationship and thereby reduce guilt, This 
experiment tested a consistency of fate con- 
trol interpretation first proposed by Brock 
oe x me argues that one see! ; 
tain the level or degrees of con 
has previously used over E ae 
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suggest that one does attempt to make subse- 
quent behavior correspond in magnitude. 
What is most surprising is that one seems 
more able to formulate and justify patterns of 
negative behavior to maintain consistency 
than positive behavior, The self-consistency 
mechanism, experimentally documented in 
the present study, may well account as well for 
other cross-situational behavioral invariances 
that have heretofore been attributed to 
“personality” or other dispositional constructs. 
For example, the well-known foot-in-the-door 
effect may require that initial compliance cor- 
respond in magnitude to further compliance 
(Seligman, Bush, & Kirsch, 1976). 
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Boys Meet Girls; Patterns of Interaction and Attribution 
in Heterosexual Attraction 
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Kelley’s concept of causal schema was used to elaborate five potential patterns 
of interaction among two males and two females that constitute different con- 
textual bases for heterosexual attraction, Attraction based on one-to-one male- 
female conversational specificity (pairing) was predicted as a stronger per- 
ceived basis for a future relationship than that based on mutual popularity or 
attractiveness (a target property), which in turn was predicted as more power- 
ful than mutual friendliness (an actor property). The patterns were opera- 
tionalized as differential rates of initiation and reception of conversational 
statements in hypothetical scripts portraying getting-acquainted encounters. 
Except for friendliness-based couples, subjects rated the high-interaction couples 
in each pattern more likely to pair up than the others, as predicted. Contrary 
to prediction, popularity-based couples were perceived as having a closer rela- 
tionship in future than pairing-based couples, Implications of the target and 
pairing patterns for the attraction-theory phenomena of physical attractiveness 
and “love,” respectively, are discussed, 


Studies of heterosexual attraction (i.e., rela- 
tionships potentially leading to courtship and 
marriage) have been concerned predominantly 
with structural variables, such as demographic 
characteristics (Udry, 1971) and physical at- 
tractiveness (Berscheid & Walster, 1974), 
which are properties of the particular indi- 
viduals involved. Beyond the generalities of 
attitude similarity (Byrne, Ervin, & Lam- 
berth, 1970) and personality complementarity 
(Kerckhoff & Davis, 1962), however, there 
has been a paucity of research on properties 
of the relationship itself, rather than of indi- 
vidual differences. The present study argues 
that, by means of the attribution-theory con- 
cept of causal schema (Kelley, 1972), several 
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Requests for reprints should be sent to John Cun- 
ningham, School of Behavioural Sciences, Macquarie 
University, North Ryde, New South Wales 2113, 
Australia. 


different sorts of attraction relationships can 
be distinguished, and these relationships will 
have different implications for the future of 
the couples involved. 

Conversation in face-to-face encounters in- 
volves reciprocal phases of overt behavior and 
attribution. Each individual emits verbal 
statements and nonverbal cues and perceives 
others’ evaluations of him or her through their 
overt responses to his behavior; in turn, these 
perceptions, organized cognitively in attribu- 
tions, guide the individual’s further behavior, 
Behavioral cues are translated into inferences 
about personality traits (e.g., friendliness) 
and stimulus properties (e.g., attractiveness) ; 
this information is then used to make predic- 
tions about the benefits of continued inter- 
action with those individuals. Attributions, or 
causal inferences, about who is attracted to 
whom in getting-acquainted situations—and 
why—will direct the individual’s attraction 
toward potential opposite-sex partners, 

The concept of causal schema Specifies a 
framework within which an individual is said 
to organize information about effects in order 
to infer their causes. Attraction can be consid- 
ered an effect produced by one or more Causes. 
Heterosexual attraction can conveniently be 
represented by means of a social schema in 
which males are aligned along the vertical 
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Target Pattern 


Actor Pattern 


Pairing Pattern 


Ficure 1. Simple social schemata. 


axis, and females along the horizontal axis 
(see Figure 1). The entries in these cells rep- 
resent the satisfaction that each person pro- 
vides to, and obtains from, each other person. 
Entries in the upper-right corner of each cell 
refer to rewards which the specified male 
provides the indicated female; those in the 
lower-left corner are rewards which the speci- 
fied female provides the indicated male. High 
rewards are regarded as an accurate index of 
attraction here because persons reward those 
whom they like and like those whom they 
reward (Berscheid & Walster, 1969). Mutual 
attraction, designated by the reciprocal ex- 
change of high outcomes (e.g., the Dave-Sue 
dyad of Figure 1), is presumably the essen- 
tial ingredient for future interaction, increas- 
ing intimacy, and the perceived formation of 
a “couple” relationship. The different contexts 
in which mutual attraction may occur and 
which may affect its perceived origins and 
implications will now be examined. 

Within a group composed of two males and 
two females, the potential exists for the for- 
mation of from zero to four heterosexual pair- 
ings. Pairings can eventuate from any of sev- 
eral possible patterns of social interaction. It 
has been shown that young persons usually 
prefer dates of high social desirability and 
attractiveness, regardless of their own attrac- 
tiveness (Berscheid & Walster, 1974). Social 
consensus probably plays an important role in 
judgments of attractiveness. If so, the attrac- 
tive or popular person is the one who attracts 
the most attention from those of the opposite 
sex. The first matrix in Figure 1 represents 
the case in which Dave is receiving more re- 
wards, or attention, from the two females than 
Ron, and Sue is attracting more than Pam 


from the two males. One might infer that 
Dave and Sue are the more attractive, charm- 
ing, or popular persons and, given the mutual 
attraction that exists betweer them because 
they receive more attention, they would prob- 
ably pair off together. As an example of en- 
tity or target attribution (Kelley, 1967), this 
matrix is referred to as the target pattern, 
There exists a high degree of both consensus 
(e.g., both Dave and Ron behave in a similar 
fashion) and distinctiveness (they behave this 
way toward Sue but not toward Pam). 

A similar line of reasoning may be applied 
to another positively valued attribute—friend- 
liness or outgoingness. The actor pattern gen- 
erates one close couple, the two members of 
which are the more friendly male and female 
(i.e., Rick and Karen). Actor properties may 
be operationalized in terms of the sheer 
amount of rewards or attention that one per- 
son provides about equally to both members 
of the opposite sex. This pattern would con- 
stitute an instance of actor attribution in that 
it is one person of either sex (low consensus) 
who behave rewardingly with both those of 
the other sex (low distinctiveness). Thus, in 
the actor pattern, Mı and F; (see Figure 1) 
are predicted to be seen as more friendly 
than Mə and Fs, respectively, but no more 
popular. In contrast, in the target pattern, 
Mı and F; are predicted to be seen as more 
popular than Mə and Fs, respectively, but no 
more friendly. (Table 1 provides a summary 
of predictions as well as results.) 

A pattern that does not permit social com- 
parison between persons of the same sex, but 
rather yields evidence for the formation of 
two couples, is presented in the right-hand 
matrix of Figure 1. In this pairing pattern, 
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there is no possibility of an actor attribution 
or target attribution because there is simply a 
one-to-one specificity of behavior from each 
single actor toward a single target. These 
patterns have elsewhere been termed “emer- 
gent effects” (Cunningham & Kelley, 1975), 
events which are unpredictable from other 
attributes of the two persons involved and 
which are perceived as characteristic of rela- 
tionships involving extreme liking, or love. If 
all we know is that Jim likes and rewards Gail 
a lot, and she him, we have little basis for 
inferring whether these persons would feel and 
behave similarly toward Mary and Bob, re- 
spectively. Thus, in the pairing pattern no 
differences between same sex persons are 
predicted in judgments of friendliness or pop- 
ularity. 

It is also possible to fuse the essential ele- 
ments of all three patterns into one divergent 
pattern (see Figure 2), so called because the 
target person of each sex is different from the 
actor person. It is apparent that Steve and 
Linda are more frequently the targets but that 
John and Nancy are more frequently the ac- 
tors. Thus, Mı and Fy are predicted to be 
seen as more popular but less friendly than 
Mz and Fy, respectively, Recognizing the pat- 
terns of mutual attraction that target and 
actor properties have produced, it is probable 
that Steve and Linda would pair off, as well 
as John and Nancy. 

The divergent pattern compels the observer 
to distinguish between a friendly person and 
a popular person of the same sex, as if these 
attributes were independent. However, be- 
cause both attributes are evaluated positively 
(Anderson, 1968), observers may have a 
tendency to infer that friendly persons are 


Linda (F;) Nancy (F2) Ann(F;) Joan(Fe) 


Nat PNE 
Biavel|) Noh Mike] + | \ + | 
a Ce ae i Ne 
O — —— 
John + ore eats | 
(M2) Md $ | 


Divergent Pattern Convergent Pattern 


Ficure 2, Complex social schemata. 
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also popular, and vice versa (an example of 
the well-documented “halo effect” of person 
perception [Thorndike, 1920]). To assess the 
contribution of the halo effect, it is necessary 
to include an additional pattern in which the 
target and actor of each sex are the same. 
The right-hand pattern of Figure 2 is termed 
convergent because the frequencies of origina- 
tion and receiving converge. Thus, Mike and 
Ann are more often both targets and actors 
than the other members of their respective 
sexes and for both reasons might be expected 
to pair up together. Here, then, M, and Fy 
are predicted to be seen as both more popular 
and friendly than Ms and Fs, respectively. 
Within each pattern, the M,F; couple is 
predicted to be seen as more likely to pair up 
than the other potential pairings (MyF2, 
M.F;, and M.F2), except that in the pairing 
pattern the M,F; and MzF. couples will be 
perceived as pairing equally strongly. In the 
divergent pattern, two couples are also pre- 
dicted—the popular male and female (M,F,) 
and the friendly male and female (M.F»). 
However, because popular persons are more 
selective in their attention giving, popularity 
is predicted as a stronger basis for pairing 
than friendliness, which is more indiscrimi- 
nate in its target. In the convergent pattern, 
in addition to the stronger pairing of MAF, 
the low-interaction couple (M2F2) is pre- 
dicted to be seen as pairing less strongly than 
the other three couples. (Table 2 provides a 
summary of these predictions as well as re- 
sults.) 
; As for comparisons among the five patterns 
it is predicted that couples in the pairing pat 
tern will evidence the greatest pair bonding of 
all because neither partner showed much 
interest in, or received much attention from 
the other opposite-sex person, In other words. 
each member of a pair is a necessary cause 
for oh other’s positive outcomes, as in the 
multiple necessary cause MNC 
(Kelley, 1972). The E A 
the target and divergent patterns) are pre 
dicted next highest in closeness. Here each 
member of the predicted pairing is merel oni 
of two sufficient causes (MSC schema) fi : 
the other’s positive outcomes: air bo: an 
will therefore be weaker th e ponding 
an.in the pairing 
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pattern. As in likelihood-of-pairing predic- 
tions, the friendly couples (in both the actor 
and divergent patterns) ought to follow the 
popular couples in perceived degree of close- 
ness. The convergent pattern couples are pre- 
dicted weakest in bonding; though M; and 
F, are both more friendly and more popular, 
this is true because three of the four pairwise 
interactions are equivalent in frequency of 
statements emitted and received. 


METHOD 
Subjects 


Subjects were 25 male and 25 female undergradu- 
ates at the University of California, Los Angeles, 
who participated in partial fulfillment of the require- 
ments in an introductory psychology course. Sub- 
jects, working individually, completed their tasks in 
groups of two to nine. 


Stimulus Materials 


Scripts depicting conversations among two males 
and two females presented the five patterns of inter- 
action in a fairly naturalistic manner while minimiz- 
ing the role of conversational content. Social reward 
was construed as the number of statements ad- 
dressed to each person. To minimize the confounding 
effects of assumed role prescriptions and previous 
interaction, the protagonists were placed in an 
expressive setting and were specified as just having 
met, that is, as college freshmen around a dormitory 
dining-hall table at the beginning of the academic 
year. Their “small tal ” centered on such noninti- 
mate topics as sports, parties, courses, jobs, travel, 
and dormitory life; politics, sex, religion, and phi- 
losophy were never broached. 

Each plus (+) sign in Figures 1 and 2 was rep- 
resented in the scripts by five statements directed 
by one person toward the other, distributed through- 
out any one script. Each statement consisted of 
either one or two complete sentences or fragments 
that were either declarations or questions. Any 
empty cell in the Figures was represented by only 
one statement from one person to the other. The 
five-versus-one rule was admittedly arbitrary; how- 
ever, in pilot work it was found that subjects could 
identify the patterns of attraction fairly accurately 
from scripts based on this rule. The speaker and 
person addressed were jdentified for each statement 
with each participant’s name written in a different 
color so that the pattern of who spoke to whom 
would be discerned more easily. Scripts for Figure 1 
patterns contained a total of 24 statements, and 
Figure 2 patterns contained 32 statements. 

To prevent possible confounding between script 
and pattern of interaction, 5 scripts of 32 statements 
each were constructed. To operationalize the con- 
vergent and divergent patterns, the identity of the 
initiator of each statement was randomly assigned 
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to one of the four script characters, subject to the 
five-versus-one constraints of each pattern. The 
recipient of each statement was then randomly des- 
ignated as one of the two characters opposite in sex 
to the initiator, within the pattern constraints. The 
three Figure 1 patterns, each containing 24 state- 
ments, were obtained by random deletion of eight 
statements from the 32-statement scripts and by 
random reassignment of the identities of initiator 
and recipient. Any awkward conversational sequences 
that resulted from these procedures were explained to 
subjects as a result of noisy dining-hall conditions, 
in which certain missing statements could supposedly 
not be heard by eavesdroppers, as seen in the fol- 
lowing typical excerpt from a script: 


Mike—>Karen Maybe I'll get some plants. The 
rooms are so dull here; they need 
something to brighten them up. 


Karen—Mike You might want to stop by the 
place where they sell plants in front 
of the bookstore. 

Karen—Tom That sounds like a good idea. 

Tom—Joan That’s where I found my philo- 
dendrons. They have all sorts of 
plants and pots there. 

Joan—Tom Are there any soft drink machines 


in the dorm? 


Each subject received the five interaction patterns, 
each embedded in a different script. Patterns were 
first counterbalanced in scripts and then randomized 
in terms of their order of presentation to each sub- 
ject. The pairing between pattern and script was 
balanced so that each pattern appeared with each 
script once per 25 script-pattern combinations. 


Questionnaires 


The first question sheet contained items asking 
which male or female was more attractive or popu- 
lar, and which was more friendly or outgoing. The 
7-point scales were anchored at each end by the 
name of one of the two script characters of the 
same sex and the word definitely; the midpoint was 
identified by the word equally. The likelihood-of- 
pairing questions, which provided comparisons among 
pairings within any one pattern, asked how likely it 
was that each of the four potential pairings of males 
and females would become better acquainted over 
time. These 7-point scales were anchored at each 
end with the words not at all likely and extremely 
likely and at the midpoint with moderately likely. 

The second question sheet included five bipolar 
semantic-differential rating scales selected to tap dif- 
ferences in the connotative meaning of the relation- 
ships emerging in the different patterns. Prefaced 
with the question, “What kind of relationship do 
you think (male’s name) and (female’s name) 
would have?”, the 7-point scales were anchored at 
either end by the concepts “temporary-lasting,” 
“superficial-deep,” “ ‘platonic’-physically intimate,” 
“a friendship-a love affair,” and “just one of several 
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similar relationships-unique among their current re- 
lationships.” The second concept in each pair con- 
notes a stronger degree of pair bonding (closeness, 
depth of relationship). In the questionnaire itself, 
the second and third scales were reversed in order 
to reduce the possible effects of response set. For 
each pattern, a randomly selected sample of half of 
the subjects used these scales to rate a couple pre- 
dicted to pair up strongly; each of the nonpredicted 
couples was rated by equivalent numbers of subjects 
drawn from the other half, Asking the subject to 
rate the predicted couple in every pattern would 
have been tantamount to divulging that couple’s 
identity, a revelation which might, over five pat- 
terns, enable the subject to deduce the experimental 
hypotheses. 

As checks on the manipulations (to determine 
whether the appropriate persons were indeed per- 
ceived as speaking more often or as being addressed 
more often) subjects were asked, after they had 
finished the fifth and last pattern, which male and 
female in that script had spoken more often and 
which had been spoken to more often. Subjects re- 
sponded on a 7-point scale, anchored at each end 
by the name of one of the two script characters of 
the same sex and at the midpoint by equally, 


Procedure 


When all subjects had arrived for a session, the 
experimenter delivered his oral instructions and then 
distributed the first script-pattern combination. Five 
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minutes later, he removed the scripts and distributed 
the first question sheet. The two question sheets 
were distributed and collected separately. When all 
subjects had completed and returned the second 
question sheets for their scripts, the second script- 
pattern combination was distributed. This procedure 
was repeated until subjects had read and answered 
questions about all five patterns. The sheet of ques- 
tions which contained checks on the manipulations 
was distributed at the end because the questions 
might have enabled the subjects to intuit the experi- 
mental hypotheses. When all subjects had completed 
these questions, the experimenter explained the pur- 
pose of the study. 


RESULTS 
Checks on the Manipulations 


Predictions for the checks on the manipu- 
lations were identical to those for the friend- 
liness and popularity ratings. In Table 1 the 
plus and minus signs refer to whether the 
relevant mean is predicted to be significantly 
above or below zero. Each mean represents a 
positive or negative deviation from zero, taken 
as the midpoint of the 7-point scales. The $ 
values indicate the degree to which the means 
are significantly different from zero, derived 
from an analysis of variance made separately 


TABLE 1 


CHECKS ON THE MANIPULATIONS AND FRIENDLINESS/POPULARITY RATINGS 


Pattern 
Target Actor Pairing Divergent Convergent 
Pre- Ob- Pre- Ob- Pre- Ob- Pre- 
Comparison dicted tained dicted tained dicted tained dicted tained dice tae 
Mi vs. Ms 
Spoke more 
often o 1.20 + 210%% o .90* => S 
Spoken to ih + 2o% 
more often + 1.80% 0 1.50** 0 60* + 90 i 1.50% 
Fi vs. Fa ; 
Spoke more : 3 
often o 1.50* 2,30% o = 
Spoken to BU) + 80 
more often + 2.00%" 0 1.20* 0 —.60 + 90 Es 60 
F values 1.23 4.04 <1 2.30 a 
Mı vs. M: 
Friendly 0 60% + 1.9644% 0 04 - -02 
Popular + 110e o 1.02 0 102 m 76744 a fice 
è 02 
Fi vs. F» 
Friendly 0 36 + 1.73668 o —04 egy 
Popular + 14o 0 136 0 “48% F e i 44 
148* 
F valueb 46,9 17.0%% 1.32 18.61 
<1 


Note. + and — indicate whether the relevant mean is predicted to be significantly aboye or belos 
W zero. (M and F 
Tefer to 


male and female members of Dyads 1 and 2), 
8. 


adf = 1, 
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for each judgment and reflecting the devia- 
tion of the grand mean of the analysis from 
zero. All four positive deviations from zero 
predicted for the manipulation checks in the 
target and actor patterns were obtained. How- 
ever, in three of four instances in which no 
effect had been predicted for these patterns, 
it was found that the male or female who 
spoke more often was also perceived to have 
been spoken to more often. None of the pre- 
dicted effects in the divergent pattern were 
obtained, though all means were in the pre- 
dicted direction. In the convergent pattern, 
both predictions for males were confirmed, but 
neither emerged to a significant degree for 
females. 

The F ratios beneath the checks on the 
manipulations represent main effects for the 
“spoke-versus-spoken to” ratings within each 
pattern in an analysis of variance with four 
factors: Sex of Subject (2) X Subjects Within 
Cell (5) X Spoke Versus Spoken to (2) X Sex 
of Characters (2). This effect indicates 
whether within the target, actor, and diver- 
gent patterns subjects were cognizant of the 
asymmetry of conversation (i.e., where the 
signs of the predicted deviations from zero 
differed). For none of these patterns did this 
effect reach statistical significance, once more 
indicating that the more frequent initiators of 
comments were assumed to be the more fre- 
quent recipients, and conversely. 


Friendliness, Popularity, and Likelihood 
Ratings 


Table 1 also contains subjects’ ratings of 
script characters’ comparative friendliness and 
popularity.? All four positive deviations from 
zero predicted on the friendliness and popu- 
larity ratings in the target and actor patterns 
were obtained. However, for male characters, 
friendliness seemed to imply popularity, and 
vice versa. In the divergent pattern, popular- 
ity was perceived in line with prediction bet- 
ter than friendliness. In the convergent pat- 
tern, only males’ friendliness and females’ 
popularity were significantly perceived as 
predicted. 

The F ratios beneath the friendliness and 
popularity means represent main effects for 
friendliness-versus-popularity ratings within 


each pattern in an analysis of variance with 
five factors, analogous to that for the checks 
on the manipulations but with the addition of 
a trials factor. This effect indicates whether 
within each pattern subjects were differenti- 
ating between the inferred traits of friendli- 
ness and popularity. In all five patterns this 
effect was obtained where it was predicted 
and was not obtained where it was not pre- 
dicted. 

In Table 2 the F ratios beneath the likeli- 
hood-of-pairing means represent interactions 
between male script characters versus female 
script characters in an analysis of variance 
with five factors: Trials (5) X Sex of Subject 
(2) X Subjects Within Cell (5) x Male 
Characters (2) X Female Characters (2). 
Within each pattern, “high” indicates couples 
predicted to be rated higher in likelihood of 
pairing than those designated “low”, with 
“intermediate” representing an intermediate 


1No predictions were made for the effects of sex 
of subjects on the dependent measures. Effects that 
did emerge were sporadic, inconsistent, and difficult 
to describe, much less interpret. In the Male Char- 
acter X Female Character analysis for the pairing 
pattern, sex of subject interacted with trials and sex 
of character; in the same analysis for the convergent 
pattern, sex of subject interacted with sex of char- 
acter. The trials factor refers to the serial position 
of the particular script-pattern combination in each 
subject’s sequence of five. As for the semantic dif- 
ferential ratings, sex of subject interacted with trials 
on the “superficial-deep” scale; in the analysis of 
the divergent pattern alone this interaction also 
emerged on three of the five scales and on the 
summed scores yet indicated an entirely different 
configuration of means than in the aforementioned 
analysis. 

2 Before the data could be pooled over scripts, it 
was essential to discover whether any systematic 
differences existed among them, based on subjects’ 
ratings. These data were subjected to analysis of 
variance with five factors: Patterns (5) X Scripts 
(5) X Trials (5) X Sex of Subject (2) X Friendliness- 
Popularity Ratings (4). There were no overall dif- 
ferences among the scripts on these ratings. How- 
ever, scripts interacted complexly with patterns and 
sex of subject jointly, F(16, 750) = 2.45, p<.01. No 
script yielded consistently stronger results than the 
others over the five patterns. Because no systematic 
configuration seemed to exist, it was deemed appro- 
priate to pool the data over all scripts. The fact that 
the same analyses on likelihood of pairing ratings 
showed no main effects or interactions for the scripts 
factor supported this decision to pool the data. 
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TABLE 2 
RATINGS or LIKELIHOOD or PAIRING 
Pattern 
Target Actor Pairing Divergent Convergent 

Com- Pre- Ob- Pre- Ob- Pre- Ob- Pre- Ob- Pre- Ob- 
parison dicted tained dicted tained dicted tained dicted tained dicted tained 
M:Fı high 5.261 high 4.46: high 5.30: high 5.061 high 4.221 
MF: low 3.644 low 4.06 low 3.365 low 3.904 high 4.601 
M:Fi low 3.764 low 3.642 low 3.724 low 4.044 high 4.261 
MF; low 3.604 low 3.382 high 5.021 intermediate 4.404 Tow 3.244 

F values 9.29% <1 27,8** 13,14 16,4 


Note, Within-patterns means are significantly different by ¢ test for related measures at p <.05, p < .01, and $ <.001 when 
subscripts differ by 1, 2, and 3 or more, respectively (df = 49, one-tailed test for planned comparisons). (M and F refer to male 


and female members of Dyads 1 and 2.) 
adf =1, 40, 
* p <01. 
* p <.001. 


case in the divergent pattern. The proper sta- 
tistical effect to indicate differences among 
the four potential pairings is the reported 
interaction F. This interaction was obtained 
in all patterns except the actor pattern, in 
which the friendliness of M, was the only 
factor that contributed to the likelihood of 
pairing, F(1,40) = 5.90, p<.025. Sub- 
scripted ¢-test results represent comparisons 


among means within each pattern. Except for 


the actor pattern, which produced a nonsig- 
nificant interaction, differences among means 
in each pattern conform closely to prediction. 


Semantic Differential Ratings 


In order to assess differences across pat- 
terns, semantic-differential data on predicted 
couples underwent analysis of variance with 
four factors: Patterns (5) x Trials (5) x 
Sex of Subject (2) X Subjects Within Cell. 
Weak main effects for pattern were found for 
the “superficial-deep,” “friendship—love af- 
fair,” and “just one of several relationships- 
unique” scales, F(4,75) = 2.17, p< .08; 
F(4, 75) = 2.24, p < .08; F(4,75) = 1.99, p 
< .11, respectively. On all five scales the rank 
order of means for the five patterns was al- 
most identical. Contrary to prediction, in 
every case the target pattern showed greatest 
strength of pair bonding, followed in order by 
the pairing and divergent patterns, with the 
convergent and actor patterns alternately indi- 
cating the weakest or next-to-weakest bond- 


ing. 


High correlations among the five semantic 
differential scales over all patterns (Pearson 
rs ranged from 45 to .76; n = 250; all ps < 
-001) supported the decision to sum each sub- 
ject’s ratings for each pattern and to perform 
the same analysis on these summed scores, A 
weak main effect for pattern emerged again, 
F(4,75) = 2.22, p< -08; the rank ordering 
of means was the same as that found for the 
individual scales, mentioned above, The New- 
man-Keuls method for probing the differences 
among pattern means revealed that target 
couples showed greater pair bonding than 
actor or convergent couples ( $ < .05 in each 
case). By the less conservative £ test, target 
couples were regarded as having a closer rela- 
tionship than pairing and divergent couples 
as well, ¢(48) = 1.69 and 1.95, respectively; 
$ < .05 in each case. 

Because predicted pairs in the divergent 
pattern may be based on mutual attributes of 
popularity or friendliness, these data under- 
went the same analysis of variance indicated 
previously, except that the pattern factor had 
only two levels (for the popularity-based pair 
and the friendliness-based pair). Weak main 
effects for basis of pairing again emerged on 
the “temporary—lasting” and “friendship—love 
affair” scales, F(1, 28) = 3.41, p< 08; F(1 
28) = 3.23, p< -08, respectively. On every 
scale, however, the popularity-based couples 
were given stronger pair bonding ratings than 
friendliness-based couples, This effect reached 
statistical significance on the summed scores 
F(1, 28) = 4.65, p < .04, ~ 
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Discussion 


2 Though the traits of popularity and friend- 
liness in script characters were varied inde- 
pendently across the patterns, subjects tended 
to assume that the two belonged together, 
Where there was a manipulated basis for des- 
ignating one male and one female character 
more friendly, subjects were very likely to 
designate those characters the more popular as 
well, and conversely. Correlational data also 
support these findings: Over all five patterns, 
ratings of male friendliness and popularity 
correlated (250) = .29, p < .001, and rat- 
ings of female friendliness and popularity cor- 
related r(250) = .16, p < .005. This effect 
emerged not only on the mere inferences of 
friendliness and popularity but also on most 
of the checks on the manipulations, which 
concern matters of fact. That is, the male and 
the female who actually spoke more often 
were also judged to have been spoken to more 
often, and conversely, even when this was 
simply not true. Correlations between these 
judgments were even higher than those be- 
tween popularity and friendliness—For males 
and females, 7(50) =.58 and .49, p < .001, 
respectively. These unexpected covariations 
between judgments of friendliness and popu- 
larity and between judgments of who spoke 
more often and who was spoken to more often 
account for most of the important deviations 
from the predictions. 

Note that both types of correlations above 
are somewhat higher for judgments of males 
than of females. This is consistent with the 
evidence from the actor and target patterns 
in Table 1 that for male characters, friendli- 
ness implies popularity, and conversely, but 
this is not the case for female characters. It 
seems that the data for males follow stereo- 
typic expectations that ‘friendliness and popu- 
larity covary. Data for females, on the other 
hand, deviate from these expectations in the 
direction predicted by the friendliness and 
popularity patterns. This effect is puzzling 
because statements in the scripts were ran- 
domly assigned to male or female characters. 
One might argue that these results reflect sex- 
role stereotypes to the extent that in the “real 
world,” friendliness/outgoingness is associated 
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with popularity/attractiveness more strongly 
in males than in females, In males, cues other 
than physical appearance may contribute to 
judgments of popularity (eg, an assertive, 
outgoing orientation toward others), Given 
the strong, though changing, norm that the 
female ought to be the passive partner in 
heterosexual relationships, her talkativeness 
might be regarded as counternormative and 
therefore undesirable, detracting from judg- 
ments made of her popularity. Subjects may 
have inferred that an unpopular female must 
be friendly or outgoing in order to elicit the 
same amount of male attention as a passive 
but popular (possibly physically attractive) 
female. 

In distinguishing friendliness from popu- 
larity, results for the divergent pattern are 
crucial; here, the two traits were contrasted, 
Provocatively, this was the only. pattern in 
which none of the checks on the manipula- 
tions was successful. Subjects’ expectations 
(at least for male characters) that talkative 
people will be spoken to were violated by this 
pattern. Inspection of the means for the 
divergent pattern in Table 1 reveals, however, 
that subjects were somewhat more accurate 
in identifying the more frequent recipients of 
comments than the more active initiators. 
Similarly, popularity was perceived more ac- 
curately than friendliness. It appears, then, 
that subjects found it easier to discern who 
receives attention than who gives it. This 
effect has important theoretical implications 
because it addresses the distinction in attribu- 
tion theory referred to as person versus stimu- 
lus attribution (McArthur, 1972; Orvis, Cun- 
ningham, & Kelley, 1975), In the present 
study, instances of target or stimulus attribu- 
tion (popularity) were more readily identified 
than instances of actor or person attribution 
(friendliness). These findings seem to contra- 
dict the indicated studies, which report a 
predominance of actor or person attributions. 
However, the interpersonal context within 
which the present findings emerged may ex- 
plain this contradiction. McArthur found a 
predominance of stimulus attribution for the 
“emotion” verb category (e.g., “Sue is afraid 
of the dog”). Liking or attraction—another 
kind of emotion—characterizes the context of 
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the present study, It may be the case that 
subjects summarize several instances of “Jack 
speaks to Jill” as “Jack likes Jill.” Speaking 
is a “manifest” verb (Abelson & Kanouse, 
1966) that evokes more actor attributions and 
less target attributions than “subjective” 
verbs (McArthur, 1972), such as liking. If 
several specific acts of speaking are inter- 
preted as an index of a general disposition of 
liking and if this liking is what subjects re- 
member, then the predominance of target at- 
tribution found here corresponds to the pre- 
dominance of stimulus attribution found for 
McArthur’s emotion verbs. 

On the likelihood-of-pairing ratings, the 
predicted couple was distinguished success- 
fully from the other potential pairings in all 
but the actor pattern. Similarly, the friendli- 
ness-based couple in the divergent pattern 
was rated much less likely to pair up than the 
popularity-based couple and no more likely to 
do so than the two nonpredicted couples, Evi- 
dently, mutual friendliness in a couple is re- 
garded by others as a tenuous basis for a 
continuing relationship unless it is accompa- 
nied by popularity, as in the convergent pat- 
tern, The friendliness-based couples of the 
actor and convergent patterns, moreover, were 
consistently rated weakest of couples in all 
patterns in pair bonding on the semantic dif- 
ferential ratings. 

The target and pairing patterns are par- 
ticularly relevant to broadly conceived phe- 
nomena of theories of attraction—physical 
attractiveness and love, respectively. Popular- 
ity, the ability to attract the attention of the 
opposite sex, is undoubtedly tied to physical 
attractiveness (Berscheid & Walster, 1974). 
The unique one-to-one specificity of interac- 
tion in pairing-pattern couples, on the other 
hand, reflects the MNC schema, which has 
been shown to characterize relationships of ex- 
treme liking, or “love.” The budding exclusiv- 
ity of the pairing couples may be regarded as 
a precursor of the attachment, caring, and 
intimacy that are said to define relationships 
of love rather than of liking (Rubin, 1973). 
However, contrary to prediction, on the se- 
mantic differential ratings, subjects forecast 
the popular couple to bond more strongly 
than the pairing couple on every scale and 
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significantly so on the summed scores. In an 
observer context, then, it appears that popu- 
larity or attractiveness is a stronger basis for 
a relationship than pairing (or love)? A 
large grain of salt is in order, however, be- 
cause of the superficial nature of a getting- 
acquainted situation. Proponents of love will 
immediately counter that love requires consid- 
erable time and emotional investment before 
it blooms. Barring the probably spurious no- 
tion of love-at-first-sight, the phenomenon 
created in the pairing pattern might better 
be labeled a temporary infatuation. 

Tt may be an academic exercise to differen- 
tiate between friendliness and popularity as if 
they were orthogonal factors. Using photo- 
graphs as stimuli, Dion, Berscheid, and Wal- 
ster (1972) showed that attractive persons of 
both sexes were perceived to be more sociable 
and outgoing than less attractive persons, as 
well as more sexually warm and responsive, 
sensitive, kind, interesting, strong, poised, and 
modest. Outside the laboratory, then, attrac- 
tive persons may also be friendlier because 
their greater popularity, even as early as 
nursery school (Dion & Berscheid, Note 1), 
ensures that others will initiate friendly inter- 
action with them, 


3In another experiment, each subject was placed 
among three confederates who attempted to enact 
either the target, actor, or pairing pattern in a 10- 
minute face-to-face getting-acquainted situation in- 
volving two males and two females (Cunningham, 
1975). Contrary to the findings of the reported 
scripts experiment, pairing-based couples emerging 
among the confederates were rated somewhat more 
likely to have both a more intense and a more 
relaxed conversation in subsequent interaction than 
popularity- or friendliness-based couples. Unlike the 
target and actor patterns, predicted pairing couples 
were also rated more likely to pair up than non- 
predicted couples in the same pattern. These results 
would imply that in actual interaction, pairing con- 
stitutes a more powerful basis for a future relation- 
ship than target or actor properties. However. 
though the differential popularity or friendliness of 
the confederates was perceived as predicted, checks 
on the manipulations (judge's ratings of the fre- 
quency and directionality of utterances from audio- 
tapes of the experimental sessions) generally failed 
because confederates were unable to maintain the 
5:1 conversational asymmetry required in the M,F, 
and MF; cells of the target and actor patterns, 
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The present experiments substantiate the 
finding that attractiveness is an important 
asset for both sexes in their heterosexual in- 
teractions but is more important for females 
(Berscheid, Dion, Walster, & Walster, 1971). 
Couples who included the popular female were 
rated as pairing more strongly (target pat- 
tern: F(1, 40) = 19.3, p < .001). In compari- 
son, popularity of the male yielded a slightly 
weaker effect, (1,40) = 13.6, p < .001. Ad- 
ditionally, in the divergent pattern, the popu- 
lar female generated greater likelihood-of- 
pairing than the friendly female, F(1,40) = 
9.50, p < .005. On the other side of the coin, 
the friendliness of the male contributed to 
the likelihood-of-pairing in the actor pat- 
tern, as reported above, and friendliness of 
the female did not. These, then, are the 
threads of some consistent sex-role differences 
which are assumed to affect the perceived 
success of pairing of males and females of 
differing traits. To the degree that the manip- 
ulated differences represent stable attributes, 
the present college freshman and sophomore 
subjects perceived those couples most likely to 
succeed in which the male is friendly or out- 
going and the female is popular or attractive 
(ie., when their behavior is in accord with 
conventional cultural norms). 


REFERENCE NOTE 


1. Dion, K. K., & Berscheid, E. Physical attractive- 
ness and social perception of peers in preschool 
children. Unpublished manuscript, University of 
Minnesota, 1972, 
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The Generosity Shift 


J. T. Muehleman, Charles Bruker, and Clara M. Ingram 
Murray State University 


Two studies were conducted to extend Brown’s value theory and Levinger and 
Schneider’s conflict compromise theory of the risky shift phenomenon to the 
area of generosity, Study 1 was designed to establish that questionnaire-mea- 
sured generosity was positively valued and that subjects saw themselves as 
above average in generosity. The subjects responded to the questionnaire with 
(a) their own choice, (b) what they believed their peers would choose, and 
(c) the choice they most admired. The results indicated that subjects presented 
themselves as more generous than their peers and that subjects admired re- 
Sponses more generous than theirs. Study 2 tested subsequent predictions from 
value theory and conflict compromise theory, Eight groups of experimental 
subjects filled out the questionnaire three times in one session: (a) individually, 
(b) unanimously as a group, and (c) individually after group discussion. The 
control subjects filled out the questionnaire two times individually with inter- 
vening time. As predicted, the experimental group showed a generosity shift 
from the first individual to the group decision as well as from the first to sec- 
ond individual administration. Statistical tests also revealed a predicted rela- 
tionship between experimental subjects’ initial relative position within each 
group (as per the initial private generosity) and the amount of shift from the 


first to the second private responses. The results are discussed in terms of the 
generality of value theory and conflict compromise theory, Other theories that 
have been advanced to account for the risky shift phenomenon are also dis- 


cussed. 


The “risky shift” phenomenon has been 
reliably found by a number of investigators, 
and a number of theories have been con- 
structed to account for the findings (see 
Cartwright, 1971, for a review). The findings 
are seen by a number of investigators as a 
particular instance of a more general phe- 
nomenon called choice shift or group-induced 
shift (Brown, 1974; Levinger & Schneider, 
1969; Pruitt, 1971). Briefly, risky shift labels 
the finding that a group discussion leads to 
a unanimous group decision which is, for some 
situations, more risky than the prediscus- 
sion individual decisions (Cartwright, 1973; 
Kogan & Wallach, 1967; Stoner, 1961; Teger 
& Pruitt, 1967; Wallach, Kogan, & Bem, 
1962). It is also found that postdiscussion 
individual decisions are more risky than pre- 
discussion decisions. The Choice Dilemmas 
Questionnaire (Wallach et al., 1962) has been 
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used to assess the riskiness of subjects before 
and after group discussion as well as the 
groups’ unanimous decisions. 

Brown (1974) has generalized his value 
theory of the risky shift, and he states the 
conditions under which other shifts toward 
increased virtuousness should occur. Specif- 
ically, he has hypothesized that in certain 
situations, people want to see themselves and 
be seen by others as above average in virtue, 
In some situations, people do not necessarily 
have accurate normative data to decide how 
to be moderately above average in the par- 
ticular valued areas. A group discussion would 
allow persons to gather such data and Brown’s 
theory predicts that the group’s unanimous 
decision would be more virtuous than the pre- 
vious individual member’s before the discus- 
sion. Also, there would be a shift toward more 
virtuousness in the postdiscussion individual 
choices in order for the participants to see 
themselves as moderately virtuous. 

Levinger and Schneider (1969) have pre- 
sented a similar analysis for the phenomenon 
which they labeled choice shift. They have 
hypothesized that a choice shift would occur 
whenever three conditions are met: (a) A 
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a 


GENEROSITY SHIFT 


&roup member has an ideal preference which 
differs from his or her expectations of the 
stoup standard; (b) the member’s own choice 
concerning the situation lies between his or 
her ideal preference and Perception of the 
group’s standard; and (c) subsequent infor- 
mation (e.g., from a group discussion) reveals 
that the group standard is much closer to the 
ideal than was earlier believed. Levinger and 
Schneider’s conflict compromise model would 
appear to make the same predictions as 
Brown’s. Both have predicted that in any 
valued choice situation such as altruism, 
honesty, or generosity, there would be a 
choice shift. 

A study by Myers and Bishop (1970) ex- 
tended the study of the choice shift phenome- 
non to the area of racial attitudes, They were 
able to predict differences in the direction of 
shifts in racial prejudice following relevant 
group discussion in groups of high-school 
students with high, medium, and low preju- 
dice. The present research consisted of two 
studies designed to extend the study of the 
choice shift into another value area, gener- 
osity. In order to test for a choice shift, 
we developed a questionnaire somewhat anal- 
ogous to the Choice Dilemmas Question- 
naire. Study 1 was similar to Levinger and 
Schneider’s (1969) investigation of the Choice 
Dilemmas Questionnaire, The study was de- 
signed to establish that generosity, as mea- 
sured by the generosity questionnaire, was 
positively valued by our population of sub- 
jects and that subjects were motivated to per- 
ceive themselves as above average in gener- 
osity. Study 2 was designed to test for a 
choice shift toward increased generosity after 
group discussion and to test other predictions 
made by value theory. 


Development of the Generosity 
Questionnaire 1 

Nineteen situations, each with five alterna- 
tive responses, were generated on the basis of 
face validity to measure generosity. These 
items were administered to 30 volunteer intro- 
ductory psychology students, and Pearson 
product-moment correlations were calculated 
between scores on individual items and total 


- test scores. The 10 items with the highest cor- 


relations with total test scores were selected; 
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the range of coefficients was from 47 to 
-66. The following two items are given as 
representative samples of the 10 items: 


1. In the city where you live, many places of resi- 
dence have been recently damaged by a series of 
explosions due to some kind of malfunctioning of the 
city’s gas system. As a result, there are a number 
of families currently without adequate shelter, The 
mayor has requested that those whose homes have 
not been damaged volunteer to take their less fortu- 
nate fellow citizens in for a time, Your home has 
not been damaged, and you have room enough for 
a family of four to stay with you for awhile, How 
long would you agree to have one of these families 
stay with you? 

A. not at all 

B. up to a week 

C. more than a week, but not more than two weeks 

D. more than two weeks, but not more than a month 

E. more than a month 


2. You're attending school in Florida and you've 
made plans to go home to New York for Christmas, 
You’ve found three other students to ride with you 
and share expenses, and everything is set, Then, 
about a week before vacation begins, you happen to 
learn that an acquaintance of yours, also from New 
York, won't be able to go home because he’s com- 
pletely broke, You've thought about asking him to 
go with you, but he couldn't share expenses, and 
his presence would make traveling conditions more 
crowded, What is the probability that you would 
invite him to ride with you? 

A. 0-20% 

B. 21-40% 

C. 41-60% 

D. 61-80% 

E. 81-100% 


Srupy 1 


The first purpose of Study 1 was to es- 
tablish that generosity, as measured by the 
questionnaire, was positively valued or seen 
as a virtue. The reasoning behind the method 
was that whenever generosity was positively 
valued, the alternative that the subjects ad- 
mire most would be more generous than their 
own choice. 


The second purpose of Study 1 was to test 


one of Brown’s (1974) hypotheses that in 
many situations involving virtuousness, per- 
sons are motivated to see themselves as more 
virtuous, in this case more generous, than 
the average person. From this it was pre- 
dicted that whenever items were used to mea- 


1 Copies of the generosity questionnaire may be 
obtained from the first author, 
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sure a positive value, subjects ought to per- 
ceive others as less generous (virtuous) than 
themselves. 


Method 
Subjects 


The subjects were 153 male and female students 
enrolled in undergraduate psychology classes. The 
subjects volunteered for the study and completed 
the questionnaire during their regular class period. 


Procedure 


Each subject responded to each of the 10 situations 
on the generosity questionnaire under the three sets 
of instructions. Instructions for the three administra- 
tions were as follows: 


1. The following items are concerned with gener- 
osity in various situations. You are asked to read 
each item carefully, try to put yourself in each 
situation and pick the one which most closely 
resembles what you yourself would do in that 
situation, 

2. You have just finished describing what you 
would do in the various situations. Now reread 
each situation and mark how you believe a major- 
ity of your fellow students would behave in each 
situation. 

3. Please reread each situation for a third and 
last time. This time indicate the choice you would 
most admire, regardless of what you or most others 
might prefer. 


Results 


Table 1 presents the means for all 10 items 
under each of the three instructional sets 


TABLE 1 


MEANS FOR Amount or GENEROSITY CHOSEN UNDER 
THE THREE INSTRUCTIONAL SETS AND MEAN 
DIFFERENCES BETWEEN FELLOW STUDENT 
Vs. SELF AND ADMIRED Vs. SELF 


Difference 
Instructional set 
Fellow 

Fellow Most student Admired 

Item Self student admired vs. self vs. self 
1 3.67% 2.86 4.65 —.81** I8 
2 2.30 2.03 4.14 +27" 1.848% 
3 3:33 152.65 4.46 —.67** 1.14** 
4 4.06 3.24 4.56 = 82 a 
5 4.25 3.39 4.82 —.86**  .58** 
6 3.78 3.14 4.83 —.63** 1.05** 
7 3.50 2.88 4.66 —.63** 1.16** 
8 4.05 3.12 4.63 —.93** 508 
9 3.69 2.98 4.54 —.71**  84** 
10 3.59. 2.76 4.39 —.83**  .80** 


Note. The p values were determined by £ tests of difference be- 
tween correlated means for a sample of 153 subjects. 
a Alternative among five given; the larger the number, the 
more generous the choice, 
*p <.0: 


** p < 1001. 


as well as the mean differences, to be dis- 
cussed below. The first column of mean differ- 
ences is between subjects’ own responses and 
their perceived fellow students’ responses. 
The negative differences indicate that the 
average subject saw himself/herself as more 
generous than the majority of his/her fellow 
students. A ¢ test of difference between 
correlated means was calculated for each dif- 
ference and indicated that for each item 
the differences between responses under two 
instructional sets were significant. 

The second column of mean differences in 
Table 1 is between subjects’ own responses 
and those marked as admired most. The sta- 
tistically significant difference for each item 
indicates that the subjects consistently ad- 
mired responses more generous than those 
which they had chosen as their own. 


Discussion 


The results of Study 1 were as predicted 
by Brown’s value theory. First it was estab- 
lished that generosity as measured by the 
questionnaire was positively valued, that is, 
subjects admired responses more generous 
than their own, The questionnaire was thus 
a method of measuring what our subjects 
perceived as virtuousness. 

Given this, Brown (1974) would predict 
that subjects would be motivated to be above 
average in generosity. This prediction was 
confirmed in that the subjects perceived them- 
selves as significantly more generous than 
their fellow students. 

These findings, then allow a test of 
Brown’s value theory as well as Levinger and 
Schneider’s conflict compromise theory of the 
risky shift pheomenon. Given the above find- 
ings, it would be possible for Brown’s value 
theory to predict a generosity shift, given the 
appropriate conditions. Specifically, if subjects 
were to record their responses initially in 
private and then undergo a group discussion 
of the items, followed by a second private 
recording, there should be a shift to increased 
generosity, 


Srupy 2 


: Study 2 was designed to test predictions de- 
rived from Brown’s value theory and Levinger 
and Schneider’s (1969) conflict compromise 


GENEROSITY SHIFT 


theory, The experimental group received all 
conditions that are hypothesized by Brown 
and by Levinger and Schneider as sufficient for 
a generosity shift. The results of Study 1 
indicated that the generosity questionnaire 
measures a positively valued behavior and 
that persons (erroneously) perceive others as 
less generous than themselves on the ques- 
tionnaire. Given this, the experimental group 
filled out the questionnaire three times: first, 
in private, individually (Iı); then, in groups, 
they discussed the items and came up with 
unanimous group decisions for each item (G); 
third, following the group discussion, subjects 
filled out the questionnaire again in private 
(I2). Control subjects filled out the question- 
naire twice individually (I, and Is) with a 
free-time interval between administrations. 
The value theory and conflict compromise 
theory make the following predictions from 
this experiment. 


Prediction 1 

The first prediction is that the experimental 
subjects will show a generosity shift from I; 
to G. Within each group the individual with 
the most generous personality will set the 
generosity norm for the other members and 
cause them to shift for the G decision. 


Prediction 2 
The second prediction states that there will 
be a significant generosity shift from I; to Tz 
after the group discussion. The group discus- 
sion serves to provide the average group 
member with information that allows him or 
her to discover that compared with the others, 
he or she is not as generous as had been 
ly. In order to maintain 


believed previous 
the image of self as moderately above average 
in generosity, the subject may select more 


generous alternatives on To. 


Prediction 3 


Prediction 3 sugges 
inverse relationship be 


ts that there will be an 
tween a group ng 
i sit sition within her 
ee ie anal of shift from I, to Io, 
with the most generous member showing » 
change or 4 shift toward less mo z 
Vidmar (1970) derived and y 
prediction from Brown’s (1965) value 
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for the risky shift situation, Brown (1974) 
has stated clearly that to be virtuous is to be 
different from the mean but in the right direc- 
tion and to the right degree, To the extent 
that a person is low in generosity as com- 
pared with fellow group members, we would 
expect that individual to shift and become 
more generous in order to regain that self- 
image as a more generous than average per- 
son. However, the person in each group who is 
initially most generous might find herself too 
different from the other group members. So 
if there is any change at all in these per- 
sons, we would expect it to be toward less 
generosity on Is. 


Method 
Subjects 


Subjects for Study 2 were 73 female students en- 
rolled in an introductory psychology course. Subjects 
volunteered for participation in an experiment con- 
cerning generosity and received class credit for their 
participation. Initially, 80 subjects were randomly 
assigned to experimental and control conditions of 
40 each. The experimental subjects were then ran- 
domly assigned to the eight 5-person groups. Two 
experimental subjects failed to appear, leaving two 
groups with only 4 members; 5 control subjects 
failed to show up, leaving 35 controls, 


Procedure 


Experimental condition. Experimental subjects were 
tested in a seminar room containing a long table, 
with no more than five subjects present for any one 
session. The initial individual testing (11), group dis- 
cussion with unanimous decision (G), and second 
individual testing (Is) were conducted with the 
experimenter present in a single 14-hour session. 

For the J; administration, the 5-person groups 
were asked to complete the generosity questionnaire 
under standard instructions (see Study 1, first in- 
structional set). Without having any prior expecta- 
tion regarding open discussion of their decisions, the 
subjects were then asked to assemble jinto a group 
at the end of the table and were read the following 
instructions: 


This questionnaire is the same one which you 
just finshed completing. You were asked to fill it 
out the first time in order to familiarize yourself 
with all the situations and to give you some idea 
where you might stand on each. Now you are 
requested as a group to discuss each item in turn 
and arrive at a unanimous decision on each. 

This time you will not be allowed to return toa 
problem, but rather must discuss each one till 
a group decision is reached before going on to 
the next one. When the group reaches its decision 
on a question, each of you is asked to mark it on 
your own questionnaire with a check mark, in 


348 


order to have a record. As you may have surmised 
by now, this discussion will be taped. The purpose 
of this is to allow the communication process to 
be analyzed later. 

Again there will be no time limit—take as long 
as you like to discuss the items. (It takes most 
groups about 40 minutes.) The group will be com- 
pletely on its own, the experimenter not partici- 
pating in the discussion at all. He will be right 
outside the room and would like for one of you 
to let him know when you have arrived at your 
unanimous decision for all ten items. 

Remember, out of the five alternatives presented 
for each item, you are to pick the one which most 
closely resembles what you as a group would do in 
that situation. 


The groups took approximately 45 minutes to 
complete the task. Immediately upon its completion, 
I; instructions were given as follows: 


You are now requested, once again, to go back over 
the situations, read each of them carefully, and 
indicate your own present personal decision with 
a “P.” Once again, there will be no time limit, 
and this time you may return to an earlier item if 
you wish to do so. 

The experimenter is not interested in testing your 
memory, your consistency, or your conformity to 
the group. So, whether or not you adhere to your 
original private decisions is not important. It is 
also unimportant whether you agree or disagree 
with any of the group's decisions. It is important 
only that you choose the alternative which most 
closely resembles how you, at the present time, 
think that you would behave in each situation, 


Upon completion of the Ie decision, subjects were 
dismissed from the experiment. 

Control condition. Subjects in the control condi- 

tion met in a classroom setting in two groups of 
approximately 20 each. They were initially (L) 
administered the generosity questionnaire under 
standard instructions (see Study 1, first instructional 
set). 
After completing the questionnaire, the subjects 
were instructed to simply wait for the next part of 
the experiment but not to talk to each other (they 
had previously been instructed ot bring along mate- 
ials with which to occupy some time). The waiting 
was approximately the same as the group 
times in the experimental groups. 


iod 


out new copies of the generosity 
ng with the following instructions: 


This questi ire is the same one you completed 
a little while . You are now requested to go 
back over the uations and to reconsider them, 
The experime is not interested in testing your 
memory or consistency, but rather wants you to 


truly reconsider each item. Whether or not you 
adhere to your original decision is not important. 
It is important only that you choose the alterna- 
tive which most closely resembles how you, at the 
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present time, think that you would behave in each 
situation. 


The instructions were designed to allow change, 
but it was neither encouraged nor discouraged. 


Results 


Table 2 presents the means of the generosity 
scores for the experimental subjects at Iı, G, 
and I, and for the controls at I, and I». In- 
cluded in Table 2 are the mean shifts from 
I, to G in the experimental group and from 
I, to Iz for both experimental and control 
groups. 

The first prediction stated that the experi- 
mental subjects would show a generosity shift 
from I, to G. The prediction was supported; 
there was a significant shift from the predis- 
cussion individual decisions to the unanimous 
group decisions in the direction of increased 
generosity, (37) = —3.460, p < .01. 

Prediction 2 was that there would be a 
significant generosity shift from Iı to Is in 
the experimental condition, The negatively 
signed mean change score was sufficiently large 
to be significant, (37) = —2.639, p < .01, 
and indicates that experimental subjects were, 
on the average, more generous on Iz than on 
I. As expected, there was not a significant 
difference between the I, and I2 generosity 
scores of the control group to reach statistical 
significance, 

Prediction 3 suggested that there would be 
an inverse relationship between initial level 
of generosity in the experimental groups and 


TABLE 2 


Means or GENEROSITY DECISIONS OF EXPERIMENTAL 
AND CONTROL SUBJECTS AND MEAN SHIFT SCORES 


Subject 
LL 
N 

Mean and mean shift score (n= 38) CES 
Generosity decision 

Initial individual (T,) 36.34 36.26 

Unanimous group (G) 39.74 

Second individual (Ip) 38.76 36.69 
Mean shift 

IL-G —3.39* 

I-i: —2.42* —.43 


Nolte. Higher scores indicate greater generosity, 
*p <.01, 


Ww 


GENEROSITY SHIFT 


te ee ey 
Bats: ee ae ae Janie relative to 
the oan n each group decreases, 
: shift toward increased generos- 
ity on Iz ought to increase. An individual 
generosity shift score was calculated for each 
subject by subtracting her Is from I, total 
generosity score. Relative position within each 
experimental group was determined by the I; 
scores. Subjects in each group were arranged 
in decreasing order of generosity, with S; indi- 
cating the most generous person within each 
group initially. Seven 5-person groups were 
randomly formed from the 35 control subjects. 
These subjects were arranged into relative 
generosity positions in the same manner as the 
experimental subjects. These groups were es- 
tablished as a control for possible regression 
toward the mean. Table 3 presents the mean 
shift scores for the most through the least 
generous members (Sı through Ss) for the 
experimental and control conditions. 

A 2 X 5 (treatment X member relative gen- 
erosity position) analysis of variance* was 
performed on the shift scores. The effect of 
relative initial generosity level was significant, 
F(4, 63) = 5.56, p<.01. Table 3 clearly 
shows that in the experimental condition, the 
least generous persons exhibited the greatest 
amount of shift from I, to Is. As the value 
hypothesis would predict, the most generous 
members in the experimental condition did 
not shift toward greater generosity. In fact, 
a £ test was computed to determine if there 
was a difference in mean shift of the experi- 
mental Sı subjects as compared with the con- 
trol Sı subjects. The difference between the 
mean shifts was statistically significant, t(13) 
= 1.98, $ <.05, suggesting that compared 
with the control Sı subjects, the experimental 


TABLE 3 


RELATIVE INITIAL GENEROSITY IN Group 
AND MEAN I-I SHIFT 


Group member Experimental Control 
Sı 3.12 —.85 
S 25 28 
S —2. f 
Si —6.19 — 92 
Ss = 742 =1.50 


res, the greater the generosity 


ve ‘hange sco! 
ee ode A S+ the least generous 


shift. (Sı indicates the most generous ani 
member.. 
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TABLE 4 
MEAN GENEROSITY SinrT SCORES FOR ITEMS 
Teele 

Item Esperimental. Control 

1 17 F 

—.4/ 

2 —.26 

3 06 

4 03 

5 00 

6 = 11 

d 09 

8 —A7 

9 20 

10 09 
Nole. Abbreviations: I = individual decision; G 

= unanimous group decision; Iz = second individual decision, 

*p <.05. 
p <1, 


Sı subjects shifted to less generosity from 
I, to I. 

The main effect for treatment was signifi- 
cant, F(1, 63) = 11.92, p < 01, and indicates 
more overall generosity shift for the experi- 
mental as compared with the control subjects. 
This could be seen as another test of Predic- 
tion 2. As expected, there was also a signifi- 
cant interaction of treatment and initial 
relative generosity in groups, F (4, 63) = 3.77, 
p < .01, which is attributable to the orderly 
increase in the amount of shift with de- 
creasing initial position in the experimental 
subjects that was not present in the control 
subjects. 

As a further test of the value theory, it 
was decided to test for significant generosity 
shifts on individual items. Table 4 presents 
the mean shift scores of the experimental and 
control groups on individual items. Individual 
t tests of difference between correlated means 
were calculated for each shift score for caghs 
item, Ten of the 20 ¢ tests for the experiment 
were statistically significant, whereas none 


2 All of the assumptions required for an analysis 
of variance are not met by the data, Clearly, sub- 
jects were not randomly selected for the levels of 
relative risk position factor (B), but rather were 
assigned as a function of their I, scores. Also, mem- 
bers of each group shared an experience which was 
unique to that particular group; therefore, Positions 
S, through Ss are not independent. Any common 
variance resulting from unique group experience 
would make the test for main effects of the member 
position factor more conservative because it would 
reduce between-levels variance. 
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theory. In the case of the S, subjects, it a 
would be assumed that they initially filled out 
3 the generosity questionnaire as a compromise 
PEAS between two conflicting pressures: (a) what 
the individuals value or desire, and (b) what 
they consider realistic or reasonable. Their 
perception of other persons served as a refer- | 
ence or standard by which to judge what was 
realistic for each item. The S, subjects dis- 
covered during the group discussion that the 
other members were further from the ideal or 
valued point than was earlier believed (on Iņ). 
The S; subjects, therefore, used the informa- 
tion from the group discussion to make a new 
compromise between their ideal and the newly 


of the 10 ¢ tests for the control groups were 
significant. 


The results of Study 2 are consistent with 
both a value theory and a conflict-compromise 
model of the choice shift phenomenon, 

Predictions 1 and 2 may be discussed to- 
gether. It was predicted that there would be 
evidence of generosity shifts from I, to G and 
from I, to Iz. Both predictions were con- 
firmed. These results thus add support for 
the generalizability of the risky shift phe- 
nomenon and the ability of Brown’s value 
theory and Levinger and Schneider’s conflict- 
compromise theory to account for this more discovered realistic point. If these S, subjects 
abstract process. strike a new compromise, it would have to be 

Prediction 3 proposed that relative I; gen- back toward what was perceived as more rea- 
erosity position within groups would be in- sonable and away from the ideal. As subjects’ 
versely related with the amount of shift from relative position in the group decreased (i.e; 

I to Iz. The prediction was confirmed, with Se through S5), subjects would see themselves 
the most generous subjects in the discussion further from their ideal and others as rela- 
groups actually shifting toward less generos- tively closer to it. The orderly increase in 
ity, as compared with the control S; subjects. shift toward generosity (which is the ideal or 
In addition, there was an orderly increase in valued choice) is clearly predicted by this x 
the amount of generosity shift as relative po- model as relative initial position decreases. 
sition decreased. These results were predicted Other theories which have been advanced to 
and are explained by Brown’s value theory in account for the risky shift phenomenon in- 
the following way: In some situations persons clude leadership, diffusion of responsibility, 
are motivated to be above average but not so and familiarization. There are at least four 
much as to appear overly righteous. The re- versions of leadership theory (Pruitt, 1971), 
sults of Study 1 indicate that taking the gen- none of which could have predicted the find- 
erosity questionnaire is an example of such a ings of the present study, Two will be con- 
situation, The S; subjects, those who initially sidered here. One leadership theory would 
had the highest scores within each group, assume that highly generous persons are in- x 
were unquestionably well above average for herently more persuasive, The analogous as- 
their own group with regards to how they sumption for the risky shift phenomenon has 
presented themselves on the questionnaire. not been supported, and there appears little 
Through the group discussions, these persons justification for the assumption. Leadership 
may have found that they were too generous confidence theory would hold that persons 
initially (i.e., more than moderately virtuous) who initially score high on the items are 
and shifted toward moderate generosity on more confident in their position, that this con- 
I>. An alternative explanation is that this shift fidence translates itself into more assertive- 
was a result of a regression to the mean, since ness, and therefore they have more social in- 
the Sı subjects in the study were selected by fluence in the group discussion. This hypothe- 
their initially high scores. This explanation sis was not directly tested in the present 
seems unlikely because the S; subjects in the study, but it appears incapable of accounting 
established control groups who did not en- for the shift toward less generosity by the S, A 
gage in group discussion did not shift to less subjects. If these persons were the most con- 
generosity at Iv. fident, why would they shift their positions 
The conflict compromise model offers essen- to more moderate generosity on Is? 
tially the same explanation as Brown’s value Diffusion of responsibility theory as an 
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oles of the risky shift holds that group 
‘Xperience reduces anxiety about potential 
negative consequences associated with risky 
decisions, This reduced anxiety makes it pos- 
sible to accept the risky alternative at a lower 
probability of success. This theory appears 
quite inappropriate when applied to the 
generosity shift found in the present study. 

Familiarization theory (Bateson, 1966) 
holds that during the group discussion, sub- 
jects could be familiarizing themselves with 
the items and that increased familiarity 
makes people more willing to take risks. This 
theory has some support within risky shift 
data but cannot readily be generalized to 
include the present findings involving a 
generosity shift. 

The conflict compromise model may have 
application in understanding and suggesting 
ways to increase prosocial behaviors. Of 
course, as Brown points out, to find that per- 
sons choose more virtuous answers on a ques- 
tionnaire after a group discussion does not 
necessarily mean that any one person would 
actually behave more virtuously. Other rele- 
vant variables, such as response cost, must be 
taken into account. For example, in one of 
the laboratory “emergency” situations used in 
studying helping behavior, it has been found 
that an individual is more likely to offer as- 
sistance when he or she is alone than when 
he or she is with a group. It has been sug- 
gested that the dominant value often aroused 
is “Don’t make a fool of yourself, appear 
cool” rather than “Help others.” Yakimovich 
and Salty (1971) found that when an emer- 
gency situation is made less ambiguous (a 
workman who had fallen called out for help), 
there is a significant increase in helping be- 
havior. Certainly, when the dominant value 
was to help others, prosocial behavior was 
more likely. It is possible that even when the 
dominant value is present, persons do not 
behave helpfully. The conflict compromise 
model assumes that the behavioral choice is 
a compromise between what the individual 
values or desires and what he or she considers 
realistic or reasonable. Further, in deter- 
mining what is realistic, other persons serve 
as a source of information. If we assume that 
initially an individual misperceives others’ 
standards as substantially further from his 
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ideal than they actually are, then any inter- 
vention which would allow the individual to 
discover that fact would allow his/her real- 
istic reference point to shift closer to his/her 
ideal point. Each member in a group would 
be free to make new compromises between 
ideal and newly perceived points of reference, 
and choice could shift toward the ideal. 

More research is required to discover how 
useful this conceptualization will be in other 
areas of prosocial behavior, but it appears 
that value and conflict compromise models 
developed to account for the risky shift are 
applicable in the study of how individuals 
manifest values or virtues in novel situations, 
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Disinhibition Versus the Cathartic Effect: 
Artifact and Substance 


Vladimir J. Koneéni and Ebbe B. Ebbesen 
University of California, San Diego s 


The present research was concerned with the effects of anger and physical ag- 
gression on subsequent physical aggression. Specifically, two experiments ex- 

plored the discrepancy between previous findings that angered subjects’ ag- d 
gression may be reduced by aggression (the cathartic effect), and a recent 

finding of a disinhibitory effect of aggression by Geen, Stonner, and Shope 

(1975). Experiment 1 showed that the Geen et al. results may have been due 

to a “scale bias” inadvertently produced by their operationalization of the in- 
dependent and dependent variables, both involving the Buss “aggression ma- 

chine.” Experiment 2 provided relatively strong evidence that the scale bias 

did not tap disinhibitory processes and that it was an artifact of modest 
substantive interest. Overall, very little support for various versions of the 
disinhibition hypothesis was obtained. In contrast, a strong cathartic effect 

was found after eliminating the scale bias, and with subjects exposed to a 
treatment involving anger-generating insults. The relationship between theoreti- 

cal concepts and the internal/external validity of aggression experiments was 
discussed; some criteria were suggested for distinguishing between the sub- 
stantive, procedurally biased, and artifactual findings. 


Several recent studies have shown that an 
opportunity to “hurt” physically an insulting 
person tends to reduce the amount of sub- 
sequent aggression directed by angered people 
against the annoyer, in comparison to the 
appropriate control groups (e.g., Doob & 
Wood, 1972; Konečni, 1975a; Konečni & 
Doob, 1972). Some extensions of the range 
of applicability of the “cathartic effect” as 
defined above have also been accomplished 
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(e.g., Konečni, Crozier, & Doob, in press). 
In addition, it has been suggested that this 
line of research should be distinguished from 
experiments in which anger is not induced in 
subjects, and/or subjects do not engage in 
physical aggression against the annoyer (i.e., 
studies that involve verbal aggression, non- 
aggressive physical activity, aggression against 
inanimate objects, watching aggressive models 
or violent/erotic films, etc.). Theoretical rea- 
sons for this distinction and reviews of 
the pertinent literature have been provided 
elsewhere (Doob & Wood, 1972; Ebbesen, 
Duncan, & Konečni, 1975; Konetni, 1973, 
1975a). 

In contrast to the previous demonstrations 
of the cathartic effect, Geen, Stonner, and 
Shope (1975) recently reported an increase 
in aggression under circumstances which, at 
first glance, seem to fulfill the requirements 
mentioned above for the occurrence of the 
effect. These authors’ explanation for the dis- 
crepancy focused on the various ramifications 
of the concept of disinhibition, The specific 
issues raised by the Geen et al. (1975) 
methodology, results, and interpretations will 
be discussed in the context of the two experi- 
ments we report here. In addition to con- 
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_ trasting the theoretical positions involved 
our experiments made it possible to address 
some general issues concerning laboratory 
research on human aggression. 


EXPERIMENT 1 


The Geen et al. (1975) 2 X 3 design was 
structurally identical to that used by Doob 
and Wood (1972). Also, the critical 2 (Attack 
vs. Nonattack) X 2 (Opportunity to Aggress 
vs. No Opportunity) portion of the Geen et 
al. design (the four means are presented in 
the top part of Table 1) was conceptually 
equivalent to parts of the designs used by 
Konetni and Doob (1972) and Konetni 
(1975a). One of the findings was that sub- 
jects who had a prior aggression opportunity 
delivered more intense “shocks” than did the 
no-opportunity subjects.* Geen et al. concluded 
that this result represented strong evidence 
against the cathartic effect. 

Alternatively, the Geen et al. finding may 
have been due to a “scale bias” induced by 
the procedure. Whereas few aspects of the 
Geen et al. experiment were conceptually 
novel, the details of the operationalization 
of the independent and dependent variables 
were somewhat different from those used in 
prior studies. The opportunity-to-aggress sub- 
jects were forced by the instructions to give 
the confederate 20 shocks by always pressing 
the number 5 button on a 10-button Buss 
machine, even though the dependent measure 
also involved the Buss machine, and the no- 
opportunity-to-aggress subjects had no experi- 
ence with the response scale.* Thus, as 4 
function of their respective activities in the 
preceding period, the opportunity and no- 
opportunity subjects may have perceived the 
shock-intensity scale rather differently in the 
dependent-measure period. The opportunity 
subjects, having just pressed the number 5 
button many times, may have concluded that 
this part of the scale was “demanded” by the 
experimenter/situation and/or “sanctioned” 
by the experimenter/situation. In either case, 
the opportunity subjects may have adopted a 
“set” for this part of the scale, which, in 
turn, may have had the following behavioral 
manifestations in the dependent-measure pê- 
riod (compared with the no-opportunity con- 
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dition where such a scale bias clearly could 
not have been induced): (a) a higher average 
level of shocks on the first few trials and (b) 
a greater likelihood and frequency of ven- 
turing into the top part of the scale. Either 
of these factors could have inflated the mean 
in the opportunity condition compared with 
the no-opportunity cell. 

If a part-of-scale bias was indeed induced, 
and was due to experimenter-demand factors, 
the Geen et al. results would properly be 
considered artifactual and would have rela- 
tively little to do with the cathartic effect or 
aggressive behavior in general. If, however, 
the bias reflected the perceived experimental 
sanction, the results would still be of theoret- 
ical interest from a disinhibition viewpoint. 
Whereas Geen et al, generally espoused the 
latter viewpoint in their paper (we shall re- 
turn to it later), they did not apply the dis- 
inhibition arguments to the number-5-button 
issue. 


Design, Rationale, and Predictions 


Experiment 1 was designed primarily to 
investigate the possibility that a scale bias 
was induced by the Geen et al, (1975) proce- 
dure. However, some preliminary tests of the 
artifact versus disinhibition interpretations of 
the bias were also made possible by the design. 

To the extent that the Geen et al, results 
were due to a scale bias (regardless of its 


11n addition, attacked subjects gave shocks of 
higher mean intensity than did the nonattacked ones, 
When the full 2 X3 design is taken into account, 
there was also a significant interaction, However, 
the latter result seems to have been due to the two 
additional cells, which are irrelevant for the preseat. 
discussion, Our calculations, based on the data and 
tests reported, and on the assumption of homogene- 
ity of cell variances, led us to the conclusion that 
the interaction was entirely negligible for the 2 X2 
design under consideration (sce the top part of 
Table 1). 

2Prior to the dependent-measure period (which 
involved the Buss machine), the no-opportunity-to- 
aggress subjects must have either pressed none of 
the buttons on the Buss machine, or pressed the 
number 5 button without being told that it pro- 
duced shocks (which of these alternatives was actu- 
ally used is not clear from the Geen et al. report). 
Thus, the no-opportunity subjects had either had 
no experience with the scale, or no experience with 
it as a shock-intensity scale. 
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source), one should be able to increase or 
decrease the score of attacked-opportunity 
subjects simply by shifting the location of the 
bias up and down the scale. In Experiment 1, 
some attacked subjects pressed a high-level 
button (number 8) 20 times, whereas others 
pressed a low-level button (number 2) the 
same number of times. The addition of an 
attacked/no-opportunity-to-aggress group and 
three parallel nonattacked conditions com- 
pleted the 2 (Attack vs. Nonattack) x 3 
(Opportunity-to-Aggress/Button 8; Opportu- 
nity-to-Aggress/Button 2; No Opportunity) 
factorial design. The design was identical to 
the four critical conditions in the Geen et al. 
study, except that the button-8 and button-2 
conditions replaced the button-5 treatment. 
To demonstrate scale bias, it was necessary 
only to obtain the button-8 > button-2 pat- 
tern for attacked subjects. However, if scale 
bias is an artifact, the button-8 versus 
button-2 bias induction should affect both at- 
tacked and nonattacked subjects. In addi- 
tion, since the button-2 subjects should be 
“anchored” to the bottom of the scale, the 
score of the no-opportunity subjects should 
fall somewhere between the scores of the 
button-8 and button-2 subjects in both the 
attack and nonattack conditions. 
The vagueness of the disinhibition hypoth- 
esis makes the prediction process rather dif- 
ficult. One possible interpretation of the 
hypothesis is that it applies only to attacked 
subjects, as there is nothing to be disinhibited 
in nonattacked subjects. According to this 
hypothesis, in attacked subjects only, a 
button-8 scale bias actually taps a disinhibi- 
tory process (these subjects presumably capi- 
talize on the sanction to deliver high-intensity 
punishments). Thus, the button-8 > button-2 
result should be obtained. In addition, the 
presumably large disinhibitory effect of having 
delivered 20 punishments (regardless of inten- 
sity) should cause even attacked/button-2 
subjects to deliver punishments of higher 
intensity than would attacked/no-opportunity 
subjects. 

In contrast to the above (“attack-only”) 
version, some authors (e.g., Goldstein, Davis, 
& Herman, 1975, p. 166) suggest the possibil- 
ity of the aggression-produced “disinhibition 
of an aggressive drive” in nonattacked sub- 


jects (cf. Bandura, 1973, and Konetni, 1975a, . 
for a critique of such a view). Zimbardo 
(1969) also discusses the escalation of ag- 
gression that is independent of the target's 
behavior both before and during an aggressive 
“bout.” Therefore, whereas the predictions 
based on the two versions of the disinhibition. 
hypothesis are identical with regard to the 
behavior of attacked subjects (attack/but- 
ton 8 > attack/button 2 > attack/no oppor- 
tunity), the Goldstein et al./Zimbardo ver- 
sion predicts a similar pattern of results in 
conditions not involving attack. 

In summary, the form of the opportunity 
(button 8, button 2)/no-opportunity main ef- 
fect should distinguish the artifact interpreta- 
tion of scale bias from interpretations based 
on either version of the disinhibition hypoth- 
esis. The pattern of results in the attack 
versus nonattack conditions should distin- 
guish between the two versions of the latter 
hypothesis, 

The further elaboration of predictions re- 
quires that the differences between Experi- 


ment 1 and the Geen et al. (1975) study be * 


noted in detail. First, female experimenters, 
confederates, and subjects were used in Ex- 
periment 1, whereas Geen et al. used male 
confederates and subjects. In fact, one of their 
two (dis)inhibition arguments, which was 
used to explain the difference between their 
results and those of Doob and Wood (1972), 
focused on the sex-of-subjects issue. Doob 
and Wood used a female confederate and 
both male and female subjects. Citing Buss 
(1966b), Geen et al. stated that the Doob- 
Wood subjects may have been inhibited from 
attacking the female confederate (unlike 
their own study where males were supposedly 
disinhibited by attacking males). However, 
even if the Doob-Wood subjects were thus 
inhibited, this grand-mean argument is inade- 
quate to explain either the Doob-Wood cell- 


means differences, or the Geen et al. cell- 


means differences, or the opposite direction 
of the differences in the two studies. Geen et 
al. did not discuss any higher-order interac- 
tions needed to make the argument tenable 
and also neglected to cite the Konetni-Doob 
(1972) findings, even though this study fully 
replicated the Doob-Wood results, used male 
and female subjects and male and female con- 


erates, and cast serious doubt on both the 
bition and retaliation reinterpretations of 
e Doob-Wood data. (The introduction of 
e Geen et al. article consists almost en- 
rely of such reinterpretations.) On the basis 
_Previous work (eg., Konecni, 1975a), 
ich showed a main effect of sex (males 
nish more than females) and no interac- 
ons, we expected simply that the results of 
xperiment 1 would be depressed overall in 
bmparison to the Geen et al. results. 

Second, because of some restrictions im- 
osed by the local human-subjects commit- 
e, we used noise wherever Geen et al. used 
ocks. Again, shock versus noise may result 

a grand-mean difference across experi- 
ents, but it would be difficult to use this 
rocedural difference to explain a different 
attern of cell means. Moreover, Koneéni 
1975a) found that subjects perceived the 
unishing potential of “relatively painful 
ocks” and “very loud blasts of noise” as 
bout equal. 

Third, blood pressure was not recorded 
ecause the Geen et al. blood pressure data 
ully supported the cathartic effect by repli- 
ating the results of Hokanson and Shetler 
1961), Hokanson, Burgess, and Cohen 
1963), Baker and Schaie (1969), and Gam- 
aro and Rabin (1969), among others. 


ethod 


Subjects. Subjects were 48 experimentally naive 
male undergraduates who were teimbursed for 
eir participation. They were randomly assigned 
experimental conditions. 
male students who were all assigned to various 
nditions about equally often. Experimenters were 
females in their early twenties. Each was ,responsi- 
le for one third of the subjects across experimental 
~¢.—fonditions. 


Procedure. The rationale given to subjects, the 


seating arrangement, and the attack/nonattack ma- 
nipulation were identical to those of Geen et al. 
(1975, PP- 722-723). The result of a rigged draw 
was said to “fully specify” what the two subjects 
would: do throughout the experiment. The subject 
was given 12 cards, each bearing a statement one 
controversial social issue, and said “I agree” or I 
disagree” (depending on her views) after reading 
each statement aloud. The confederate, who was in 
auditory but not visual contact with the subject, 
either said “I agree” (meaning agreement with the 
subject’s opinion) oF delivered an aversive 97- 
dB(A) blast of 350-Hz square-wave noise for 4 
seconds (meaning disagreement). The subject knew 


nae 


Confederates were 10, 
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that the confederate had to deliver noise to express 
disagreement and that she had no option regarding 
its kind or loudness. Subjects in the attack condi- 
tions received 10 blasts of noise, whereas those in 
the nonattack cells received 2 blasts of noise, Except 
for the replacement of shocks by noise, these treat- 
ments were quite similar to those of Geen et a 
The manipulation of the opportuni 
variable was also very similar to the C 
(1975, p. 723) procedure. Each of the confed- 
erate’s 10 trials on a “maze-learning” task consisted 
of 3 “choice-points,” and the confederate made 1-3 
errors per trial, for a total of 20. On each of these 
20 occasions (signaled by a light), subjects in the 
button-8 condition pressed the number 8 button on 
a 10-button Buss machine, as they had been in- 
structed to do. Subjects in the button-2 condition 
pressed the number 2 button 20 times, following their 
instructions. In both the button-8 and button-2 
conditions, subjects believed that the button they 
pressed delivered a “loud, but quite safe, blast of 
noise” to the confederate. Because of the previously 
mentioned ambiguity in the Geen et al. report (see 
Footnote 2), the no-opportunity subjects were di- 
vided into two subgroups. Half of the subjects 
simply monitored the confederate’s progress over 
trials; neither noise nor the Buss machine was men- 
tioned. The other half pressed the number 5 button 
on the Buss machine for each error, but there was 
no mention of noise. These subjects believed that 
pressing the button registered the confederate’s er- 
rors “for subsequent computer analysis,” The cables 
and electrical apparatuses surrounding the subject, 
and a PDP-12 computer next door, supported the 
cover story. 
The dependent measure was obtained by means of 
a procedure highly similar to that of Geen ct al. 
(1975, p. 723). The subject keyed in trigrams (on 
an apparatus not resembling the Buss machine), and 
the confederate made 10 “errors” over 16 “code- 
learning” trials. The subject had to press one of the 
10 buttons on the Buss machine for each error, but 
she could freely choose the intensity of “loud blasts 
of noise” supposedly delivered to the confederate. 
The subject had been told that noise intensity in- 
creased with the button numbers, but that all levels 
were safe. Before the beginning of the code-learning 
task, subjects overheard the experimenter instruct k 
the confederate to put on the earphones. In the 
case of the no-opportunity subjects who had previ- 
ously pressed the number 5 button to register errors, 7 
the experimenter plugged in a different looking 
cable “explaining” that the machine was now plugged 
into the noise generator. 1 
After the dependent-measure task, the subject 
responded on several scales described by Geen et al. 
(p. 723). A thorough debriefing session was then 
carried out. 


Results and Discussion 


Ratings. On a 100-mm scale prefaced by 
“How did you feel during the Opinion Simi- 
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larity Task?” (not angry-angry), attacked 
subjects rated themselves more angry (M = 
22.2) than did nonattacked ones (M = 8.9), 
F(1,42) = 4.14, p < .05. In addition, on a 
scale anchored by “I liked her very much” 
(0) and “I did not like her at all” (100), 
attacked subjects (M = 46.4) indicated that 
they liked the confederate significantly less 
during the opinion similarity task than did 
nonattacked subjects (M = 29.6), F(1, 42) 
= 10.75, p<.01. Finally, on a “happy” 
(0)—“not happy” (100) scale, attacked sub- 
jects (M = 50,1) were less happy than were 
nonattacked subjects (M = 42.7), but not 
significantly so, F(1,42) = 3.18, p < .10. No 
other main effects or interactions on these 
three scales, or any effects on the other three 
scales used by Geen et al. (1975) even ap- 
proached statistical significance. Overall, 
these results indicated that the attack manip- 
ulation was reasonably successful. 


TABLE 1 


MEAN INTENSITY OF PUNISHMENTS ON A 10-BUTTON 
Buss MACHINE BY EXPERIMENTAL CONDITION 


Mean intensity of shocks (Geen et al., 1975) 


Initial treatment of 


Subjects’ subject by confederate 
interpolated 
activity Attack Nonattack 
Shocks 
(button 5) 6.655. 3.9250 
No shocks 5.20, 3.20, 


Mean intensity of noise blasts (Experiment 1) 


Initial treatment of 


Subjects’ subject by confederate 
interpolated -— 
activity Attack Nonattack 
Noise blasts 6.09, 3.630 
(button 8) (8) (8) 
Noise blasts 210a 1.7404 
(button 2) (8) (8) 
No noise* 449, 1.454 
(10) (6) 


Note. Data in the upper section are from Geen, Stonner, and 
Shope (1975, Table 1). Cells having a common subscript, 
within the two experiments, are not different at the .05 level 
by the Duncan multiple-range test. In the upper section, these 
subscripts are provided for illus ve purposes only because 
{ been computed in the Geen et al. study on the basis 
MSe for the entire 2 X 3 design. In the lower panel, numbers 
rentheses are cell ns. 

a Means for the subgroups were as follows: attack /no pressing 


< (n =5), 4.64; attack/button 5 (# = 5), 4.34; no ck/no 


pressing (n = 3); 1.40; nonattack/button 5 (» = 3), 1.50. 


a 
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Intensity of noise. The means for the basic 
2 x 3 design are shown in the lower part of 
Table 1. A footnote to this table provides the 
means for the subgroups in the no-oppor- 
tunity-to-aggress conditions, Giyen the minute 
differences between the subgroups in both the 
attack and nonattack conditions, respectively, 
the subgroups were combined for the pur- 
pose of statistical analysis. 

Both main effects were significant. Attacked 
subjects delivered noise blasts of greater in- 
tensity than did nonattacked ones, F (1, 42) 
= 14.27, p < 01. Subjects’ interpolated ac- 
tivity (opportunity/no opportunity to ag- 
gress) also influenced their subsequent be- 
havior, F (2,42) = 11.03, p < .01. The inter- 
action between the two variables was not sig- 
nificant, F(2,42) = 2.47, p = .097. (MS, for 
the above effects was 62.86.) 

As is clear from Table 1, the means were 
generally depressed in comparison to those 
in the four critical conditions of the Geen et 
al. (1975) study, which are shown in the top 
part of the table. While comparisons across 
experiments must be made with caution, this 
finding may reflect the fact that females 
aggressed against females in the present study, 
whereas males aggressed against males in the 
Geen et al. experiment. Such considerations 
do not obscure the principal findings of the 
present experiment. The mean intensity of 
noise blasts administered by attacked subjects 
who had repeatedly delivered the button-8 
noise in the preceding period was greater than 
that of attacked subjects who had had no 
prior opportunity to aggress, whereas the 
mean intensity of noise blasts delivered by 
the attacked/button-2 group was smaller 
than that of the attacked/no-opportunity 
subjects. In other words, by having subjects 
press one or another button in the first ag- 
gression period, either a cathartic or a dis- 
inhibitory effect may be readily obtained. By 
their particular operationalization of the op- 
portunity to aggress (i.e., by requiring sub- 
jects to press repeatedly a button—number 
5—well away from the bottom of the scale), 
which was followed by a dependent-measure 
task involving the same scale, Geen et al. 
may have inadvertently increased the odds of 
not obtaining the cathartic effect. Their data, 
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TABLE 2 
PERCENTAGE oF RESPONSES N DIFFERENT AREAS or tie Noise-Inrensiry SCALE 


BY 


PERIMENTAL CONDITION 


Initial treatment of subject by confederate 


a Attack No attack 
Subjects’ interpolated i Bu ~ Bu B ons Button 
ts’ nterpolate Buttons Buttons Buttons Buttons Buttons Buttons 
Noise blasts (button 8) 12 4b H 56 33 ' a1 p 
Noise blasts (button 2) 81 18 1 91 “d “0 
No noise 29 55 16 95 5 0 


Note. The percentages are based on the total number of responses in a given cell, m X 10 (10 being the number of “error” triale 


on the dependent-measure task). Cell ns are given in T: 
ask). 5 a able 1 
total of 80 (8 X 10) responses were on buttons number 1, 2, or 3, 


therefore, are not convincing evidence against 
such an effect. 

The pattern of the six means, the form of 
the opportunity (button 8, button 2)/no- 
opportunity main effect, and the absence of 
a significant interaction not only supported 
the notion of scale bias, but also suggested 
that the bias may be an artifact of only mod- 
est significance for aggression theory, rather 
than a finding that captures an underlying 


" disinhibitory process. Both versions of the 


disinhibition hypothesis fared poorly. Neither 
version can account for the fact that the 
attacked/button-2 subjects delivered punish- 


` ments of lower intensity than did the at- 


tacked/no-opportunity subjects. In addition, 
whereas the attack-only version cannot ac- 
count for the nonsignificant overall interac- 
tion—as well as the nonsignificant 2 x 2 
attack/nonattack X button-8/button-2 inter- 
action, F(1,42) = 2.76, p = .104—the Gold- 
stein et al./Zimbardo version cannot explain 
the fact that the nonattacked/button-2 sub- 
jects delivered noise blasts of just as low 
intensity as did the nonattacked/no-oppor- 
tunity subjects. While some isolated aspects 
of the results may be interpreted as support- 
ing one or the other version of the disinhibi- 
tion “hypothesis, the overall pattern did not 


* favor either version. 


Distribution of responses. The way in which 


-, subjects in different conditions distributed 


their responses over various parts of the 
noise:intensity scale seems as important as 


' the overall cell means. For this purpose, the 


scale was divided into three parts, such that 


or example, in the attack/button-® condition, 10 (12%) of the 


responses in the two critical areas (around 
buttons number 2 and 8) could be distin- 
guished from each other and from the middle 
area of the scale. 

Several aspects of the data shown in Table 
2 are of interest. In only three conditions did 
any subjects venture into the very top of the 
scale (buttons number 7-10), and two of 
these were the attacked/ and nonattacked/ 
button-8 groups. In the attack/button-8 con- 
dition, 44% of all responses were in this area, 
and of these, pressing the number 8 button 
was the most frequent response (13 out of 35, 
or 37%). In contrast, the attack/no-noise 
subjects pressed buttons in the top area rela- 
tively infrequently (less frequently than the 
nonattack/button-8 group) and concentrated 
instead on the middle area, especially, for 
some reason, on the number 4 button (53% 
of their responses were in the buttons number 
4 through 6 range). In the nonattack/but- 
ton-8 group, responses at the top of the 
scale were relatively frequent, in sharp con- 
trast to the other two groups of nonattacked 
subjects and the attacked/button-2 subjects. 

Thus, pressing the number 8 button 20 
times in the preceding period appeared to 
lead both attacked and nonattacked subjects 
to spread their responses over the entire scale 
and to place a relatively large number of 
their responses at the top of the scale. On the 
other hand, pressing the number 2 button 20 
times in the preceding period anchored sub- 
jects to the bottom of the scale. These find- 
ings further corroborated the conclusion that 
differential scale biases had been successfully 
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induced in Experiment 1 and suggested that 
a similar bias may have been induced in the 
Geen et al. (1975) study. 

Effects of trials on intensity of noise. Sub- 
jects’ behavior over the 10 dependent-mea- 
sure trials—divided into 5 blocks of 2 trials 
each for the purpose of analysis—was also of 
considerable interest. The main effect of 
trials was significant, (4,168) = 11.33, p < 
.01, revealing an increase in the intensity of 
noise over trials. The significant interaction 
of Attack/Nonattack x Blocks, F(4, 168) = 
6.18, p < .01, indicated that the increase over 
trials was due largely to the behavior of at- 
tacked subjects. However, the significant 
triple interaction, F(8, 168) = 2.75, p< .01, 
MS, = 2.39, modified the interpretation of 
both of the above effects. In Figure 1 are 
presented the overall Attack/Nonattack x 
Trials interaction and its profiles at the 3 
levels of the opportunity /no-opportunity vari- 
able. Note that the form of the triple inter- 
action did not dispute the conclusions reached 
earlier on the basis of the overall 2 X 3 
means. In fact, from Figure 1 (Panels b, c, 
and d), it may be seen that the pattern of 
the means for Block 1 (first 2 trials) in the 6 
basic conditions was quite similar to the pat- 
tern of the overall means (see Table 1). 
Thus, the intensity levels of initial noise 
blasts (“starting points”) were affected by 
scale bias, just as were the overall means and 
the response distributions. 

As can be seen from Panels b and c of 
Figure 1, the lines representing attacked and 
nonattacked subjects are essentially parallel. 
Thus, the button-2 and button-8 conditions 
did not contribute much either to the Attack/ 
Nonattack X Blocks interaction or to the tri- 
ple interaction. In the case of the button-2 
groups, there was neither an effect of attack/ 
nonattack nor of trials, F(4,168) < 1, for 
the simple-interaction effect. For the button-8 
groups, there was an effect of attack/non- 
attack, but no differential effect of trials, 
F(4, 168) = 1.76, p = .139, for the simple- 
interaction effect. Thus, both the Attack/ 
Nonattack x Blocks and the triple interac- 
tions were largely produced by the no-noise 
groups, F(4,168) = 9.26, p< .01, for the 
simple-interaction effect (see Panel d). 
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The absence of an increase over trials in 


both the attack/ and nonattack/button-2 be 


conditions, and the parallel lines representing 
the behavior of the attacked/ and nonattacked/ 
button-8 subjects, were consistent with the 
artifact interpretation of scale bias. In con- 
trast, the attack-only version of the disinhi- 
bition hypothesis could not account either for 
the lack of an increase over trials in the 
attack/button-2 condition or for the trials 
effect in the nonattack/button-8 cell. The 
Goldstein et al./Zimbardo version could not 
explain the lack of an increase over trials for 
both attacked/ and nonattacked/button-2 
subjects, as well as a similar absence of a 


» 


trials effect in the nonattack/no-opportunity * 


cell.* 

However, data for the no-opportunity con- 
ditions may be regarded as evidence for the 
attack-only version of the disinhibition hy- 
pothesis. No scale bias had been induced in 
these conditions; yet, the mean intensity of 
noise blasts increased over trials for attacked, 
but not for nonattacked, subjects. Neverthe- 
less, it would seem unreasonable to interpret 
this differential behavior over trials as evi- 
dence against the cathartic effect. It could 
hardly be expected that the effect would 
emerge within the 10 trials of which the de- 
pendent-measure task consisted, that is, that 
a decrease would be observed after a “bout” 
containing only a few aggression opportuni- 
ties. Of course, there is also the possibility 
that aggression escalates within a bout, 
whereas the cathartic effect occurs across con- 
secutive bouts (the latter being a reliable 
finding in the Doob-Wood, 1972, Konetni, 
1975a, and Koneéni—Doob, 1972, studies). 


3 These data were actually analyzed as forming a 
part of a 2 X 3 X 5 factorial design. Since Geen et al. 
(1975) had made no mention of the effects of trials, 
results for the basic 2 X 3 design of Experiment 1 
were presented separately to facilitate a comparison 
with their study. 


4It may seem that the absence of a trials effect ™ 


in the nonattack/no-opportunity condition represents 
a failure to replicate some previous work, for ex- 
ample, a recent study by Buss, Booker, and Buss 
(1972). However, a close examination of their results 
(see Figures 1 and 2 of the Buss et al. report) shows 
that there was, in fact, mo increase over the first 10 
trials for the control subjects. 
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Ficure 1, The overall (Panel a) and simple (Panels b, c, and d) Attack/No Attack X Blocks 
of Trials interactions (Experiment 1), The value given for a block is the mean of two consecutive 


trials, 


An alternative explanation for the escala- 
+ tion over trials observed in the attack/no- 
opportunity cell is that these subjects had an 
implicit starting-point bias. Although this 
group had no prior experience with the scale, 
their response variability in the very first 
block of trials (SD = 1.25) was even some- 
what smaller than that of the presumably 
biased attack/button-8 subjects (SD = 2.00). 
Given a starting-point bias located in the low 
part of the scale, and on the reasonable as- 
sumption that the attacked/no-opportunity 
subjects disliked the confederate and regarded 
low intensities as inadequate punishment, it is 
not surprising that an increase over trials was 
observed. To the extent that such low-start- 
ing-point biases exist, an increase over trials 
(“disinhibition”) would be a built-in feature 
of the Buss aggression machine.° 


An additional point concerning the Buss 
machine seems appropriate, The use of this 
machine in a learning-task context has re- 
cently been criticized on the grounds that 


ë Buss (1966a, p. 160) made a somewhat similar 
point: Subjects start low and then explore the scale, 
which yields an “upward drift.” Goldstein et al, 
(1975) found, with a verbal analogue of the Buss 
machine, that nonangered subjects, forced to use the 
equivalent of the number 10 button on the first trial, 
displayed an upward drift over trials. These authors 
concluded that Buss had been wrong, since only a 
downward drift would supposedly indicate scale 
exploration, given the button-10 start. Actually, the 
means for the successive blocks of $ trials were 5.07, 
5.11, 5.09, and 6.27; Not only was the effect of 
trials due solely to the last block, but the results 
could actually be interpreted as showing a downward 
drift from the number 10 button, implying explora- 
tion of the scale. 
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subjects’ delivering shocks (noise) of higher 
intensity may sometimes be an index of their 
desire to help the confederate learn, rather 
than an index of aggression (e.g., Baron & 
Eggleston, 1972; Rule & Hewitt, 1971; Rule 
& Nesdale, 1974). Results for the no-oppor- 
tunity groups did not support these criticisms, 
unless it is somewhat implausibly assumed 
that attacked subjects were more motivated 
to help the confederate than were the non- 
attacked ones. 


EXPERIMENT 2 


Experiment 1 offered more than enough 
support for a scale-bias interpretation of the 
Geen et al. (1975) results to suggest that 
such bias ought to be eliminated in order to 
examine the cathartic and disinhibitory ef- 
fects adequately. In an attempt to attain this 
objective in Experiment 2, a dependent mea- 
sure that did not involve the Buss machine 
was used, and a comparison was made be- 
tween the Geen et al. attack procedure and a 
procedure that seemed more likely to lead to 
anger, 


Rationale and Predictions 


A simple way of eliminating scale bias 
seemed to be through the use of a dependent 
measure that did not involve either the Buss 
machine or intensity of punishments, Three 
conditions of Experiment 2 involved an exact 
replication of the attack/button-8, attack / 
button-2, and attack/no-opportunity condi- 
tions of Experiment 1, except that the total 
number of blasts of noise delivered by sub- 
jects in the context of a “creativity test” 
(e.g., Konecni & Doob, 1972) served as the 
dependent measure. This measure consists 
of many aggression opportunities, each of 
which the subject may choose to take or not 
take by pressing or not pressing a button. 

Note that whereas the use of a different 
dependent measure may be expected to elimi- 
nate the effect of scale bias, it should not pre- 
vent the emergence of a disinhibitory effect. 
Neither version of the disinhibition hypothe- 
sis is Presumably restricted to the Buss-ma- 
chine operationalization of aggressive behavior 
or dependent on a procedural artifact (scale 
bias). As in Experiment 1, both versions pre- 


dicted the following pattern of results: at- | 


tack/button 8 > attack/button 2 > attack/ 
no opportunity.® 

A quite different pattern would support the 
cathartic effect: attack/no opportunity > at- 
tack/button 2 > attack/button 8. However, 
we were reluctant to predict that this effect 
would emerge in the three cells described so 
far, since the research to date has shown that 
anger—a pronounced negative emotional 
state—is necessary for its demonstration 
(Buss, 1961; Doob & Wood, 1972; Koneéni, 
1975a; Koneéni & Doob, 1972). The attack 
manipulation used by Geen et al. may not be 
powerful enough to result in anger; its ef- 
fects may be mainly cognitive, consisting of 
a negative evaluation of the confederate. One 
may disagree with people, especially concern- 
ing social issues known to be controversial, 
and even dislike them, without experiencing 
much emotion,’ 

An important feature lacking in the at- 
tack manipulation used by Geen et al. (1975) 
is insult. Buss (1961, 1966a) has suggested 
that insult may be a major antecedent of 
anger. Indeed, some form of severe, arbitrary 
insult has been used in a large number of 
studies successfully, judging by behavioral 
(aggression), physiological, verbal self-rating 
(anger), and rating-of-confederate criteria 
(e.g., Doob & Wood, 1972; Gambaro & 
Rabin, 1969; Hokanson & Shetler, 1961; 
Hokanson et al., 1963; Koneéni, 1975a, 
1975b; Koneéni et al., in press; Konečni & 
Doob, 1972). In view of these studies, the 
second of the two disinhibition arguments 
offered by Geen et al. to explain the differ- 


6 It was not possible to predict whether the inten- 
sity of punishments delivered (button 2 vs. button 8) 
would have a differential effect in addition to the 
simple distinction of having versus not having 
delivered punishments. 

7 The diastolic-pressure increase following attack, 
observed by Geen et al. (1975), may have been due 
to the shocks subjects received and attributed by 
them to the shocks (subjects knew that the con- 
federate had to administer shocks in order to show 
disagreement). Moreover, using a similar attack/non- 
attack procedure, Zillmann, Katcher, and Milavsky 
(1972) found, in contrast to Geen et al., that the 
physiological indices increased for both attacked and 
nonattacked subjects following the attack/nonattack 
manipulation, and about equally so, 
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ence between their own and the Doob-Wood 
results is surprising (the sex-of-subject/con- 
federate argument has already been dis- 
cussed), According to Geen et al., Doob and 
Wood were able to obtain the cathartic effect 
partly because the subjects had only been 
exposed to the confederate’s “verbal badger- 
ing and insult” (p. 722). Subjects thus an- 
gered, who had had a prior aggression oppor- 
tunity, supposedly obtained a low aggression 
score because they felt that the insulting con- 
federate had already been “repaid.” We con- 
cur that insult (and the resulting anger) was 
one of the key features of the Doob-Wood 
procedure. However, the available evidence 
suggests that the cathartic effect obtained by 
Doob and Wood was not due to inhibitory 
processes (Koneéni & Doob, 1972). Instead, 
it may have been due to the anger-reducing 
properties of insulted subjects’ aggression (in 
the first aggression period), leading to a de- 
crease in the amount of their subsequent ag- 
gression. (See Konetni, 1975a, for an elab- 
oration of this position, that is, a reinterpre- 
tation of the cathartic effect in terms of a 
relationship of bidirectional causality between 
the degree of anger and the amount of physi- 
cal aggression expressed.) 

Such considerations clearly warranted that 
the attack procedure used by Geen et al. be 
contrasted with a manipulation containing 
insult. Therefore, we included three additional 
conditions in the design of Experiment 2. 
These conditions in every way paralleled the 
three conditions already described, except 
that an insult procedure replaced the attack 
manipulation. We predicted that insult would 
prove to be more powerful than attack (in 
terms of various ratings), and that the ca- 
thartic effect, rather than a pattern consistent 
with either version of the disinhibition hy- 
pothesis, would be obtained in the cells in- 
volving insult. Control groups of subjects who 
experienced neither insult nor attack were 
also included in the design. 


Method 


Subjects. Subjects were 58 females drawn from 
the same population as those in Experiment 1. The 
10 confederates and 3 experimenters were those used 
jn Experiment 1. 

Procedure. The design was a 2 (Attack vs. Insult) 
x3 (Opportunity-to-Aggress/Button 8; Oppor- 
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tunity-to-Aggress/Button 2; No Opportunity) fac- 
torial, with two additional contro) conditions. Sub- 
jects assigned to the three attack conditions were 
treated exactly as the attack subjects had been 
treated in Experiment 1. The procedure used in the 
three insult conditions had been described in detail 
elsewhere (Konetni, 1975a; Koneéni & Doob, 1972), 
A subject and a confederate, seated at the same 
table, worked independently for 7 minutes on identi- 
cal, difficult anagrams. The confederate finished her 
anagrams quickly and proceeded to insult the sub- 
ject in a standardized manner. She criticized the 
subject's slowness, alternately told her to hurry up 
and to give up, and repeatedly expressed doubts 
about the subject's intellectual ability, Ten different 
confederates were used to increase generality. 

In the next period, the opportunity/no-oppor- 
tunity variable was manipulated exactly as in Ex- 
periment 1. Both the attack/ and insult/no-oppor- 
tunity groups were again subdivided as in Experi- 
ment 1, Since the differences were negligible, the re- 
spective subgroups were combined for statistical pur- 
poses. 

The dependent-measure setting, task, and instruc- 
tions have been described in detail elsewhere (Ko- 
netni, 1975a; Konečni & Doob, 1972). The task 
supposedly involved “creative imagination,” The 
subject read words from a 30-word list, and the 
confederate provided “creative” responses, On a 
two-button box, the subject pressed the button la- 
beled “good” whenever she thought a response was 
creative, and pressed a button labeled “noise” one or 
more times for each uncreative response, Confed= 
erates’ responses were standardized. Subjects be- 
lieved that the noise button delivered a “loud, but 
safe” blast of noise to the confederate. During this 
task, the subject and confederate were in auditory, 
but not visual, contact. The dependent measure was 
the total number of blasts of noise delivered by a 
subject.® 

The two control groups differed from each other 
only in the initial treatment, which was either no 
attack (as in Experiment 1) or no insult (as in 
Koneéni & Doob, 1972). In the latter case, the con- 
federate treated the subject neutrally during the 
anagram task. Subjects in both conditions then 
monitored the confederate’s work on the maze task 
without pressing any buttons, Because the differences 
were negligible, the two control groups were com- 
bined for statistical purposes. Finally, all subjects 
responded on the scales used in Experiment 1 and 
were debriefed, 


Results and Discussion 


Ratings. An analysis of variance (2x43, 
excluding the control group) revealed that 


*In an attempt to replicate the Konetni-Doob 
(1972) results, the same stimulus words, responses, 
and order were used. Because the original list had 
not been divided into sublists equated for creati 
the effect of trials could not be examined. 


â 
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insulted subjects had been considerably more 
angry in the first part of the experiment than 
attacked ones had been, F(1,46) = 8.06, $ 
< .01 (Ms = 23.3 for attack and 42.9 for in- 
sult, where 0 = “not angry” and 100 = “an- 
gry”). Attacked subjects, in turn, were more 
angry than controls (as revealed by a one- 
way analysis of variance), F(3,28) = 3.01, 
Ż < .05 (M = 9.2 for the controls). In addi- 
tion, insulted subjects disliked the confederate 
far more than attacked subjects did, F(1, 46) 
= 20.48, p < .01 (Ms = 41.0 for attack and 
64.8 for insult, where 0 = “I liked her very 
much”), whereas attacked subjects disliked 
the confederate more than controls did, F(3, 
28) = 6.57, p < .01 (M = 20.8 for the con- 
trols). Finally, whereas insulted subjects were 
Not significantly less “happy” than attacked 
ones were, F(1,46) = 2.98, p< .10 (Ms = 
55.1 for attack and 66.0 for insult, where 0 = 
“happy”), they were less happy than controls, 
F(3, 28) = 3.11, p < .05 (M = 35.0 for the 
controls). In contrast, attacked people were 
not significantly less happy than controls, F (3, 
28) = 1.48. No other effects on these three 
scales, or on other scales used by Geen et al., 
approached statistical significance, 

In summary, whereas the attack versus no- 
attack differences found in Experiment 1 were 
replicated, the results indicated that the insult 
procedure was far more powerful than at- 
tack. This is important in view of the evi- 
dence suggesting that a successful experi- 


TABLE 3 


Mean Numaer or Noise BLASTS ON THE “CREATIVITY 
TEST” By EXPERIMENTAL Conprrion 


Initial treatment of subject 
by confederate 


jnaubiects 
interpolat No attack 
activity. Attack Insult N inate 
Noise blasts 11.25, 9.88), — 
(button 8) (8) (8) 
Noise blasts 10.25, 10.38, — 
(button 2) (8) (8) 
No noise 10.70, 14.90, 7.00 
(10) (10) (6) 


4 Maes Cells having a common subscript are not different at 


no subscript is shown for the control condita Numbers in 
Parentheses are cell ns, 
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mental induction of anger is indispensable for 
the demonstration of the cathartic effect. 

Number of noise blasts. A 2 X 3 analysis 
of variance (excluding the control condition; 
MS. = 7.71) revealed a nonsignificant main 
effect of attack versus insult, F(1,46) = 
1.62, a significant main effect of the oppor- 
tunity/no-opportunity variable, F(2,46) = 
4.16, p < .05, and a significant interaction, 
F(2,46) = 4.62, p < .05. As can be seen 
from the pattern of the means in Table 3; 
both significant effects were due to the large 
number of noise blasts delivered by insulted 
subjects who had no prior opportunity to 
aggress. The one-way analysis on all 7 means 
revealed a significant effect of treatments, 
F(6,51) = 5.93, p< 01, MS. = 7.50. By 
Dunnett’s test, the control group did not dif- 
fer significantly (by two-tailed tests) from 
insult/button 8, attack/button 2, and insult/ 
button 2, but it was significantly smaller 
than attack/no noise (p < .05), attack/but- 
ton 8 (p< .05), and insult/no noise (p< 
01). 

Thus, a strong cathartic effect was ob- 
tained in the three insult conditions: Insulted 
subjects who had had a prior aggression op- 
portunity subsequently delivered fewer pun- 
ishments than did the equally insulted sub- 
jects without a prior opportunity. In fact, 
the opportunity conditions (button 8, button 
2) did not significantly differ from the con- 
trol condition. These results fully replicated 
the findings of Doob and Wood (1972), Ko- 
necni and Doob (1972), and Koneéni 
(1975a); moreover, this was achieved with 
the Buss-machine operationalization of the 
opportunity-to-ageress variable, a procedure 
different from that used in earlier studies. In 
short, with subjects who had been insulted 
(and thus presumably made quite angry), and 
when scale bias had been removed by a 
change in the dependent measure, the ca- 
thartic effect was obtained, rather than a dis- 
inhibitory one. Both versions of the disinhibi- 
tion hypothesis, of course, predicted an in- 
crease in aggression in the opportunity groups 
in comparison to the no-opportunity group, 

The three attack conditions did not differ 
from each other. These results represent an- 
other failure of both versions of the disinhi- 
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bition hypothesis and stand in sharp contrast 
to the findings in the three attack cells of Ex- 
periment 1 and the two attack conditions of 
the Geen et al. study. The purpose of using a 
different dependent measure in Experiment 2 
was to eliminate scale bias. However, this 
minor procedural change eliminated all ef- 
fects of the opportunity-to-aggress variable on 
attacked subjects. These findings comple- 
mented the evidence gathered in Experiment 
1; The scale bias effect seems to be an arti- 
fact of only minor substantive interest; it 
does not tap an underlying disinhibitory 
process. 


GENERAL Discussion 


The converging evidence from Experiments 
1 and 2 suggests that the disinhibition-like 
effect obtained by Geen et al. (1975), and 
their failure to replicate the previous results, 
may have been due to a procedural artifact 
and a somewhat weak induction of anger. 
Predictions based on the various versions of 
the disinhibition hypothesis received no con- 
sistent support. In contrast, the cathartic 
effect was obtained after removing scale bias, 
and only with insulted subjects. The latter 
finding supported earlier suggestions that a 
relatively powerful method of inducing anger 
is necessary for the demonstration of the 
cathartic effect (e.g., Buss, 1961; Doob & 
Wood, 1972; Koneéni, 1975a). 

We are obviously not suggesting that dis- 
inhibition is an invalid concept. The disin- 
hibitory (facilitating) effects on angered and 
nonangered subjects’ aggression of watching 
aggressive models and films, as well as of 
other activities mentioned in the introduc- 
tion, have been well documented. However, 
disinhibition apparently does not occur in 
situations where (a) the emotion of anger 
has been induced, (b) subjects had a prior 
opportunity to hurt (physically) the annoyer, 
and (c) the dependent measure involves 
physical aggression. Under such circum- 
stances, a decrease in aggression, the ca- 
thartic effect, is obtained instead. 

The theoretical reasons for the occurrence 
of the cathartic effect in experiments involy- 
ing anger and physical aggression have been 
discussed in detail elsewhere (Konecni, 


1975a), and need not be reiterated here, It 
also seems clear that the cathartic effect is 
not necessarily due to an inhibitory tendency 
such as “guilt” or “aggression anxiety,” as 
Berkowitz (1962) and Geen et al, have sug- 
gested. Koneéni and Doob (1972) and Frost 
and Holmes (Note 1) found that nonangered 
subjects who had a prior opportunity to hurt 
a confederate (which should have aroused 
guilt) were subsequently no less aggressive 
against a different confederate than were non- 
angered subjects without a prior aggression 
opportunity (who presumably felt less guilty). 
Koneéni and Doob (1972) and Koneéni 
(1975a) obtained the same result even when 
the two-opportunities nonangered group al- 
ways had the same confederate as a target, 
Holmes (1972) found that angered subjects 
felt no more guilty after either direct or dis- 
placed aggression than did controls. In short, 
the cathartic effect obtained in Experiment 2 
and previous studies seems to be relatively 
independent of the (dis)inhibition factors, 
In exploring the Geen et al, (1975) results, 
the present experiments simultaneously ad- 
dressed some more general methodological and 
substantive issues. Criticisms of aggression 
research have often focused on the “meaning” 
of laboratory measures of aggression (¢.g., the 
“teaching artifact” mentioned earlier), Such 
concern is highly justifiable. However, the 
issue of procedural bias, which seems just as 
important, has received very little attention, 
The term procedural bias, as used here, de- 
notes the effect of secondary components of 
an independent variable, namely those which 
have not been designated as the key features 
of the operationalization. (Thus, Geen et al, 
defined their conditions as “shocks vs, no 
shocks,” not “shocks-button-5 vs. no shocks.” 
Presumably, they did not regard the number 
5 button as the critical feature of the oppor- 
tunity-to-aggress variable.) This type of pro- 
cedural bias is clearly due to the relationship 
between the independent and dependent vari- 
ables, rather than to the latter alone, Although 
this problem is not confined to the research 
on the cathartic effect, it is particularly sali- 
ent in this area, because subjects typically 
engage in aggression on two successive occa- 
sions. This requirement makes some catharsis 
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experiments open to alternative explanations 
based on procedural-bias issues (cf. Kahn, 
1966; Konečni, 1975a). 

Of course, a, procedural bias may not nec- 
essarily be an artifact, by which we mean an 
effect irrelevant for aggression. Had we 
found that scale bias tapped a disinhibitory 
process, we would have considered it an in- 
direct confirmation of an aggression-relevant 
concept, rather than an artifact. This point 
emphasizes the need for careful analyses of 
procedures used in aggression research. All 
too often, criticisms of a particular procedure 
or finding are not backed by experiments de- 
signed to separate artifact from substance. 

However, the criterion for the accuracy of 
the type of analyses we are recommending 
must be the real-life patterns of aggressive 
behavior, rather than the current theoretical 
concepts. For example, whether or not 
the “many-punishments” standard (Konecni, 
1975a) is an artifact depends on the likeli- 
hood that people continue to aggress in real- 
life interpersonal situations after their anger 
has subsided. Very little is presently known 
about the relationship between the degree of 
anger, and the duration and other character- 
istics of real-life bouts involving physical 
aggression. Similarly, the present conclusion 
that the scale bias effect is an artifact ulti- 
mately rests on the assumption that compara- 
ble biases do not govern aggressive behavior 
in real life; such an assumption may be in- 
correct, 

Nevertheless, even when data collected in 
the field are in agreement with laboratory re- 
sults, this does not preclude the possibility 
that the latter have been produced by pro- 
cedural biases. For example, in a widely cited 
laboratory experiment, Kahn (1966) found 
that angered subjects’ verbal aggression was 
increased by their prior verbal aggression, 
and suggested that this may have been due to 
cognitive consistency: Having once verbally 
expressed their dislike for ‘he annoying con- 
federate, subjects felt com l to do so 5 
minutes later irrespective of true opinion 
at that time. If Kahn’s exp! n is correct, 


his results may be viewed a o a type of 
procedural bias, because cog -onsistency 
hether or 


was “imposed” by the proce 
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not such a bias is an artifact depends on the _ 


degree to which real-life verbal aggression is 
governed by cognitive consistency. In a field 
experiment, Ebbesen et al. (1975) replicated 
Kahn’s results, but the data offered little sup- 
port for a cognitive consistency explanation. 
To the extent that one considers naturalistic 


data as standards against which laboratory 2 


data should be measured, the conclusion 
would be that in the laboratory “wrong” pro- 
cedures are sometimes used to obtain “right” 
results. In other words, sound intuitions con- 
cerning external validity may sometimes, 
paradoxically, contaminate the internal valid- 
ity of laboratory experiments and also lead to 
inadequate theoretical explanations. 
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The tendency to “bask in reflected glory” (BIRG) by publicly announcing 
one’s associations with successful others was investigated in three field experi- 
ments. All three studies showed this effect to occur even though the person 
striving to bask in the glory of a successful source was not involved in the 
cause of the source’s success. Experiment 1 demonstrated the BIRG phenome- 
non by showing a greater tendency for university students to wear school- 
identifying apparel after their school’s football team had been victorious than 
nonvictorious. Experiments 2 and 3 replicated this effect by showing that stu- 
dents used the pronoun we more when describing a victory than a nonvictory 
of their school’s football team, A model was developed asserting that the BIRG 
response represents an attempt to enhance one’s public image. Experiments 2 
and 3 indicated, in support of this assertion, that the tendency to proclaim a 
connection with a positive source was strongest when one’s public image was 


threatened, 


It is a common and understandable tend- 
ency for people who have been successful in 
some positive way to make others aware of 
their connection with that accomplishment. 
However, there also appears to be a seem- 
ingly less rational but perhaps more inter- 
esting tendency for people to publicize a 
connection with another person who has been 
successful. This latter inclination might be 
called the tendency to bask in reflected glory 
(BIRG), That is, people appear to feel that 
they can share in the glory of a successful 
other with whom they are in some way asso- 
ciated; one manifestation of this feeling is 
the public trumpeting of the association, Such 
a phenomenon is not hard to understand when 
the one wishing to share in another’s success 
has been instrumental to that success. How- 
ever, the more intriguing form of the phe- 
nomenon occurs when the one who basks in 
the glory of another has done nothing to 


„Requests for reprints should be sent to Robert B. 
Cialdini, Department of Psychology, Arizona State 
University, Tempe, Arizona 85281. 


bring about the other’s success. Here, a 
simple case of affiliation or membership is 
sufficient to stimulate a public announcement 
of the critical connection. 

There does seem to be abundant anecdotal 
evidence that people try to make us cognizant 
of their connections with highly positive or 
successful others. The forms of these connec- 
tions are varied. For example, they may im- 
ply similarity of residence, past or present: 
States and cities like to list the names of 
famous entertainers, statesmen, beauty con- 
test winners, etc., who live or were born 
within their boundaries; the state of Indiana 
has even gone so far as to brag that more 
vice-presidents of the United States have 
come from Indiana than any other state. 
Other such connections involve ethnic or 
religious affiliation: Italians speak proudly of 
the ethnic background of Marconi, and Jews 
refer to Einstein’s heritage. Still other con- 
nections reflect physical similarities: “Napo- 
leon was short, too.” Sexual identity may also 
give rise to the BIRG phenomenon: At a 
women’s movement forum attended by one 
of the authors, there was a round of feminine 
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applause when it was announced that Ma- 
dame Curie was a woman and Lee Harvey 
Oswald was not. Finally, connections suitable 
for BIRGing may be as tenuous as an inci- 
dental contact: We all know people who de- 
light in recounting the time they were in the 
same theater, airplane, or restroom with a 
famous movie star. 

While there appears to be rich informal 
support of the sort described above for the 
existence of a BIRG phenomenon, there seem 
to be’no experimental investigations of the 
effect. Thus, it was the purpose of this series 
of studies to examine this tendency to bask 
in the reflected glory of another or group of 
others. In so doing, it was hoped to (a) reli- 
ably demonstrate the existence of the phe- 
nomenon, (b) establish its generality over 
experimental contexts and measures, (c) de- 
termine a mediating process for its occurrence, 
and (d) discover some of its limiting con- 
ditions and thereby gain further information 
as to its nature. 

One of the most obvious arenas for the 
working of BIRG effects in our society is the 
athletic arena. Fans of championship teams 
gloat over their team’s accomplishments and 
proclaim their affiliation with buttons on their 
clothes, bumper stickers on their cars, and 
banners on their public buildings. Despite the 
fact that they have never caught a ball or 
thrown a block in support of their team’s 
success, the tendency of such fans is to claim 
for themselves part’ of the team’s glory; 
it is perhaps informative that the chant is 
always “We’re number one,” never “They're 
number one.” : 

It was our view that a sports context 
would be ideal for a test of some of our 
notions concerning BIRG effects. Our expec- 
tation was that an individual would attempt 
to bask in the glory of, an associated, suc- 
cessful source by publicly announcing his or 
her affiliation with the source and that this 
effect would obtain even when the affiliation 
was clearly irrelevant (i.e., noninstrumental) 
to the success of the source. In order to 
gather data relevant to the above hypothesis, 
an experiment was simultaneously conducted 
at seven universities with powerful intercol- 
legiate football teams during part of the 
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1973 football season, It was predicted that 
students at these schools would be more likely 
to announce publicly their connection with 
their universities after the varsity football 
teams had been successful than after the 
teams had not been successful. We decided 
to measure students’ tendency to announce 
their university affiliation by means of an 
examination of wearing apparel, The fre- 
quency with which students wore apparel that 
clearly identified the university that they at- 
tended was hoped to be a subtle yet sensitive 
measure of the willingness to declare publicly 
a university affiliation. 


EXPERIMENT 1 
Method 


Procedure. From the third week of the 1973 col- 
legiate football season through the last week of 
regular play, the apparel of students enrolled in 
sections of introductory psychology courses at seven 
large universities was covertly monitored. At each 
school, three types of data were recorded in the 
same classes every Monday during the season: (a) 
the number of students present in the class, (b) the 
number of students with apparel identifying the 
school of attendance, and (c) the number of stu- 
dents with apparel identifying a school other than 
the school of attendance, Data recorders at each 
place received the following definitions prior to data 
collection: 


Apparel identifying the school of attendance is 
identified as apparel which unambiguously identi- 
fies your school through names, insignia, or em- 
blems. Examples would be buttons, jackets, sweat- 
shirts, tee shirts, etc. which display the school 
name, team nickname or mascot, or university 
insignia, Apparel which appears school-related 
solely through the use of colors would not qualify. 
‘Also excluded are utilitarian objects which an- 
nounce a university affiliation such as briefcases, 
notebooks, or bookcovers. Apparel identifying a 
school other than the school of attendance are 
those which meet the same criteria for inclusion 
as above but which identify a school other than 
your own, 


The data recorders were not members of the classes 
they monitored. 


Results 

Over all schools and across all weeks, an 
average of 176.8 students were present in the 
monitored classes; an average of 84% of 
these students wore apparel identifying the 
university of attendance, while 2% of them 


y 
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wore apparel identifying a school other than 
the university of attendance, Because of huge 
differences among the schools in absolute 
amounts of these two kinds of apparel wear- 
ing and in order to make comparisons be- 
tween the universities as well as between the 
types of apparel wearing, standardized indexes 
of relevant apparel wearing were considered 
necessary. The standard we decided on was the 
highest percentage of relevant apparel wear- 
ing that occurred on any Monday during 
the season; this standard was simply com- 
puted as the number of students wearing rele- 
vant apparel that day divided by the number 
of students in class that day. The percentages 
of apparel wearing on all other Mondays of 
the season were scored as proportions of the 
highest percentage. So, the Monday with the 
largest percentage of relevant apparel wearing 
was scored as 1.00, and any other Monday 
percentage was scored as a fraction (pro- 
portion) of that standard. This procedure was 
performed on the data from each school for 
the two relevant categories of apparel wear- 
ing: school-of-attendance apparel wearing and 
school-of-nonattendance apparel wearing. A 
mean proportion for each category was ob- 
tained for Mondays following a team’s wins 
and nonwins; these are the mean proportions 
presented in Table 1. As can be seen from 
Table 1, these indexes showed a generally 
consistent tendency for students to wear 
school-of-attendance apparel more after vic- 


TABLE 1 


School-of- 
onattendance 
rel wearing 


attendance 
apparel wearing 


School ins Nonwins 
Arizona State 61 (1) 58 (5) -68 (1) 
Louisiana VERS, rs 

3) 58 (5) -51 (9) 
pase State ) (1) 56 (4) -%4 (1) 
Notre Dame 9 (1) 62 (7) 52 (1) 
Michigan 3(1) .20.(5) 00 (0 
Pittsburgh 7 (2) 31 (4) -50 (2) 
Pe oroli aw -17 (6) o% (1) 

M 63 d 3 4 
ae ~ she number ofgames 


parentheses represe: 


A zes for each school. 
Ad nonwins categories for © ba 


Note. Numbers in 
that fell into wins a 
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tories than nonvictories; but this was not 
the case for school-of-nonattendance apparel. 
Because of the non-normality of the propor- 
tion data, the scores were converted to ranks, 
and Wilcoxon matched-pairs signed-ranks tests 
were performed using school as the unit of 
analysis.* Despite the conservativeness of such 
an approach (for this mode of analysis, m is 
only 7), the Wilcoxon T reflected a conven- 
tionally significant difference on the school- 
of-attendance measure (T = 2, p < .05, two- 
tailed). This result indicated, as predicted, that 
Mondays following football victories ranked 
significantly higher in school-of-attendance 
apparel wearing than Mondays following non- 
victories.2 The mean rank for victories was 
3.2, while that for nonvictories was 4.9. A 
similar test for school-of-nonattendance ap- 
parel did not show any effect; the victory 
and nonvictory mean ranks for this measure 
were 3.4 and 3.7, respectively. This latter 
result suggests that the obtained effect on the 
school-of-attendance measure is not attribut- 
able to a simple tendency to wear clothing of 
a certain type (eg., athletic team jackets, 
sweat shirts, tee shirts, etc.) after an athletic 
team victory. 


Discussion 


In all, we found support for our expecta- 
tions concerning the BIRG phenomenon. 


1In any conversion of parametric data to ranks, 
the possibility exists that the ranked scores will not 
fully reflect the character of the parametric data. 
In order to examine such a possibility with respect 
to our results, a correlational analysis was performed 
on the standardized index scores and their derived 
scores. A highly similar relationship (r = —.83) 
» the two forms of scores was found; the 
ity of this correlation is simply due to the 
the better ranks of those of lower 
l value. 
2ye may be instructive to note that the single 
exception from this pattern in Table 1 occurred at 
the University of Michigan as a direct result of a 
10-10 tie with Ohio State University in a game 
for the Big Ten Conference Championship. Most 
observers, especia the Michigan supporters, felt 
that the Michigan hed outplayed Ohio State 
that day and that t demonstrated Michigan's 
superiority. Howe game, constituted 
Michigan’s only entry our nonwin category, 
resulting in the only reve rsal in our data. 
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Students chose to display more apparel indi- 
cators of their academic affiliation after their 
university’s varsity football team had re- 
cently been successful. It appears, then, from 
these data and from numerous anecdotal re- 
ports that people desire to make others aware 
of what seem to be their causally meaning- 


“ less associations with positive sources.* Why? 


what do they intend to get from it? Perhaps, 
the answer has to do with Heider’s balance 
formulation (1958). Heider discussed two 
types of perceived relations between things: 
sentiment relations, which imply a feeling 
state between stimuli, and unit relations, 
which merely imply that things are connected 
in some manner. It is the unit relationship 
that seems akin to the noninstrumental con- 
nection that people tend to publicize between 
themselves and a successful or otherwise posi- 
tive source, The results of the present experi- 
ment could well be seen as consistent with 
balance theory. For example, if observers 
perceive a positive unit relationship (e.g., uni- 
versity affiliation) between a student and a 
successful football team and if observers gen- 
erally evaluate successful teams positively, 
then in order to keep their cognitive systems 
in balance, the observers would have to evalu- 
ate the student positively as well. Hence, we 
might expect the student to want to make 
the unit connection evident to as many ob- 
servers as possible, in this case, through the 
wearing of university-identifying clothing. 
The process whereby one publicly seeks to 


associate himself or herself with a successful’ 


other, then, may be reinforced by the tend- 
ency of observers to respond in a similar 
fashion to associated stimuli. 

Indirect evidence that tends to support this 
hypothesis comes from research concerning 
the transmission of positive and negative in- 
formation. Manis; Cornell, and Moore (1974) 
have shown that one who transmits informa- 
tion that the recipient favors is liked more 
by the recipient than one who transmits in- 
formation that the recipient disfavors and 
that this liking occurred even though it was 


` understood that the transmitters did not 


necessarily endorse the communicated infor- 
mation. Like the royal messengers of old 
Persia who were feted when they brought 
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news of military victory but killed when they 
brought news of defeat, the transmitters in 
the Manis et al, (1974) study acquired the 
valence of the message with which they were 
simply paired. Moreover, there is evidence 
that people recognize this generalization effect 
and tend to take actions that connect them, 
in the eyes of observers, with positive rather 
than negative news, For example, Rosen and 
Tesser have repeatedly shown (e,g., Rosen & 
Tesser, 1970; Tesser, Rosen, & Batchelor, 
1972; Tesser, Rosen, & Tesser, 1971) that 
people prefer to be connected with the com- 
munication of good news to another than with 
the communication of bad news. Investigating 
the basic effect, Johnson, Conlee, and Tesser 
(1974) found their subjects reluctant to com- 
municate negative information not because 
they felt guilty about transmitting bad news 
but because they feared that they would be 
negatively evaluated by the recipient of such 
news; again, this was true even though all 
concerned knew that the communicators had 
in no way caused the bad news. Thus, it ap- 
pears from these data that: first, individuals 
who are merely associated with a positive or 
negative stimulus (in this case, favorable or 
unfavorable information) will tend to share, 
in an observer’s eyes, the affective quality of 
the stimulus; and second, at some level indi- 
viduals seem to understand the workings of 
this phenomenon and make use of it in the 
ways they present themselves to others, We 
wish to interpret the results of Experiment 1 
in terms of this formulation, Students at our 
seven monitored universities chose to wear 
school-of-attendance apparel after football 
team victories in order to display their con- 
nection with the successful team and thereby 
to enhance their esteem in the eyes of ob- 
servers to the connection, However, another 
explanation of our findings exists as well. 
Perhaps the tendency to wear university- 


3 It might be argued that some subjects felt that 
their presence in the stands on the day of a game 
directly contributed to their team’s success, This 
seems an unlikely explanation for the obtained re- 
sults, as an analysis of the data of Experiment 1 
showed an equally strong BIRG effect for home and 


away games. 
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self in the tende 
apparel. To test 


and to establish the general of the tend- 
ency to BIRG in a xperimental 
situation than that of Exp: at 1, a second 


experiment was conducted. 


EXPERIMENT 2 

The major distinction between the com- 
peting interpretations described above is the 
contention of the BIRG model that students 
wore school-of-attendance clothing after vic- 
tories in order to publicize their university 
affiliations and hence increase their prestige 
in the view of others. The “heightened at- 
traction” formulation makes no such claim: 
One is simply seen to like the school more 
following victories, and this, rather than the 
possibility of increased interpersonal prestige, 
is said to stimulate the wearing of relevant 
apparel, We decided to test these explanations 
by way of an examination of the pronoun 
usage of university students describing the 
outcome of one of their school’s football con- 
tests. Earlier in this article we alluded to the 
tendency of athletic fans to crowd in front 
of television cameras, wave their index fingers 
high, and shout, “Were number one!” The 
choice of this pronoun seemed to us a very 
good measure of the tendency to BIRG. By 
employing the pronoun we, one is publicly 
able to associate oneself with another person 
or group of persons. Through the use of some 
other designation, for example, they, one is 
able to distance oneself from (i.e., to weaken 
the perceived association with) another per- 
son or persons. It was our feeling that in 
order to BIRG a successful football team, 
students would be more likely to describe the 
outcome of a team victory using the pronoun 
we than they would a team nonvictory Tis; 
it was our expectation that this tendency to 
connect oneself with a positive source but 


distance oneself from a negative source would 
influence subjects to use the term, “We won,” 


to describe a team win but use the third - 


person (e.g., “They lost”) to describe a team 
loss.* Further, in line with our BIRG model, 
it was expected that this differential use of 
language would be most pronounced when the 
subject’s esteem in the eyes of an observer 
had been recently lowered. That is, if we are 
correct in proposing that one proclaims a con- 
nection with a positive source in an attempt 
to raise one’s esteem in the view of others, 
then one should be most likely to declare such 
a connection when that esteem has recently 
been jeopardized. Thus, if we were to create 
experimentally in subjects a need to bolster 
esteem in the eyes of an observer, subjects 
should be most likely to announce publicly 
(through use of the pronoun we) a connection 
with a successful team and be least likely to 
publicize a connection with an unsuccessful 
team. On the other hand, subjects who have 
less need to elevate an observer's evalua 
of themselves should show a lesser effect. 


The 


simple “heightened attraction” model would _ 


not make such a prediction, since one’s pres- 
tige in the eyes of others is not a critical 
variable in that formulation. 


Method 


Subjects. The subjects were 173 undergraduates 
at a large state university with a nationally ranked 
footbali team. Subjects were randomly selected from 
student listings in the university phone directory. The 
sample included approximately equal distributions of 
males and females, 

Procedure. During a 3-day period midway through 
the 1974 football season, subjects were contacted on 
the phone by one of 16 experimenters (eight males 
and eight females) identified as an employee of a 
“Regional Survey Center” with headquarters in an 
out-of-state city. The caller explained that he (she) 
was conducting a survey of college students’ knowi- 
edge of campus issues and was in town that day 
calling students at the subject's university, Subjects 
agreeing to participate (93%) were then asked a 
series of six factually oriented forced-choice ques- 
tions about aspects of campus life (e.g, “What per- 
cent of students at your school are married? Would 
you say it’s closer to 20% or 35%?”). Following 


4It should be evident to the reader that the 
general statement of the BIRG formulation includes 
not only the tendency to bask in reflected glory but 
also the tendency to distance unattractive sources. 
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the subject’s sixth response, the caller administered 
the first manipulation, Half of the subjects were 
told that they had done well on the test, and half 

. were told that they had done poorly. Specifically, 
subjects were told: 


That completes the first part of the questions. 
The average student gets three out of six correct. 
You got [five; one] out of six correct. That 
nfeans you [did really well; didn’t do so well] 
compared to the average student. 


Subjects were then told that there were a few 
more questions and that the first concerned students’ 
knowledge of campus athletic events. At this point 
the second experimental manipulation occurred. Half 
of the subjects were asked to describe the outcome 
of a specific football game; their school’s football 
team had won this game. The other half were 
asked to describe the outcome of a different game; 
this was a game that their team had lost. The 
question was phrased as follows: 


game of the season, your 
the University of 
tell me the outcome 


In the [first; third] 
school’s football team played 
[ Houston, Missouri]. Can you 
of that game? 


If a subject’ did not know the results of the game, 
a new subject was called. Otherwise the subject's 
verbatim description of the game outcome was 
recorded, At the end of the interview, all subjects 
were fully debriefed. 

Independent variables. Two factors were manipu- 
lated; a subject’s personal outcome on the survey 
task (success or failure) and the affiliated football 
team’s outcome in the game described (win or non- 
win). These factors combined to produce a 2X2 
factorial design. 

Dependent variables. Subjects’ tendency to use a 
we'or non-we response in describing a team outcome 
constituted our dependent measure. Descriptions such 
as “We won,” “We got beat,” etc., were considered 
we responses. All other descriptions (¢.g-, “The score 
was 14-6, Missouri.” “They lost.”) were classified 
as non-we responses. 

Predictions. Two predictions were made. First, it 
was hypothesized that subjects would emit more we 
responses in describing a team victory than a team 
defeat. Second, it was expected that the effect of 
Hypothesis 1 would be greatest for subjects who 

“had “failed” the survey test. The latter hypothesis 
was based on the assumption that subjects in the 
personal failure conditions would attempt to asso- 
ciate themselves publicly with a positive event or 
distance themselves from a negative event through 
Janguage usage in order to bolster or salvage their 
damaged images in the eyes of the caller. Subjects 
in the -personal success conditions were not expected 
to show a similar sized effect, as their prestige had 
„already been ensured via their successful task per- 
formance. Evidence that public success and failure 


on an experimental task leads to differential tend- 
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encies for social approval has been offered by 
Schneider (1969). He manipulated success and failure 
and found failure subjects to present themselves 
more favorably to an observer who could provide 
an evaluation, Thus, if the BIRG phenomenon is 
indeed an attempt to gain social approval, we should 
see our failure subjects BIRG more than our success 
subjects. 


Results 


Of the 173 subjects, the data of 5 were 
discarded because they clearly reported the 
game results incorrectly. For example, the 
description “We won” was not counted if in 
fact the subject’s team had lost the game in 
question, The percentages of we responses 
emitted in the four experimental conditions 
are presented in Table 2. The first prediction, 
that we usage would be greater in the descrip- 
tions of team victories than team defeats, was 
tested by comparing the team win conditions 
against the team nonwin conditions. A sig- 
nificant effect was obtained, x*(1) = 4,20, 
„p < .05, confirming Hypothesis 1. The second 
prediction, that the tendency for we responses 
to attend victory rather than defeat descrip- 
tions would be strongest after a personal fail- 
ure, was tested as an interaction of the two 
major independent variables. The resultant 
statistic, suggested by Langer and Abelson 
(1972) for testing interactions within a 
2 x 2 contingency table, just missed conven- 
tional significance levels, Z=1,75, p < 08, 
two-tailed. Tests of the simple main effects of 
the interaction strongly supported Hypoth- 
esis 2. The difference in we responding be- 
tween the team success and team failure cells 
of the personal failure condition was highly 
significant, x*(1) = 6.90, p <.01. The com- 
parable test within the personal success Con- 
dition did not approach significance, XU) 
= .07, ns. There were no significant sex effects 
in the data. 

TABLE 2 
Sunjecrts Usine Wr", 
ERIMENT 2 


PERCENTAGE OF 


Eh Personal outcome 


Team 


outcome Success Failure Mean % 
> Win 24 (11/45) 40 (16/40) 32 (27/85) 
Nonwin 22 (9/41) 14 (6 42) 18 (15/83) 
— nns 
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Discussion 


The data of Experiment 2 seem clearly to 
support the general BIRG formulation. Sub- 
jects used the pronoun we to associate them- 
selves more with a positive than a negative 
source, and this effect was most pronounced 
when their public prestige was in jeopardy. 
We interpret these results as evidence for our 
contention that people display even the most 
noninstrumental connections between them- 
selves and the success of others so as to re- 
ceive positive evaluations from the observers 
of those connections. 

It should be evident that the observer's 
tendency to assign positivity to one who is 
associated with positive things is crucial to 
our hypothesizing about the BIRG phenome- 
non. It follows from our previously stated 
assumption that if a person understood that 
a given observer did not value the success of 
a specific source, that person would be less 
likely to try to BIRG that source to the 
observer. So, if one of our subjects knew that 
an observer abhored successful college ath- 
letic programs, we would predict that there 
would be little likelihood of the subject at- 
tempting to make visible a connection with 
a winning football team. But this is a fairly 
obvious example; few people would predict 
otherwise. A more subtle and perhaps more 
informative demonstration might be obtained 
through a somewhat different manipulation 
of the observer’s relationship to the connec- 
tion. When an observer to a highly positive 
association can also lay claim to the associa- 
tion, the prestige of the connection is diffused 
and, consequently, reduced for anyone at- 
tempting to bask in its glory. It is when one’s 
bond to a positive source is not shared by 
an audience that its prestige value is optimal. 
Thus, when everyone has a similar positive 
characteristic, there is no special prestige in- 
volved in possessing it, and_the likelihood 
that any one person will boast about that 
quality should be reduced. For example, a 
resident of California is less likely to brag to 
fellow Californians about the favorable cli- 
mate than to geographically distant others, 
especially those who cannot claim similarly 
pleasant weather. It is our hypothesis, then, 
that the tendency to BIRG a positive source 


CIALDINI, BORDEN, THORNE, WALKER, FREEMAN, AND SLOAN 


should occur most often when one’s connec- 
tion with the source is stronger than the 
observer's. A third study was conducted to 
test this contention. 


EXPERIMENT 3 


In Experiment 2, it was shown that a per- 
sonal failure experience increased our sub- 
jects’ tendency to associate themselves with 
a positive source and decreased their tend- 
ency to associate themselves with a negative 
source. We have argued that this result oc- 
curred because the failure experience lowered 
perceived prestige and motivated subjects to 
try to either bolster their images in the eyes 
of others or prevent them from being further 
degraded. Central to this argument is the 
assumption that one’s simple, noninstrumen- 
tal connections are seen to influence observers’ 
personal evaluations. If so, it should be the 
case that in addition to their use as dependent 
measures, such connections could be used as 
effective independent variables, That is, it 
should be possible to influence subjects’ be- 
havior by publicly connecting them with 
either positive or negative events. In fact, if 
we are correct in our assumption, manipu- 
lating one’s public connections with good or 
bad things should have the same effect as 
manipulating one’s personal success or failure 
experiences. For example, just as Experi- 
ment 2 showed that subjects who failed a task 
increased the tendency to affiliate themselves 
with a winner and decreased the tendency to 
affiliate themselves with a loser in the eyes 
of an observer to their failure, it follows from 
our formulation that subjects who are merely 
publicly connected with a negative event 
should emit comparable BIRG onses in 
the presence of an observer to i... connec- 
tion. Experiment 3 was designe: to test 
this possibility and represented a conceptual 
replication and extension of Experiment 2. 


5 We do not wish to suggest that the tendency to 
BIRG a positive source never takes place when the 
observer's association to a successful other is as 
strong as one’s own but only that the prestige to be 
derived from a unique (vis-a-vis the observer) 
connection is relatively more desirable. 
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Method 


Subjects. The subjects were 170 undergraduates at 
a large state university with a powerful football 
team. The university was not the same as that of 
Experiment 2; however, subjects were selected for 
participation in a fashion identical to that of 
Experiment 2. 

Procedure. Following the completion of play for 
the university’s football team, subjects were called 
on the phone by 1 of 18 experimenters (11 males 
and 7 females) identified as an employee of either 
the “university's Survey Center” located on campus 
or the “Regional Survey Center” located in an out- 
of-state city. Subjects were told that a survey was 
being conducted of “undergraduates’ knowledge of 
university athletic events.” Those agreeing to par- 
ticipate (96%) were asked to describe the outcome 
of first one, then another of their football team’s 
Jast games of the year. One of the games constituted 
an important victory, and the other an important 
nonvictory in the team’s season. Half of the subjects 
were first requested to describe the nonvictory game 
and, having responded, to describe the victory game. 
The other subjects had the requests put to them 
in reverse order. The subjects’ verbatim descriptions 
of the game results were recorded. 

Independent variables. Two factors were orthogo- 
nally varied: the strength of the subject's affiliation 
to the university team compared with that of the 
observer (same as observer’s or stronger than ob- 
server’s) and order of presentation of the games to 
be described (victory game description requested 
first or nonvictory game description requested first). 

Dependent variables. The dependent measure was 
the pattern of we and mon-we usage employed by 
subjects to describe the combination of the victory 
and nonvictory games. Three combinations were pos- 
sible. A subject could have used the same we or 
non-we term to describe both the victory and the 
nonvictory, could have used we to describe the non- 
victory, and non-we to describe the victory, or 
finally, could have used we for a victory and non-we 
for a nonvictory. 

Predictions. It was predicted, first, that there 
would be an overall tendency for subjects to use 
we in their descriptions of a team victory and 
non-we in their descriptions of a team nonvictory. 
Such a finding would replicate the basic BIRG 
effect obtained in Studies 1 and 2. A second hypoth- 
esis was that the tendency to use we for victory and 
non-we for nonvictory descriptions would be greater 
in the nonvictory-description-requested-first condi- 
tions. Such a result would constitute a conceptual 
replication of the second finding of Experiment 2. 
On the basis of the BIRG model, we expected that 
the effect of publicly describing a negative event 
with which one is connected would be equivalent in 
nature to publicly failing on a task. Both Cisse) 
were thought to reduce subjects perceptions of their 
prestige as seen by an obsetver and, hence, to in- 
crease the likelihood of subjects’ attempts to Sata 
the positivity of subsequent evaluations. The thir 
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prediction was that Hypothesis 2 would hold most 
strongly when the observer was identified with an 
off-campus organization. This expectation was based 
on the belief that felt prestige to be gained from 
one’s connections to a source is greater and, thus 
more sought after, when one’s connections to that 
source are stronger than an observer's, Confirmation 
of this prediction would appear as an interaction of 
the independent variables of the study.® 


Results 


As expected and consistent with the results 
of Experiments 1 and 2, the basic BIRG ef- 
fect occurred in Experiment 3 to support our 
first hypothesis. That is, subjects used the 
term we nearly twice as often to describe a 
victory than a nonyictory (26% vs, 13.59%). 
This effect is further confirmed when the data 
are examined in terms of individual subjects’ 
we/non-we usage patterns. The majority of 
subjects were constant in their pattern of re- 
sponding to the two requests for descriptions; 
they consistently used either we or non-we to 
describe both game outcomes, Thus, there was 
a strong tendency for our subjects to be con- 
sistent in their verbal usage patterns for the 
two descriptions. However, in 23 instances 
subjects provided an inconsistent we/non-we 
pattern. In 22 of those instances, the pattern 
supported the BIRG model; the pronoun we 
was used for the victory description and a 
non-we term was used for the nonvictory de- 
scription, Using McNemar’s test for the sig- 
nificance of changes (Siegel, 1956, pp: 63-67) 


6 The experimenters, undergraduate students in a 
laboratory social psychology course, were not fully 
aware of these predictions. In order to test the 
influence of conscious or unconscious experimenter 
bias on the results of this study, the experimenters 
were informed of the nature of Hypothesis 1. How- 
ever, they were blind to the more subtle Hypotheses 
2 and 3. If only Hypothesis 1 were confirmed, the 
data would likely have to be interpreted as poten- 
tially influenced by the experimenter bias artifact, 

The investigation of such a possibility was deemed 
an important one, since in the prior experiments, 
experimenters had knowledge of the experimental 
hypothesis. In Experiment 1, some data recorders 
were unintentionally informed of the major hypoth- 
esis, while others were not. An analysis of the data 
from these two groups found only a minimal dif- 
ference in the data patterns, with the uninformed 
group’s data actually more favorable to prediction 
than the informed group. However, in Experiment 2; 
all experimenters had knowledge of the prediction. 
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TABLE 3 


PERCENTAGE OF SUBJECTS USING BOTH “WE” FOR 
VICTORY DESCRIPTIONS AND “Non-WE” FOR 
NONVICTORY DESCRIPTIONS, 
EXPERIMENT 3 


Order of requesta 


Victory Nonvictory 
Strength of subject's description description 
connection to team requested requested 
relative to observer's first first 
Stronger than observer’s 3 (1/39) 21 (10/47) 
Same as observer’s 11 (4/36) 14 ( 7/48) 
Mean % 7 (5/75) 18 (17/95) 


a Numbers given are percentages. 


and correcting for continuity, the data are 
highly significant, x?(1) = 17.39, p < .001. 
The tests of Hypothesis 2 and 3 were con- 
ducted by considering the distribution (across 
the cells of our design) of the instances of 
we/non-we usage fitting the pattern pre- 
dicted by the BIRG model. Table 3 presents 
these data. 

Hypothesis 2 stated that more subjects 
would use we to describe the victory and 
non-we to describe the nonvictory when they 
were asked to describe the nonvictory first. 
As expected, subjects were significantly more 
likely to so respond in the nonvictory-descrip- 
tion-requested-first conditions, x*(1) = 4.69, 
p < .05. Hypothesis 3 stated that Hypothe- 
sis 2 would hold most clearly when the sub- 
jects were more strongly connected with the 
university team than was the observer. As 


yredicted, Hypothesis 2 was supported to a 

extent when the observer was affili- 

h an off-campus rather tf © a campus 

ever, this tendenc; not quite 

ral levels of icance, Z 

a . As in Ex; it 2, there 

were no s t effects Í f subject. 

aL Di 

Overall, Ex; ients | prowicis 
support for the bil mulation, 

Aa ant tend- 


three experiments sho 


ency for students to strive tc W: ae 
selves publicly with their univ: s ou 
team more after the team had been successtul, 
A striking aspect of the phenomenon n that 
Subj s clai heir affiliation 
bjects sought to proclaim ther 4 j 
with s i hen they in 


with a successful source even W 
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no way caused the source’s success. 
component of the effect suggests a mediator 
consistent with balance theory. It is our 
contention that people make known thei 
noninstrumental connections with positi 
sources because they understand that « 
servers to these connections tend to evalu 
connected objects similarly. It appears tha: “ 
the tendency to BIRG is an attempt to secu 
esteem from those who can perceive the c 
nection. Studies 2 and 3 provided support i 
such an interpretation, Both showed that . 
perimental operations designed to threat: 
subject’s esteem in the eyes of an obse ‘er j 
caused subjects to be more likely to try pub- 

licly to associate themselves with positive 
sources. Intriguingly, it was possible in- 

crease the tendency to BIRG in these ev; ori- 5 
ments either by initially causing the sul ect _ a 
to experience personal failure in an observ \'s 
eyes or by initially causing the subject to be 
noninstrumentally connected with a negative 
event in an observer’s eyes. These manipu 
tions proved functionally equivalent in modi 
fying subject pronoun usage. Thus, in support + 
of our basic argument, being merely associ-. 
ated with someone else’s success and failure 
had much the same effect as personal success 
and failure. Experiment 3 provided evidence ¥ 
in a different way as well that the desirerfor  “ 
prestige is the mediator of the BIRG re- 
sponse. It demonstrated that when subjects’ 
affiliation with a positive source was stronger 
than an observer’s (and therefore carried a 
greater amount of prestige), they were most 
likely to BIRG that source in the presence 

of the observer. 

These studies suggest a way to understand 
how the fortunes of affiliated sports teams can 
cause lavish displays of civic gratitude and ~ 
pride in American cities, or “sports riots” in 
Europe, or murders in South America of 
players and referees whose actions had caused 
a home-team defeat. Through their simple 
connections with sports teams, the personal ? 
images of fans are at stake when their teams 
take the field. The team’s victories and de- 
feats are reacted to as personal successes and 2 
failures. 

Throughout this article we have stressed 
an interpersonal mediator of the BIRG phe- 
nomenon—the perceived esteem of others. We 
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do not wish, however, to preclude the pos- 
sibility of the tendency to BIRG privately, 
That is, for wholly intrapersonal reasons, 
people may draw connections between them- 
selves and positive sources. For example, one 
may well feel an enhancement of self-esteem 
that is unrelated to the assessments of others 
when one is associated with success or posi- 
tivity. Such an_effect could also be interpreted 
in terms of a tendency to respond similarly 
to associated objects. It might be that the 
results of our experiments are, in some degree 
at least, due to a desire to bolster or maintain 
one’s self-concept. The tendency to employ 
appropriate apparel or language in a way that 
connects oneself to something good may in- 
volve an attempt to remind oneself of such 
connections and, thereby, positively affect 
self-esteem. The fact that in Experiment 2 
we were able to influence the BIRG response 
simply by manipulating the characteristics of 
the observer suggests that the BIRG phe- 
nomenon is’ not mediated solely by intra- 
personal phenomena. Nonetheless, it remains 
possible that the tendency to BIRG has its 
basis in a desire to affect self-image as well 
as social image. In fact, since there is evi- 
dence that how we regard ourselves is influ- 
enced by how. we perceive that others regard 
us (e.g., Harvey, Kelley, & Shapiro, 1957), 
these two mediators are not mutually exclusive. 


w 
~ 
ui 
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Awareness and Attitude Change in the Forced-Complian 
Paradigm: The Importance of When 


D. R. Wixon and James D. Laird 
Clark University 


Forced-compliance subjects reminded of their initial attitude changed in one 
previous study and did not in another. This difference was interpreted as due 
to timing of the reminder. Results indicated that both the timing of such a 
reminder and the context immediately prior to such a reminder were impor- 
tant. A significant degree of attitude change occurred for subjects who were 
reminded after they had been led to draw a conclusion regarding their present 
attitude, but no attitude change occurred for those subjects who were reminded 
of their past attitudes prior to being led to draw such a conclusion. 


Recently one group of experimenters (Sny- 
der & Ebbesen, 1972) found that subjects re- 
minded of their original attitudes did not 
change their attitudes following counteratti- 
tudinal behavior, whereas another group 
(Ross & Shulman, 1973) found that reminded 
subjects did change. Because the relation of 
prior attitude and change is important for any 
theory of attitude change, both in this para- 
digm and in general, this study is directed at 
resolving this empirical contradiction. 

The two contradictory studies that gen- 
erated this one had been viewed as “crucial” 
in resolving the controversy between cogni- 
tive dissonance theory and self-perception 
theory (Bem, 1972; Ross & Shulman, 1973; 
Snyder & Ebbesen, 1972). Their crucial status 
seems questionable, however, perhaps be- 
cause of the nature of the theories (Green- 
wald, 1975) but also because crucial tests do 
not occur in any scientific context (Hanson, 
1969; Kuhn, 1962; Popper, 1968). At most, 
one might hope that developing patterns of 
sults will appear to fit one view more com- 

:bly than another. However, evaluation 
two theoretical perspectives is too 

ı task to attempt here, and this 

ıs the empirical question of the 


I wie 
tween awareness of original 


rela 
attitud quent attitude change. 

If neith ler and Ebbesen (1972) 
nor Ross an (1973) results are due 
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to artifact, then subtle diffe: 
methodologies must be respo: 

different outcomes, Two major 

apparent, both of which conc: 

ers. The first concerns the wa, 

jects’ initial attitudes were made mo 

Snyder and Ebbesen simply asked their 

jects to think carefully about their ideas < 
the topic on which they would subsequently 
be writing, whereas Ross and Shulman asked 
their subjects to copy their earlier responses 
to a questionnaire. Clearly, Snyder and Eb- 
besen’s procedure is less “committing” 
(Brehm & Cohen, 1962) than Ross and Shu! 
man’s. Since commitment prevents change, 
one might expect the Snyder and Ebbesen sub- 
jects to change most. However, the results 
were opposite, so commitment differences are 
probably not the source of the different re- 
sults, 

The second major difference in the two 
studies is in the timing of the reminder. There 
are a number of major parts of the usual 
forced-compliance procedure, including (a) 
the identification of the topic, (b) the assign- 
ment of the subject’s position on that topic 
(contrary to his prior attitudes), (c) the ac- 
tual performance of the counterattitudinal be- 
havior, and (d) the final measurement of the 
subject’s attitude. In the Ross and Shulman 
procedure, the reminder came as late as possi- 
ble (between c and d), whereas in the Snyder 
and Ebbesen procedure, the reminder came as 
early as possible (between a and b). 

Thus, the Snyder and Ebbesen subjects 
were aware during their performance that 
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their behavior did not represent their true 
attitudes and may, in effect, not have “be- 
lieved” their own behavior. In contrast, if the 
Ross and Shulman subjects did not originally 
recall their attitudes, they might have be- 
lieved their own behavior, drawn an inference 
based on it, and only then discovered that 
they previously held a different attitude. 

The direct test of this interpretation con- 
sisted of providing the same reminder at dif- 
ferent points in the procedure. The Ross and 
Shulman reminder technique was used. This 
technique did not prevent change in their 
study, so if it did inhibit attitude change 
when administered at other stages in the pro- 
cedure, this result would be the clearest indi- 
cation that the timing was the critical varia- 
ble. 

The interpretation stated above suggests 
that subjects reminded prior to engaging in a 
counterattitudinal speech should not change, 
whereas subjects reminded immediately after 
their counterattitudinal speech should change. 
However, the Ross and Shulman subjects were 
reminded only after a delay, during which 
they searched for their prior questionnaire. 
Thus, they not only performed the counter- 
attitudinal behavior without “disqualifica- 
tion,” but also were provided with a delay 
during which they might consider the atti- 
tudinal implications of their behavior, and 
during the delay they were encouraged to 
think about their position on the issue, since 
they were following instructions to find and 
copy the statements they had made on the 
issue a week earlier. To explore the role of 
these factors, a number of different varia- 
tions of the late-reminder condition were 
employed. The final design consisted of six 
groups. 

The prestatement-reminded group was re- 
minded before the performance of the counter- 
attitudinal behavior. With respect to timing, 
this group represented a replication of Snyder 
and Ebbesen’s design. Second, the no-delay 
group was reminded immediately after the 
counterattitudinal behavior had been per- 
formed. If the only important difference be- 
tween the two earlier studies had been before 
versus after behavior, then this group would 
be expected to change. If delay was necessary, 


then no change would be observed. A third 
group, uncued delay, was reminded after a 
“neutral” delay, during which the issue of 
the subjects’ own opinion on the topic was 
not raised. If delay alone was required, then 
this group might be expected to change. If, 
in addition to delay, the question of the sub- 
ject’s own opinion had to be raised, then only 
the fourth group, cued delay, would be ex- 
pected to change. The cued-delay group most 
closely approximated Ross and Shulman’s con- 
ditions. These subjects were told explicitly 
that they would be recording their old atti- 
tudes, but they were first required to spend 
some time waiting while a search for the old 
questionnaires took place. Finally, two addi- 
tional comparison groups were run. Subjects 
in the between-questionnaires group reported 
their attitudes on the issue on a questionnaire, 
were reminded, and asked to report their atti- 
tudes again. If the preceding analysis of the 
Ross and Shulman procedures was correct, 
this group would be expected to be identical 
to the cued-delay group because the only 
difference was whether the subjects were led 
to conclusions about their own attitudes im- 
plicitly, through the “searching for the ques- 
tionnaire” manipulation, or explicitly, through 
filling out a questionnaire. Finally, the no- 
reminder group consisted of subjects who were 
never reminded but who filled out the same 
two sets of attitude and other measures that 
were filled out by the reminded groups. 


METHOD 

Subjects 

A total of 49 unpaid undergraduate volunteers 
participated. Four were excluded because they ex- 
pressed reservations about the cover story. The 
judgments to exclude these subjects were made by 
two raters who were blind with regard to the hy- 
potheses of the study and the subjects’ experimental 
group. Furthermore, the inclusion of these subjects 
would not appreciably change any of the results, Of 
the 45 subjects included, 16 were men and 29 were 
women; since no difference between the sexes was 
observed on the experimental variables, no further 
reference will be made to this distinction. 


Procedures 


The general for 
employed by Laird 
similar to one repor 
(1968). 


ced-compliance procedure was that 
and Berglas (1975) and was 
ted by Helmreich and Collins 
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A brief in-class preattitude survey asked potential 
subjects how they felt on a 19-point scale ranging 
from “strongly agree” to “strongly disagree” and 
also how important they considered each issue to be 
on a 10-point scale ranging from “not at all” to 
“extremely.” This survey consisted of items of mod- 
erate interest to college students, such as “trial by 
jury is no longer a democratic process,” and “the 
family is no longer an appropriate social institution.” 
The actual item used for each subject’s counteratti- 
tudinal statement was chosen from among those 
which he or she had marked as between 4 and 8 
on the importance ratings and between 2 and 8 or 
11 and 17 on the agree/disagree scale. This tech- 
nique ensured that the attitude items on which 
change is to be studied were similar in their impor- 
tance to the subject and in the degree of polariza- 
tion of the subject’s opinion. Seven different issues 
were employed. On four of these issues, some sub- 
jects were initially positive and were induced to 
make negative speeches, while others who were ini- 
tially negative were asked to make statements in 
favor of them. The numbers of subjects assigned to 
each issue were too small to permit statistical analy- 
sis. However, issues were distributed widely across 
conditions, with no more than two subjects in any 
one condition assigned to any one issue. No differ- 
ences between items in amount of change were ob- 
servable. 

Between 1 and 2 months after the initial survey, 
subjects were asked to help the experimenter “pre- 
pare stimulus materials.” Upon arrival at the ex- 
perimental session, the subject was told that the 
experimenter needed a collection of videotapes on 
various attitude issues for presentation to large 
groups of students on campus. The subjects were 
told that attitude changes would be surveyed re- 
peatedly; if they were asked by anyone about the 
videotapes they had made, they should repeat the 
same arguments they had given on the tape. This 
procedure committed the subjects to a long-term 
public defense of the views they had stated on the 
videotape before learning on what issue they would 
be speaking. 

After reminding the subject that participation in 
the study was entirely voluntary, the experimenter 
explained that the subject would not have to think 
up any arguments because these had already been 
prepared and that the subject would read an initial 
statement asserting the basic argument, which would 
be videotaped. This statement would then be played 
back, allowing the experimenter to check out the 
videotape equipment, before supporting arguments 
were taped. ` 

The experimenter consulted a list of poten 
issues, explained that he had already obta 
number of tapes, and asked if the subjec 
mind making the speech that correspondt 
introductory statement written on the 

subjects were first aware of what speech th 
make when they read the card. The card 
a sentence that began with “I would like to 
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why I believe (or do not believe) that” and ended 
with the exact wording of the relevant issue from 
the subject’s initial attitude survey. All subjects were 
asked to argue the opposite of the position previ- 
ously endorsed. After the videotape was played 
back, the experimenter appeared unable to find the 
arguments supporting the videotaped statement and 
asked the subject to fill out a questionnaire while 
he searched for them. The questionnaire contained 
one filler and three important items, one of which 
measured the main dependent variable—the extent to 
which the subject agreed with the statement he or 
she had made on videotape. Two others asked how 
much “conflict or discomfort” the subject had expe- 
rienced and what his or her attitude had been before 
the experiment. 

Upon returning, the experimenter announced that 
he could not find the supporting arguments but 
asked if the subject would mind filling in several 
other questionnaires while he looked elsewhere. The 
experimenter asked the subject which form he or 
she had just completed. When the subject replied, 
the experimenter expressed some consternation and 
explained that the subject had been given the wrong 
form. The experimenter gave the subject another 
questionnaire and “threw away” the “wrong” ques- 
tionnaire. This procedure was intended to permit 
the subject to answer the second questionnaire with- 
out concern for consistency with the first. The sec- 
ond questionnaire contained seven items, including 
the nonfiller items from the first questionnaire, as 
well as questions asking the subjects (a) how free 
they felt in making the videotape, (b) how long 
they had held their present positions, (c) what 
their positions had been in the past, and (d) to 
comment if they felt they had changed their opin- 
ions. A third questionnaire asked the subjects the 
purpose of the experiment and other questions de- 
signed to test whether the subjects believed what 
they had been told about the experiment. 


Timing of Reminders 


The six experimental groups differed primarily in 
the time at which the subjects were reminded of 
their previous attitude statements. The prestate- 
ment group was reminded before committing them- 
selves to making a counterattitudinal statement on 
videotape. The no-delay group was reminded after 
performing and viewing the counterattitudinal be- 
havior. The uncued-delay group was reminded after 
viewing the playback and after a 1-minute delay. 
The cued-delay group was reminded after a 1-min- 
ute delay during which the question of the subject's 
present beliefs was salient. The between-question- 
naires group was reminded after they had viewed 
the playback and had filled out the first question- 
naire. The nonreminded group was not reminded at 
all. Figure 1 illustrates the timing of the reminder 
for the five reminded groups. After being reminded, 
subjects continued through the experimental pro- 
cedures shown in the figure. 
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ur Timing of ace Non-Delay  Cued and Uncued Between Questionnalres 
Reminders Sst es Group 
[Reminded | (Reminded j |_ i J 
‘Major 
: ee 7 aping) [One-Mincte 
; i of issue and and Playback Delay Questionnaire questionnaire (Form Y) 
subjects’ agree~ ond post-experimental 
ment to make questionnaire, 
video tapes, 
Ficure 1. Timing of reminder for different experimental groups, 
4 
Reminder Technique tions perceived freedom was moderately high 


Tn all but the cued-delay condition, the experi- 
menter asked in an offhand way if the subject re- 
© membered the earlier survey and then explained that 
“the people who took the survey” wanted to score 
the results by computer. Because some of the sub- 
jects who had filled in the survey were also helping 
y out in this study, the people who had taken the 
survey had asked the experimenter to request that 
these subjects transfer responses from their surveys 
to computer cards. In order “to preserve confiden- 
tiality” the previous questionnaires, along with the 
computer cards, had been sealed in manila envelopes 
identified by mailbox number. The subjects trans- 
ferred their responses to the computer cards, re- 
sealed the envelopes, and gave them to a secretary 
when they left. Thus, subjects receiving this re- 
minder technique were not sure until they actually 
saw it whether their previous questionnaire was 
available or whether it contained an item related to 
their counterattitudinal statement. 
Subjects in the cued-delay group were told ex- 
plicitly that their questionnaire was available, that 
- it contained their previous judgment on the issue of 
their counterattitudinal statement, and that they 
would be re-recording that particular issue. Thus, 
the cued-delay group was “cued” to a relationship 
between their counterattitudinal statement and their 
own position on the issue. They then waited for the 
1-minute delay before actually secing their question- 
naire. This reminder procedure was most similar to 
that used by Ross and Shulman (1973). 


RESULTS 


An analysis of variance of initial attitude 
and importance scores revealed no significant 
differences between the six groups [Attitude 
Score, F(5,39) = 2.09, ns; Importance, F(5, 
39) = 1.25, ns]. Thus, any subsequent atti- 
tude change is not attributable to differences 

4 in initial attitudes. 

Variations in perceived freedom to make or 
refuse to make the speech might have led to 
differences in change. However, in all condi- 


(12.2 on a 20-point scale ranging from “not 
at all free” to “completely free”), and there 
were no significant differences across condi- 
tions, F (5,39) = 2.01, ns, 

There were two postmeasures of attitude, 
one from each questionnaire, In all condi- 
tions except the between questionnaires, in 
which subjects were reminded of their prior 
attitudes between filling out the two, these 
questionnaires were answered without any im- 
portant intervening events, Thus, it was ex- 
pected that there would be virtually no dif- 
ferences between the two measures, Table 1 
shows that this was the case. Predictions for 
the between-questionnaires group are not clear 
with respect to the attitude measure, although 
the most probable outcome was that the two 
measures would be identical. This expectation 
was also confirmed. As one would infer from 
the similarity of the means, the analyses of 
the two sets of measures are virtually identi- 
cal, so only the results for the first of these 
will be presented in detail, except where 
otherwise specified. 

Attitude change scores were computed so 
that change in the direction of agreement with 
the counterattitudinal speech was a change 
toward a higher score. A two-way analysis of 
variance of these scores was performed, with 
the reminder group serving as an independent 
factor and premanipulation versus postmanip- 
ulation attitude estimates as a repeated fac- 
tor. Both main effects are significant: for re- 
minder conditions, F (5,39) = 4.35, p < .005, 
and for premanipulation/postmanipulation, 
F(1,39) = 30.27, p < .001. The interaction 
is also significant, F (5,39) = 4.62, p < .002. 

No change occurred in any of the reminded 
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groups except those who received the re- 
minder after a cued delay or after responding 
to a questionnaire (see Table 1). This con- 
trast was tested by comparing change scores 
for the first three groups, the prebehavior, 
immediate-postbehavior, and noncued-delay 
groups, with scores for the cued-delay, be- 
tween-questionnaire, and nonreminded groups. 
This difference was highly significant, F(1, 
39) = 14.64, p < .001, and in fact accounted 
for 58% of the between-groups variance rep- 
resented by the interaction term in the first 
analysis. As a direct test, individual compari- 
sons were carried out between premanipula- 
tion and postmanipulation attitude scores for 
each treatment group. The pre/post scores 
are not different for the premanipulation, 
nondelayed, or uncued-delay treatment groups, 
#(6) = .10, #(5) = 1.28, and ¢(6) = .40, re- 
spectively, all ps = ns, and the pre/post scores 
are significantly different for the cued-delay, 
between-questionnaires, and nonreminded 
groups, #(5) = 3.21, #(8) = 4.47, and #(9) 
= 5.29, respectively, all ps < .03. 

Subjects reported level of discomfort or con- 
flict on both experimental measures. These 
reports may be viewed as reflecting the sub- 
jects’ awareness of the discrepancy between 
their actual behavior and their “real” atti- 
tudes. If this is so, then one would expect 
conflict to be at a maximum in all conditions 
where the subjects were reminded of their 
original attitudes and, hence, were most likely 
to perceive the discrepancy between their at- 
titudes and behavior. One exception would be 
the between-questionnaires group members, 
who first reported the conflict rating before 
being reminded and, hence, might be expected 
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to repeat that rating even though they have 
now recognized that they have changed their 
attitudes. Thus, one would expect high-con- 
flict ratings in the first three groups and low 
ratings in the last two. The cued-delay group 
is less easy to predict, but because they had 
changed their attitudes—apparently because 
of the “cuing” of that condition—they should 
then experience conflict when they discover 
that their attitude endorsements earlier were 
opposite. Thus, it would be expected that the 
cued-delay group would report conflict rat- 
ings at least as high as the first three groups. 

The pattern of means for both conflict 
measures is consistent with these predictions 
(see Table 2). These differences are not sig- 
nificant in an overall one-way analysis of 
variance for either Conflict Measure 1, F(5, 
39) = 2.01, or Conflict Measure 2, F(5, 38) 
= 1.68 (one subject did not fully complete 
the second questionnaire). The contrast that 
was the direct test of the hypothesis was per- 
formed. For the first measure the composite 
mean for the first four groups was 11.2 and 
the composite mean for the last two was 8.4; 
this difference is significant, (44) = 3.05, p 
< .01. For the second measure the same com- 
bination yielded means of 11.2 for the first 
four and 8.6 for the last two; this difference 
was also significant #(43) = 2.70, p < .01. 

In addition, correlations between levels of 
conflict and degrees of attitude change were 
computed for both conflict measures within 
the three groups that had changed signifi- 
cantly. Both correlations are negative (7 = 
—.38, p = .06; r= —.36, p =.08) for Con- 
flict Measures 1 and 2, respectively. 

On the second questionnaire, subjects esti- 


TABLE 1 
MEAN PRIOR AND PosTEXPERIMENTAL ATTITUDE ASSESSMENTS*® 


Postmeasures of attitude 


Recall of 
Prior Measure Measure preexperimental 
Experimental group n attitude 1 2 attitude 
Prestatement reminded 7 or pa si J 
No-delay reminded 6 5. ii 7.5 = 
Uncued delay 7 6.3 5.7 6.0 5.1 
Cued delay ` 6 3:8 9.3 9.3 8.7 
Between questionnaires 9 6.2 a 11.1 6.» 
No reminded 10 6.3 12. 11.3 12.1 
20 = maximum agreement with counterattitudinal statement. 


s Assessements were made on a 20-point scale, where 
b One subject from this failed to answer 


che recall question and was excluded from this analysis, 
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“ had been. By this 
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mated what their preexperimental attitudes 
point, all subjects except 
the nonreminded group had been shown their 
Prior attitude ratings. Results consistent with 
past research (Bem & McConnell, 1970) were 
expected; that is, the nonreminded subjects, 
who have changed their opinions but have 


“ not been informed, should exhibit a high de- 


gree of recall error (i.e,, they recall their past 
attitudes as consistent with their present 
beliefs). On the other hand, subjects who had 
been informed of their prior attitudes and who 
had not changed their attitudes might be ex- 
pected to be most accurate in their recall. The 
results for these groups are consistent with 
this interpretation, Recall scores and original 
scores did not differ significantly for the pre- 
behavior, no-delay, and uncued groups (all ts 
< 1.0), whereas the difference was highly sig- 
nificant for subjects in the nonreminded 
group, 4(9) = 4.92, p < .002. The mean re- 
call of attitudes scores appear in the last col- 
umn of Table 1. r 

The predictions are not as clear for the 
other two groups of subjects. Both groups had 
made counterattitudinal statements, reported 
new “changed” attitudes, and then were 
shown their original questionnaires. When 
asked what they had previously believed, they 
might have reported their old attitudes as 
consistent with their original questionnaires, 
or with their recent self-descriptions. Both 
strategies seem to have occurred. In the cued- 
delay groups, the majority reported that their 
original attitudes were consistent with their 
current self-descriptions, a result which paral- 
lels Ross and Shulman’s (1973). The differ- 
ences between the preexperimental and recall 
scores in this group was significant, (5) = 
3.77, p = 02. In the between-questionnaires 
group a larger proportion of the subjects 
reported their preexperimental attitudes ac- 
curately, perhaps because they had inspected 
their prequestionnaires only a moment before 
responding to this question. Im this group the 
difference between preexperimental and recall 
scores was not significant (£ < 1.0). 


Discussion 


The first simple conclusion is that the 
differences between the Snyder and Ebbesen 
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TABLE 2 
MEANS or REPORTED CONPLICT: FIRST AND 
Seconp MEASURES*™ 
Group "n Measure i Measure 2 
„rem 11.57 11.00 
No delay 6 10,83 10.17 
Uneped delay 7 12.00 12.14 
Cued delay 6 10,33 11.67 
Between * > 
questionnaires 9 8.77 7.77 
No reminder 10 8.00 8.22» 


* Conflict was measured on a 20-point scale, ranging from 1 
(no conflict) to 20 (high conflict), 

b One subject in this group failed to complete the second 
questionnaire, 


(1972) and the Ross and Shulman (1973) 
results were probably due to the timing of 
the reminder, When a reminder similar to 
Ross and Shulman’s was administered at the 
same time as theirs, identical results were ob- 
tained: change. However, when administered 
at the same time as Snyder and Ebbesen’s, 
the same reminder prevented change. 

The implication of this result for the rela- 
tion between awareness of initial attitude and 
change depends on what role the reminder 
plays. Two alternatives seem possible, The 
obvious possibility, of course, is that indi- 
viduals do not recall their initial attitudes and 
when they do, this prevents change, The less 
obvious possibility is that the subjects are 
always aware of their initial attitudes, but 
the reminder procedure itself somehow pre- 
vents change. In particular, the reminder pro- 
cedure might have “committed” subjects to 
their original position, Brehm and Cohen 
(1962) point out that any action, or even 
decision to act, may commit an individual to 
a cognitive element (in this case, the initial 
attitude) and “commitment increases the re- 
sistance to change of an element” (p. 9). It 
could be that these reminders prevented 
change through “commitment” rather than 
through making subjects aware of their initial 
attitudes. 

This interpretation looks less reasonable 
in light of other studies, however. Snyder's 
and Ebbesen’s subjects just thought about 
the issue; absolutely o behavior was elicited, 
yet attitude change was inhibited. Further- 
more, spontaneous recall of one’s initial atti- 
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tude is sufficient to prevent change (Laird & 
Berglas, 1975). Thus, it would seem that 
“just thinking” or spontaneous recall is suffi- 
cient to produce commitment to one’s initial 
attitude. This interpretation may seem rea- 
sonable but means that we must also assume 
that the reason attitudes did change in some 
groups in this study is that these subjects 
were not aware of their initial attitudes and, 
hence, not committed to them. In short, then, 
it seems that we must interpret the effect of 
the reminders as nothing more than remind- 
ers and further conclude that individuals 
often do not recall their initial attitudes with- 
out such a reminder. 

Thus, it seems that if one is aware of one’s 
original attitude early in the process, attitude 
change is prevented. Why this should occur is 
less clear. It could be that the behavior itself 
is performed differently, perhaps half-heart- 
edly, or that although the behavior is not dif- 
ferent, it is experienced differently (e.g., cyn- 
ically) in light of one’s original attitude. 
Neither of these explanations will fit, however, 
since the reminder seems to have exactly the 
same effect even when administered after the 
behavior has been performed. The differences 
seem to be in how the behavior and the in- 
formation about the original attitude are 
integrated rather than in the performance or 
experience of the behavior. Apparently, when 
people clearly know that they believe one 
thing yet have done another, they can more 
readily find an explanation for this behavior 
than imagine that they could change their 
attitude. 

The fact that late reminders do not prevent 
change may be explained by two relatively 
obvious principles. The first is simply that 
when an individual reaches an attitude con- 
clusion (by self-perception or dissonance-re- 
duction processes, etc.), the individual em- 
ploys only that information of which he or 
she is currently aware. This interpretation is 
almost a truism but has received empirical 
support in contexts in which it is not obvious. 
For instance, making subjects more aware of 
the implications of their behavior increases 
the degree to which attitude conclusions are 
consistent with it (Carlsmith, Ebbesen, Lep- 
per, Zanna, Jancas, & Abelson, 1969; Kiesler, 

Nisbett, & Zanna, 1969; Snyder & Ebbesen, 
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1972; Zanna, Lepper, & Abelson, 1973), and 
similarly, increasing the salience of external ~ 
justifications for that behavior decreases the 
degree to which attitude conclusions corre- 
spond to it (Zanna et al., 1973). 

The second principle is equally clear: Once 
the attitude-concluding process has occurred, 
it is irreversible. This proposition has received 
some support in studies in which information 
about external justifications, which would or- 
dinarily reduce attitude change, did not do 
so when presented after attitude conclusions 
had already been drawn (Goethals & Copper, 
1975; Heilmreich & Collins, 1968; Lepper, 
Zanna, & Abelson, 1967), It appears that 
many of the studies originally done in the 
context of the dissonance/incentive theory 
controversy are also amenable to this inter- 
pretation (e.g., Freedman, 1963). 

These principles suggest that the between- 
questionnaires group drew attitude conclu- 
sions for the first questionnaire without the 
initial attitude information, and this conclu- 
sion was then irreversible even though they 
believed the first questionnaire had been de- 
stroyed unexamined. Similarly, it appears 
that in the cued-delay group, the effect of 
the cuing during the delay was to implicitly 
ask the subjects to draw conclusions about 
their current attitudes. Again, the conclusion 
was formed before the initial attitude infor- 
mation was available. 

The fact that the no-delay and uncued- 
delay groups did not change suggests a third 
principle: Attitude conclusions are only drawn 
when somehow instigated. It was not enough 
to have performed the counterattitudinal be- 
havior or even to have had time afterward to 
contemplate it. Rather, there needed to be 
either an implicit or an explicit question 
about attitudes on the issue before the con- 
clusions would be drawn. These conclusions 
would be based only on the information cur- 
rently available and would be irreversible. 

The results with respect to the subsidiary 
measures are all quite consistent with this 
view. First of all, if one assumes (a) that 
“conflict and discomfort about making the 
speech” are produced by the recognition that 
one’s speech is contrary to one’s attitude and 
(b) that subjects recall their initial attitudes 
only if reminded, then it is to be expected, as 
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was observed, that conflict scores would be 
much higher in the four groups that were 
reminded before they responded to the con- 
flict measure. Of course, only one of these, 
cued-delay group, changed because they had 
been reminded before the conflict measure 
was obtained but after they had reached a 
new attitude conclusion. 

In addition, among the last two groups, 
which both changed and reported conflict 
without (or before) a reminder, conflict scores 
were lower than for the other four groups, but 
there were still some variations between indi- 
viduals. If these variations reflect the degree 
to which subjects spontaneously (though per- 
haps not very clearly) recalled their initial 
attitudes and the preceding analysis of the 
conflict results is correct, then a negative 
relation between conflict scores and change 
would be expected. That, of course, was just 
what was observed. Again, the more one 
recalls one’s original attitude, the less one 
changes. 

Finally, the subjects’ reports of their pre- 
» experimental attitudes were consistent with 

previous work but were also of considerable 
interest. The fact that the first three re- 
minded groups reported very accurately is 
hardly surprising, since those subjects had 
seen their previous questionnaire and still 
endorsed that position. The unreminded-group 
members replicated earlier results (Bem & 
McConnell, 1970) and reported that they had 
always held the position which, in fact, had 
been adopted just a moment before. Finally, 
subjects in the cued-delay and between-ques- 
tionnaires groups, who now held a position 
- contrary to their initial one but had seen 
their original questionnaire, seem to have re- 
< sponded in both ways. Earlier reports of sub- 
jects’ failure to recognize attitude change 
y could have occurred either because subjects 
never recalled their initial attitudes or because 
the final stage of the change process was the 
i repression of the memory of the original atti- 
tude position (Bem & McConnell, 1970). 
These results seem to make it clear that the 
£ first interpretation was correct—that unre- 
minded subjects who changed were never 
aware of their original attitude. 

The unreminded subjects apparently did 
not recall their initial attitudes; presumably, 

1 
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however, they could have recalled right up to 

the moment they were asked to draw an atti- 

tude conclusion, as did otherwise identical 

subjects who were randomly assigned to the 

uncued-delay condition, for instance, Having 

drawn an attitude conclusion, however, it was 

striking that they now were unable to recall 

that they had ever held any other conclusion. 

This was especially clear during the debrief- 

ings, when many subjects who had changed 

continued to deny that a change had occurred. 

The most common explanation for the ap- 

parent discrepancy was that “I must have 

misunderstood the question the first time.” 

Another alternative was to assert that the 
change occurred during the interval between 
the first questionnaire and the forced-compli- 
ance procedure. For any of the items which 
had any suggestion of political content, the 
common explanation for these subjects (run 
during the fall of 1974) was that “Watergate 
had changed my mind about that.” Subjects 
are characteristically so persuasive when of- 
fering these explanations that we would be 
seriously tempted to believe them except that 
no corresponding changes occur in control 
groups. These observations suggest that these 
subjects were, in effect, revising their own 
histories to make sense of the change in their 
attitudes. The speed, magnitude, and cer- 
tainty of these revisions was striking. 

In discussing these results, a number of gen- 
eralizations have been offered: (a) Individu- 
als do not seem to recall their attitudes very 
well, (b) they can be induced easily to act 
in ways contrary to their attitudes, (c) they 
will not draw any conclusions about their 
attitudes from their behavior (or other 
“data”) unless instigated in some way, (d) 
when instigated, individuals will base these 
conclusions on whatever information they 
have at the moment, (e) once an attitude 
conclusion is drawn, it is irreversible, and (f) 
once an attitude conclusion is drawn, the indi- 
viduals will revise their histories to fit the new 
position. 

All of these generalizations seem warranted 
by the results of this study and most have 
received at least indirect support in other 
studies. Taken together, they convey a picture 
of our subject’s behavior which is sharply at 
variance with usual assumptions about peo- 


384 


ple. “Attitudes” seem to be extremely fleeting 
things, which neither constrain behavior in 
the ways we usually assume nor have the 
permanence we expect. The only permanence 
is an illusion, constructed after the fact to 
explain our current position. As historians of 
our own lives we seem to be, on one hand, 
very inattentive and, on the other, revision- 
ists who will justify the present by changing 
the past. 
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What is the influence of moods or emo- 
tional states on behavior? This is a question 
that has long interested psychologists but 
that has been eclipsed for some years, in part 
because of the difficulty of establishing the 
presence of a given mood state. Despite this 
difficulty, recently there have been renewed 
attempts to study the effect of feeling state 
on behavior, especially on altruistic or help- 
ful behavior. Several authors, for example, 
have examined the effect of guilt on compli- 
ance with a request for help (Carlsmith & 
Gross, 1969; Freedman, Wallington, & Bless, 
1969), on spontaneous helping (Regan, Wil- 
liams, & Sparling, 1972), or on willingness to 
administer electric shock (termed conscience) 
(Rawlings, 1970). Another sample line of in- 
vestigation has centered on the experience of 
empathy with a person in distress as a deter- 
minant of helping (Aderman & Berkowitz, 
1970; Aronfreed, 1968, 1970). In addition to 
such presumably negative states, positive 
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Two field studies investigated the time course of the effect of feeling good on 
helping. Subjects were given small packets of stationery by a confederate who 
went from door to door. Then, at different intervals, each subject received a 
“wrong number” telephone call during which he or she had the opportunity to 
help. Results showed that subjects who had received stationery helped more 
than did those in either of two control groups. The effect declined gradually 
over time, and by 20 minutes after receipt of the stationery, the experimental 
group did not differ from the control groups. 
in helpfulness and the basic relationship between good mood and helping were 
discussed in terms of cognitive processes. 


The time course of the decline 


moods have been postulated and studied as 
determinants of helping. It is this topic, the 
relationship between good mood and helping, 
that we attempted to investigate in greater 
detail. 

The postulated good mood state has been 
induced in a variety of ways and has been 
shown by more than one investigator to Jead 
to helping in a variety of situations (Ader- 
man, 1972; Berkowitz & Connor, 1966; Tsen, 
1970; Isen & Levin, 1972; Isen, Horn, & 
Rosenhan, 1973; Levin & Isen, 1975; Moore, 
Underwood, & Rosenhan, 1973). Moreover, 
many of the converging studies have served to 
eliminate alternative interpretations of the 
findings. Thus, although never directly moni- 
tored or confirmed, the construct of mood, 
independent of other situational variables that 
might also lead to helping, is gradually receiv- 
ing support as a mediating variable in helping 
through converging operations. Moreover, the 
relationship between good mood and desire to 
help is similarly being established by methods 


akin to those suggested by Campbell and 
(1954), and Garner, 


Fiske (1959), Garner 
Hake, and Eriksen (1956): both converging 
operations and discriminant validation (Isen 
& Levin, 1972). However, the postulating of 


nonobservable intervening variables is not 
feel that such 


without risk. For this reason, we 
hypothesizing is not justified by the comfort- 
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able feeling of “understanding” that it may 
provide, but rather is warranted only insofar 
as it leads to further research and allows us 
to organize a broader array of behavior than 
would reference to the operations alone. Posi- 
tive mood state seems a useful construct at 
this stage of investigation because it does 
promise, when understood, to direct research 
along new lines. Thus, we continue to feel 
that a most important question centers on 
the nature of the postulated relationship: 
Why and how does good mood produce its 
effect on helping? 

A first step in studying this question is to 
establish some characteristics of the relation- 
ship between good mood and helping. One im- 
portant aspect is how long the effect lasts, In 
our studies we have typically dealt with 
what would seem to be rather weak manipu- 
lations, and we have been surprised at the 
power that these simple everyday events 
have in producing kindness. For example, 
sensing that the mood induced by a cookie or 
a dime (Isen & Levin, 1972) might be fleet- 
ing, we have been careful in previous research 
to provide an immediate opportunity for 
subjects to help.? Recently, we have become 
interested in just how long such a mood might 
be effective, in part because this may provide 
some clue as to why or by what process this 
effect occurs, but also to get some idea of the 
amount of time with which we are dealing in 
these studies and to gauge the practical ap- 
plicability of these findings. To some extent, 
but not entirely, the theoretical importance of 
the effect depends on its time course; how- 
ever, the practical significance of the effect is 
even more dependent on its lasting some rea- 
sonable amount of time. Thus, the present 
experiments set out to determine the amount 
of time for which induction of good mood 
can be expected to produce increased “every- 
day” helping. 

STUD} 
Method 


Subjects. Subjects were 
who were at home betwe 
and 4 p.m. and were resid 
of Lancaster, Pennsylvan 

Design and procedure. 
conditions: a control gro 
not induced but subjects 


les and 34 females 
hours of 9:30 a.m. 
ı suburbanlike area 


consisted of four 
hich elation was 
mply given the 
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opportunity to help, and three experimental groups, 
in which subjects were put in a good mood and 
then tested for willingness to help either immedi- 
ately, after 5 minutes, or after 10 minutes. These 
times were selected in accord with our intuitive 
notion that the effect would last only a few minutes, 

Good mood was induced by having subjects re- 
ceive, in their homes, a “free sample” packet of sta- 
tionery. This packet consisted of three note cards 
with envelopes, attractively tied together by a small 
piece of ribbon. The value of this packet was ap- 
proximately 20¢. The stationery was given out by a 
female confederate, who went door to door, pre- 
cisely following a prearranged time schedule for 
reaching each particular house. The confederate had 
no knowledge of the experimental condition into 
which a given house fell, since her schedule simply 
consisted of an address with a corresponding time at 
which she was to knock at the door. The definition 
of the experimental condition remained to be com- 
pleted by the experimenter’s making a telephone call 
that would provide the opportunity for helping; the 
confederate had no knowledge of when this call was 
to come. 

Meanwhile, the experimenter was calling these 
same subjects according to her own prearranged 
schedule of telephone numbers and times. The two 
schedules were designed to dovetail in such a way 
that the experimenter would reach some subjects 
immediately after they had received the free sample, 
some subjects 5 minutes, and some 10 minutes after 
the confederate had come to the door. In all cases, 
following the confederate’s knock at the door, 1 min- 
ute was allowed for the subject to answer the door 
and receive the free sample. Thus, in the immediate 
condition the phone call came 1 minute after the 
knock at the door; in the 5-minute condition the call 
came 6 minutes after the knock, and so on. (Prior to 
the session the experimenter’s and confederate’s 
timepieces had been synchronized and allowed to run 
together for 24 hours in order to ensure that their 
timing could be exact.) As was the case with the 
confederate, the experimenter was unaware of the 
condition of any subject at the time that she was 
interacting with him or her. She knew neither when 
nor whether the confederate had distributed sta- 
tionery to the subject. 

When the telephoning experimenter reached a 
subject, she used a modification of Gaertner and 


2One study in which the helping may actually 
have occurred at some time substantially after the 
mood induction is the “letter” study, in which the 
matter of interest was the subjects’ willingness, after 
having received a dime in the coin return of a pub- 
lic telephone, to mail an apparently forgotten letter 
(Levin & Isen, 1975). The case for persistence of 
the positive mood state is not strong in that study, 
however, because in order to help, subjects had to 
initiate the process immediately by taking the letter, 
and once having done so, they may have been com- 
mitted to mailing it at some later time. 
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Bickman’s (1971) “wrong number technique” to 
Provide subjects an opportunity to help. As the 
conversation developed, the experimenter conveyed 
that she had accidentally reached the wrong house, 
that she must have been given the wrong number, 
and that she had spent her only change in the 
process. The sitaution was not presented as an emer- 
gency, but she asked the subjects whether they 
would look up the number, make the call, and con- 
vey a message for her. The dependent measure was 
whether or not a subject agreed to make the call for 
the experimenter. 

A second confederate was waiting at the correct 
number to receive the calls. He simply listened to 
the subject’s message and thanked him or her for 
calling. All three experimenters recorded the sex and 
any identifying characteristics of the person with 
whom they had interacted. This was done in order 
to be able to eliminate cases where the person who 
received the stationery, the person who answered the 
telephone, and the person who made the helping call 
were obviously different; but in fact such a lack of 
correspondence occurred only very infrequently. By 
the use of this admittedly crude indicator, it seemed 


100 
90 


80 
70 
60 


(5) 


Percentage Helping 


LEVEL OF HELPING 
IN CONTROL GROUPS 
——— (30) NO CONTACT —— (11) DEMONSTRATION 
(41) No CONTACT 


387 


that in the majority of cases the person who an- 
swered the phone was the one who had received the 
stationery. All subjects who agreed to call actually 
did so, and there was never an apparent discrepancy 
between the person reached by the experimenter and 
the person who called Confederate 2. 


Results 


Figure 1 (dotted line) shows the percentage 
of subjects in each condition who helped, and 
(in parentheses) the number of subjects in 
each condition. It should be noted that only 
1 control subject out of 11 helped, whereas a 
substantial percentage of subjects in the three 
experimental conditions helped. It should also 
be noted that the percentage who helped was 
greatest in the 5-minute condition, Tests for 
the significance of the difference between two 
proportions indicated that the level of helping 
in the control condition was significantly 
lower than that in every other condition (p 
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< 025). Subjects in the 5-minute condition 
tended to help more than those in the 10- 
minute condition ( b = .08) ; however, the dif- 
ference between the immediate and 5-minute 
conditions did not reach customary levels of 
significance (p = -16, two-tailed). Over all 
conditions, 62.5% of the males and 55.9% of 
the females helped. 


Stupy 2 


The results of Study 1 indicated that in our 
situation people who had received stationery 
helped more than those who had not, and the 
data seemed to suggest that less helping would 
occur after 10 minutes than after 5. However, 
subjects in even the 10-minute condition 
helped at a significantly greater rate than did 
those in the control condition, This suggested 
that our intuitive expectation that the effect 
would disappear after 5 minutes was incorrect. 
Thus, we designed a second experiment to 
study the effects of periods of time in excess 
of 10 minutes. Moreover, it appeared that 
differential units of 5 minutes might be too 
large. Thus, Study 2 includes conditions where 
helping is measured at 3 minute intervals up 
to 20 minutes after receipt of the free gift. 
Study 2 also introduces a second control 
group, in which subjects are called to the 
door by the confederate but do not receive a 
gift. This condition was instituted in order to 
control for the several possible effects associ- 
ated with the confederate’s presence (e.g., 
interaction with a person just prior to the re- 
quest for help, distraction from previous ac- 
tivity, etc.). In most other ways Study 2 is 
similar to Study 1, 

Method F 


Subjects. Subjects were 110 residents of suburban 
areas of Baltimore, Maryland (46 subjects) or Lan- 
caster, Pennsylvania (64 subjects) who were at home 
between 9:30 am and 4 pm. Of the total, again, 
approximately 80% were female and 20%, male. 
Both blacks (21%) and whites (79%) participated 
in Study 2. 

Design and procedure. The design of Study 2 was 
similar to that of Study 1, except that Study 2 was 

- expanded to include additional time periods and to 
incorporate an additional control group. Subjects 
were called either 1, 4, 7, 10, 13, 16, or 20 minutes 
after receipt of stationery; the second control group 
Was one in which the confederate did ring the bell 
and make contact with the subject but merely “dem- 
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onstrated” what the new stationery looked like and 
asked for the subjects’ opinions. She did not give 
them a free sample. People in this condition were 
telephoned either 4, 7, or 10 minutes later, Again, 
the confederate and the experimenter were unaware 
of the subject’s condition when they interacted with 
him or her, except that the confederate knew which 
subjects were “demonstration” controls, as this 
could not be avoided. The experimenter, however, 
did not know which subjects were in that condition 
when she telephoned. 

The confederate again gave out stationery, this 
time whole packages worth 39¢ each; her procedure 
was similar to that of the first study, except in the 
case of the demonstration control condition, There, 
the occupant was not given a free sample but was 
shown two types of stationery and told that the 
company was interested in having her (or him) see 
these two examples of its new line of stationery. The 
subject was asked which of the two she (he) pre- 
ferred, and the confederate then nodded, smiled, and 
politely took leave of the subject. In all conditions, 
subjects occasionally expressed the expectation that 
the confederate was intending to sell the stationery 
to them. If this occurred, the subject was pleasantly 
assured that this was not the case, 

While the confederate was making her way down 
a street, following her schedule of addresses and 
times, as in Study 1, the experimenter at the tele- 
phone was calling each number on her schedule at 
the exact appointed time. When a subject answered 
the phone, the experimenter asked for Victor. In a 
procedure similar to that of Study 1, when the ex- 
perimenter was told that she had reached the wrong 
number, she haltingly said the following: 


Oh, no . . . my brother is at Victor’s, and I’m 
supposed to pick him up there. He’s Waiting for 
me... . I’m in a phone booth and just used my 
last dime to make this call. , +. Do you think you 
could call him for me? Could you look the num- 
ber up? Apparently Information has given me 
the wrong number. 


A short pause after the statement that she was call- 
ing from a pay phone and had just used up her 
change gave subjects the opportunity to interrupt 
and volunteer to help. As in the earlier study, they 
occasionally did so, at which time the experimenter 
responded appropriately, If subjects did not inter- 
the experimenter continued to the 
end of the statement and then paused to give the 
subject time to answer. (Once the experimenter had 
perfected her timing, before the start of the first 
study, no subject hung up on her before she had 
made her situation known.) Typically, at the conclu- 
sion of the Statement, the subject did one of three 
things: hung up without a word, refused and gave 
an excuse, or agreed to help. If the subject’s reply 
Was negative, the experimenter simply said, “O.K. 
+ + + never mind . . . sorry.” If the subject agreed 
to help, however, the experimenter gave her “Vic- 
tor’s” full name and address so that the subject 


ad 


} 
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could look up the number and make the phone call. 
Then she thanked the subject, hung up, and recorded 
the subject's helping response (i.e, whether he or 
she agreed to call Victor). Estimates of each sub- 
ject’s age, race, and sex, as well as any notable char- 
acteristics of voice,or speech, were also recorded for 
purposes of identification, as in Study 1. There was 
-* “er an apparent discrepancy between the person 
by the experimenter and the person who 

confedefate, and again, as in the first 

bjects who agreed to call actually did so. 


amount of helping and pattern of re- 
sults obtained in Baltimore, Maryland, did 
not differ from those obtained in Lancaster, 
Pennsylvania, Therefore, the data from the 
two locations were combined for analysis. 
Figure 1 (solid line) shows the percentage 
of subjects in each condition who helped and 
(in parentheses) the number of subjects per 
condition.® Since the experimental conditions 
were created by differential amounts of time 
having been allowed to elapse before help 
was sought, Figure 1 represents the amount 
of helping over time. Beginning with the 4- 
minute delay condition, amount of helping 
“appears to decline, and by about 20 minutes 
from the time of mood induction, it has re- 
turned to the baseline (control group) level 
tof about 12%. 
In order to test for a decreasing trend in 
J these data, an analysis of variance, on the 
dichotomous data coded as 0 or 1, was per- 
formed. This analysis revealed a significant 
main effect, F(6,62) = 3.26; p< 007, and 
the trend analysis (unweighted means) indi- 
cated a significant linear component with 
negative slope, F(1, 62) = 13815, p< .001. 
Examining the data, we see that the 1-, 4-, 
and 7-minute conditions appear to cluster to- 
gether at approximately an 83% rate of help- 
ing; the 10-, 13-, and 16-minute conditions, 
; at 50%; and the 20-minute and control con- 
{ ditions, at about 12%. Chi-square tests re- 
a vealed these differences between the grouped 
` conditions, (1, 4, 7) versus (10, 13, 16) and 
(10, 13, 16) versus (20, control, demonstra- 
tion control), to be significant (x? = 5.85, 
12.10; p < .02, < .001, respectively). A fur- 
a? : ther indication that the free-sample conditions 
| (with the exception of the 20-minute condi- 
tion) differed from the control conditions 1s 
revealed by a Fisher exact test, which showed 
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that the 16-minute condition differed from the 
control conditions (p < .025). 

These data include subjects who were not 
originally assigned to the control condition, 
but who were reassigned to that condition, 
when we discovered that the confederate had 
not been able to deliver the stationery as in- 
tended, even though the experimenter had 
succeeded in reaching the subjects by tele- 
phone. In other words, subjects who were 
not at home (or who may have refused to 
answer the door) when the confederate rang 
the doorbell, but who did answer their tele- 
phones when the experimenter called, were re- 
assigned from their originally intended condi- 
tions to the control condition. If examined 
without these reassigned control subjects, the 
data remain essentially the same (control n 
= 20, percentage helping = 10). 


GENERAL DISCUSSION 


The results of these studies indicate that 
people who receive a free sample package of 
stationery are more willing to help a stranger 
by looking up a telephone number and mak- 
ing a phone call for her than are control sub- 
jects who have not been previously contacted 
or who have been contacted but have not 
received a free sample, The results also dem- 
onstrate that this relationship lasts about 20 
minutes. That is, while the amount of helping 
appears to decline gradually over time, it is 
not until 20 minutes between receipt of the 
free gift and the request for aid that an ex- 
perimental group fails to differ from the con- 
trol groups. 

We feel that these studies provide further 
support for the idea that being in a good 


3 As in the first study, subjects were randomly 
assigned to treatments: The unequal number of sub- 
jects per condition resulted from an uneven loss of 
subjects, Factors such as subjects being out when 
the experiment took place, subjects being on the 
telephone when the experimenter tried to telephone, 
or (occasionally) experimenter and/or confederate 
being unable to adhere to the prescribed timetable, 
were responsible for subject Joss. In the first study, 
which was conducted over & number of weeks, only 
a few subjects were run each day. Thus, lost sub- 
jects could be replaced the following day. However, 
in the second study, many subjects were run each 
day, leaving insufficient time to equalize numbers 
of subjects in each condition. 


: 
> 
ci 
: 


390. 


mood can lead to helping, since they involved 
induction of good mood in still another way 
and again demonstrated its effect on help giv- 
ing in a new situation, Equity considerations 
should have been at a minimum in this situa- 
tion, as suggested by Latané and Darley 
(1970), because of the use of the commercial 
free sample. And the failure of the demonstra- 
tion control group to differ from the no-con- 
tact group in helpfulness makes alternative 
hypotheses attributing increased helping to 
mere interaction with another person, or to 
change in focus of attention as a result of 
having been called to the door, seem less com- 
pelling than the mood interpretation. Again, 
however, we must emphasize that the impor- 
tant question is why good mood should lead 
to helping, because the answer to this ques- 
tion promises to direct research along new 
lines. We shall consider this issue below. 
In examining the data of these studies, two 
points should be made, First, the similarity 
of the results of the two experiments should be 
noted. With regard to the specific values ob- 
tained for comparable conditions and the 
overall curve that emerged the two studies 
are consistent with one another. Second, it is 
worth pointing out that in both studies, most 
helping appeared to be obtained in the 4- or 
5-minute condition rather than in the immedi- 
ate or 1-minute condition, as might have been 
expected. Although this difference did not 
reach the customary .05 level of statistical 
significance in the first study, with 10 subjects 
in each condition (the Fisher exact test re- 
vealed = .16, two-tailed), and probably 
would not have been significant if tested in 
the second study, with 12 and 5 subjects per 
condition, one cannot help but notice the 
tendency toward a difference and the persis- 
tence of the pattern in the two studies; some 
discussion of the issue may be of interest. One 
possible explanation for the pattern is that 
subjects whose phones rang within 1 minute 
of their interacting with the person at the 
door were physically more harried than sub- 
jects whose call came a few minutes later. 
Thus, the call and its request of them may 
have seemed more burdensome than it would 
have seemed 2 or 3 minutes later. Or, it may 
be that immediate and 1-minute subjects were 
still psychologically distracted at the time of 
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the call, so that the request fell, to some 
extent, on “deaf ears.” Another possibility is 
that the receipt of the stationery takes some 
small period of time to “sink in” and have its, 
maximum effect. This time period, apparently 
something between 1 and 4 minutes, probably 
does not represent simply the time required 
for taking in the information and storing it in’ 
memory, since this requires at most a few sec- 
onds and since some substantial effect of the 
free gift is observed even in the immediate 
condition. The information, we can assume, 
has already been processed by the time sub- 
jects in the immediate and 1-minute condi- 
tions have answered the telephone. An in- 
triguing possibility is that the period of 2-4 
minutes, which differentiates the immediate 
and I-minute conditions from the 5- and 4- 
minute conditions, is used by subjects to 
rehearse cues or aspects of the events that 
have just happened, and that this rehearsal 
heightens the effect of the event and allows it 
to reach its maximum. There are several 
ways in which this might occur, and we would 
like to develop this issue further, in the 
context of the overall decline in helping. 

One possible explanation for the observed ' 
decline, over time, in helping might be in 
terms of simple memory for the positive 
event: that after 20 minutes subjects no 
longer remembered that they had received ¿ 
free gift. We suspect, on the contrary, that 
this was not the case; that if asked, subjects 
in the various conditions would have probably 
been equally capable of remembering that 
they had received a package of stationery, 


& Isen, 1975; Isen, Note 1), the differences 
between conditions (both the effect of having 
received the free gift and the decline of this 
effect over time) may be due to mood-based \ 
differences in what the person is actively 4 
thinking about, the categories for the process- .}) 
ing of new information that are available or 
salient to him (or her), and his (or her) per-_ 
ception, on that basis, of costs and rewards’ | 
for helping. Pilot studies indicate that a gen- 
eral optimism does seem to prevail during a 


Rather, as suggested in earlier articles (Levin j 


sacos t 
*Deese and Hulse (1967) reported that simple 
nonsense syllables are processed in something under 


1 second. 
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good mood (Isen, Note 1; Isen, Clark, & 
7 Karp, Note 2; Isen & Walker, Note 3, Note 
4), which lends support to our interpretation. 
That is, upon receipt of a free.gift, a person 
may reflect on its positive implications and 
may find other positive associations more 
readily accessible than they otherwise would 
be. This induces and heightens the mood 
state. However, there is a limit to these im- 
plications and associations, the time available 
for thinking about them, or the ability to 
keep associations relevant; and after having 
reflected upon them for as long as they war- 
rant or as is possible, a person’s thoughts turn 
elsewhere. As this occurs, the good mood that 
has been induced gradually dissipates. In ad- 
dition, so does desire to maintain it, all of 
those cognitive processes that result from it, 
and any behavior that follows from it. 
Several questions about the dissipation of 
the helping effect over time are interesting to 
pursue, One such question is whether the in- 
tensity of the mood might affect the time 
course or pattern of the relationship between 
mood and helping. It is possible that greater 
elation could lead to a longer lasting inclina- 
tion to help. It is also possible that the rela- 
tionship is not sensitive to level of mood but 
that beyond a certain threshold level, feeling 
_ good leads to helping, and that once evoked, 
> this relationship lasts for about 20 minutes. 
In order to test such a proposition, one 
would have to have some means of inducing 
moods of varying intensity. One suggestion 
has been to distribute gifts of varying value 
and compare the helping reactions that fol- 
low. There are really two questions here. One 
involves the effect of intensity of mood on 
duration of the inclination to help. The other 
involves the determinants of intensity of 
mood. We presently know little about the 
relationship between the value of the gift and 
the intensity of the mood induced. Presum- 
| ably, more valuable gifts make people feel 
happier; yet there may be other variables 
that intervene to complicate this relationship. 
(For example, if factors such as rehearsal or 
cue salience associated with the mood-induc- 
ing event, as mentioned above, play a role in 
level of mood induced, then attention simply 
to the value of the gift might be misleading. 
Jn addition, from another point of view, gifts 
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of great value may induce a rather negative 
sense of inequity (Walster, Berscheid, & Wal- 
ster, 1973, p. 168) or may result in suspicion 
or perception of ingratiation (Jones, 1964). 
These latter two states may lead to reactance 
(Brehm, 1966).) Thus, the relationship be- 
tween value of the gift and level of mood 
induced should be studied separately from 
the question of the relationship between level 
of mood and duration of the tendency to help. 

In light of our earlier discussion of mood 
as a cognitive process that is influenced by 
other cognitive activities such as rehearsal, 
several other factors suggest themselves as im- 
portant in the relationship between mood and 
helping. If rehearsal, or thinking about posi- 
tive associations, is crucial to mood mainte- 
nance, then the duration of the helping effect 
should be influenced by how much rehearsal 
an event induces, how large a network of 
pleasant associations it has, or the number of 
positive implications that it carries with it 
rather than by simply its value per se. Unex- 
pectedness of the positive event and oppor- 
tunity for rehearsal after such an event also 
emerge as potential variables of interest. 

We would like to consider briefly an alter- 
nate interpretation of these studies: that the 
effect does not actually dissipate with time 
but that the passage of time allows for the 
subject to encounter opportunities to help, 
and that once a person has helped in these 
circumstances, he does not help again when 
the caller reaches him with her request. In 
order for this interpretation to account for 
our results, a substantial number of our sub- 
jects would have had to have encountered, 
accidentally, nonplanned additional oppor- 
tunities to help within the time period studied, 
This does not seem likely to us. In addition, 
underlying this interpretation is the assump- 
tion that once a person has helped, he is 
unlikely to help again; however, the latter 
question is unresolved. There is at least some 
evidence supporting the opposite position, The 
early work by Freedman and Fraser (1966) 
on the “foot-in-the-door” technique tends to 
support the expectation that one who has 
helped would be more likely to help on a 
subsequent occasion. Likewise, a study by 
Weiss, Buchanan, Alstatt, and Lombardo 
(1971) found that altruism is reinforcing in 
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itself; this would imply that one who has 
helped might go on helping. And finally, a 
direct test of the question (Harris, 1972) 
showed that those who agreed to help on an 
initial task were also subsequently more help- 
ful. Although there might be circumstances 
under which helping would reduce the incli- 
nation to help subsequently, such circum- 
stances need to be delineated. Thus, given 
the state of the research on the topic and the 
improbability of subjects (in large enough 
numbers to account for the differences be- 
tween the conditions) encountering opportu- 
nities to help within the time period studied, 
the argument that helping declines, not over 
time, but because of alternative opportuni- 
ties to help, does not appear warranted. 
Finally, we would like to suggest that 
knowledge of parameters such as duration of 
an effect or circumstances that alter relation- 
ships within it may prove helpful in under- 
standing the nature of the effect. That is, it 
may be possible to distinguish between behav- 
iors that appear similar but that in fact have 
different antecedents and consequences and 
are mediated by distinct processes. For exam- 
ple, in addition to the evidence that good 
mood leads to helping, there is also evidence 
that guilt, embarrassment, failure, or other 
“bad moods” can lead to helping, depending 
on the circumstances (Cialdini, Darby, & 
Vincent, 1973; Freedman, Wallington, & 
Bless, 1967; Isen, Horn, & Rosenhan, 1973; 
Regan, Williams, & Sparling, 1972; Staub, 
Note 5). However, helping that results from 
one of such bad moods may be the result of a 
unique process and may differ in specifiable 
ways from helping that results from good 
moods. Studies investigating parameters such 
as the duration of these effects and the extent 
or nature of the helping that results from 
each of these may shed light on this question. 
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Several studies are reviewed that focus on the development and validation of 
the Test of Attentional and Interpersonal Style (TAIS). The TAIS was de- 
veloped in an attempt to respond to the criticisms that assessment procedures 
(a) have little predictive utility, (b) fail to lead to the selection of particular 
treatment approaches, (c) do not take into account situational variables, and 
(d) are not based on a sound theoretical or conceptual base, A summary of 
research efforts indicates that the test has good test-retest reliability as well as 
some construct and predictive validity. An advantage of the TAIS relative to 
other psychological measures is that its test profile describes particular atten- 
tional and interpersonal characteristics, These characteristics are operationally 
definable, and they permit prediction of performance across a variety of life 


situations, 


Over the past decade increasing criticism 
has been leveled at existing assessment in- 
struments. The reliability, validity, and pre- 
dictive utility of test results have been ques- 
itoned by some of the foremost psychologists 
in the country (McClelland, 1973; Meehl, 
1954; Rotter, 1973; Wallace, 1966). One 
major concern is that traditional assessment 
procedures have been oriented to the mea- 
surement or inference of needs and/or traits 
and have not sufficiently taken into account 
the many situational variables that influence 
behavior. Although psychologists are aware 
of these shortcomings, they have done little 
to correct them. 

Rotter (1973), writing about the future of 
clinical psychology, cites several reasons for 
the failure to develop more effective assess- 
ment devices. The proliferation of treatment 
procedures in psychology, the obvious im- 
portance of situational factors in predicting 
behavior, and the lack of a good unifying 
theory have resulted in a stimulus jungle. As 
the complexity of the problem and the de- 
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mand for treatment and research productiv- 
ity increase, fewer individuals are willing or 
able to spend time developing new assessment 
techniques. Clinical psychologists are finding 
themselves forced either to get involved in the 
delivery of health care services or to engage 
in a research program with more immediate 
payoff. This is particularly disheartening be- 
cause unless some methods are developed to 
select and evaluate different kinds of treat- 
ment approaches to different kinds of clients 
and unless some unifying theoretical con- 
structs emerge, clinical psychology cannot 
expect to advance as a science. 

Notwithstanding the difficulties in assessing 
the environmental, interpersonal, and intra- 
personal variables that influence an individu- 
al’s behavior, some concrete suggestions have 
been offered. Illustratively, it has been ar- 
gued that we should define variables more 
explicitly to provide more structure for cli- 
ents and sample the behavioral criteria we 
are attempting to measure. Wallace (1966) 
refers to this as assessing the behavioral 
competency of the individual in particular 
situations. Although this is important, we 
must also be able to generalize lest we end 
up measuring literally thousands of behav- 
ioral competencies that individuals must have 
to function effectively in complex job or life 
situations. 

The concept of personality traits can be 
seen as one way of identifying individual 
characteristics that allows for the prediction 


of behavior across situations. Unfortunately, 
trait concepts, used in the past, have failed 
to provide much predictive validity (Mischel, 
1968). McClelland (1973) has suggested that 
psychologists identify and assess competen- 
cies that are involved in “clusters of life out- 
comes.” Although he does not use the word 
trait, he implies that some of these compe- 
tencies could be identified as “personality 
variables” (e.g., patience). McClelland differs 
from prior approaches by emphasizing situa- 
tional analyses that can help to locate gen- 
eralizable competencies. He also identifies 
interpersonal skills (e.g., communication 
skills) and “operant thought patterns” (e.g., 
need achievement) as two prime areas to ex- 
“amine for generalizability. 

The arguments thus far reviewed lead to 
the position that assessment devices should 
be (a) as situation specific as possible (e.g., 
questions should be phrased to reflect actual 
behavior in particular settings), (b) designed 
in unambiguous ways (i.e., the examinee 

. should know what is being measured and the 
relevance of these measures to his or her life 
[Wallace, 1966]), and (c) useful, subject to 
evaluation, based on good theoretical con- 
structs, and lead to prescriptive statements. 

4 Meeting these requirements will make it 

~ easier to determine the usefulness of assess- 

ment devices in selecting and evaluating 
treatments (Rotter, 1973). Finally, which- 
ever competencies, traits, or personality varia- 

; bles are measured should be generalizable to 

l clusters of life situations (McClelland, 1973). 


The Need for Unambiguous Constructs 


The criticisms mentioned, particularly the 
lack of predictive utility associated with most 
assessment instruments, underlie the present 
attempt to develop a behavioral measure with 
good predictive utility. Some past difficulties 
in using traditional assessment devices predic- 
tively result from limits in their interpreta- 
tion (ie, in applying their implied con- 
structs). For example, scales labeled hysteria, 
psychesthenia, and ego strength often mean 

$ different things to different people. Because 
they all typically include so many elements, 
it is difficult to make reliable predictions of 
a person’s behavior from them (Jackson, 


1971). 
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A requirement for a test’s predictive utility 
is that the variables it presumes to measure 
be directly and unambiguously translatable 
to particular interpersonal or environmental 
situations. Thus, we should be able to look 
at a situation and assess the strength of its 
demand for “hysterical,” “schizophrenic,” or 
“self-actualized” behavior, Unfortunately, be- 
cause most clinical and personality variables 
have surplus meaning, this is a very difficult 
task, 

Given this way of thinking, a logical first 
step in developing a useful test should be to 
identify variables which, as McClelland sug- 
gests, are important across a variety of life 
situations and are as free as possible of am- 
biguity and suxplus meaning, Since most of 
the situations to which we ultimately want to 
predict involve interpersonal interactions, this 
is an area of obvious importance. Beyond 
the interpersonal area, however, there is a 
large body of theoretical and empirical litera- 
ture suggesting that knowledge of an indi- 
vidual’s attentional processes is critical to 
understanding and predicting his or her per- 
formance. 


Attention and Performance 


The importance of attention for under- 
standing and predicting behavior has long 
been emphasized in psychology. Attentional 
constructs have formed the basis of theories 
designed to explain phenomena ranging from 
schizophrenia (Shakow, 1962) to some al- 
tered states considered to reflect “higher lev- 
els of consciousness” (Nideffer, 1974). It is 
hard to imagine a variable more central to 
performance than the ability to direct and 
control one’s attention (Nideffer, in press). 

Attention can be conceptualized on at 
least two dimensions: breadth of focus and 
direction. Attention can vary from very nar- 
rowly focused (filtering out a great deal of 
information) to very broad (Cromwell, 1968; 
Easterbrook, 1959; Wachtel, 1967). We can 
also attend to external (environmental) or 
internal (thoughts, feelings) stimuli (Heil- 
brun, 1972; Shakow, 1962). Most people 
maintain a balance between these two direc- 
tions, shifting from one to the other, Figure 
1 presents a way of conceptualizing these two 
attentional dimensions, At any single time, 
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Ficure 1. Theoretical consequences of particular attentional styles. 


attention can be described on the basis of 
both its width and direction. 

Conceptually, it is relatively simple to ap- 
ply the attentional dimensions of width and 
direction to specific environmental situations. 
Conducting sensitivity training, for example, 
demands a broad external focus, whereas 
working out a computer program calls for a 
much narrower internal focus. Attentional 
competencies, such as controlling width and 
direction, should change adaptively as a func- 
tion of specific environmental demands. 
Knowledge of attentional abilities and en- 
vironmental demands should increase the ac- 
curacy of prediction of task success or failure. 

The remainder of this paper provides a 
description of the development and validation 
of the Test of Attentional and Interpersonal 
Style (TAIS). The presentation is divided 
into the following five sections: (a) test de- 


velopment, (b) group difference studies, (c) 
construct validity, (d) predictive validity, 
and (e) a discussion and summary. 


Test DEVELOPMENT 


The Test of Attentional and Interpersonal 
Style (TAIS) was developed using college 
students as subjects. On the basis of the theo- 
retical formulations of Shakow (1962), Crom- 
well (1968), and Nideffer (1974), and the 
clinical observations of MacKinnon and 
Michels (1971) and Frank (1961), 17 dif- 
ferent aspects of attentional and interpersonal 
behavior were identified. Following sugges- 
tions made by McClelland (1973), Wallace 
(1966), and Jackson (1971), an attempt was 
made to develop a pool of unambiguous items, 
reflecting behavioral experiences, in the 17 
areas considered important for predicting 
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performance and making specific treatment 
recommendations. 

5 Of the 17 TAIS variables, 6 reflect atten- 
tional | processes, 2 reflect behavioral and 
cognitive control, and 9 describe interpersonal 
style. Table 1 defines the TAIS subscales. 

An item analysis procedure based on a 
sample of 302 undergraduates was used to 
develop the 17 rationally defined TAIS sub- 
scales. Each subscale item was correlated with 
its total subscale score, correcting for the 
item’s inclusion in the total score. Items that 
did not exceed a probability value of .01 
were dropped. This procedure identified 144 
items belonging to the 17 subscales. 
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Table 2 reports the number of items on 
each TAIS subscale and the amount of item 
overlap between scales. There is a consider- 
able overlap between two scales measuring 
effective broad external and internal atten- 
tion (BET and BIT) and the one that mea- 
sures the amount of information an individual 
processes (INFP). 

Table 3 presents the TAIS interscale cor- 
relations. These range from .01 to .80 with a 
median of .28. Although it is conceptually 
useful to think of amount of information 
processed independently of attentional width, 
it is unreasonable to assume that these varia- 
bles are, in fact, unrelated, Some of the over- 


TABLE 1 


Tur Test OF ATTENTIONAL AND INTERPERSONAL STYLE (TAIS) Scares 


Description 


Scale Abbreviation 

Broad external attentional focus BET High scores on this scale are obtained by individuals who describe 
themselves as being able to effectively integrate many external 
stimuli at one time. 

Overloaded by external stimuli OET The higher the score, the more individuals make mistakes because 
they become confused and overloaded with external stimuli, 

Broad internal attentional focus BIT High scores indicate that individuals see themselves as able to 
effectively integrate ideas and information from several different 
areas. They see themselves as analytical and yhilosophical, 

Overloaded by internal stimuli OIT The higher the score, the more mistakes individuals make because 
they confuse themselves by thinking about too many things at 
once. 

Narrow attentional focus NAR The higher the score, the more effective individuals see themselves 
with respect to being able to narrow their attention when they 
need to (e.g, to study or read a book). 

Reduced attentional focus RED A high score on this scale indicates that the individuals make 
mistakes because they narrow their attention too much, S 

Information processing INFP High scorers tend to process a great deal of stimulus information. 
Their perceptual-cognitive worlds are busy. k 

Behavior control BCON A high score indicates that individuals tend to be somewhat im- 
pulsive. In addition, they engaged in behavior that could be 
considered antisocial, though not necessarily harmful. 

Control scale CON A high score indicates that individuals are in control of most of the 

' situations, interpersonal and otherwise, they find themselves in. 
It also means they attempt to gain control of situations, 

Self-esteem SES The higher the score, the more highly the individuals think of 
themselves. , A 4 

Physical orientation P/O A high score indicates the individual participates in and enjoys 
compete Ar. h tendency fo ruraioats ane 

si OBS A high score indicates the person as a J 

Oe waty about one particular thing without any real resolution or 
movement, - R 7 

Extroversion EXT A high score indicates the individual is warm, outgoing, needs to 
be with other people, a the life aoe ald sas estore: KO 

i T iah score indicates. the person Ukes o be alone, enjoy’ 

na ao Ny Be pital times, and avoids being the center of attention, 

Intellectual expression TEX A high score indicates that individuals express their thoughts and 
ideas to other people. aF f hel d 

Negative affective expression NAE A high score indicates that individuals express their anger an 
negative feelings to others. | : ` 

PAE A high score indicates the individuals express their feelings of affec- 
ys. 


Positive affective expression 


tion to others in both phy 


ysical and verbal w 
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TABLE 2 
Trem OVERLAP on TAIS SUBSCALES 
Total no. 

Scale BET OET BIT OIT NAR RED INFP BCON CON SES P/O OBS EXT INT IEX NAE PAE items 
BET — o 1 0 0 o CO 1-02 30 OO Reo o o 0 6 
OET in ty Ol y ai 0 0 2 0 OS; 1 0 0 0 0 0 12 
BIT — o0 o -2 6 0 Oe EE E cee: o 0 0 8 
OIT — -i 1 o 0 RAS ER EN OER a 0 0 0 o 9 
NAR = 4 2 0 OL) O I ese le 2, o 0 0 u 
RED — 4 0 OAE eB ie coh an o 0 o 15 
INFP -> o Ose Oras ated. o o 0 19 
BCON — Os” ete T 30, o 3 0 13 
CON Se! AE AN ets eae 1) 4 3 0 20 
SES SS 03 20: sue 1 18 
P/O SO ede ty 0 0 0 0 7 
OBS — o 0 o 0 0 9 
EXT — 0 ixo 3 14 
INT = O =1 11 
IEX — 1 0 8 
NAE _ 0 9 
PAE — 9 

Totalno. 

items A o A E E LE lS) aD A EA A ETRA e) 


Note, TAIS = Test of Attentional and Interpersonal Style. 


lapping items on the three scales are as fol- 
lows: “All I need is a little information and I 
can come up with a large number of ideas” 
(BIT); “Even when I am involved in a game 
or sport my mind is going a mile a minute” 
(BIT, INFP); “It is easy for me to focus on 
a number of things at the same time” (BET, 
INFP); “I am good at quickly analyzing com- 
plex situations around me such as how a play 
is developing in football or which of four or 
five kids started a fight” (BET, INFP). 


Overlap, where it occurs, obviously acts to 
inflate interscale correlations. 

Given item overlap and some high TAIS 
interscale correlations, questions can be raised 
about the independence of the 17 variables. 
Jackson (1971) has suggested one method of 
assessing subscale independence. The mean 
correlation (corrected for bias) between sub- 
scale items and total subscale score is com- 
puted. The remainder of the test items are 
then correlated with total subscale scores to 


TABLE 3 
TAIS INTERSCALE CORRELATIONS (n = 230) 


Scale BETOET BIT OIT NAR RED INFP BCON CON SES P/O OBS EXT INT IEX NAE PAE 


BET == =.26 52 —.26 23° =.28 68 —.15 
a 38 71 —.60 52. =,22 63 

—=23 34 —.48 80 —.14 

= —.49 S5 —.16 54 

= —.07 1A =T 

= = a2 25 

E 05 


51 43 29 —.28 33 15 18. 0 35 
—.26 —4t —22 39 —.05 20 —.13 25 —.13 
58 59 28 —,37 39 —.10 47 —.02 33 
14 —41 = 13 36 —.13 32 —.06 36 —.21 
19 30 16 08 05 —.07 i4 =22 l4 
—.28 —.44 —.09 77 —.16 21 —,24 12 =.18 
61 58 30 —.39 52 — 16 45 02 46 
0S —.27 01 07 02 14 —.01 55 —.10 
— 64 58 =.27 „51 —.08 56 28 35 


Nole. TAIS = Test of Attentional and Interpersonal Style. 


` psychology students (45 
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5 Prepicren 
IN THE 


% of items 
not scored 
with correla. 


Scale items 
mean 


Scale No. items correlation tion > Ẹ 
BET 6 -000 
OET 12 O15 
BIT 8 -000 
OIT 9 007 
NAR 11 007 
RED 15 022 
INFP 19 016 
BCON 13 «000 
CON 20 016 
SES 18 015 
P/O 7 «000 
OBS 9 000 
EXT 14 :007 
INT 11 -000 
IEX 8 .000 
NAE 9 -000 
PAE 9 -000 


Note. TAIS = Test of Attentional and Interpersonal Style, 


determine the percentage that correlate with 
the score as high or higher than the mean 
for the subscale items. Table 4 presents mean 
item-scale correlations for 17 TAIS scales 
and for each scale indicates the percentage of 
nonsubscale items that exceed the mean item- 
scale r. The highest Percentage of overlap is 
only .022 (three items), 


Test-Retest Reliability 


TAIS test-retest reliabilities based on a 2- 
week interval were Studied for introductory 
males and 45 fe- 
males), Correlations ranged from a low of .60 
on the obsessive scale to .93 on physical ori- 
entation, with a median of .83 (Wolfe & Ni- 
deffer, Note 1). 


Group DIFFERENCE STUDIES 


An examination of response differences on 
the 17 TAIS subscales was examined by com- 
paring the scores of 74 male and 123 female 
introductory psychology students. Students 
volurNeered to take the TAIS to fulfill a 
course requirement for participation in some 
experimental studies. The test was admin- 
fstered to groups of subjects (35 to 50), who 
were told that the data were being used to 
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establish norms for the 
Note 2), 

There were significant differences between 
males and females on five of the 17 TAIS sub- 
scales, Males described themselves as (a) bet- 
ter able to broaden and internally focus at- 
tention, (b) better able to narrow attention 
when it is required, (c) more physically ori- 
ented, (d) more intellectually expressive, and 
(e) less expressive of positive affect, These 
results are presented in Table 5, 

The differences between males and females 
can be explained on the basis of social learn- 
ing discrepancies. Men are typically described 
as more physically oriented, more openly 
competitive, and less expressive of positive 
affect than females, Thus, differences on these 
interpersonal scales are not surprising, With 
Tespect to the attentional differences found, it 
can be postulated that the need to be in con- 

trol and competitive causes men to make 
more use of focused ( narrow) and more ana- 
lytical (broad-internal) attentional processes, 
Such an explanation assumes that attentional 
processes can be modified through learning, 
From a therapeutic point of view, one would 


instrument (Nideffer, 


TABLE 5 


GROUP Comparisons 


ale vs. female 
introductory 
psychology 
students 


Police applicants 
vs, male college 
studenta 


t 20> t 20+ 
—=1.19. >.20 2.73 <01 
—1.88 <.10 — 6.68 <.002 
2.64 <.01 1,51 >10 
= 95 >.20 — 6,55 <.002 
2.20 <05 542 <.002 
82 >.20 =314 <.002 
—72 >.20 2.21 <05 
OL >20 S01 <.002 
97 >.20 2.50 <.02 
1.05 >.20 399 <.002 
3.02 <01 3.82 <.002 
0.00 >.80 — 58 >50 
—83 >20 287 <0 
145 >.10 —632 <.002 
IEX 2.00 <.05 240 <.02 
NAE 32 >.50 —3.69 <.002 
PAE 2.39 <.02 4.80 <.002 
Nole. TAIS = Test of Attentional and Interpersonal Style, 


had a higher score on the particular scale than the first 
meses {e.g., male college students scored higher on ORT than 


ice applicants), 
PO lane that the test for significance was two-tailed, 
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hope that this is the case; however, these 
changes remain to be demonstrated, 

Results of a second group difference study 
are also presented in Table 5. Nideffer and 
Wiens (Note 3) compared the scores on the 
17 TAIS subscales of 60 white male police 
applicants in a large metropolitan city with 
those of 97 male college students. The two 
groups differed significantly on 15 of the 17 
subscales, Differences that occurred indicated 
police applicants described themselves as more 
in control of both attentional and interper- 
sonal processes, more extroverted, less intel- 
lectually expressive, less expressive of nega- 
tive affect, and more expressive of positive 
affect. 

The preceding findings seem to be logically 
and internally consistent if it is concluded 
that, as a function of interests and job de- 
mand, characteristic applicants for police 
training should exhibit greater control than 
a group of college males. A potential con- 
found exists, however, since police applicants 
believed the test data would have some influ- 
ence on their selection for training. In con- 
trast, college students had no reason to be- 
lieve that their lives would be affected by 
their test scores. This difference suggests that 
it is appropriate to speculate about the ex- 
tent to which police applicants may have 
faked test scores to appear more competent 
and effective. Although this question has not 
been assessed directly, some indirect evidence 
suggests that faking does not account for the 
observed group differences. 

A large body of experimental literature ex- 
amines relationships between scores on vari- 
ous personality measures and an individual’s 
tendency to give socially desirable responses 
(Edwards, 1957). The Minnesota Multiphasic 
Personality Inventory (MMPI) K scale has 
been found to correlate between .69 and .81 
with various measures of social desirability, 
and high K scores are taken to mean that 
respondents are faking well on the test (Dahl- 
strom & Welsh, 1963). For the applicant sam- 
ple the mean correlation between MMPI K 
and the 17 TAIS scales was .20, suggesting 
that faking was minimal. 

To be useful as a predictor of performance, 
to select individuals for a particular job, or 
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to select treatment approaches, the TAIS 
scale scores must be able to differentiate reli- 
ably between individuals and groups of indi- 
viduals. Data just presented suggest that 
groups who are identifiably different on the 
basis of sex or some occupational and/or atti- 
tudinal variable(s) do respond differently to 
TAIS items. The question of task relevance of 
these differences depends on both the con- 
struct and predictive validity of the instru- 
ment, 


Construct VALIDITY 


Construct validity has been examined by 
correlating TAIS scale scores with the same 
individuals’ scores on other psychological in- 
struments. Nideffer and Wiens (Note 3) cor- 
related the TAIS scores for a group of 60 
police applicants with scores on the MMPI, 
California F Scale, Rotter Internal—External 
(I-E) Scale, Taylor Manifest Anxiety Scale 
(TMAS), Wechsler Adult Intelligence Scale 
(WAIS), and the Maudsley Personality In- 
ventory (MPI). McPherson and Nideffer 
(Note 4) compared scores of 28 female under- 
graduates on the TAIS with their scores on 
the Personal Orientation Inventory (POT; 
Shostrom, 1966) and their scores on the 
State-Trait Anxiety Index (Spielberger, Gor 
such, & Lushene, 1970). Finally, Wolfe an 
Nideffer (Note 1) correlated the TAIS scores 
of 83 introductory psychology students with 
scores on the State-Trait Anxiety Index 
(STAT). 

Looking first at the TAIS attenti 
scales, Table 6 indicates that the pr’ 
relationship between attentional proce 
level of arousal is reflected in the TAIS at- 
tentional subscales. We know, for example, 
that as arousal increases, attention narrows 
and effective cue utilization decreases (Easter- 
brook, 1959). Thus, anxious individuals de- 
scribe themselves on the TAIS as making 
errors of underinclusion. Anxious individuals 
also tend to direct their attention internally 
and to become overloaded by their own 
thoughts and feelings. This tendency is re- 
flected in anxious subjects scoring high on 
the internal overload scale (OIT) and on 
the obsessive subscale (OBS), Relationships 
between scores on the TAIS attentional scales 


” 
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TABLE 6 
RELATIONSHIP BETWEEN THE TAIS AND 
MEASURES OF ANXIETY 


College tory 
students Police paycha 
(Pretest) applicants students 

Scale State Trait TMAS State Trait 

BET 09 —.17 = 38 3 2 

OET. < 005: 48" aim 3 

BIT — A — 60% AS 

OIT 03 43 58 

NAR 20 =—40* 92 

RED 38" 53% 354 

INFP g egi =l4 

BCON 22 63" hd 

CON =09 ~.07 -01 

SES =4 57% =.35% 

P/O —.08 04 =.05 

OBS 22 Alt 288 

EXT -30 —.21 =.16 

INT 29 24 38 

TEX =l = .26 07 

NAE 24 37% 1284 

PAE -32 —,30 Sted bd 


Note. n = 28, 60, and 83 for college students, police appli- 


cants, and intreductory psychology students, respectively, 
(Abbreviations st of Attentional and Interpersonal 
Style; TMAS = | nifest Anxiety Scale.) 
*p <05. 
ED < 0L 


and measures of anxiety in these three stud- 
ies provide some construct validation for the 
TAIS attentional scales. 

Table 7 presents the relationships found 
between the TAIS, the California F Scale, 
Rotter I-E Scale, MMPI, and the MPI for 
the 60 police applicants in Nideffer and 
Wiens’s (Note 3) study. This data, along with 
the correlations between- the TAIS and the 
POI (McPherson & Nideffer, Note 4), pro- 
vide some construct validation for several 
TAIS interpersonal scales. In particular, 
TAIS self-esteem correlates .69 with POI self- 
regard, TAIS positive affect expression with 
POI spontaneity (.42), TAIS introversion 
with neuroticism on the MPI (.36) and with 
spontaneity on the POI (—.61), TAIS be- 
havior control with K on the MMPI (—.32), 
and extroversion on the TAIS with extrover- 
sion on the MPT (.64) and with social intro- 
version on the MMPI (—.46). 

In discussing construct validity of the 
TAIS it is important to consider the content 
Similarity across the ‘various instruments. 


. Dealing first with the TATS attentional scales, 


there is very little overlap across tests. The 
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measures of anxiety deal primarily with bod- 
ily responses (e.g., “I feel tense,” “I have 
frequent headaches”), whereas the TAIS at- 
tentional items reflect cognitive and percep- 
tual experiences (e.g., “When people talk to 
me, I find myself distracted by my own 
thoughts and ideas”). 

There is some content similarity between 
several of the TATS interpersonal scales, the 
MMPI K and S/ scales, the Rotter I-E Scale, 
and the MPI neuroticism and extroversion 
scales, The TAIS scale correlations that may 
be artificially inflated due to the content simi- 
larity are interpersonal control, self-esteem, 
physical orientation, extroversion, and ex- 
pression of positive and negative affect, 

Overlap between other tests and the TAIS 
interpersonal scales is limited to one or two 
items. In addition, each test listed above 
overlaps on several TAIS scales because items 
cover a much broader range of behaviors, For 
example, the SZ scale on the MMPI is com- 
posed of items indicating the individual is 
self-critical, has low self-esteem, shrinks from 
responsibility or a crisis, is sensitive to criti- 
cism, and withdraws, 

In conclusion, the attentional scales on 
the TAIS appear to measure an aspect of the 
individual that is correlated with other mea- 
sures of personality and arousal but that is 
conceptually distinct. In contrast, there are 
some content similarities between TAIS inter- 
personal scales and other conceptually re- 
lated personality measures. Such overlap is 
not unwanted if it is assumed that the other 
instruments reliably measure the same con- 
structs. 


PREDICTIVE VALIDITY 


Two studies have related TAIS scores to 
actual behavior measures, Nideffer (in press) 
correlated a coach’s performance ratings of 
10 male swimmers (all performing regularly 
in intercollegiate competition) on an 11-item 
scale with the swimmers’ TAIS attentional 
scale scores. Swimmers scoring high on the 
TAIS scale measuring the tendency to make 
errors of underinclusion were rated by the 
coach (a) as choking under pressure (r = 
-75), (b) as falling apart if they make early 
performance errors (r = .59), (c) as having 
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to work hard for everything they obtain (r 
= .66), (d) as becoming worried about one 
particular thing and being unable to think of 
anything else (r = .80). In addition, swim- 
mers the coach described as being “very in- 
consistent ranging from looking very good at 
times to being very poor” tended to become 
overloaded with external (r = .60) and in- 
ternal stimuli (7 = .67), to be unable to inte- 
grate thoughts and ideas (r = .63), and to be 
unable to effectively narrow attention (r = 
-63). 

McPherson and Nideffer (Note 4) did a 
second study examining the relationships be- 
tween TAIS scores and performance mea- 
sures at a small, women’s liberal arts college. 
In this study the instructor, without knowl- 
edge of TAIS scores, rated 28 students’ be- 
havior (on a 20-item scale) in a vocational 
counseling course. These ratings were corre- 
lated with students’ TAIS scores. 

A pattern of significant rs between the in- 
structor’s ratings and TAIS scores was found. 
These, rs clustered into two major groups. 
Students judged to contribute to class discus- 
sion, assume leadership, express personal opin- 
ions and not back down, and to seek the in- 
structor out after class to continue discussions 
described themselves on the TAIS as (a) 
able to integrate ideas from different areas 
effectively, (b) able to avoid errors of under- 
inclusion, (c) processing large amounts of 
information, (d) in control of interpersonal 
situations, (e) having high self-esteem, (f) 
being extroverted, and (g) expressive intel- 
lectually and affectively. 

In contrast to the above, students identi- 
fied by the instructor as easily intimidated, 
prone to withdrawal, unable to deal with more 
than one topic at a time, and unable to be- 
have in the ways prescribed by the course 
described themselves on the TAIS as (a) 
having low self-esteem, (b) being out of con- 
trol of both their attentional processes and 
interpersonal situations, (c) processing little 
information, and (d) being introverted. 


DISCUSSION AND SUMMARY 


If assessment is to be effective and clini- 
cally useful, its procedures must change. 
Clinical assessment makes best sense to the 


extent that it serves the following functions: 
(a) shortening treatment duration through 
rapid accurate identification of salient areas, 
(b) providing clinicians with a conceptual 
framework that enables them to select treat- 
ment procedures and facilitates understand- 
ing of the client’s behavior, and (c) improves 
communication both ways, between clinicians 
and patients. The present study summarizes a 
network of research studies constructing and 
validating a test measure designed to serve 
these functions. 

The TAIS consists of 144 items selected 
rationally and grouped into 17 attentional and 
interpersonal subscales. Each of the 17 sub- 
scales was considered on the basis of theo- 
retical formulations and/or clinical observa- 
tions to be potentially useful in predicting 
behavior across a variety of situations, Data 
collected thus far suggest several conclusions 
about the TAIS psychometric properties. 


1, Test-retest reliability for the subscales 
ranges from .60 to .93, with a median of .83. 

2. TAIS items correlate highly (median 
.53) within scales and moderately across 
scales, suggesting that the scales are internally 
consistent. 

3. Group difference studies indicate that 
the TAIS will differentiate between subjects 
differing on some major variable, such as sex. 

4. Correlations between TAIS subscales and 
other psychological measures offer some con- 
struct validation for the following attentional 
subscales: becoming overloaded with internal 
stimuli, making errors of underinclusion, and 
having an effective broad internal focus. Some 
construct validity has also been found for 
the following TAIS interpersonal scales; in- 
terpersonal control, self-esteem, extroversion, 
intellectual expressiveness, and the expression 
of both positive and negative affect. 

5. Actual behavioral correlates of TAIS 
scores were found in two studies. In both 
instances poor attentional control and the 
tendency to make errors of underinclusion 
were found to be associated with performance 


deficiencies. 


A major potential advantage of the TAIS 
when compared with other psychological mea- 
sures is that its test profile describes specific 
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attentional and interpersonal characteristics. 
These characteristics are operationally de- 
finable and they permit prediction of per- 
formance across a variety of life situations. 
Although the present series of studies shows 
that the TAIS has reasonable test-retest reli- 
ability and some construct and predictive 
validity, the measure has not yet been used 
to predict performance on an a priori basis 
or to evaluate therapy approaches. Future 
research should attempt to assess the TAIS’s 
predictive utility for anticipating performance 
and for selecting intervention techniques. To 
date, informal use of the TAIS in this way 
has been promising. 
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The hypothesis was tested and confirmed that attributions of causality mediate 
influence patterns in verbal and vocal channels. Communication assertiveness 
was independently measured for verbal content and vocal intonation (filtered 
speech) during both free interaction and a structured counselor supervision 
task. Participants were 44 adult students in a counselor training program. In 
the free-interaction situation, subjects who expressed internal attributions 
demonstrated significantly greater assertion in vocal intonation than in verbal 
content; subjects who made high external attributions demonstrated signifi- 
cantly greater assertion in content than in intonation. No significant differences 
were found in the counselor supervision situation. Verbal content, which pro- 
vides high internal feedback to the speaker, was interpreted as reflecting the 
degree of assertiveness intended or thought appropriate by the participants, 


In contrast, vocal intonation provides low internal feedback to the 


aker and 


was viewed as “leaking” expectations of one’s own influence capabilities, 


The purpose of this investigation was to de- 
termine the extent to which an individual’s 
attributions of causality are related to his 
interpersonal influence patterns in both verbal 
and vocal channels. We were concerned here 
with differences in communication style be- 
tween individuals who believe that they have 
a high degree of control over their environ- 
ment and those who believe that events are 
determined by such factors as fate, luck, or 
powerful others. In the locus of control litera- 
ture, the former group are typically referred 
to as “internals” and the latter as “externals” 
(Phares, 1973; Rotter, 1966). 

Kelley (1973) has noted that the ultimate 
goal of the attributional process is to assist 
the attributor in the effective management of 
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himself and his environment. There is now con- 
siderable evidence that attributions do indeed 
influence an individual's approach to the world 
and that changing an individual’s attributions 
may enhance his (or her) coping behavior and 
self-mastery (e.g, Davison, Tsujimoto, & 
Glaros, 1973; Dweck, 1975; Ross, Rodin, & 
Zimbardo, 1969; Valins & Nisbett, 1971; 
Weiner & Sierad, 1975). Although numerous 
studies have focused on interactions between 
locus of control and attempts at environmen- 
tal influence (e.g., Gore & Rotter, 1963; Lev- 
enson, 1974; Silvern & Nakamura, 1971), re- 
lationships among causal attributions and 
modes of influence attempts (e.g, communica- 
tion style) have received scant empirical at- 
tention, The evidence that we do have sug- 
gests that individuals who perceive little per- 
sonal control over their outcomes are more 
likely to demonstrate very strong displays of 
assertion than are those who perceive high 
personal causation, Raven and Krugslanski 
(1970) have noted the use of coercive power 
by those in a position of relative powerlessness 
and suggest that use of coercive techniques 
may serve to increase the individual’s self- 
esteem. Kipnis and his associates (Goodstadt 
& Kipnis, 1970; Kipnis & Lane, 1962) found 
that individuals who lacked confidence in their 
ability to influence others were more likely to 
employ coercive control tactics than were 


405 


f 


those who expressed self-confidence. Clearly 
related to the present study is the finding by 
Goodstadt and Hjelle (1973) that when 
placed in a supervisory position, individuals 
classified as externals on Rotter’s (1966) di- 
mension employed more power-assertive con- 
trol mechanisms than did internal individuals. 

The present study focuses on vocal (voice 
intonation) as well as verbal influence as a 
function of attribution. Research by Rosen- 
thal and others (e.g., Chaiken, Sigler & 
Derlega, 1974; Duncan & Rosenthal, 1968) 
on experimenter and teacher biases has dem- 
onstrated the critical importance of nonverbal 
communication and paralinguistic cues in in- 
terpersonal influence attempts. Two related 
notions emerging from the nonverbal com- 
munication literature provide additional bases 
for the present investigation. The first is the 
finding that nonverbal behaviors such as vocal 
intonation (Holzman, 1966) and body move- 
ment (Ekman & Friesen, 1969; 1973) supply 
relatively little internal feedback to the 
speaker and are thus not readily self-moni- 
tored. The second is the hypothesis that a 
speaker discloses or “leaks” his feelings and 
predictions through those communication 
channels—the nonverbal ones—of which he is 
least aware. In the present study we limited 
our analysis to verbal content (high aware- 
ness) and vocal intonation (low awareness). 
Vocal intonation was of particular interest be- 
cause it combines low awareness for the 
speaker with high salience for the listener 
Mehrabian & Wiener, 1967; Bugental, 
1974). 

We predicted an interaction between causal 
schemata, or more specifically internal—ex- 
ternal locus of control, and communication 
style. It was anticipated that an individual 
who holds an external orientation, perceiving 
events as relatively difficult t^ control through 
his own actions, will be fa‘); direct and power 
conscious in verbal com: © ication; more spe- 
cifically, the content of | ‘bal statements, 
a component that he ca monitor quite 
readily, will be assertive. ill, however, 


“leak” his low expectation i's own influ- 
ence capability through tive vocal 
behavior. The opposite ; ‘ern was pre- 
dicted for the individuaı who perceives 


strong. personal causation: He will show 
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low verbal assertion and high vocal assertion, 
“leaking” his high expectations of influence 
effectiveness through the low awareness chan- 
nel. In other words, it was predicted that ex- 
ternal locus of control would be dfirectly re- 
lated to verbal assertiveness and inversely 
related to vocal assertiveness and that the 
reverse pattern would hold for internal locus 
of control. This prediction is consistent with— 
and tests the external validity of—recent re- 
search findings of Bugental and Love (1975) 
that high verbal assertiveness and low vocal 
assertiveness are characteristic of ineffective 
parents, that is, parents who have little in- 
fluence over children who engage in discordant 
behavior. 

As an additional check on the assumption 
that deliberate attempts to influence are 
accompanied by assertiveness in the high- 
awareness (verbal) channel, communication 
patterns were analyzed in two different situa- 
tional contexts, one an unstructured interac- 
tion sequence and the other a structured exer- 
cise where the explicit task was one of 
interpersonal influence, It was anticipated that 
both “internals” and “externals” would be 
verbally assertive in the latter situation, given 
the externally imposed demands to influence 
another person, Individual differences between 
these two groups were anticipated within both 
contexts, but it was expected that they would 
be most apparent during unstructured inter- 
action. In related research (Love & Kaswan, 
1974), stronger individual differences were 
found for parental assertion in free interaction 
than in a structured family discussion. The 
structured task used in the present study was 
a three-person counselor training exercise in 
which the subject, serving as supervisor, pro- 
vided evaluative feedback to a counselor- 
trainee following a simulated therapeutic dia- 
logue between counselor and client, 


METHOD 
Subjects 


The University of California, Los Angeles, Exten- 
sion Counseling Program places trainees into classes 
or groups according to their time of enrollment, Two 
groups participated here, each within 1 month of be- 
ginning the program. There were 33 females and 1 
male in Group 1, which had a mean age of 42 years, 
Group 2, which was run 2 months after Group 1, 
had 23 females and 12 males with a mean age of 39 
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years. Some of the original subjects had to be dropped 
because of poor sound quality on their tapes or be- 
cause of incomplete dependent measures. The final 
number of subjects retained in the study was 19 in 
Group 1 and 25 in Group 2. 


Attribution Measure 


The Levenson (1974) measure of internal-external 
control was administered to all subjects, This instru- 
ment includes scales measuring three dimensions: 
internal control, external control by powerful others, 
and external control by.luck or fate. 


Procedure 


Near the beginning of the 1-year counseling pro- 
gram, the students participated in a set of training 
exercises, Grouped into triads, trainees alternated 
with each other in assuming the role of the counselor, 
the client, and the supervisor. During the exercise 
each trainee assumed each role once, The supervisor 
portions of the triad interaction were the primary 
focus of this study, along with the spontaneous group 
interactions that occurred prior to the training exer- 
cise. The spontaneous interaction period lasted for 
approximately 3 minutes, and the supervisor portion 
of the exercise lasted for 5 minutes. 

The triads met separately in small rooms, each of 
which contained a video camera and an audio cas- 
sette tape recorder. All participants knew that they 
were being recorded. Since several triads met simul- 
taneously, continuous video recordings of all triads 
were not feasible; the video was used mainly to 
monitor the exercise and record samples of each triad 
from a master control center. All data analyses were 
based on the audiotape recordings. 


Selection of Messages 


The number of messages obtained for each subject 
was two during the free-interaction situation and six 
while the subject was the supervisor. These numbers 
were determined by the number of complete messages 
available for the majority of subjects. Using natural 
message units, we defined a message termination as 
the point at which there was a change of speakers 
or a marked change in meaning or affect. For 87% 
of the messages, termination occurred as a result of 
a shifting of speakers; 13% (23 messages) were 
terminated when two of the experimenters agreed 
that there was an affect or meaning shift. 

The tapes for the 44 participants were randomly 
divided into two sets: Set A, comprising 22 subjects, 
and Set B, comprising the remainder. Since each 
subject had 8 messages (2 free responses and 6 
supervisor responses), the 22 subjects in each set 
yielded a total of 176 messages that were to be rated. 
When a subject prodyced more than 8 messages, the 
2 free responses and 6 supervisor responses were ran- 
domly selected. The next step was to arrange these 
176 messages into 8 subsets. This was done by ran- 
domly placing 1 of each subject’s 8 messages into 
one of the 8 subsets, Therefore, each subset con- 
tained 1 message from each of the 22 subjects. Con- 
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sidering both Set A and Set B, the total number of 
subsets was 16, For each of the 16 subsets, Av... e 
+B. » a master tape for the voice judgments 
was prepared, with order again randomized, as well 
as an exact typescript of the 22 messages, 

The original tapes were edited through a band-pass 
filter (Krohn-Hite Model 310-C) to produce a con- 
tent-free voice message. Frequencies below 200 cycles 
per second (cps) were filtered out to eliminate back- 
ground noise. The upper frequency setting was vari- 
able (500-1,000 cps) and was adjusted to suit the 
individual voice, with the filter setting brought down 
as low as necessary to mask the content effectively, 


Judgments of Messages 


Judges were 16 groups of undergraduates, with two 
males and two females in each group, Each judge 
rated two subsets of messages, one from Set A and 
one from Set B, One subset was rated for assertive- 
ness of the verbal content (messages in typescript 
form), and the second subset was rated for assertive- 
ness of vocal intonation (content-filtered taped mes- 
sages). Thus, every judge rated every subject, with 
half of the judgments made for content and the other 
half made for intonation. Order was counterbalanced 
so that half of the judges rated content first and the 
other half rated intonation first. Under this plan, 
each tape and each transcript were rated by four 
judges. 

The training procedures used for the judges were 
similar to those used in previous research (Bugental 
& Love, 1975) in which we obtained interjudge re- 
liabilities of .82 for voice assertiveness and .80 for 
content assertiveness, One group (n=4) of judges 
employed in the present study was randomly selected 
for a reliability check. Following Winer (1962), 
agreement was estimated by means of the formula: 
1— (MS within messages/MS between messages). A 
coefficient of .73 was obtained for verbal content, and 
a coefficient of 68 was obtained for vocal intonation. 
During training, judges received sample messages of 
highly assertive versus submissive verbal content and 
were given similar contrasts for vocal intonation, 
Ratings for both channels, verbal and vocal, were 
made on a 7-point scale ranging from +3 (strong 
and assertive) to ~3 (weak and submissive.) 


Length of Messages 


Mean message time was 3.89 sec (SD = 2.17) for 
free responses and 15.02 sec (SD = 6.37) for super- 
visor responses. Message length was not significantly 
correlated with any of the attributional measures 
(for free response, rs=.12, —.10, and —.25 with 
external luck, external powerful others, and inter- 
nality; for supervisor response, rs = ,09, .19, and —.07 
with the three scales). Message length showed no 
consistent correlations with ratings of assertiveness, 
Correlations of .18 and .18 were obtained between 
message length and content assertiveness of free re- 
sponses and supervisor responses, respectively. Corre- 
lations of —.10 and .37 (p < .05) were obtained be- 
tween message length and vocal assertiveness of free 
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RESULTS 


An initial analysis was done to test for 
global sex differences within Group 2 (which 
contained both males and females), A mixed- 
design analysis of variance was used with two 
between-subjects comparisons (sex, attribu- 
tion) and two within-subjects comparisons 
(channel, situation). No significant main ef- 
fects or interactions were found as a function 
of sex. Correspondingly, data from males and 
females were pooled in the main analysis. 

The central hypothesis was tested by a 
mixed-design analysis of variance that in- 
cluded two between-subjects variables (group, 
attribution) and two within-subjects vari- 
ables, (channel, situation). Subjects were ini- 
tially divided into three attribution groups 
using Levenson’s norms: (a) subjects whose 
standard scores on the attribution measure 
were highest on the internal scale, (b) sub- 
jects whose scores were highest on the external 
luck scale, and (c) subjects whose scores were 
highest on the external powerful others scale. 
Since only two subjects emerged in the third 
category, subsequent analyses were limited to 
the first two attributional categories (the 
eliminated powerful others scale was highly 
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TABLE 2 


SUMMARY OF MEANS AND STANDARD DEVIATIONS 
OF ASSERTIVENESS RATINGS 


Group 1 Group 2 
Variable M SD M SD 
Free interaction 
Internals 
Content —.02 96 —.32 81 
Voice 89.87 AZ 66 
Externals 
Content 45 84 2I RON 
Voice SL 54 15.) T) 
Supervisor task 
Internals 
Content -64 49 33 53 
Voice 24), 217 EA A 
Externals 
Content -53 18 47 AB 
Voice 36 © 70 Py Sis 


correlated with the external luck scale, r= 
-62). The two subjects were regrouped on the 
basis of their scores on the other two scales, 
Mean raw scores for the external group (7 = 
17) were 20.12 (SD = 6.71) on external luck 
and 33.75 (SD = 4.74) on internality, The 
corresponding scores for the internal group 
(n= 27) were 9.81 (SD = 5.66) and 39.48 
(SD = 3.67). Within each communication 
context, the mean assertiveness rating given 
each channel was the unit of analysis. 

A summary of the analysis of variance is 
given in Table 1. The central prediction made 
here, as tested by the Channel x Attribution 
interaction, was confirmed: Assertiveness var- 
ied for channels as a function of the speaker’s 
attributions. Subjects expressing internal at- 
tributions were more assertive in vocal intona- 
tion than verbal content, whereas those ex- 
pressing external attributions showed the 
opposite pattern. The significant three-way 
interaction (Channel x Situation x Attribu- 
tion) indicates that this effect was significant 
only during free interaction and not in the 
supervisory context, as can be seen in Table 
2. Means and standard deviations for all cells 
are provided in Table 2. 

The significant Channel x Situation inter- 
action showed that all speakers used more 
assertive content (but not voice) in the super- 
visor role than in the free-interaction setting. 
This finding supports the assumption that 
deliberate attempts to influence are accom- 
panied by assertion in the high-awareness 
(verbal) channel, 
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An unexpected finding was a significant 
main effect for groups, reflecting stronger 
total assertiveness in Group 1 than in Group 
2, It will be recalled that more than one third 
of the subjects in Group 2 were males, 
whereas there was only one male in Group 1. 
A possible explanation for the significant 
group effect might be that females are more 
assertive in same-sex than in mixed-sex 
groups. However, comparison of mixed-sex 
versus same-sex triads failed to reveal signifi- 
cant differences in assertiveness, F(1,12) = 
.13; thus the group difference remains unex- 
plained. 


5 DISCUSSION 


A strong relationship was found between 
causal attributions and assertiveness in inter- 
personal interaction, and the pattern differed 
for high-awareness (verbal content) and low- 
awareness (vocal intonation) communication 
channels. More specifically, vocal intonation 
was relatively assertive in individuals who 
presented themselves as capable of controlling 
their outcomes (internals) and relatively non- 
assertive in those who viewed events as be- 
yond their personal control (externals). In 
contrast, the verbal content of internal at- 
tributors was relatively nonassertive and that 
of external attributors relatively assertive. 
These findings were interpreted as indicating 
that vocal intonation, which is not easily self- 
monitored, mirrors an individual’s personal 
beliefs about his effectiveness as a source of 
influence, whereas verbal content, more salient 
to the speaker, reflects his beliefs about how 
much assertion is required for effectiveness. 
That is, voice quality mirrors relatively en- 
during self-perceptions, while verbal content 
indexes the individual’s conception of what is 
needed in the current situation. 

Viewing locus of control along a dimension 
of felt power, Seeman (1963) suggested that 
externals are the “psychological powerless.” 
Given this notion, the results of the present 
study dovetail quite nicely with those of 
Goodstadt and Hjelle (1973). In the latter 
study, the external attributors used more 
coercive power tactics than did internals; in 
the present study they used stronger, more 
_ assertive verbal statements than the internal 
attributors.: Another parallel can be seen in 
the recent history of social activism. Good- 
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stadt and Hjelle pointed out that peace pro- 
testors who were initially calm and orderly 
used increasingly coercive tactics when con- 
fronted with government resistance, As their 
expectancies for successful influence de- 
creased, their assertiveness increased. This 
analysis underscores the significance of recent 
findings that externality in college students 
has increased over the past decade (Rotter, 
1975). 

To summarize, the implication of the pres- 
ent findings is that external attributors, view- 
ing themselves as relatively powerless or in- 
effective, elect to use more direct influence 
tactics than internals, Their less assertive 
vocal intonation, however, “leaks” their lower 
expectations of interpersonal effectiveness in 
much the same way as the stronger vocal in- 
tonations of the internals reveal an air of 
assuredness, The following two messages, 
drawn from spontaneous interaction sequences 
by trainees who were discussing their upcom- 
ing role-playing session, illustrate these in- 
ternal and external patterns. The external 
attributor’s message was as follows; “Yeah, 
because there’s not time to go through the full 
‘bit. This is the only other taping. So you get 
to be the client next time.” This message 
yielded a voice assertiveness score of —.25 
and a content assertiveness score of 1.25, The 
internal attributor’s message was the follow- 
ing: “I don’t know. And they don’t know 
either, We all don’t know.” This message 
scored 1,75 on voice assertiveness and —,25 
on content assertiveness. The external at- 
tributor played the “expert” with assertive, 
knowledgeable statements, attenuated by non- 
assertive voice quality, In contrast, the in- 
ternal attributor openly expressed uncertainty, 
but his voice conveyed an underlying air of 
confidence. 

In previous research by Bugental and her 
associates, parents of disturbed children were 


1Jt should be noted that neither of these experi- 
mental paradigms involved extreme situations. People 
who repeatedly experience extreme powerlessness 
might be expected to leave the field rather than to 
increase the assertiveness of their coping efforts, 
Moreover, in a situation where assertion is clearly 
required, such as when a mother is attempting from 
a distance to prevent her toddler from running into 
the street, we would expect strong assertion in all 
communication channels. 
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observed to use weak voices when attempting 
to control their children, which detracted from 
their assertive verbal content (Bugental & 
Love, 1975); parents who demonstrated this 
pattern were also more likely than others to 
be ignored by their children (Bugental, Note 
1). A similar pattern was observed by Ban- 
dura and Walters (1959) among the mothers 
of aggressive children; that is, these parents 
employed an inconsistent influence strategy 
and were ineffective in their obedience de- 
mands, The present findings suggest the value 
of directly assessing the relationship between 
vocal “leakage” and effectiveness in adult- 
adult interactions. 

The observed Channel X Attribution inter- 
action was limited to spontaneous social in- 
teraction, Although these trainees knew that 
they were being recorded throughout the ses- 
sion, it was apparent through the video moni- 
toring that they had understandably assumed 
that the training exercise, and not their pre- 
ceding conversations, was the point of interest. 
Apparently, the demand characteristics of the 
structured supervisor task were strong enough 
to mask out individual differences. The exer- 
cise was a novel and very probably a difficult 
experience. As part of their counselor training 
program, subjects were being asked to observe 
and evaluate their peers. They also knew that 
their peers would serve a similar evaluative 
function for them, Many subjects openly ex- 
pressed uncertainty and anxiety about the 
exercise, and it is likely that most of them 
were at least moderately apprehensive. Given 
these considerations, the high degree of verbal 
assertiveness and the similarity of the inter- 
nals and externals in the supervisory role pro- 
vides some validation for the initial assump- 
tion that people deliberately select assertive 
strategies when they view outcomes as diffi- 
cult to control. 

The similarity of the two groups in the 
supervisory role highlights another feature of 
the internal’s communication style, that of 
situational or role discrimination. The inter- 
nals made didactic, assertive statements when 
the situation required that they do so but did 
not show this pattern during spontaneous in- 
teractions with peers. Their speech styles in- 
dicated a high degree of differentiation be- 
tween roles. In contrast, the externals did not 
modify their general speaking style very much 
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across the two situations. That is, they com- 
municated in the social situation in much the 
same way as they (and the internals) did 
when acting as supervisors. Perhaps ordinary 
interactions have, for externals, psychological 
similarity to situations in which they are 
“supposed to” take a directive role, or perhaps 
their repertoire, in an influence situation, con- 
sists of techniques that are manifestly asser- 
tive. An alternative, but related, explanation 
derives from the assumption that the super- 
visory exercise was an anxiety-inducing experi- 
ence for most participants. Since the free in- 
teraction immediately preceded the structured 
exercise, the external attributors’ similar com- 
munication styles across the two situations 
may reflect “preparatory” anxiety. This spec- 
ulation is supported by frequent reports that 
externals are more anxious and less effective 
in social situations than internals (e.g., 
Phares, 1973). 

Paralanguage has emerged as an increas- 
ingly useful behavioral measure in studying 
the individual’s expression of affect and atti- 
tudes, Because of the low speaker awareness 
of paralinguistic components of communica- 
tion, this channel is of particular importance 
in understanding unintended communication 
of attitudes; that is, measures of paralanguage 
are relatively nonreactive. Duncan and Rosen- 
thal’s research (1968) has demonstrated that 
paralanguage acts as a mediator in experi- 
menter biasing effects, with voice stress in- 
fluencing responses of experimental subjects. 
Milmoe, Rosenthal, Blane, and Chafetz 
(1967) found that the affective quality of 
physicians’ voices was a good postdictor of 
success in referring alcoholics for further 
treatment. Weitz (1972) found that expressed 
friendly attitudes of white subjects to blacks 
was negatively related to voice intonation and 
direct behavioral friendliness to blacks; voice 
and behavior, on the other hand, were posi- 
tively related. Judgments of voice quality in 
the present investigation were on a very global 
level; it will be of interest in future research 
to determine more precisely the specific com- 
ponents of voice quality that relate to the 
speakers’ causal attributions, 

Our results add to the emerging body of 
evidence that causal attributions act to medi- 
ate interpersonal as well as achievement be- 
haviors. The present findings also indicate 
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¿ that attributional effects are more complex 


than is usually demonstrated, The relation 
between attributions and influence patterns 
was found in loosely structured interaction 
but not in a highly structured context, More- 
over, attribution influence differed for com- 


_ ‘munication ‘channels that are high and low in 


reactivity. These data support the emerging 
trend in social psychology of using nonreactive 
behavioral measures and of testing hypotheses 
in more naturalistic settings. 
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Some Effects of Congruency of Member Characteristics and 
Group Structure upon Group Behavior 


Marvin E. Shaw and Blaze Harkey 
University of Florida 


An experiment was conducted to test the hypothesis that group effectiveness 
is a function of the degree to which enacted behaviors correspond to behaviors 
expected on the basis of the role associated with a group position. Individuals 
whose typical behavioral tendencies (as revealed by their scores on the Indi- 
vidual Prominence scale) were either congruent or not congruent with the 
leader role were assigned to the leadership position in problem-solving groups. 
As hypothesized, congruent groups performed more effectively than noncon- 
gruent groups. This effect was interpreted as resulting from difficulty of co- 
ordination of member actions and the consequent interpersonal tension in 
noncongruent groups. In addition, some differences between male and female 


groups were observed and discussed. 


In attempting to understand group behay- 
ior, theorists and researchers alike have re- 
sorted to classifications of variables known 
or predicted to influence group behavior. Al- 
though there is some disagreement about the 
appropriate classification, almost all writers 
have agreed that group behavior is influenced 
by member characteristics and group struc- 
ture. For instance, Cattell (1948) identified 
three “panels” or aspects of the group that 
must be considered in the analysis of group 
behavior: syntality traits (behavior of the 
group as a group), characteristics of internal 
structure, and population traits (definitions 
of the personality of the average group mem- 
ber). Carter (1954) listed the kinds of indi- 
viduals who compose the group, group struc- 
ture, and the group’s task as the significant 
determinants of group behavior. Member char- 
acteristics and group structure are also given 
an important place in the theories proposed 
by Schutz (1958), Thibaut and Kelley 
(1959), and others. 

Numerous studies have been conducted to 
determine the consequences of group struc- 
ture (see Collins & Raven, 1969) and of 
member characteristics (see Mann, 1959) for 
group behavior. However, there has been rela- 
tively little research examining the interac- 
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tion of group structure and member charac- 
teristics with respect to their joint effects on 
group behavior. This is surprising, because 
theoretically, the position that a group mem- 
ber occupies in the group (and consequently 
the role that he or she is expected to play) 
should determine the characteristics required 
for most effective participation in the group. 
Every group member cannot play a given role 
equally well, nor can a given person play all 
roles equally well. That is, individuals vary 
as to the number of different roles in their 
repertory and also as to the adequacy and 
completeness with which roles are enacted 
(cf. Sarbin, 1943). It follows that a group 
member who occupies a position requiring the 
enactment of a role that is congruent with his 
or her behavioral tendencies can perform the 
role requirements more adequately than if the 
role demands were incongruent with his or her 
behavioral tendencies. 

There is another aspect of group structure 
that relates to member characteristics—role 
congruency as a determinant of group effec- 
tiveness. Effective group action requires co- 
ordination of action among group members, 
and coordination is facilitated by the expected 
behaviors associated with positions in the 
group; that is, by the roles of group mem- 
bers. The position that a person occupies in 
the group specifies the role that he or she is 
expected to play. These expectations are 
usually known to group members and provide 
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anticipatory knowledge about the behavior of 
the occupant of the position (cf. Sampson, 
1963). This anticipatory knowledge permits 
group members to adjust their own behavior 
to coordinate with the expected behavior of 
others in the group. Thus, when all group 
members behave in ways that are congruent 
with expectations, coordination is facilitated 
and group performance should also be facili- 
tated. When incongruencies exist, coordina- 
tion may be difficult, resulting in interpersonal 
tension and reduced productivity. 

The leadership position provides a good 
example of role expectations. When a person 
occupies a position of leadership in the group, 
that person is expected to play the leader 
role; that is, he dr she is expected to behave 
in ways that make him or her stand out in 
the group (attempt to get things organized, 
volunteer ideas, initiate discussion, in short, 
take command). To the extent that the occu- 
pant of the leadership position behaves as 
expected, coordination is facilitated, and the 
group should perform efficiently; when leader 
behaviors are not enacted by the occupant, 
coordination is difficult and group effective- 
ness is decreased. 

The purpose of the present experiment was 
to test these theoretical expectations concern- 
ing the consequences of behavior—role congru- 
ency upon group effectiveness. Specifically, in 
some groups persons whose typical behaviors 
were congruent with the leader role were 
assigned to the leadership position, whereas 
in other groups persons whose typical be- 
havior was not congruent with the leader role 
were assigned to the leadership position. It 
was hypothesized that congruent groups would 
be more effective than noncongruent groups. 
In addition, both male and female groups 
were tested to determine whether observed 
effects varied with group composition. Al- 
though no hypothesis was formulated, the tra- 
ditional cultural roles of men and women 
might be expected to have some effect upon 
the differential effects of congruency. 


METHOD 
Experimental Design 


The experimental design was a 2X2 factorial, 
involving two congruency conditions (congruent vs. 
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noncongruent) and two sex compositions (all male 
vs. all female groups). In addition, one person was 
assigned the leadership position in each group; some 
analyses also included the position variable (leader 
vs. follower), 

The congruency variable was manipulated by 
assigning the group member most likely to enact 
behaviors expected by leaders to the leadership posi- 
tion (congruent condition) or to a follower position 
(noncongruent condition). These persons were iden- 
tified on the basis of their scores on the Individual 
Prominence (IP) scale (Shaw, 1959, 1960), This scale 
measures the degree to which an individual engages 
in behavior that makes him or her stand out in the 
group and reflects many of the behaviors associated 
with the leader role. It contains items like: “In 
meetings of groups to which I belong I often find 
myself trying to get things organized”; “In group 
situations I am quick to take the lead when an 
opportunity presents itself”; “If I disagree with 
someone, I rarely let them know that I disagree.” 
Both positive and negative items are included. 
Agreement with the first two examples indicates high 
IP tendencies, whereas agreement with the third 
example indicates low IP tendencies, Split-half reli- 
ability was reported to be .79, and test-retest re 
ability was found to be .85. Validity was examined 
in a number of ways. First, scores on the IP scale 
predicted which person in a group would volunteer 
for a task that would make him stand out from 
other group members, namely, volunteering to record 
the group’s decisions on a chalkboard. In 14 groups 
the volunteer had the highest IP score in the group, 
in 4 groups he had the second highest IP score, 
and in two groups he had the third highest IP 
score. In no group did the person having the 
lowest IP score volunteer for the recording tas 
Second, the IP scale discriminated among under- 
graduates, graduate students in management, and 
young executives in the expected direction (Ms = 
76.7, 81.6, and 84.1, respectively, p < .01). Finally, 
the scale did not correlate with measures that it 
theoretically should not correlate with, thus provid- 
ing a small degree of construct validity. For exam- 
ple, it correlated .02 with the verbal scale of the 
Scholastic Aptitude Test, .15 with the Least Pre- 
ferred Co-Worker Scale, and .22 with the Social 
Acquiescence Scale (Bass, 1956). 

Persons who score high on the IP scale therefore 
reveal behavioral tendencies that are more in accord 
with the requirements of a leadership position than 
do persons who score low on the IP scale, Thus a 
congruent condition was created by assigning a 
high IP person to the leadership position; a non- 
congruent condition resulted from assigning a rela- 
tively low IP person to the leadership position, In 
the congruent condition the mean score of leaders 
was 85.7 and of followers 73.4, (30) =5.72, p< 
.001; in the noncongruent condition, the mean score 
of leaders was 71.6 and of followers 78.8, ¢(30 
2.95, p<.01. Mean IP scores did not differ signifi- 
cantly for either congruency or sex composition (t < 
1.00 in each case). 
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Subjects 


The subjects for this experiment were 48 male and 
48 female students who agreed to participate in this 
experiment as one means of satisfying a course re- 
quirement. Subjects were recruited in groups of three 
(same sex), and each group was randomly assigned 
to one of four experimental conditions. 


Tasks 


The four tasks used in this experiment were as 
follows: 
Horse Trading 


A man bought a horse for $60 and sold it for 
$70. Then he bought it back again for $80 and 
sold it for $90. How much money did the man 
make in the horse trading business? 


Strange Mountain 


A man reported that in his travels over the 
United States he found a mountain with such 
strange characteristics that a glass would hold 
less water on top of the mountain than in the 
valley below. What mountain has this strange 
characteristic? 

Mother’s Son 


A lady was once seated in the family room 
watching television when her son entered the room. 
Upon receiving the parental command, “Go away, 
my son, and do not disturb me!”, the son replied, 
“T am your son but you are not my mother, and 
until you have shown me how this can be, I shall 
not leave.” How do you explain the son’s state- 
ment? 

Probabilities 


If in your bureau drawer are ten blue socks and 
sixteen grey socks, and you reach into it in the 
dark, how many socks must you take out to be 
sure of getting a pair that match? 


The horse-trading problem is one used by Maier 
and Solem (1952); the remaining three are adapted 
from old parlor games. Each task required that the 
group not only arrive at the correct answer but also 
that the answer be explained. A group could earn a 
score ranging from O (no correct answer, no expla- 
nation) to 2 (both the correct answer and the cor- 
rect explanation). For example, to earn the full 2 
points on the “strange mountain” task, the group 
would have stated that this was true of any moun- 
tain (correct answer=1 point) and that this was 
the result of the fact that the curvature of the 
earth, to which the surface of water conforms, is less 
convex (flatter) at higher than at a lower altitude, 


and t results in the glass holding less water on 
top of a mountain than in the valley below (correct 
explanation = 1 point). 
Procedure 

Upon the laboratory, each subject was 
seated in a tbicle and given the IP scale. 
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As soon as these were completed, the experimenter 
quickly scored them and made the appropriate posi- 
tion assignments. The subjects were then assembled 
in a group and seated at a round work table. They 
were told that they would be attempting a series of 


tasks working together as a group. It was explained _ 


that each decision would be made after group dis- 
cussion but that the actual decision would be made 
by the leader. After it was determined that all group 
members understood the general instructions, the first 
task was presented. Each task was typed on a separate 
sheet, and spaces for the decision (answer) and an 
explanation of the decision were provided. The tasks 
were presented in random order, with the restriction 
that each task appear in each ordinal position an 
equal number of times and that orders be the same 
in congruent and noncongruent conditions. A maxi- 
mum of 5 minutes was allowed for each task. 

During the group discussion, the experimenter op- 
erated an event recorder that kept a record of the 
time each group member talked on each task. Since 
the records for all subjects were on a common tape, 
it was possible to determine the total time each 
person spent talking, the number of times each 
member talked (number of speeches), the average 
duration of each speech, and the number of times 
each person interrupted another group member. 

When all tasks had been completed, each subject 
completed a questionnaire designed to measure each 
person’s evaluation of the other members of the 
group (ratings of intelligence, personal adjustment, 
liking, liking to work with), confidence in the lead- 
er’s decision, preference for groups with a leader, and 
judgment of whether groups make better decisions 
than individuals, All responses were on a 7-point 
scale, with higher scores reflecting more positive re- 
actions. 


RESULTS 


The results of this experiment fall into 
three broad categories: group effectiveness 
scores, communication process scores, and 
questionnaire responses. 


Group Effectiveness 


The basic performance score for each group 
was the sum of points earned on each of the 
four tasks and therefore could range from 0 
to 8. Mean performance scores as a function 
of congruency and sex composition are given 
in Table 1. Analysis of variance revealed that 
congruent groups (M = 4.4) were more effec- 
tive than noncongruent groups (M = 3.1), 
F(1, 28) =9.91, p< .001, accounting for 
23% of the variance. (Percentage of variance 
accounted for was estimated by the method 
proposed by Hays, 1973, which yields a con- 
servative estimate.) 


| 


CONGRUENCY AND GROUP BEHAVIOR 


TABLE 1 
MEAN PERFORMANCE AND TIME SCORES 


Congruent Noncongruent 
Measures Males Females Maļes Females 
Performance 
score 4.6 4.3 3.0 3.3 
Total time 
to solve 2.5 3.1 2.5 2.9 
Mean time 
per correct 
2.9 14 24 


àl 1.9 
ae. 
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Communication Process 


Means for the various measures of the com- 
munication process are given in Table 2. 
Analysis of variance of total speaking time 
revealed that leaders talked more (M = 41.4 
sec/task) than followers (M = 28.2 sec/ 
task), F(1,28) = 17.12, p < .001, accounting 
for 31% of the variance. No other significant 
difference was found. Similarly, leaders made 
more speeches (M = 16.6) than did followers 


fs recorded in minutes. 
y 


Table 1 also gives means for total time 
spent on each task and for the average time 
per correct trial. There was no significant 
difference for total time; however, this out- 
come could be an artifact resulting from dif- 
ferences in performance levels. For example, 
if time spent attempting a given task is 
greater for incorrect than for correct decisions, 
or vice versa, the total time measure could be 
misleading. Therefore, the average time re- 
quired for reaching a correct decision was 
computed and analyzed. The only significant 
finding was that male groups required less 
time (M = 1.7 minutes) than female groups 
(M =2.7 minutes), F(1,28) = 8.48, p< 
.01, accounting for 19% of the variance. 
There was a nonsignificant tendency for both 
male and female groups to spend less time 
on correct trials in the noncongruent than in 
the congruent conditions. Apparently, if 
groups in the noncongruent condition did not 
arrive at the correct answer quickly, they 
were unable to work it out, or perhaps groups 
in the noncongruent condition conceded de- 
feat more readily than groups in the congru- 
ent condition. 


(M = 13.4), F(1,28) = 8.51, p< .01, ac- 
counting for 18% of the variance, with no 
other significant difference. Length of speeches 
by leaders (M= 2.6) was significantly 
greater than speeches by followers (M = 2.1), 
F(1,28) = 15.88, p < .001, accounting for 
32% of the variance. 

The measure of interruptions revealed a 
different pattern in that the only significant 
source of variance was the Congruency X 
Position interaction, F(1,28) = 4.98, p< 
.05, accounting for 10% of the variance. This 
interaction effect is depicted in Figure 1. 
Leaders interrupted others more frequently 
in the noncongruent condition than in the 
congruent condition, whereas followers did the 
opposite. 


Questionnaire Responses 


Table 3 presents the questionnaire data as 
a function of the various experimental condi- 
tions. The ratings of intelligence, liking, liking 
to work with, and personal adjustment were 
highly correlated; therefore, they were com- 
bined into a single measure of evaluation of 
others. Analysis of variance revealed a signifi- 
cant Sex Composition X Position interaction, 
F(1,28) = 5.29, p < .05, accounting for 10% 
of the variance. Male leaders evaluated others 


TABLE 2 
MEASURES OF THE COMMUNICATION PROCESS 


3 e? N t 
Conates Cfemnales Normales 

Measure* L F L F. L fou 
i i 5.5 39.0 28.5 

Total speaking time per task 39.0 31.5 510. 25: 

No. AE per task 14.0 13.0 aos ae wI ne 
Average length of speech 2.8 24 25 7 ie 2i 

Number of interruptions per task 38 1.3 1.0 i i : 


Note. L = leadership position; F = followership position. 
a Time is recorded in seconds, 
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Ficure 1. Mean number of interruptions as a func- 
tion of position and behavior-role congruency. 


higher (M = 5.6) than did male followers 
(M = 5.0), whereas female leaders did not 
significantly differ (M = 5.2) from female 
followers (M = 5.3). This finding was quali- 
fied by a significant Sex Composition X Con- 
gruency X Position interaction, F(1,28) = 
5.04, p < .05, accounting for 10% of the vari- 
ance. This interaction is shown in Figure 2, 
which indicates that male leaders evaluated 
others higher than did male followers in both 
congruency conditions, but this difference was 
greater in the noncongruent than in the con- 
gruent condition; whereas female leaders and 
followers did not differ in the congruent con- 
dition, but female followers evaluated others 
more highly than did female leaders in the 
noncongruent condition. 

Analysis of ratings of confidence in the 
leader’s decision resulted in a significant Sex 
Composition X Position interaction, F(1, 28) 
= 5.89, p < .025, accounting for 13% of the 
variance, Interaction means are shown in Fig- 
ure 3. Male leaders had more confidence in 
their decisions than did female leaders in 
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their decisions, but male and female followers 
did not differ significantly in their confidence 
in the leader’s decision. Stated differently, 
male leaders had more confidence in their de- 
cisions than their followers had, but female 
leaders had less confidence in their decisions 
than did their followers. 

Analysis of ratings of preference for groups 
with leaders resulted in a significant Sex 
Composition X Congruency interaction, F(1, 
28) = 14.86, p < .001, accounting for 30% 
of the variance. This interaction effect is 
shown in Figure 4. Males indicated a greater 
preference for groups with a leader in the 
congruent condition than in the noncongruent 
condition; females showed a greater prefer- 
ence for groups with a leader in the noncon- 
gruent than in the congruent condition. 

Finally, males’ ratings of “groups better 
than individuals” was significantly higher (M 
= 5.7) than ratings by females (M = 5.1), 
F(1, 28) = 5.25, p < .01, accounting for 12% 
of the variance. 


Discussion 


The results of this study clearly support 
the major hypothesis concerning the relation 
of behavior—role congruency and group effec- 
tiveness. Groups in which the behavioral tend- 
encies of the leader were in accord with the 
behaviors anticipated on the basis of the 
leader role performed markedly better than 
groups in which behavioral tendencies and 
role requirements were not congruent. Pre- 
sumably, behavior—role congruency facilitates 
coordination of member actions (and hence 
group effectiveness), whereas lack of such 
congruency interferes with coordination (and 
hence reduces group effectiveness). 


TABLE 3 
QUESTIONNAIRE DATA 


Congruent Congruent Noncongruent Noncongruent 
leader follower leader follower 
Measure M F M F M F M F 
Evaluation of others 5 5.4 5.4 5.0 5.4 5.7 5.0 5.0 5.2 
Confidence in leader’s decision 6.1 4.0 47 4.8 5.0 4.6 43 4.9 
Preference for group with leader 4.6 3.5 44 3.4 3.8 44 3.1 AA 
Groups better than individuals 5.5 5.0 5.5 5.3 6.0 5.1 5.8 5.2 
Note. M = male group; F = female group. All responses are on a 7-point scale with the higher score reflecting a more positive 
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Ficure 2. Mean evaluation of other group mem- 
bers as a function of position, sex composition, and 
behavior-role congruency. 


One might also have expected that the con- 
Zruency variable would affect the communica- 
i 7. 
tion process. For example, P might be ex- 
pected that the leader would spend less time 
talking, make fewer and/or shorter speeches, 
etc., in the noncongruent than in the congru- 
ent condition. If so, one might interpret the 
congruency effect as the consequence of differ- 
ences in talking; that is, if the leader in the 
congruent groups talked more than the leader 
in the noncongruent groups, he or she might 
get accepted more quickly, thus permitting 
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Ficure 3. Mean confidence in the leader’s decision 
as a function of position and sex composition. 
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MEAN PREFERENCE FOR LEADER 


CONGRUENT 
RELATION OF BEHAVIORAL TENDENCY TO POSITION 


NONCONGRUENT 


Ficure 4. Mean preference for groups with lead- 
ers as a function of sex composition and behavior- 
role congruency. 


the group to devote more time and effort to 
working on the problems. This was not found; 
instead, leaders made more and longer 
speeches than followers in both congruency 
conditions. The only measure of communica- 
tion that was influenced by congruency was 
number of interruptions: In congruent groups 
the leader interrupted others less frequently 
than followers, whereas in the noncongruent 
condition leaders interrupted more frequently 
than followers. This finding probably reflects 
interpersonal tension resulting from the diffi- 
culty of coordinating member actions in the 
noncongruent groups. When the occupants of 
the various positions in the group behave in 
accord with expectations, as in the congruent 
groups, group members may be expected to 
be more relaxed (at least, relatively so), and 
the followers may not feel as restrained from 
interrupting the leader to make a point, to 
facilitate task completion, etc. At the same 
time, the leader may not feel a need to im- 
pose his/her will upon the group. On the 
other hand, when the occupant of the leader- 
ship position does not typically enact leader- 
ship behaviors, as ju the noncongruent condi- 
tion, some tension is likely to be aroused; the 
leader may feel a need to exert some influence 
upon the group (hence tending to interrupt 
others), whereas the followers may not feel 
free to interrupt the leader. Thus, there is 
minimal evidence that congruency influences 
the structure of the communication process; 
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however, it seems likely that the greatest 
differences occur in the quality of communica- 
tions. 

The impact of noncongruence on group 
members is also revealed in members’ evalu- 
ations of each other and their relative pref- 
erence for groups with leaders, but the impact 
is different for male and female groups. Be- 
cause of the differential expectations deriving 
from traditional cultural sex roles, it is of 
interest to consider the findings with respect 
to male groups. Male leaders evaluated fol- 
lowers more highly than followers evaluated 
others in both congruency conditions, but the 
difference was greater in the noncongruent 
than in the congruent condition. This finding 
further supports the interpretation that the 
leader experienced some tension in the non- 
congruent condition, which he tried to allevi- 
ate by rating his followers relatively high. The 
male leader also indicated a greater preference 
for groups with a leader in the congruent than 
in the noncongruent condtion: By assuming 
that groups are relatively more effective with- 
out a leader in the noncongruent condition, 
the leader is relieved of some responsibility 
for the group’s performance. The evaluations 
of others and the ratings of preference for a 
group with a leader by females were generally 
opposite to those of males. This difference 
could be due to differential expectations of the 
appropriate behavior of females in different 
group positions. 

This latter conclusion is bolstered by other 
significant differences attributable to the sex 
composition of the group. For example, male 
groups spent less time per correct trial than 
did female groups, male leaders reported rela- 
tively greater confidence in their decisions 
than did female leaders, and males generally 
believed that groups are relatively more effec- 
tive than individuals to a greater extent than 
did females. Thus, there is ample reason to 
believe that the perceptions and expectations 

of females may be different from those of 
males with respect to groups and behavior in 
groups, and these differences probably ac- 


count for their differential responses to behav- 
ior—role congruency. 

In summary, the data from this experi- 
ment provide evidence that behavior-role con- 
gruency is related to group effectiveness and 
some indirect evidence that this effect may 
be related to problems of coordination and 
related interpersonal tension. Further research 
is necessary to provide more definitive data 
concerning the processes mediating the con- 
gruency-effectiveness relationship. 
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We examined the effects on delay time of two different ways of cognitively 
representing reward objects in a delay of gratification paradigm. Previous work 
had demonstrated that when children focused on the rewards for which they 
were waiting, their ability to delay gratification was inhibited; in contrast, 
attending to symbolic representations of the rewards enhanced their ability to 
delay. Children were taught to transform cognitively the stimuli present during 
the delay (real rewards or pictures of them), turning real rewards into pictures 


` epresents the rewards cognitively is'a much more potent determiner of 
m or her delay behavior than is the actual reward stimulus in front of the 


Fae into real rewards. Results show that the manner in which the 


_ ild. Theoretical implications of the role of cognitive appraisal in self-control 


are discussed. 


Although it is widely asserted that how we 
construe a potentially rewarding or “reinforc- 
ing” stimulus determines our response to it, 
there have been surprisingly few empirical 
demonstrations of this basic proposition, and 
even less is known about the specific mecha- 
nisms that may underly the phenomenon. 
Given the importance of both rewards and 
cognitions in contemporary behavior theories, 
this is a curious neglect. To help illyminate 
the relevant mechanisms, a series of experi- 
ments has begun to explore the role of the 
cognitive representations of rewards in self- 
control and, specifically, in voluntary delay 
of gratification (Mischel, 1974; Mischel & 
Baker, 1975; Mischel & Ebbesen, 1970; 
Mischel, Ebbesen, & Zeiss, 1972; Mischel & 
Moore, 1973). The first results showed that 
young children could not delay gratification 
for long when confronted with the contingent 
rewards for which they were waiting. When 
they were provided with distractions from the 
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rewards, they were much better able to delay 
gratification, suggesting that, contrary to ini- 
tial theoretical predictions, voluntary delay is 
facilitated when attention and ideation are 
directed to pleasant distractions rather than 


"to the goals (Mischel, 1974). Data from the 


later studies indicated that voluntary delay of 
gratification was enhanced by any covert or 
overt activities that distract the individual 
from attending to the expected rewards in the 
delay contingency; conversely, delay appeared 
to be diminished by attention to the rewards 
during the delay period. 

Although this interpretation seemed reason- 
able, and results which had led to it were 
replicated (e.g., Schack & Massari, 1973), it 
proved incomplete. Indeed, it was found that 
exposure to symbolically presented rewards 
(pictures) during the delay period substan- 
tially enhanced, rather than decreased, the 
length of time that children were able to de- 


y lay gratification (Mischel & Moore, 1973). 
” Thus, exposure to pictures of the rewards had 


an effect opposite to that of exposure to the 
actual rewards. The findings showed that dif- 
ferent modes of presenting rewards (i.e., real 
vs. symbolic) may either enhance or inhibit 
self-control and suggested that the specific 
ways in which rewards are construed may 
have crucially different effects on behavior 
(e.g, Mischel & Baker, 1975). 
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The present study was designed to test 
more precisely the way in which cognitive 
representation of the rewards in the delay 
paradigm determines their impact on the abil- 
ity to sustain delay of gratification. Preschool 
children were taught “cognitive transforma- 
tions” which they were to employ during the 
waiting period. These transformations were 
intended to help them cognitively represent 
the rewards in the delay contingency as either 
“real” or as “pictures,” regardless of the ob- 
jective reward stimuli facing them during the 
delay period. The design also varied system- 
atically the real or pictorial nature of the 
reward stimuli actually confronting the child 
during the delay period. In this way it is pos- 
sible to assess the relative importance of the 
actual reward stimulus versus its cognitive 
representation. 

The general predictions were that the cog- 
nitive representation of the rewards would be 
a more potent determiner of behavior than the 
actual objects which confronted the child and 
that the effects of the reward objects could be 
changed predictably by cognitive transforma- 
tions. Specifically, we predicted that regard- 
less of the actual objects facing the child, if 
the rewards are cognitively represented as 
real, delay will be short; if they are imagined 
as pictures, delay will be longer. The design 
also allowed us to replicate the previously ob- 
served effects on delay of gratification of 
attending to actual rewards as opposed to 
pictures of them during the waiting period. 


METHOD 
Design 


The basic delay of gratification paradigm was used 
(Mischel, Ebbesen, & Zeiss, 1972), in which all sub- 
jects are asked to choose between two rewards. They 
are then told that in order to obtain the preferred 
reward, they must remain seated in a chair alone in 
the room until the experimenter returns. If they do 
not wish to wait, they may terminate the waiting 
situation by signaling for the experimenter to return 
to the room. But if they signal, they cannot have 
the preferred reward and instead get the less pre- 
ferred one immediately. This procedure was em- 
ployed in a 3X2 design that varied the stimuli in 
the child’s attentional field during the delay interval 
(slide of the reward objects, the actual rewards, or 
nothing) and the instructions about how to construe 
the available stimuli (ie., as pictures or as real) 

during the waiting period. 
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Subjects 


Subjects were 48 children (24 boys and 24 girls) 
from the Bing Nursery School of Stanford Univer- 
sity. They ranged in age from 4 years 0 months to 
5 years 7 months, with a mean age of 4 years 8 
months. Subjects equated for mean age were assigned 
randomly to one of the six cells of the design, and 
the sex ratio across groups was also equated. There 
were three female and one male experimenters, each 
of whom served equally in each group. 


Apparatus 


The experimental room and setting were basically 
similar to that described previously (Mischel & 
Moore, 1973). Briefly, against one wall was a chair 
facing a table on which there was a desk bell. A 
rear-projection screen (measuring 18 X 20 in., or 
45.72 X 50.80 cm) was placed on the table and 
faced the seated subject at a distance of approxi- 
mately 18 inches. The screen was at the end of a 
38-inch- (96.52-cm-) long box containing a concealed 
carousel 35-mm slide projector. The subject’s chair 
was placed in a 2ł}X2ł-ft. (30.48 X 30.48-cm) 
square on the floor. A partition shielded a corner 
area of the room in which a box of toys was kept. 


Procedure 


When the experimenter escorted a child to the 
experimental room, he or she first showed the child 
the box of toys, demonstrated one or two of them, 
and explained that they would play with the toys 
later. The child was then escorted to the table and 
asked to sit in the chair. Next, the experimenter in- 
troduced and practiced the child’s use of the bell 
signal. Specifically, they played a “game” during 
which the experimenter would step out of the room 
and close the door, but return the moment the child 
signaled. Every time the child rang the bell, thereby 
bringing the experimenter back into the room, the 
experimenter said, “See, you made me come back 
just by ringing the bell.” After making certain that 
the child clearly understood the function of the bell 
(generally after three repetitions of the signaling 
procedure), the rewards were introduced. 

The experimenter removed an opaque 
from under the table and, opening the 
reveal the contents, said the following: 


cake tin 
cover to 


Let’s see what’s under here. I bet it’s a surprise. 
Look, two marshmallows and a pretzel. Which 
would you like to have? The two marshmallows 
or the pretzel? (The child then chose the object 
he wanted.) Say, do you remember I told you 
that sometimes I have to go out of the room? 
Well, I have to go out of the room now, but I'll 
tell you what. Do you see these lines (pointing to 
the square on the floor around the child’s chair) ? 
Well, this is the waiting square. If you sit in your 
chair in the waiting square all the time until I 
come back, then you can have (pointing to the 
chosen object) . But if you don’t want to 
wait, you can always ring the bell and make me 
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come back. But if you ring the bell or get out of 
your chair out of the waiting square, then you 
can’t have (chosen object), but you can have 
(unchosen object). 


The contingency was repeated twice. The experi- 
menter then assessed the child’s comprehension by 
asking the following three questions: “If you sit in 
your chair in the waiting square all the time until I 
come back, which one do you get?” “But if you 
don’t want to wait, what can you do to make me 
come back?” “But if you ring the bell or get out of 
your chair out of the waiting square, then which 
one do you get?” Only children who were able to 
answer all three criterion questions were permitted 
to continue the experimental procedure. 

After the subjects answered the contingency ques- 
tions they were shown the “picture box” (projection 
screen). The experimenter removed the cover of the 
picture box and said, “Here is a picture of the 
marshmallows and pretzel; it looks just like those 
(pointing to the real rewards), but they’re not 
real.” Thus, all subjects were exposed to both the 
actual rewards and a picture of the rewards during 
the preexperimental period. The experimenter then 
consulted a slip of paper that stated what was to 
be left in front of the child during the waiting 
period (the actual rewards, the picture of the re- 
wards, or nothing) and what the child would be 
instructed to “see” during the waiting period (the 
real rewards or pictures of the rewards). 


Experimental Manipulations 


In two conditions the children were instructed to 
see what was actually in front of them, in two other 
conditions they were asked to transform what was in 
front of them (real to picture or picture to real), 
and in two additional conditions the children had 
nothing in front of them but were to “pretend to 
see” either a pitcure or the real rewards in front of 
them. 

Specifically, the instructions to “see real” when 
facing the reward picture were as follows: 


Do you know what a marshmallow and a pretzel 
are? Vou can see them and touch them and 
they’re real. Here is a picture of two marshmallows 
and a pretzel. You can see it. Now close your 
eyes. In your head, take the picture of the two 
marshmallows and the pretzel and make it real. 
It’s not just a picture anymore, it’s real. From 
now on you can see two real marshmallows and 
a real pretzel. Now open your eyes and do the 
same thing. Take the picture of the marshmallows 
and pretzel that you can see and touch, From now 
on you can see two real marshmallows and a real 
pretzel right here in front of you (points at table). 
Can you see them? [If the answer was “no,” the 
experimenter said, “Pretend like you can see them. 
Imagine that you can see them right here in front 
of you (pointing to table). Can you pretend to see 
them? Act like you have two real marshmallows 
and a real pretzel right here in front of you; just 
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reach out and touch them. Pick up the real marsh- 
mallows. Can you show me the pretzel? That's 
right, they're right there.) While I'm gone, re- 
member to see the real marshmallows and pretzel 
right here in front of you. 


The instructions to “see picture” when facing the 
real rewards were as follows: 


Do you know what a picture is? It shows some- 

thing and looks just like it, but it isn’t real, Here 

are two real marshmallows and a real pretzel. You 

can see them. Now close your eyes. In your head, 

take the two marshmallows and the pretzel and 

make them into a picture. Make a color picture 

of marshmallows and a pretzel; put a frame 

around them. From now on you can see a picture 

of two marshmallows and a pretzel right here in 
front of you (pointing to screen), The marsh- 
mallows and pretzel aren’t real—they’re just a 
picture. Can you see the picture? [If the answer 
was “no,” the experimenter said, “Pretend like 
you see a picture, Imagine that you see a picture 
right here in front of you; just reach out and 
touch it. Show me the frame around the picture. 
Where are the marshmallows? Can you show me 
the pretzel? That’s right, the picture is right 
there.”] While I’m gone, remember to see the 
picture right here in front of you, 


Instructions in the nontransformation conditions were 
slightly modified to be appropriate but otherwise 
identical. In all conditions, the experimenter assessed 
the subject's understanding of the instructions and 
then left the room. 


Measuring Delay Time 


Time was recorded from the moment the experi- 
menter left the room until the subject rang the bell 
or waited for a maximum period of 20 minutes, The 
experimenter also returned to the room if the sub- 
ject left the waiting square for more than 20 sec- 
onds. The subject was under continuous observation 
by means of a one-way mirror, Upon returning to 
the room, the experimenter asked all subjects which 
reward they thought they were to get. Only subjects 
who correctly identified the proper reward (96%) 
were included in the data. 


RESULTS 


Table 1 shows the mean delay times for all 
groups. A two-way analysis of variance for 
equal ms was performed to test for the effects 
of cognitive representation and stimulus con- 
ditions. As expected, a highly significant main 
effect, F(1,42) = 19.74, p < .001, was ob- 
tained for cognitive representation; there was 
no effect (F < 1) for stimulus conditions and 
no interaction. 

Data for several of the groups were skewed; 
in particular, in some conditions almost every- 
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TABLE 1 


EFFECTS OF COGNITIVE REPRESENTATIONS AND ACTUAL 
REWARDS PRESENT ON VOLUNTARY DELAY 
TIME (IN MINUTES) 


Cognitive representation of rewards 
(“pretend”’ instructions) 


Rewards Pictures Real 

present in 

situation M sD M sD 
Picture 17.75 3.41 5.95 6.86 
Real 17.70 6.08 7.91 6.34 
None 15.61 4.91 12.24 7.83 


Note. Maximum possible delay is 20 minutes. 


one waited the full 20 minutes. Consequently, 
a supplemental nonparametric analysis test- 
ing the number of subjects waiting to cri- 


terion (20 minutes) was also conducted. For ` 


this analysis, data were combined across stim- 
ulus conditions, and the effect of the two 
ideation conditions was tested. A significant 
effect for cognitive representation, y7(1) = 
8.39, p < .01, was obtained; children who 
pretended to see the picture were more likely 
to wait to criterion than those who pretended 
to see the real objects. 
To compare several specific cells that were 
of particular theoretical interest, individual 
contrasts were computed. The first compari- 
son examined the difference between the mean 
of the group that viewed a slide of the re- 
wards and was instructed to see them as a 
picture and the group that viewed the real 
rewards and was instructed to see them as 
real. These conditions were intended to dem- 
onstrate the “natural” effects of stimulus 
conditions (without cognitive transforma- 
tions) and, in fact, closely replicated the 
earlier findings. The contrast yielded a highly 
significant effect with the picture group wait- 
ing longer, as expected, F(1,42) = 9.19, p < 
.005. The next contrast compared the two 
conditions in which the stimulus objects were 
cognitively transformed into their opposites 
(i.e., the conditions in which the picture was 
construed as real and the real rewards as 
pictures). Again, a highly significant effect 
was found, F(1,42) = 13.12, p < .001; the 
group that faced the real rewards but con- 
strued them as symbolic representations 
waited longer. A final comparison, not or- 
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thogonal to the first two, tested the main . 
effect of the cognitive transformation instruc- 
tions. In this contrast the groups that faced 
either nothing or the real rewards, but that 
had been asked to “see” pictures of the re- 
wards, were combined and compared with 
those groups that faced the same stimuli but 
that had been asked to construe them as real. 
As expected, the former delayed longer than 
the latter, F(1,42) = 8.23, p <.01. 


Discussion 


As predicted, the results show that children 
can cognitively transform the reward objects 
facing them during the delay period in ways 
that either permit or prevent effective delay 
of gratification. The findings further indicate 
that the reward objects have opposite effects 
on children’s self-control depending on 
whether they are construed as “real” or as 
“pictures” (symbolic representations). The 
absence of an effect of what the child is look- 
ing at objectively and the potent effect of 
how the child construes the reward objects 
demonstrate the powerful role of cognitive 
representation in self-regulation. 

The data are particularly impressive given 
the strength of previous findings regarding 
the debilitating effects of facing the real re- 


wards during the delay -period (Mischel & ~ 


Ebbesen, 1970; Mischel, Ebbesen, & Zeiss, 
1972; Schack & Massari, 1973). Although 
the objective stimulus conditions were iden- 
tical to those in the earlier research (in which 
the children tended to delay for only short 
periods), the addition of simple instructions 
which led the child to construe the actual re- 
wards as “pictures” resulted in an average 
waiting time of over 17 minutes. 


In sum, the overall results offered strong » 


support for the contention that the effects of 
a particular reward stimulus on children’s 
waiting behavior may be dramatically and 
predictably altered by the manner in which 
the child construes that stimulus. Consis- 
tently, the cognitive representations outweigh 
the effects of the actual stimuli facing the 
child. 

In addition to demonstrating the powerful 
role of cognitive appraisal in children’s self- 
control, the data also replicate the findings 


| 
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of the previous studies that, in the absence of 
cognitive transformations, attending to the 
real rewards in the contingency produces 
short delay times, whereas attending to sym- 
bolic representations of the rewards (pictures) 
helps the child to delay gratification. We 
have theorized elsewhere about the possible 
reasons underlying these effects (Mischel, 
1974). Briefly, a stimulus may have a moti- 
vational (consummatory, arousal) function 
and an informational (cue) function, The ac- 
tual reward stimuli probably tend to motivate 
or instigate a consummatory response, whereas 
a picture of the rewards simply reminds the 
subject of the contingent goal without being 
so real as to arouse a consummatory response. 
Yet these effects of the reward stimuli (real 
or pictorial) can easily be wiped out or even 
reversed by simple cognitive transformations, 
as the present findings show.’ Interestingly, 
the total patterns of findings from our re- 
search program (summarized in Mischel, 
1974) seem to fit nicely predictions that may 
be made independently by the Atkinson- 
Birch model of the “dynamics of action,” 
which analyzes how the content of thought 
may function to instigate, to consummate, or 
to inhibit an activity (Birch & Atkinson, 
1974, pp. 93 ff).. 

Previous research has. investigated how the 
cognitive appraisal of threatening stimuli (a 
film of cde, primitive genital operations) 
influences emotional responses to those stim- 
uli (Speisman, Lazarus, Mordkoff, & Davison, 
1964). The findings showed that emotional 
responses were higher when the film was 
accompanied by a sound track that empha- 
sized the dangers than they were when the 
sound track denied or “intellectualized” the 
dangers in a detached manner. The authors 
of this widely cited, interesting study in- 
terpreted their results as due to differences in 
“cognitive appraisal” generated by the dif- 
ferent sound tracks. The present study pro- 
vides a more direct demonstration of the 
importance of cognitive appraisal because the 
stimulus remained identical across situations. 
Different behavioral effects occurred as a re- 
sult of specific ideation instructions given 
before exposure. (In the Speisman et al. study, 
in contrast, different sound tracks accompa- 
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nied the film across groups, thus creating 
different stimulus conditions.) The present 
study provides a method of manipulating cog- 
nitive appraisal more directly by means of 
instructions through which subjects may 
transform the identical stimuli into diverse 
cognitive representations, The results demon- 
strate the powerful role of such cognitive 
transformations and support the theoretical 
view that the cognitive encoding of stimuli 
significantly and predictably influences their 
impact on behavior (Mischel, 1973), Our 
results are also consistent with recent work 
showing the value of redefining stressful sit- 
uations for the subject’s ability to cope with 
them (e.g., Holmes & Houston, 1974). 

These data, in addition to demonstrating 
the powerful role that cognitive representa- 
tions may play in self-regulation and offering a 
new methodology for investigating those 
processes, may also suggest one way in which 
the development of self-regulation occurs. As 
children move from representing things in con- 
crete action-bound ways to representing con- 
sequences in more abstract, symbolic, and 
less real and arousing ways, the ability to per- 
sist in the face of frustration is enhanced. 
Studies of cognitive growth indicate that 
during the course of development the child’s 


1Qne possible (although unlikely) alternative ex- 
planation of the results is that it may be more en- 
joyable to transform real objects into pictures than 
to transform pictures into real objects. If so, the 
data might merely reflect the effects of providing 
some children with a more enjoyable distraction, In 
order to test this alternative hypothesis, we tested 
four additional groups of eight subjects each in 
similar conditions except that the children had to 
wait facing either actual or symbolic representations 
of reward objects that were comparable but irrele- 
vant to the contingency. For example, these children 
would choose between marshmallows and a pretzel 
but would wait while facing either two mints and a 
graham cracker or a picture of two mints and a 
graham cracker. They were also given instructions 
to either make the rewards in front of them “real” 
or to make them into “pictures,” as in the main 
experiment. Under these conditions (i.e. irrelevant 
rewards) we found no effects approaching signifi- 
cance. Thus cognitive transformations of reward 
stimuli from real into pictorial representations facili- 
tate delay only when the rewards are the ones in 
the delay contingency and not when they are irrele- 
vant to the contingency. 
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imagery becomes progressively more abstract 
on a dimension of concreteness—abstractness 
(Paivio, 1971). In particular, Piaget and In- 
helder (1966) suggest that while imagery and 
symbolic functions begin to appear at the 
age of 14 to 2 years, a crucial change emerges 
at 7 to 8 years of age. At that time the child 
begins to establish “anticipatory images” 
which permit him to foresee new conse- 
quences and to engage in more abstract, less 
static mental transformations. In a some- 
what similar vein, Werner and Kaplan (1963) 
consider developmental changes in terms of 
progressive “distancing” between the (inner) 
symbolic vehicle and the referential object. 
Thus, the effects obtained here may be par- 
tially age specific, with the progress of self- 
regulation partially resting on the develop- 
ment of the ability to transform concrete, 
arousing, or “hot” rewards and events to less 
real, ‘cooler,’ more symbolic or abstract 
representations. These more “distant” repre- 
sentations may help to guide goal-directed 
behavior without being so arousing and frus- 
trating as to impair effective performance. 


Conclusions 


Tn sum, our results are important in two 
ways: (a) They demonstrate the critical role 
of cognitive operations in delay of gratifica- 
tion, and (b) they show specifically that the 
effects of the objective stimuli in the delay 
paradigm (e.g., having relevant real rewards 
or pictures of the rewards facing the child 
during delay) can be systematically reversed 
by specific cognitive transformations. When 
the real objects are imagined as pictorial rep- 
resentations, and vice versa, delay may be 
predictably enhanced or impaired. The gen- 
erality of these findings to other age groups 
and types of rewards remains to be examined 
to test possible developmental changes and 


interactions with stimulus variables (e.g. food 
vs. nonfood rewards). 
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Actors and observers were exposed to behavioral manipulations of distinctive- 
ness and consensus information, Both actors’ and observers’ physiological re- 
actions to musical presentations were manipulated using a modification of the 
false feedback technique in a 2 X 2X 2 X 2 (Sex X Distinctiveness X Consensus 
X Role) factorial design. As predicted, high distinctiveness produced stronger 
environmental and weaker personal attributions than did low distinctiveness. 
Low consensus produced stronger personal and weaker environmental attribu- 
tions than did high consensus. Actor-observer differences were evidenced in an 
actor preference for basing causal ascriptions on distinctiveness and observer ` 
preferences for basing attributions on consensus information. These data indi- 
cated that actor-observer attributional tendencies may derive, in part, from 
information preference as well as information availability differences. The data 
also suggested that causal ambiguity theoretically obtained by conflicting dis- 
tinctiveness and consensus information may not be phenomenologically repre- 
sented, Discrepancies between the present and previous findings were discussed 


in terms of the distinction’ between passive and coacting observers. 


Heider (1958) initially suggested that per- 
son perception involves a process of causal 
attribution. Simply, any behavior can be seen 
as resulting from personal causes (within the 
actor) or from environmental causes (exter-+ 
nal to the actor). The central task of the per- 
ceiver is to determine the relative contribution 
of these two causal classes in producing an 
observed behavior. Two recent attribution the- 
ories of particular interest draw heavily on 
Heider’s work and focus on those factors that 
influence attribution to person or environ- 
ment, 

Kelley (1967) specified three types of in- 
formation that determine causal attributions: 
consistency, distinctiveness, and consensus. 
Stable attribution to either a personal or 
environmental cause requires that the ob- 
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served behavior be highly consistent (i.e., does 
not vary across time or modality). Attribu- 
tion to environment or person is determined 
by the levels of distinctiveness and consensus. 
Environmental attribution occurs when the 
observed beħavior is highly distinctive (i.e. 
when an individual’s behavior is different for 
different stimulus objects) and highly con- 
sensual (i.e., when an individual’s behavior 
toward a particular stimulus object is similar 
to that of other individuals). Personal attri- 
bution results when both distinctiveness and 
consensus are low. Finally, Kelley suggested 
that when distinctiveness and consensus infor- 
mation are mixed, a causally ambiguous situ- 
ation is produced that leads to an information 
search and/or results in attributions that are 
susceptible to- the particular biases of the 
perceiver. 

Jones and Nisbett (1971) 
overriding tendency for actors to 
havior as environmentally caused and for ob- 
servers to view behavior as personally caused. 
Three different explanations may account for 
this actor-observer divergence. Perceptual 
perspective explanations suggest that actors 
see themselves as constant and the environ- 
ment as varied, whereas observers view the 
environment as constant and the actor as 
varied. Thus, in comparison with observers, 


proposed an 
view be- 
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actors should favor environmental causes to 
explain their own behavior (cf. Storms, 1973). 
Motivational explanations generally assume 
that actors defend or enhance self-esteem by 
assigning personal causation for desirable be- 
havior and environmental causation of non- 
desirable behavior (Lowe & Hansen, in press). 

The present concern, however, is with in- 
formational differences. Several theorists have 
suggested that actors typically have historical 
knowledge about their own emotional states 
and intentions as well as about their past 
performances to which observers are not privy 
(Bem, 1972; Jones & Nisbett, 1971; Kelley, 
1971), That is, actors but mot observers have 
both consistency and distinctiveness informa- 
tion and may therefore rely heavily on the 
latter information to determine their attribu- 
tion to personal or environmental factors. In 
contrast, consensus information should play a 
major role in determining observers’ attribu- 
tion to personal or environmental factors. 

While no study has attempted to assess the 
effects of varying informational input per se 
on the attributions of both actors and observ- 
ers, evidence reported by researchers employ- 
ing actor-only and observer-only designs is 
available. Data gathered from actors and 
observers have generally supported Kelley’s 
(1967) distinctiveness and consistency pre- 
dictions. Observers made more personal and 
less environmental attributions in low-dis- 
tinctiveness than high-distinctiveness condi- 
tions (McArthur, 1972; Orvis, Cunningham, 
& Kelley, 1975), and these attributions be- 
came stable only under conditions of high 
consistency (Himmelfarb, 1972), A reexami- 
nation of several self-perception studies 
(Barefoot & Staub, 1971; Jones & Sigall, 
1971; Snyder, Schulz, & Jones, 1974; Valins, 
1966; Valins & Ray, 1967) also followed dis- 
tinctiveness and consistency predictions. These 
researchers typically provided actors with in- 
formation suggesting that their behavior had 
been both highly distinctive and consistent. 
While none of these studies report measures 
of attributions per se, the results do suggest 
that highly distinctive, highly consistent be- 
havior led actors to make stable inferences of 
environmental causation. 

Research conducted to assess consensus 
effects, however, has provided only limited 
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support for Kelley’s (1967) model. A se 
of studies conducted by Nisbett, Borg: 
Crandall, and Reed (in press) failed to sh: 
any consensus effects on actors’ attribution: 
Attribution data gathered from observers was 
also discouraging. A number of researchers 
(McArthur, 1972; Miller, Gillen, Schenker, & 
Radlove, 1973; Nisbett & Borgida, 1975) 
have presented data suggesting that observ- 
ers ignore consensus information. However, 
other researchers (Goethals, 1972; Harvey, 
Arkin, Gleason, & Johnston, 1974; Orvis et 
al., 1975) have reported consensus effects on 
observers’ attributions. The failure of some 
researchers to obtain consensus effects on o! 
servers’ attributions was surprising in view 
Jones and Nisbett’s (1971) contention ' at 
observers should be particularly sensit\\. to 
variations in consensus. 

One source of these conflicting findi: 4s may 
be the ability of observers to generate im- 


mediate consensus information by \'rtue of 
knowing or surmising their own bel vior in 
the situation. In this light, it is inter ig to 
note that many researchers who faile: find 
consensus effects (Miller et al., 1973 ett 
& Borgida, 1975) employed undesira! im- 
ulus behaviors, conditions under w! ob- 
servers would surmise that their own vior 
would differ from that of the acto hose 
researchers reporting consensus effec ypi- 
cally presented subjects with behavi that 
either made immediate consensus dif lt to 
generate (Harvey et al., 1974) or w con- 
trolled for desirability (Orvis et al., 1°). In 
short, the reported failure to obtain sen- 
sus effects may have resulted from th- con- 
flicting implications of abstract nor «tive 


eri- 


consensus information provided by 
and 


menters and observers’ more concre! 
immediate self-generated consensus date 


The present study was conducted to assess 
the effects of varying informational input 
(ie., consensus and distinctiveness), inde- 


pendent of perspective and motivationa! in- 
fluences, on the personal and environmental 
attributions of both actors and observers. A 
modification of the false physiological feed- 
back paradigm (Jones & Sigall, 1971) was 
employed to ensure equally vivid distinctive- 
ness and consensus manipulations as well as 
an equivalence of normative and immediate 


«é consensus information. All subjects were ex- 
posed to high-consistency information, 

One important feature of the current pro- 
cedure was the use of neutral behaviors (ie, 
preferences for musical selections) to reduce 
the effects of motivational factors on causal 
attributions. In addition, all subjects viewed 
the behaviors on television monitors, which 
minimized attributional differences due to 
perspective. Most importantly, distinctiveness 
and consistency information, in the form of 
an actor’s behaviors, and equally vivid con- 
; sensus information in the form of observers’ 
behaviors toward the same stimuli, were si- 
| 
| 


multaneously manipulated for both actors and 
observers. 
It was expected that more personal (and 
less environmental) attribution would be pro- 
= duced by low- than by high-distinctiveness 
information and by low- than by high-con- 
sensus information. The proposed (Jones & 
Nisbett, 1971) actor preference for environ- 
mental and observer preference for personal 
attributions were not expected to be evidenced 
| as attributional “main effects” in that an 
] attempt was made to eliminate differences in 
| perceptual perspective and motivation and to 
| provide all subjects with information about 
| each of Kelley’s (1967) factors. As a result 
of a differential actor—observer weighting of 
distinctiveness and consensus information, 
however, actor—observer differences could oc- 
| cur when the causal information provided was 
; ambiguous (Kelley, 1967). Jones and Nisbett 
| suggested that actors’ access to distinctiveness 
and observers’ access to only consensus may 
produce a processing difference such that ac- 
| tors overweigh distinctiveness and observers 
overweigh consensus information. Such pro- 
“ cessing differences should obtain only when 
distinctiveness and consensus information sug- 
gest divergent causes for behavior (i.e., causal 
ambiguity). We would, therefore, expect ac- 
tors’ attributions to show distinctiveness 
effects (i.e., personal attributions for low-dis- 
tinctiveness-high-consensus behavior and en- 
vironmental attributions for high-distinctive- 
ness-low-consensus behavior) and observers’ 
attributions to show consensus effects (i.e., 
personal attributions for low-consenus-high- 
distinctiveness behavior and environmental 
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attributions for high-consensus—low-distinc- 
tiveness behavior). 


METHOD 
Subjects and Design 


Subjects were 208 undergraduates who participated 
to partially fulfill an introductory psychology course 
requirement. Previously unacquainted subjects were 
recruited in same-sex groups of four, The experi- 
ment was designed as a four-way factorial with two 
levels each of consensus (low vs. high), distinctive- 
ness (low vs. high), role (actor vs. observer), and 
sex, 


Procedure 


Each of four subjects was seated at an enclosed 
desk before one of two television monitors, Subjects 
were asked to remove all jewelry. Electrodes were 
attached to each subject’s wrist and palm and 
plugged into consoles at each desk. To reduce the 
affective import of the behaviors, subjects were told 
that they would be assigned person numbers by com- 
puter and that this assignment would be anony- 
mous, random, and unrelated to their seating posi- 
tion. Actually all subjects in each session were 
assigned the same person number, 

Each subject was then fitted with earphones and 
listened to audiotaped instructions. These instruc- 
tions described the study as an attempt to measure 
reactions to various musical selections. A computer 
system was established as a “bogus pipeline” (Jones 
& Sigall, 1971) to subjects’ evaluative reactions, Sub- 
jects were led to believe that the computer would 
measure physiological concomitants (galvanic skin 
responses and electromyograms) of their reactions to 
the stimulus material, obtaining input from the 
electrodes attached to their wrists and palms and 
feeding back the “results” via the television moni- 
tors. Each monitor displayed four scales, supposedly 
one for each numbered subject, with lights indicat- 
ing reactions from —5 (highly negative reaction) 
through O (neutral reaction) to +5 (highly positive 
reaction). Subjects were told that as they listened to 
each selection, the lights on the scale would flash 
randomly. At the end of each selection, a tone would 
sound, and one light on each scale would indicate 
each person’s reaction to the previous musical selec- 
tion, This response supposedly represented a person's 
reaction averaged over the entire listening period. 

Subjects were given an opportunity to familiarize 
themselves with and to test the validity of the 
computerized system in a series of three practice 
trials. A standard stimulus (Concerto for Four Key- 
boards by J. S. Bach) was employed on all practice 
trials, After the first listening period, all subjects 
were given feedback to indicate that they had had a 
positive reaction to the selection. To strengthen this 
manipulation, perceived time was manipulated such 
that subjects would perceive the first listening period 
as passing rapidly. Subjects were told that the 
listening period would be 75 sec in length, but the 
actual time was 60 sec. Thus, subjects should have 
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perceived the practice trial as passing rapidly, and 
this perception should have enhanced the believabil- 
ity of the computer feedback (cf. London & Mo- 
nello, 1974). Over the next two practice trials, real 
time was employed to establish a base rate of time 
passage (Snyder et al., 1974). Feedback patterns 
given after the second trial were randomly deter- 
mined. After the last practice trial, all subjects were 
given feedback on the monitor to indicate that their 
reactions had been neutral. The experimenter then 
informed subjects that the monitoring system had 
been standardized (as indicated by each person show- 
ing a neutral reaction to the final stimulus). Subjects 
were then given an opportunity to ask questions 
about the procedure. 

After the experimenter and the assistant had 
answered all questions, recording sheets for the 
experimental trials were distributed. All subjects 
listened to two different musical selections—Branden- 
burg Concerto Number 3 and Brandenburg Concerto 
Number 4 by J. S. Bach—each of which was pre- 
sented in two different modalities (i.e., an orches- 
trated version and a computer synthesized version), 
for a total of four presentations. The two selections 
including both versions of each selection were se- 
lected from a pool of pretested musical presenta- 
tions and had been rated by an independent sample 
of undergraduates as equivalent and neutral on 
bipolar adjective scales assessing the dimensions of 
familiarity, interest, goodness, likability, and per- 
formance quality. Eight different orders of selec- 
tion-version presentation were randomly employed, 
but in all cases subjects heard both versions of one 
selection before hearing either version of the second 
Selection. 

Prior to each presentation, pretaped instructions 
informed subjects of the listening duration as well 
as the version and selection numbers of the music 
to be presented, Subjects were required to record on 
their log sheets the reactions of the actor to each 
musical presentation and the reactions of three ob- 
servers to the final presentation. Following all four 
presentations, subjects were allowed to review their 
completed log sheets. These log sheets were then 
collected, and the manipulation checks and dependent 
measures were administered. 


Feedback Manipulations 


All subjects heard four musical presentations and 
were given one subject’s (the actor’s) reactions to 
all four presentations as well as three additional sub- 
jects’ (observers’) reactions to the final presentation. 
The consistency of the actor’s responses was held 
constant. Thus, all subjects saw the actor consis- 
tently respond to both versions of a given selection, 
In addition, one half of the subjects viewed the 
actor favorably responding and half, unfavorably 
responding to the final selection. 

Role manipulation. Prior to the presentation of 
the experimental selections, all subjects in each ses- 
sion were randomly assigned to the observer or the 
actor role. Actors were informed that they were 
Person Number 21, and would be viewing their own 
reactions to each of the four presentations. Observ- 
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ers were told that they were Person Number 35 
would be viewing the reactions of another pers 
(Number 21) to each of the four presentations. All 
subjects were informed that they would be iven 
feedback for all four persons (including thems: es) 
to the final musical selection, It should be p: ied 
out that within the positive and negative final ac- 
tion manipulations, both actors and observe e- 
ceived identical patterns of feedback. Thu: ir 
perceptual perspective was the same, 
Distinctiveness manipulation. Subjects in 

distinctiveness conditions received feedback 

ing that the actor had reacted differently to ) 
selections but consistently to both versions h 
selection. Subjects in low-distinctiveness co 


received feedback indicating that the ac id 
reacted similarly to both selections and con ly 
to both versions of each selection, To avoid po- 
tential contamination of observers’ distinc iess 
information regarding their own (similar or di at) 
reactions to the two versions, observers we: (old 
that the order of versions and selections th uld 
hear would not necessarily be the same as | rder 
for the actor. The one exception would be final 
musical presentation, when all persons wou sten 
to the same musical piece. 

Consensus manipulation. Consensus was ipu- 
lated by varying the feedback given to ects 
regarding observers’ reactions to the final sical 
presentation. Subjects in high-consensus c: ons 
received feedback to indicate that all perso roth 
actor and observers) had responded simila iC. 
either four positive reactions or four negati ac- 
tions). Subjects in low-consensus conditions ved 
feedback to indicate that the actor had ted 
differently than the three other persons. Thu tors 
believed that their reaction was different th ree 
other subjects whose reactions were similar rV- 
ers believed that their reaction was simila: two 
other observers but different from that of t tor. 

Variation in perceived time. To maximiz« be- 
lievability of the computer feedback, perceiv. time 
was varied according to whether reaction i iback 
was positive or negative. Although four 2 nute 
presentations were employed in all conditic the 
introduction for each presentation was appro) riate 
to the feedback given. When subjects wer: given 
positive reaction feedback, they were told t the 


presentation length would be 135 sec, encouraging 
the perception of time passing rapidly. When sub- 
jects were to be given negative reaction feedback, 
they were told that the presentation length would be 
105 sec, thus encouraging the perception of slowly 
passing time. Because observers were not given 
feedback regarding their own reactions to the first 
three presentations, they were told that each selec- 
tion would last 120 sec. However, for the last presen- 
tation, observers were told the appropriate presenta- 
tion time according to their feedback condition. 


Measures 


The measures included both direct and indirect 
manipulation checks as well as questions assessing 
both general and specific causal attributions. All 
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Á questions were appropriately worded for actors and 


observers, employing “You” for actors and “Person 
21” for observers. 

Direct manipulation checks. After allowing sub- 
jects time to review their log sheets on which they 
had recorded all reaction feedback, subjects re- 
sponded to three statements on 7-point scales labeled 
from O (very similar) to 6 (very dissimilar). The 
statements asked how similar the actor’s reactions 
had been to: (a) the two different selections (dis- 
tinctiveness check), (b) others’ reactions to the final 
selection version (consensus check), and (c) both 
versions of the second selection (consistency check). 

Indirect manipulation checks. On the same ques- 
tionnaire, subjects then responded to two hypotheti- 
cal situations regarding their confidence in the ac- 
tor’s ability to judge music. In the first situation, 
the actor was described as being able to win a 
monetary reward for correctly identifying the musi- 
cal selection that was most different from others in 
a series of musical selections (distinctiveness check). 
In the second situation, subjects were asked to 
imagine that the actor could earn a monetary re- 
ward for correctly predicting popularity (consensus 
check), that is, which of several musical selections 
would be the most appealing. Subjects responded on 
7-point scales labeled from O (not at all confident of 
the actor’s ability) to 6 (very confident of the 
actor’s ability). 

Attribution to general factors. Following the ma- 
nipulation checks, subjects were asked on a different 
questionnaire to indicate how much they thought 
the actor’s reaction to the final musical presentation 
was influenced by “something about that person as 
a person” (personal cause) and by “something about 
the environment or stimulus material” (environ- 
mental cause). Subjects were asked to read both 
items carefully before responding on 7-point scales 
labeled from 0 (had a very weak effect) to 6 (had 
a very strong effect). The order in which these al- 
ternatives were presented was randomly determined. 

Attributions to specific factors. Four additional 
questions on a separate page introduced possible 
reasons for the actor’s reactions to the musical 
presentations. Two reasons, “personal preference in 
music” and “mood,” measured specific personal 
causes; two other reasons, “the particular qualities 
of the performance” and “the particular qualities of 
the selection,” measured specific environmental 
causes. Again, subjects were asked to read the entire 
questionnaire before responding on scales identical to 
those employed for the more general measures. 


ResuLts 


Data from eight males and eight females 
were eliminated from the analyses on the 
basis of suspicion expressed during the de- 
briefing session. Each measure thus included 
192 observations and was analyzed using a 
four-factor between-subjects analysis of vari- 
ance (Consensus X Distinctiveness X Role X 


Sex). No significant main effects or interac- 
tions due to sex were obtained, 


Consistency, Distinctiveness, and Consensus 
Manipulation Checks 


As expected, the analysis conducted on 
subjects’ judgments of the similarity of the 
actor’s reactions to the two versions of the 
second selection produced no significant main 
effects or interactions, indicating that the 
actor’s behavior was seen as highly consistent 
across modalities, 

The analysis conducted on responses to 
the distinctiveness item produced only a sig- 
nificant main effect for distinctiveness (F = 
177.12, p < .001),1 The actor’s responses to 
the two selections were perceived as more 
similar under low- (M = 4.97) than under 
high-distinctiveness (M = 1.63) conditions. 
In addition, subjects were more confident of 
the actor’s ability to discriminate between 
musical selections under high- (M = 3.86) 
than under low-distinctiveness (M = 3.29) 
conditions, producing a main effect for dis- 
tinctiveness (F = 10.30, p < .01 on the indi- 
rect manipulation check measure, No other 
significant effects were obtained. 

The analyses conducted on the consensus 
manipulation checks indicated that this varia- 
ble was also successfully manipulated. On the 
direct measure, the perceived similarity be- 
tween the actor’s and the observers’ reactions 
was greater under high- (M = 5.11) than 
under low-consensus (M = .99) conditions, 
producing a significant main effect for con- 
sensus (F = 816.75, p < .001). A significant 
main effect for consensus was also obtained on 
the indirect measure (F = 103.52, p < .001). 
Subjects rated the actor’s ability to judge 
musical popularity higher in high- (M = 
4.06) than in low-consensus (M = 1.64) 
conditions. A significant Consensus X Distinc- 
tiveness interaction (F = 6.19, p < .05) was 
also obtained for this measure. A Duncan 
multiple-range statistic applied to the means 
underlying this interaction indicated that sub- 
jects rated the actor’s ability to predict musi- 
cal popularity lower ($ < .05) when both 


1 All F statistics reported were based on 1 and 176 
degrees of freedom. 


consensus and distinctiveness were low than 
in any other experimental condition. 


Causal Attributions 


The results of the four-way analyses con- 
ducted on environmental and personal attri- 
butions indicated considerable support for 
Kelley’s (1967) model. As predicted, environ- 
mental attributions were greater for high- 
(M = 4.29) than for low-consensus (M = 
3.08) conditions (F = 57.92, p< .001) and 
for high- (M = 3.99) than for low-distinc- 
tiveness (M = 3.39) conditions (F = 14.48, 
$ < .001). Similarly, personal attributions 
were greater for low- (M = 4.41) than for 
high-distinctiveness (M = 3.66) conditions 
(F = 13.24, p < .001) and for low- (M = 
3.94) than for high-consensus (M = 3.65) 
conditions, although the latter difference did 
not reach an acceptable level of significance 
(F = 1.24, ns). In addition, planned compari- 
sons conducted to test specific predictions de- 
rived from Kelley’s (1967) model confirmed 
that low-distinctiveness-low-consensus behav- 
ior produced stronger personal attribution (M 
= 4.46 vs. M = 3.48; F = 12.26, p < .001) 
and weaker environmental attribution (M = 
4.54 vs. M = 2.73; F = 64.98, p< .001) 
than did high-distinctiveness-high-consensus 
behavior, 

Overall, actors and observers did not differ 
in their causal attributions. However, signifi- 
cant Perspective X Distinctiveness interac- 
tions were obtained on both personal (F = 
8.69, p < .01) and environmental (F = 11.65, 
$ < .001) attributions. The means underlying 
each interaction together with the results of 


TABLE 1 


EFFECTS OF PERSPECTIVE AND DISTINCTIVENESS ON 
PERSONAL AND ENVIRONMENTAL 
ATTRIBUTIONS 


Perspective/ Personal Environmental 
distinctiveness attribution attribution 
Actor 
High 3.23, 4.27, 
Low 4.56, 3.13, 
Observer 
High 4.08), 3.705 
Low 4.25, 3.64, 


Note. For each mean, n = 48. Means are based on 0-6 scales, 
with higher numbers indicating stronger attributions, As 
indicated by Duncan multiple-range statistics, column means 
not sharing the same subscript differ significantly (p < .05). 
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TABLE 2 


EFFECTS OF PERSPECTIVE AND CONSENSUS ON 
PERSONAL AND ENVIRONMENTAL 


ATTRIBUTIONS 
Perspective/ Personal Enviro: ental 

consensus attribution attr on 
Actor 

High 3.98... 4 

Low 3.81b 3 
Observer fi 

High 3.85, 4 

Low 4.48, 2 


Note. For each mean, » = 48. Means are based on 
with higher numbers indicating stronger attributic in~ 
dicated by a Duncan multiple-range statistic applied t 
attributions, means not sharing the same subs 
significantly (p < 05). 


Duncan multiple-range tests applied to hese 


means are presented in Table 1. The + sults 
for personal attributions indicated that tors 
but not observers made less personal 2 ribu- 
tion (p < .05) for high- than for low- tinc- 
tiveness behavior. Analogously, actors |. not 
observers made more environmental ibu- 
tion (p< .05) for high- than for | dis- 
tinctiveness behavior. Although both -tors 
and observers had yeridical perceptions the 
variations in distinctiveness, apparent! only 
actors employed this information in { ing 
their causal attributions for behavior. 

A significant Perspective X Consen in- 
teraction was obtained on personal bu- 
tions (F = 4.00, p < .05) but not on er ~on- 


mental attributions (F = 1.40, ns). T ie 2 
presents the means underlying these ii rac- 
tions and the results of a Duncan mt iple- 
range test applied to the personal attri) ‘tion 
means. The results indicated that ob: vers 
but not actors gave stronger personal ai. ibu- 
tions (p < .05) for low- than for high con- 


sensus behavior. Again both role groups had - 


correctly perceived variations in consensus, 
but apparently only observers employed this 
information in attributing personal causation 
for behavior. None of the remaining main ef- 
fects or interactions on either personal or en- 
vironmental attributions approached signifi- 
cance, 


It will be recalled that actor-observer dif- ~ 


ferences were expected when the causal in- 
formation was ambiguous; specifically, actors 
were expected to show distinctiveness effects, 
and observers were expected to show con- 
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sensus effects. To test these predictions, 
specific comparisons between actors’ and ob- 
servers’ attributions in the two ambiguous in- 
formation conditions were conducted. As 
expected, in the low-distinctiveness-high-con- 
sensus condition, actors gave stronger personal 
attributions (M = 4.83 vs. M = 3.88; F = 
5.21, < .05) and weaker environmental 
attributions (M = 3.71 vs. M = 4.38; F = 
4.45, p < .05) than did observers. In the 
high-distinctiveness-low-consensus condition, 
actors gave weaker personal attributions (M 
= 3.33 vs. M = 4.33; F= 5.88, p< .05) 
and stronger environmental attributions (M 
= 3.92 vs. M = 3.04; F= 7.68, p< .01) 
than did observers. When the causal informa- 
tion provided was ambiguous, actors’ attribu- 
tions showed distinctiveness effects and ob- 
servers’ attributions showed consensus effects. 

Results of the four-way analyses conducted 
on responses to three of the four specific 
causal items produced no significant effects. 
Subjects did attribute an actor’s reaction to 
“personal preference” more in low- (M = 
4.59) than in high-distinctiveness (M = 
4.25) conditions (F = 4.89, p < .05). A sig- 
nificant Distinctiveness X Consensus interac- 
tion (F = 4.30, p < .05) was also obtained. 
The results of a Duncan multiple-range test 
applied to the means underlying this interac- 
tion revealed that high-distinctiveness—high- 
consensus behavior produced less attribution 
to personal preference (p < .05) than did any 
of the three remaining information conditions. 


Discusston 
Evidence Bearing on Kelley's (1967) Model 


The results of the global attribution mea- 
sures largely supported Kelley’s (1967) model 
of the attribution process. In agreement with 
earlier data from both observers (e.g., Mc- 
Arthur, 1972) and self-perceivers (e.g., Bare- 
foot & Staub, 1971; Valins & Ray, 1967), 
variations in distinctiveness had the predicted 
effect on both environmental and personal 
attributions, Contrary to data reported by 
Nisbett et al. (in press) and Nisbett and 
Borgida (1975), consensus effects were ob- 
tained, In agreement with previously reported 
results (e.g., Goethals, 1972; Lowe, 1971), 
variations in consensus did have the pre- 
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dicted effect on environmental attributions. 
Moreover, as reported by Orvis et al. (1975), 
high consensus in combination with high dis- 
tinctiveness produced the expected strong at- 
tribution to environmental causes, and low 
consensus in combination with low distinctive- 
ness produced the expected strong personal 
attributions. The failure to obtain a signifi- 
cant consensus effect on personal attributions 
may have been due to the hesitancy of actors 
to attribute the cause of their deviant (ies 
low-consensus) behavior to personal factors. 

The general failure of the more molecular 
attribution measures to produce equivalent 
results is discouraging. However, as Bem 
(1972) has suggested, such molecular assess- 
ments will be valid measures of attribution 
only when the experimenter’s phenomenology 
of the plausible causes matches that of the 
subjects. There is no automatically effective 
way to guarantee that this will be the case. 


Evidence Bearing on Actor-Observer 
Differences 


The failure to obtain evidence for Jones 
and Nisbett’s (1971) proposed actor-observer 
attributional divergence, although predicted, 
was striking in view of the results of several 
recent studies. These studies were conducted 
to assess manipulations of information and 
have produced evidence supporting the hy- 
pothesized preference of actors for environ- 
mental and observers for personal attributions 
(McArthur, 1972; Nisbett & Borgida, 1975). 
Thus, the failure of the actor-observer manip- 
ulation to produce significant main effects on 
personal or environmental attributions dem- 
onstrates the success of the paradigm em- 
ployed in the present study to effectively 
control for actor-observer differences in 
information, perspective, and motivation, The 
paradigm enabled a direct and effective 
manipulation of the relevant causal informa- 
tion that was uncontaminated by strong 
motivational and/or perspective differences, 

The present data underscore important dif- 
ferences between the present and previous 
paradigms. Specifically, the relative vividness 
of the actor’s behavior as compared with con- 
sensus data presented to Nisbett and Bor- 
gida’s observers may have produced percep- 
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tual perspective biasing. As discussed by 
Jones and Nisbett (1971), the overwhelming 
perceptual salience of the actor’s behavior 
may result in a tendency for observers to 
ignore more pallid sources of inference infor- 
mation. Clearly, perceptual perspective effects 
cannot account for the predominance of per- 
sonal attributions made by McArthur’s 
(1972) observers exposed to verbally de- 
scribed behaviors. However, as mentioned by 
Orvis et al. (1975), the extremity of the dis- 
tinctiveness (e.g., “John laughs at almost any 
other comedian”) and consensus information 
(e.g., “Hardly anyone who hears the come- 
dian laughs at him”) employed by McArthur 
may have augmented the effects of attribu- 
tional biasing. Thus, the results of the current 
study highlight the complexity of behavioral 
information as well as the necessity of speci- 
fying both the perceptual and informational 
qualities of distinctiveness, consensus, and 
consistency manipulations. 

Although no overall (“main effect”) actor- 
observer attributional divergence was found, 
there were important differences in the man- 
ner in which actors and observers processed 
the available information. Both actors and 
observers correctly perceived variations in 
consensus and distinctiveness. However, as 
indicated by the significant Perspective x 
Distinctiveness interactions on both personal 
and environmental attributions, by the sig- 
nificant Perspective X Consensus interaction 
on personal attribution, and by the actor- 
observer differences obtained in ambiguous 
information conditions, actors’ attributions 
were strongly influenced by distinctiveness, 
whereas observers’ attributions were strongly 
influenced by consensus. Thus, the present 
data supported Jones and Nisbett’s (1971) 
suggestion that actors and observers will 
differentially weight available information. 
While this suggestion does not preclude the 
possibility that differences in available infor- 
mation may contribute to actor-observer dif- 
ferences, it does suggest that even when in 
possession of the ie information, actors 
and observers process this information dif- 


ferently. 
This difference between actors and obsery- 


ers was most apparent when the information 
provided was theoretically specified as am- 
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biguous. In high-distinctiveness—low-consen: 
conditions, actors’ attributions did not 
from their attributions for high-distinc!; 
ness—high consensus behavior, whereas obse-v- 
ers’ attributions did not differ from thir 
attributions for low-distinctiveness—low- 
sensus behavior. Similarly, for low-distinc: 
ness—high-consensus behavior, actors’ atti 
tions did not differ from their attribut 

for low-distinctiveness—low-consensus be 
ior, whereas observers’ attributions did 
differ from their attributions for high-disi - 
tiveness—high-consensus behavior. When 
formation was ambiguous, actors ign 
consensus information that conflicted 
distinctiveness information; observers ign 
distinctiveness information that confi 
with consensus information. 

The evidence indicating observers’ re! 
on consensus information contradicts the 
ings of Nisbett and Borgida (1975) and 
Arthur (1972). As previously menti 
Nisbett and Borgida’s use of typically unc: 
ble behavior may have resulted in discre 
cies between self-generated consensus an 
more abstracts data provided by the ex 
menter for high-consensus conditions. 
present data suggest that researchers em; 
ing normative consensus to study informa 
effects should either ensure an equivalen 
immediate and abstract consensus or en 
behaviors that make the generation of 
mediate consensus difficult. Contrasting 
Arthur’s finding of distinctiveness effects 
the present evidence of consensus effects hi 
lights potential differences between “passiy 
and “coacting” (Jones & Nisbett, 1971) 
servers. The observer, who is interacting wi!) 
the same environment as an actor and ; 
tributing his or her own behavior to the cu 
vironment (as in the present case) will ascribe 
any discrepant behavior performed by the co- 
actor to personal factors. Simply stated, the 
discrepancy between the minimal consensus 
effects obtained by McArthur and the strong 
consensus effects currently reported may rep- 
resent the relative potency of knowing and 
surmising one’s own behavior. 

It is important to note that subjects in the 
present experiment made strong attributions 
in causally ambiguous situations. The sug- 
gestion (Kelley, 1967) that causally ambigu- 
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ous situations should produce rather weak 


attributions and that subjects would engage 
in an information search while holding their 
attributions in abeyance was not supported. 
Although this tendency to make strong attri- 
butions in ambiguous situations may have 
been forced by the experimental procedures, 
previous work has suggested that coacting 
perceivers do make strong causal attributions 
to a sufficient cause even in the face of causal 
ambiguity (Enzle, Hansen, & Lowe, 1975). 

The data support Jones and Nisbett’s 
(1971) contention, then, that the observer 
who is also an actor will be “attracted to 
convenient simplifying assumptions” (p. 87). 
However, the suggestion that these simplifica- 
tions will take the form of an overall bias 
toward personal attributions was not sub- 
stantiated by the data. Rather, actors and 
observers appear to simplify complex causal 
situations by differentially relying on either 
distinctiveness or consensus information. 
Thus, those conditions theoretically described 
as ambiguous were, in fact, not phenomeno- 
logically ambiguous. 
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Effects of Social Density in University Residential Environments 


Reuben M. Baron, David R. Mandel, Claire A. Adams, 
and Lynne M. Griffen 
University of Connecticut 


In order to evaluate the impact of an unanticipated partial tripling of male 
dormitory rooms normally used for double occupancy, data were collected 
concerning residents’ cognitive and affective reactions to their present living 
situation (including perceptions of self, roommate, and room properties). Resi- 
dents of triples expressed greater feelings of crowding, perceived less control 
over room activities, expressed more negative interpersonal attitudes, and ex- 
perienced a more negative room ambience. Factor analyses carried out sepa- 
rately for doubles and triples revealed dramatically different implicit theories 
of crowding. Triples located privacy, control, interpersonal attitudes, and per- 
ceptions of crowding within the same phenomenal space; whereas doubles ap- 
peared to perceive a number of separate realms of experience involving the 
loading of crampedness, privacy, interpersonal compatibility, etc, on separate 
factors, Although there were no overall differences in academic performance 
between doubles and triples, triples but not doubles manifested a negative re- 
lationship between indices of interpersonal adjustment and academic perform- 


ance (average grade point and incompletes). 


Due to a miscalculation of the rate of ac- 
ceptances, an unanticipated shortage of dor- 
mitory space occurred at a state university 
in New England. In order to deal with this 
shortage of space, it became necessary for a 
number of dorm rooms originally designed 
for double occupancy to be tripled up. This 
situation of partial tripling presented us with 
a relatively unique opportunity to study vari- 
ations in social density in a field setting with 
a nondeprived subject population. 

The categories of antecedents and conse- 
quences we chose as providing the necessary 
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context for understanding and evalu ng 
tripling were based upon two major class of 
objectives. First, given the types of ney ve 
findings concerning the impact of de ty 


found by Freedman and his associates cf. 
Freedman, Klevansky, & Ehrlich, 1971. it 
was important to establish the differentia’ n- 
pact of the present variations in social n- 
sity, in terms of both short- and long-t -m 
effects and subjective and objective indi s. 
Second, we were concerned with establis! g 
whether certain proximal and distal fac!~-s 
were related to perceived crowding along ie 
lines suggested by current conceptualizatic 1s 
of crowding. 

In this theoretical endeavor we were ba- 
sically concerned with three potential corrs- 
lates of perceived crowding: (a) the role of 
freedom of choice and outcome control; (b) 
the role of proxemic factors such as privacy, 
personal space, and territoriality; and (c) the 
role of interpersonal processes in the sense 
of adequacy of social adjustment. 


General Models of Cognitive Control 
and Crowding 


The theorizing of Proshansky, Ittelson, and 
Rivlin (1970), Stokols (1972), and Zlutnick 
and Altman (1972) all points to a direct 
causal relationship between loss of freedom 
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lý- of choice and/or outcome control and per- 


ceptions of crowding. Thus, it might be hy- 
pothesized that increases in social density 
lower feelings of choice and control and that, 
in turn, perceptions of volitional constraint 
produce feelings of crowding. We might also 
question the relationships among differences 
in the personality factor of locus of control, 
crampedness, personal space, and other prox- 
emic variables. Although no firm hypothesis 
was made, Stokols speculated that internals 
might experience more stress than externals 
under prolonged, noncontrollable conditions 
of high-density living. 


Privacy Satisfaction, Proxemics, and 
Crowding 

While it creates an artificial dichotomy that 
in some way distorts Altman’s (1975) holis- 
tic conception of privacy, Altman’s process 
treatment of privacy as control over social 
contact may be distinguished from his adapta- 
tion level treatment of privacy satisfaction. 
Specifically, Altman proposes that crowding 
perceptions result when the achieved level of 
privacy falls below the desired level of pri- 
vacy. Thus, in the present context it might 
be predicted that dissatisfaction with privacy 
provides an antecedent condition for percep- 
tions of crowding. Similarly, since Altman 
views personal space (e.g., an index of pre- 
ferred interaction distance) and territoriality 
as buffers for privacy, it might be expected 
that variations in personal space and/or ter- 
ritoriality will be implicated in the crowding 
process (see also Edney, 1975; Proshansky, 
Ittelson, & Rivlin, 1970). For example, it 
might be hypothesized that successful estab- 
lishment of territorial boundaries is inversely 
related to perceptions of crowding. 


Interpersonal Adjustment and Crowding 


The theorizing and research of Altman and 
his associates (cf. Sundstrom & Altman, 
1974; Zlutnick & Altman, 1972) as well as 
Baum and his associates (cf. Baum, Harpin, 
& Valins, 1975; Valins & Baum, 1973) have 
stressed the role of interpersonal adjustment 
factors as both antecedents and consequences 
of crowding. For example, Baum et al. (1975) 
found that dorm residents who were well in- 
tegrated into groups experienced less severe 
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crowding effects. Thus, we are concerned with 
the relationships among density variations, in- 
terpersonal adjustment, and perceived crowd- 
ing. 

It should be noted that choosing students 
to be in triples and doubles by both random 
and nonrandom procedures (see below) 
created a situation that although statistically 
complicated, allowed us to examine the im- 
pact of two important classes of variables: 
(a) self-selection of roommates versus random 
allocation and (b) class ranking, that is, the 
relative distribution of upper and lower class- 
men. Incorporating these potentially con- 
founding factors directly into our major room 
density comparisons as ancillary blocking 
variables allowed us to increase the external 
validity of any differential social density ef- 
fects that might be obtained. For example, 
density differences could be generalized across 
class rank, 

Finally, special attention was paid to col- 
lecting data concerning two areas of poten- 
tially negative long-term consequences of 
high-density living: academic performance 
and health. Specifically, information was 
gathered concerning (a) grade point average, 
(b) number of incomplete courses, and (c) 
visits to the school health clinic. 


METHOD 
Subjects 


Subjects were 144 male students from five high- 
rise dormitories at the same university? They were 
chosen to be in tripled rooms by both random and 
nonrandom procedures, In nonrandom situations, 
two possibilities could occur: (a) Two previously 
acquainted roommates could have a stranger as- 
signed to the room; (b) all roommates could have 
been previously acquainted. 

Subjects were contacted by telephone and asked to 
volunteer their time, All data were collected at the 
dorm, Of the subjects who volunteered, 102 were 
from triples and 42 were from double rooms in 
which all room members participated. 


Procedure 

Seven measures were administered to the subjects 
over a period of 1 hour. The following five were 
presented in the hall study lounges: 


1 Because almost all of the female dormitory popu- 
lation, who had been in tripled rooms, had been 
“detripled” by the time the study was initiated, it 
was impossible for us to obtain this data, 
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1. A present living situation questionnaire per- related room features (eg. ordered vs. chaotic, clut- _ | 
tained to the subject’s satisfaction, level of coopera- tered vs. uncluttered, colorful vs. subdued). E 
tion, and amount of personal control within his 7. A series of territorial maps were drawn encom- 
room. Questions also dealt with the amount of space passing furniture and large objects of the room as 
needed and used, the degree of comfort, and aspects well as the occupants’ personal belongings.? 
of interpersonal relations among roommates. Certain 
items referred to their future need for space and RESULTS 
paesteried enion eaa Since preliminary analyses of varian.: re- 


2. A comfortable interpersonal distance measure Ż A 3 
(adapted from Duke & Nowicki, 1972), a pencil-and- vealed that the evaluative ratings of nc -her 


paper instrument, was designed to determine the per- doubles nor triples were significantly af ted 
ceived distance at which the subject feels most com- by dormitory location, all subsequent an> ‘ses j 
fortable when interacting with roommates, friends, were performed with subjects collapsed ‘ver , 


and strangers. ! 
3. A privacy scale (adapted from Marshall, 1972) dorms: for doubles, (4, 36) = 2.13, #. for ¢ 

7 

i 


was designed to measure individual differences in triples, F(4, 36) = 2.16, ns. 


need and preference for privacy in various situations F ‘ 
and encounters. Effects of Social Density 
4, A past spatial history survey focused on past 


living experiences in one’s home environment (eg., Differences on evaluation of dorm e nga 
home vs. apartment, had a private bedroom vs. did For expository purposes, the major t ole- 
pol) And also contained demographic data relating double differences have been categorized ong 
e Roy three dimensions: (a) evaluations of ing « 


5. An internal-external locus of control scale (Rot- 


ter, 1966) was used. l d 
control, and (c) inte: nal adjust: nts. 
The following measures were obtained in the in- 3 (c) EpeIsODe AGUS S 


dividual dorm rooms: 


6. A room evaluation scale (items from Hersh- 2 This measure was created by Barbara Gra? and 
berger, 1972, and Kasmar, 1970) consisted of 24 7- Carol Grenon, students in the Department of ign 
point semantic differential scales focusing on design- at the University of Connecticut. 


| 
| 
| 
space, (b) self-perceptions of privacy and 
l 
i 


TABLE 1 
MEAN DIFFERENCE: THREE Versus Two PERSONS IN A Room < 


Room density X 


Item 2 Persons 3 Persons t df 


Present living situation 


Rate living space (cramped-roomy) 3:5 2.0 8.549 141 00 
Satisfied with privacy 44 2.7 6.949 138 ‘00 
Satisfied with roommates 49 37 5.397 139 00 
Satisfied with dorm room 44 3.3 4.771 139 ‘00! 
Satisfied with furnishings 4.4 3.0 5.585 139 ‘001 
Cooperation with sharing 47 3.9 3.346 140 ‘001 
Cooperation with privacy 48 3.5 5.818 140 ‘001 
Control over dorm room 44 3.7 5.387 139 ‘001 
Control over studying 41 3.1 3.080 139 ‘001 
Control over sleep 4.1 3.3 3.064 141 003 
Control over sharing 48 4.0 3.701 141 ‘001 
Hours spent in room 6.3 5.3 2.362 141 (020 
Room descriptors (1-7) 
Crowded-uncrowded 4.6 24 7.937 7 yi 
Cluttered-uncluttered 4.1 2.8 aed ae cot 
Empty-full 52 5.7 —2.447 137 016 
Uncomfortable-comfortable 6.0 44 5.329 137 ‘001 
Public-private 43 3.5 2.674 137 ‘009 
Small-large 4.6 3.6 3.186 137 002 p 
Dirty-clean 5.0 35 4.846 137 ‘001 
Chaotic-Ordered 5.1 3.6 4.323 137 001 


7 ares ea x 

Note. The questions dealing with satisfaction, cooperation, and control were rated on a 5-point scale, with higher s indicatin; 
ee The Aution cooperation. and control. The room descriptors were rated on a 7-point scale with higher scores indicating More 
lincrowded, uncluttered, etc., conditions. Missing data are due to incorrectly completed questionnaires, | 
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ke Within each category, the significant results 
of room density variations on items taken 
from the present living situation question- 
naire and room evaluation scale are presented. 
Under the subheading of interpersonal adjust- 
ments, data are also reported from a measure 
of comfortable interpersonal distance. Unless 
otherwise noted, differences cited are signifi- 
cant at .05 or less (see Table 1). 
b-t Evaluation of living space. Triples as op- 
Ps posed to doubles rated their living space as 
» more cramped and saw their room as more 
crowded, cluttered, smaller, more full, more 
public, more uncomfortable, and dirtier. They 
also indicated less satisfaction with their dorm 
room in general and with its furnishings in 
particular. 
Self-perceptions of privacy and control. In 
» line with current theories of crowding (cf. 
Altman, 1975; Proshansky et al., 1970; 
Stokols, 1972) triples differed from doubles 
in terms of perceptions of privacy and control, 
Specifically, they were less satisfied with their 
privacy and with the cooperation they re- 
ceived from their roommates in establishing 
privacy. In terms of perceived control over 
room activities, triples felt less control in re- 
gard to use of study space, room decor, sleep, 
and sharing of personal belongings. 
Interpersonal adjustments. Triples, as com- 
pared with doubles, perceived that they re- 
ceived less cooperation from their roommates 
and in general were less satisfied with their 
roommates. They also perceived that they 
i spent less time in their room. Moreover, on 
| the comfortable interpersonal difference mea- 
| sure (Duke & Nowicki, 1972), triples pre- 
ferred closer interaction distances for friends 
than roommates, whereas the reverse was true 
for doubles, who assigned roommates closer 
interaction distances (see Table 2). 


Methodological Issues in the Rating Data 


Before we report the results of (a) be- 
havioral comparisons and (b) distal and 
proximal correlates of perceived crowding, the 
major methodological problems we encoun- 
tered in the study will be elaborated, along 
with the solutions we adopted. 

Confounding due to roommate selection. 
Since there was a far greater opportunity for 


TABLE 2 
MEANS or COMFORTABLE 


TERPERSONAL DISTANCES 


FOR Dous 
Relationship Doubles Triples 
Stranger 43.7 39.2 
Roommate(s) 15.7 21.6 
Friend 21.0 15.8 


Note. Figures are based upon a range of 0-80 Ji 
ENAA Or pon a range ol mm (cf, Duke 


free choice of roommates in the doubles than 
in the triples, an attempt was made to evalu- 
ate the impact of this factor. 

First, a series of 20 2 X 2 (Triple vs. 
Double X Random vs. Acquainted) analyses 
of variance were performed on all of the de- 
pendent variables yielding significant mean 
differences. In all 20 analyses of variance 
there were significant main effects for social 
density (ps ranging from .02 to .001), while 
none of the main effects for roommate selec- 
tion and none of the interactions were shown 
to be significant, 

Second, a subgroup sample of 36 subjects 
(18 from doubles, 18 from triples), chosen 
because they had all been randomly assigned 
to their rooms, was tested on the dependent 
variables that had yielded significant mean 
differences. All 20 ¢s for this selected sub- 
sample were significant, thereby further 
strengthening our belief that the variation in 
social density was the primary factor mediat- 
ing the differential reactions of the room oc- 
cupants. 

Class rank as a possible confounding factor. 
A second potentially confounding factor in all 
of the between-room comparisons pertains to 
the fact that there was a disproportionate 
number of freshmen in the triples (75%). 
This suggests the possibility that effects at- 
tributed in the above analysis to differences 
in social density could have been all, or at 
least in part, due to differences in experiences 
with university living, that is, upper versus 
lower classmen. In order to rule out class rank 
as an influencing factor, a number of sub- 
analyses were performed. 

Two-way analyses of variance, Twenty 2 X 
2 (Triple vs. Double X Freshmen vs. Upper- 
classmen) analyses of variance were per- 
formed on all the dependent variables that 
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yielded significant mean differences. All 20 
yielded significant main effects for social den- 
sity (ps between .017 and .001), while none 
of the main effect comparisons for class rank 
were significant. Two significant 2 2 inter- 
actions were obtained for: (a) satisfaction 
with physical dorm features and (b) satis- 
faction with privacy. In both cases the form 
of the interaction indicated that freshmen 
were more satisfied than upperclassmen in 
triples, while upperclassmen were more satis- 
fied than freshmen in doubles, F(1, 137) = 
8.11, p= .006; F(1, 136) = 6.58, p = .012, 
respectively. 

In addition to this blocking procedure, 
analyses of covariance (double-triple with 
effect of class rank covaried out) were run 
on all variables that were found to yield sig- 
nificant differences between doubles and 
triples. The results of these analyses indi- 
cated that no major change occurred in the 
significance levels achieved by the earlier 
analyses. (All results previously found at $ < 
001 remained at p< .001, and all results 
within p < .05 remained as such.) 

Unit of analysis. It is recognized that for 
certain purposes, the room rather than the 
individual subject is a more appropriate unit 
of analysis. For such analyses, scores from 
individuals can either be averaged within 
rooms to yield a single room score and/or an 
individual can be randomly chosen to repre- 
sent the room. Using the random selection 
procedure, we found that 16 of 20 original 
comparisons yielded żs significant at the .05 
level or less, while 4 other items fell into the 
.06-.09 range. Specifically, those items, now 
at a marginal significant level, were (a) co- 
operation with sharing, (b) hours spent in 
room, (c) public vs. private, and (d) control 
over sleep. 

Reducing redundancy effects. Because they 
had high intercorrelations (.40 or above), all 
items specified as “satisfaction with . . . ,” 

“cooperation with . . . ,” and “control over 

_ 2? were averaged to form three composites 
in their respective categories. ¢ tests run with 
each composite on the double versus triple 
factor yielded highly significant results: for 
relative satisfaction, Ł(139) = 7.414, p< 
.001; for relative cooperation, #(139) = 5.257, 
p < .001; for relative control, (139) = 5.036, 
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p<.001. A further attempt to reduce the 


number of items employed through factor 
analysis is discussed below. 
Behavioral Data 

Two major types of behavioral diffe:ences 
were examined. First, a comparison was made 
of the use of the student health service. ‘hile 
the average use was almost identica' for 
doubles and triples, it was found that 1. stu- 
dents in triples and none in doubles rej) ited , 


using the health service two or more ' nes, 


While in the expected direction, this ~ fer- 
ence was not statistically reliable. 

A second class of behavior examine: was 
academic performance, both in terms ot rade 
point average and number of incom etes. 
There were no significant overall difi ences 
between doubles and triples on either adex « 
of academic performance; however, « tain 
comparisons within triples suggest th. ade- 
quacy of interpersonal adjustment a aca- 
demic performance may be inversely ited 
for triples. Thus, grade point average sig- 
nificantly negatively correlated with s: fac- 
tion with cooperation and control over » ring 


(r= —.22, p<.05, for cooperation *= 


—.30, p < .01, for sharing). Moreove stu- 
dents who were dissatisfied with their om- 
mates had a higher grade point averag: t= 
1.80, p < .10). 

Similarly for incompletes, students w! ad 
incompletes tended to spend more tim: vith 
their roommates than students without iv om- 
pletes (p < .10). Also, within triples, stu ents 
having incompletes allowed strangers and 
roommates closer interaction distances | 1an 
those without incompletes (p< .05, for 
strangers;  < .10, for roommates). 

These data suggest that so much time and 4 


effort go into making good interpersona! ad- 
justments in triples, that less time is left for 
academic pursuits, 

Finally, it should be noted that there were 
no significant correlations between indices of 
interpersonal adjustment and academic per- 
formance for doubles, 


Proximal and Distal Correlates of 
Perceived Crampedness 


In order to examine the possibility that in 
addition to main effect differences in negativ- 
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ity of perception, there were differences in 
how students organized their perceptions 
about themselves, their roommates, and the 
properties of the dorm room, a number of 
correlational analyses were performed, These 
analyses were primarily designed to reveal 
something about the structure of the students’ 
perceptions of crowding, privacy, and room 
ambience. 

A preliminary correlation analysis, First, in 
order to determine whether there were im- 
portant differences in structure of perceptions, 
a simple 3 X 3 correlational matrix was con- 
structed with zero-order and first-order par- 
tial rs for doubles and triples using correlates 
of crowding that seemed most related to cur- 
rent theories of crowding. Specifically, the in- 
terrelations of perceptions of control over 
room activities, satisfaction with privacy, and 
perceived crampedness were examined sepa- 
rately for doubles and triples. As can be seen 
from Table 3, there were indeed differential 
patterns of relationships. Specifically, while 
perceptions of control and satisfaction with 
privacy were strongly associated for both 
triples and doubles, only in triples were strong 
relationships obtained between perceptions of 
control and perceptions of crampedness and 
between satisfaction with privacy and percep- 
tions of crampedness (with the strongest link 
after partialing being crampedness and con- 
trol). 

Factor analyses. These differential patterns 
of correlations suggested the utility of full 
scale factor analyses based upon intercorre- 
lations among all of the scale items contained 
in the present living situation questionnaire 
and the room evaluation scales. Separate or- 
thogonal factor analyses, using a varimax 
rotation, were carried out for doubles and 
triples. 

As anticipated, dramatic differences in fac- 
tor structure appeared for doubles and triples. 
(It may be noted that the varimax rotations 
yielded five major factors for doubles and 
triples, which accounted for 43.2% of the 
variance in triples and 50.7% of the variance 
in doubles; for all reported factors, the eigen- 
values were greater than one.) 

Triples. Triples appeared to have a multi- 
dimensional view of crowding analogous to 
recent conceptions of theorists such as Alt- 
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TABLE 3 
CORRELATIONS AND PARTIAL CORRELATIONS AMONG 
PERCEPTIONS OF CRAMPEDNESS, PRIVACY, 
AND CONTROL 
Environmental perception 1 2 3 
Doubles 
1. Crampedness of room 1,00 .049 — O17 
2. Satisfaction with privacy (049) 1,00 619" 
3. Perceived control over 
aspects of the room (000) (619) 1.00 
Triples 
1. Crampedness of room 1.00 440% sagte 
2. Satisfaction with privacy (.234)* 1.00 493 
3. Perceived control over 
aspects of the room (.451)** (336) 1,00 


Note. The entries above the principle diagonal are zero-order 
correlations, while those below the diagonal, in parenthe 
first-order partial correlations, Each spatial variable i 
preted in terms of the larger score indicating a greater feeling of 
noncrampedness, privacy, and control. 

*p < 05. 

++p <.001, 


man (1975), Stokols (1972), and Prohansky 
et al. (1970). Specifically, perceptions of 
crowding and crampedness share the same 
phenomenal space as satisfaction with pri- 
vacy, perceptions of control over room actiy- 
ities, and positivity of interpersonal attitudes.* 
Secondary factors were found involving fas- 
tidiousness (cleanliness, orderliness, clutter), 
age-related variables, physical features of the 
environment (satisfaction with furnishings, 
dorm room clutter), and aesthetic ambience 
(unique ys. common, colorful vs. subdued, 
plain vs. ornate). 

Doubles, Doubles, on the other hand, dealt 
with the correlates of crowding as being part 
of separate realms of experience. Thus, there 
were separate factors for privacy, physical 
crampedness, compatibility (with regard to 
tidiness), and orderliness (orderly vs. chaotic, 
rigid ys. flexible). Also of interest is the fact 
that the privacy factor emphasizes an avoid- 


3 This factor had strong loadings for 18 of the 48 
items. The variables having factor loadings of .50 or 
above were as follows: rate living space from roomy 
to cramped, satisfaction with privacy, satisfaction 
with roommates, cooperation with noise, cooperation 
with privacy, control over furniture, control over 
study space, control over inviting people into the 
room, control over room decor, control over closet 
space, control over sharing personal belongings, con- 
trol over noise, control over study, control over sleep, 
crowdedness of room, comfortableness of room, use- 
fulness of room, and friendliness of room. 
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ance of people, a concern with solitary activ- 
ities (sleep, noise, and study). Further, the 
crampedness factor defines restriction in space 
as tied to physical room characteristics (e.g., 
large vs. small, light vs. dark, cornered vs. 
rounded, and dirty vs. clean) as opposed to 
the triples, where crampedness loads with 
physical, interpersonal, and control properties. 

An aesthetic ambience factor emerged for 
both doubles and triples. For doubles only, 
this factor included friendly, warm, and com- 
fortable in addition to ornate, unique, and 
colorful dimensions. 

The implications for current crowding the- 
ories of these data, which suggest that dorm 
experiences are differentially structured for 
doubles and triples, are examined in the Dis- 
cussion section, 


Proximal Correlates of Crampedness 


In the next two sections we deal with how 
perceptions of crampedness affect and were 
affected by two key proxemic variables, ter- 
ritoriality and personal space (comfortable 
interpersonal distance). 

Territoriality findings. First, it should be 
noted that students in triples who felt roomy 
tended to claim more room territory (as mea- 
sured through territorial maps) than students 
who felt cramped, t(95) = 1.73, p< .08. 
Moreover, if doors and/or windows were in- 
cluded in one’s territory, persons were more 
likely to view their situation as roomy as op- 
posed to cramped: for door, x*(1) = 2.919, 
$ < .088; for window, x°(1) = 5.533, p< 
.019. Other correlates of territoriality were as 
follows: (a) Roommates who spent a lot of 
free time with each other had greater per- 
ceived room territory than those who did not 


TABLE 4 


MEANS or COMPORTABLE INTERPERSONAL DISTANCES 
ror STRANGERS AMONG TRIPLES 


Perceived living situation 


Locus c Cramped Roomy 
Internal 45.7 33.2 
(28) (13) 
Externa 33.8 43.6 
(32) (9) 


0-80 mm scale. Numbers in 


Note, Means are $ 
fee ubjects in each cell. 


Parentheses represent 
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(r= 307, p < .01); (b) there was also a 
trend (p < .078) for students in triples who 
indicated that they frequently dropped in on 
others unexpectedly to have a larger territory 
than those who did not drop in on others. 

There were no significant differences be- 
tween doubles and triples in size of terri 
tories, nor was territory related to satisfaction 
or cooperation with others or to satisfactir 
with the room. 

Crampedness as a moderator of intern 
external control effects. Stokols (1972) sp 
lated that internals might be expected > 
suffer more frustration than externals if « - 
posed to relatively nonmodifiable, long-te 1 
conditions of crowding. Aspects of the pres: | 
data on comfortable interaction distance | 
vide a partial confirmation of this deriva 
Specifically, since internal-external coi 
and perceptions of crampedness are s 
tically independent (7 = .02), it was de 
to explore the possibility that preferred ir 
action distance might be an adaptive resp 
that was sensitive to the interaction of 
ternal-external control and perceived cra 
edness. 

In order to examine this hypothesis 
the different comfortable interaction dista 
for stranger, friend, and roommate, sep 
2X 2 analyses of variance were perfor: d 
for each referent. While no significant i: 
actions were found for friends or roomma °, 
a perceived Crampedness X Locus of Con il 
interaction was significant for strangers, / |, 
78) = 6.06, p < .02. In looking at the sin ‘e 
effects (see Table 4) internals deman 4d 
more interaction distance than externals t 
strangers only when their perceptions vf 
crampedness were high. For students who 't 
roomy, there was a trend for externals to « °- 
mand more distance. Similarly, internals inci- 
cated a preference for more distance when 
feeling cramped than when feeling roowy, 
whereas the reverse was true for externals. 
The simple effects described above for inter- 
nals were reliable; those for externals were 
only trends. 


Home Density: A Distal Correlate 


One distal antecedent emerged as a sig- 
nificant predictor of crampedness perceptions, 


a 
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de home room density (a ratio of people to num- 


ber of rooms index). Specifically, home room 
density and perceived crampedness were cor- 
related (r = —.289, p < .01).* That is, the 
greater one’s home room density, the greater 
is one’s perception of crampedness within the 
triples, a finding that contradicts the adapta- 
tion level model of perceived crowding but 
fits the drive model; that is, past high den- 
sity sensitizes persons to future crowding ex- 
periences (see below and Discussion section). 
Similarly, high home room density was sig- 
nificantly negatively correlated with indices 
of control (r = —.26, p < .05) and coopera- 
tion (r = —.185, p < .10). 


Using Path Analysis to Test a Simple 
Causal Model of Crampedness 


The technique of path analysis was used to 
examine the causal efficacy of certain current 
models of crowding (cf. Altman, 1975; Sto- 
kols, in press) that use concepts such as social 
density, control, privacy, and the influence 
of past environmental history, “Path analysis 
is a procedure whereby the relative impor- 
tance of the effects of several independent 
variables on one or more dependent variables 
may be evaluated”; it differs from multiple 
regression in that it “permits one to deal with 
intervening variables within a framework of 
a causal model” (Hadden & DeWalt, 1974, 
p. 105). 

The path coefficient represents the direct 
effect of a variable as to its causal influence 
on another variable while controlling for all 
potential alternative paths (cf. Kerlinger & 
Pedhazur, 1973). In this section a brief de- 
scription of the proposed model is reported 
along with an explanation of certain relation- 
ships, while the theoretical implications of the 
model are mentioned in the Discussion section. 

The path analysis is a completely recursive 
(unidirectional) model containing two exoge- 
nous variables (i.e., variables influenced by 
causes outside of the model): (a) a home 
density index based upon the number of per- 
sons per room and (b) the type of dorm room 
to which the students were assigned: either 
double or triple. These variables extend paths 
to (c) perceived relative control over aspects 
of the room and (d) perceived control to 
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satisfaction with privacy, which implies that 
perceived control is a potential cause of 
privacy satisfaction. All of the variables lead 
to (e) perceived crampedness of living space, 
The last three variables (c, d, and e) are 
endogenous, and so may be caused by either 
other endogenous and/or exogenous variables. 

In the course of investigating the above 
variables within the context of a path ana- 
lytic framework, we moved to a more simpli- 
fied causal model that eliminated the home 
density index (see Figure 1). In the present 
description of the path analyses findings, we 
concentrate on the simplified causal model 
because (a) the weights for the other signifi- 
cant paths do not substantially change when 
the home density index is eliminated, (b) the 
home density index did not lead to significant 
direct or indirect causal paths to perceived 
crampedness, the only significant effect of 
home density being on perceived control, 
t(139) = —2.33, p< 02. 

Table 5 was constructed in order to show 
the decomposition of the zero-order correla- 
tions into direct, indirect, and joint effects.® 
As revealed by Figure 1 and Table 5, room 
type has a substantial direct effect on both 
satisfaction with privacy and crampedness; 
that is, triples are both less satisfied with 
privacy and feel more cramped in their 
rooms.® The indirect effect of room type 
through perceived control, though minor, is 


4 The Marshall (1972) scale of privacy did not cor- 
relate significantly with either the perceived cramped- 
ness measure or the composite measure of control 
over room activities. It was, however, significantly 
negatively correlated with the satisfaction with pri- 
vacy item (r=—.32, p < .01), a direction of effect 
that indicates support for a drive model (see Discus- 
sion section). 

5 According to Hadden and DeWalt (1974): 


“A joint effect is any effect not classifiable as direct 
or indirect. Joint effects are basically of two types: 
(1) correlational joint effects which involve one 
or more unanalyzed correlations, and (2) con- 
comitant joint effects in which a single causally 
prior variable exerts direct or indirect effects on 
two subsequent variables, thereby causing the sub- 
sequent variables to vary concomitantly (i.e, to 
be correlated)” (p. 112). 
6 Any minor discrepancies between the correlations 
listed in this table as compared with Figure 1 are due 
to rounding errors. 
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I. Roomtype: 
Double / Triple 709 
-392 3. Satisfaction 152 4. Perceive 
(-392) with Privacy (548) Crampeć! “ss 
oA 
Z 333 
T514) 
2. Perceived Control 
/920 
Ra 
Ficure 1. A four-variable path analysis: The values in parentheses represent zero-order corr 
tions. (The other numbers are path coefficients, rs and ps > .175 are significant at p< .05. Ra, 
Re, represent residual causes, or latent values, which are not accounted for in the path model 
each causal element, where Ra, R», Re = V1 — R*.) 
not inconsequential compared with the mini- compare the lack of effect found by ! & 
mal impact of satisfaction with privacy. Adams (Note 1) with the strong evid of 
Perceived control also has a large direct differential physiological arousal fou by 
effect with both satisfaction with privacy and Epstein & Karlin, 1975), human stu of 
perceived crampedness: Specifically, the naturalistic variations in residential s ‘ngs 
greater the perceived control that students appear to produce uniformly different ef- 
felt over aspects of their room, the more satis- fects across a wide array of response dons. 
fied they were with their privacy, and the Thus, the present findings indicatii g- 
roomier the room appeared. Finally, the di- nitive, affective, and behavioral conseq ces 
rect effect of satisfaction with privacy on per- of variations in social density fit nicely vith 
ceived crampedness is weak, especially when those of Aiello, Epstein, and Karlin (No. 2), 
compared with the other two direct effects. Valins and Baum (1973) and Baum, E »ʻin, 
In summary, perceived crampedness is best and Valins (1975). Moreover, differenti- so- 
viewed as caused by (a) the direct effects of cial density effects in field studies are not 
room type or perceived control and (b) the limited to the university residential soting 
indirect effect of room density as mediated used in the above studies. Rodin (in press) 
by control, found stress effects based on social density 


Discussion 
Beginnin with the oft-cited and unex- 
pectedly nv, ative findings of Freedman et al. 
(1971), i ; become incumbent upon in- 
vestigato! human crowding to have as 
their prel goal the demonstration that 
variations nsity do have differential 


> laboratory studies of hu- 
»roduce mixed results (e.g., 


stress effects. 
man crowding + 


variations in lower-class apartment dwe! ‘ngs, 
while Paulus, Cox, McCain, and Chandler 
(1975) obtained differential overcrowding ef- 
fects in a prison setting. 

There are, however, two aspects of the 
present study that represent unique contribu- 
tions. At the empirical level, the present study 
probably has investigated a broader range of 
effects than any investigation thus far re- 
ported. Thus, the present study has brought 


Me 


4 
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TABLE 5 
DECOMPOSITION OF EFFECTS OF THE PATH ANALYSIS 


Effects on satisfaction with privacy of 


Effects 
Room type 
Direct effect (psi) —.321 
Indirect effect through perceived con- 
trol (ps2 par) —.190 
Correlation (r13) —5i1 
Perceived control 
Direct effect (ps2) 484 
Joint effect due to room type 126 
Correlation (r23) -610 
Effects on perceived crampedness of 
Room type 
Direct effect (Ppa) —.378 
Indirect effect through perceived con- 
trol (pas paz) —.131 
Indirect effect through satisfaction with 
privacy (Pas Paz) —.049 
Indirect effect through privacy and con- 
trol (Pas pse p21) —.029 
Correlation (r14) —.587 
Perceived control 
Direct effect (paz) 333 
Indirect effect through satisfaction with 
privacy (pas paz) 074 
Joint effect due to room type 
(Pat Per + Pas Pat Pan) 167 
Correlation (r24) S74 
Satisfaction with privacy 
Direct effect (p4s) 152 
Joint effects due to room type and per- 
ceived control (p42 pae + Paz Pat Pat 
+ pa pai + pa Par Paz) 396 
Correlation 548 


forth subjective as well as objective data rele- 
vant to both proximal and distal consequences 
of social density. Moreover, the present study 
has paid closer attention than any comparable 
investigation to room design features ranging 
from the inclusion or exclusion of windows 
and doors in territorial maps to the use of 
room ambience scales (Hershberger, 1972). 
Similarly, the present study has investigated 
the impact of locus of control variations in 
greater depth than has previous crowding 
research, 

It is, however, at the theoretical level that 
the present study is most unique. The pres- 
ent analyses, more than any other reported 
field experiment (or for that matter labora- 
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tory study), are organized in such a manner 
that (a) the cogency of a number of existing 
theoretical formulations of crowding may be 
investigated, and (b) possible new theoreti- 
cal leads may be picked up. Point b is par- 
ticularly important in the context of a field 
study, where it is possible to cast a net of 
sufficiently broad gauge to capture a wide 
range of potentially explanatory variables. 
Such a setting provides a particularly rich 
“context of discovery” without any necessary 
compromise in one’s standards for the “con- 
text of justification” (see McGuire, 1973, for 
a similar point). 


Some Distal Correlates of Crampedness 


Home density effects. In line with Paulus 
et al. (1975), we found that tolerance for 
crowding was less for persons who came from 
settings that were high in density. Specifi- 
cally, as the ratio of persons to rooms in one’s 
home increased, so did feelings of cramped- 
ness in the dormitory settings. These findings 
along with Paulus et al.’s data disconfirm the 
adaptation level conception (see Marshall, 
1972) that tolerance for crowding should in- 
crease, that is, “adapt out,” with recurrent 
exposure to high-density living. In contrast, 
the present data are consistent with what 
Paulus et al. described as a “drive model”: 
People with a previous history of high-density 
living appear to value low-density living more 
than persons who have had minimal experi- 
ences with “crowded” living conditions. 

It is also of theoretical interest to note 
that a high home density ratio is significantly 
negatively associated with perceptions of con- 
trol over room activities. This finding is par- 
ticularly interesting in light of Rodin’s (in 
press) finding that high room density living 
creates a proneness to learned helplessness ef- 
fects in task performance (see Seligman, 
1975). 

Finally, it is of methodological interest that 
the present path analysis data suggest that 
both simple and multiple regression models 
may overestimate the impact of exogenous 
variables like home density that are impli- 
cated in a number of causal paths. 

Locus of control. Unlike Schopler and Wal- 
ton (Note 3), we did not find students high 


4 444 


in internal control reporting less crowding 
than students high in external control. Within 
the present study, perceived crowding and 
internal-external control were essentially in- 
dependent. When we combined internal-exter- 
nal control with perceived crampedness, how- 
ever, we found rather interesting support for 
Stokols’ (1972) hypothesis that “if conditions 
of crowding are prolonged and unrelievable, in- 
ternals may experience more frustration with 
being confined and unable to exert their usual 
control over the situation” (p. 235). Specifi- 
cally, in the present chronic high social den- 
sity setting, internals who felt cramped at- 
tempted to keep strangers further away in 
terms of the Duke and Nowicki (1972) mea- 
sure of comfortable interpersonal distance 
than internals who felt roomy. Further, ex- 
ternals tended to show the reverse pattern, 
although here the small sample size of the 
comparison groups prevented satistical sig- 
nificance. 

These data are also of interest because they 
qualify Duke and Nowicki’s (1972) assertion 
that internals are comfortable with smaller 
interaction distances with strangers than are 
externals. Moreover, the present comfortable 
interpersonal distance results suggest that pre- 
ferred interaction distance may index modes 
of coping with stress; that is, persons with 
different loci of control appear to seek dif- 
ferent points of equilibrium in regard to in- 
terpersonal distance under different levels of 
perceived situational stress (e.g., cramped- 
ness). Such data indicate the importance of 
incorporating individual difference parameters 
into general homeostatic models of interper- 
sonal distancing such as Argyle and Dean 
(1965) and Altman (1975). 


Proximal Antecedents of Crampedness 


the most important findings of the 
stigation concern a specification 
ure and the relative impdreance 
2 proximal antecedents of perceived 
ess. Both a preliminary correlational 

an in-depth factor analysis 
ate that triples and doubles do 
more than in the negativity of their 
perceptions it self, roommates, and room; 
they also differ in how they organize their 


Perhaps 
present i 
of the 


and 
strongly inc 
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perceptions in ways suggesting that cramped- ,, 
ness, privacy, and beliefs about control may 
have qualitatively different meanings as func- 
tions of group size. Thus, it appears that 
triples cognize the correlates of cramp <dness 


in a manner that fits recent multidime»:sional 
crowding models such as Altman 75), 
Zlutnick and Altman (1972), and S‘okols 
(1972). Specifically, for triples, concerss over 
crampedness share the same phen menal * 
space as satisfaction with privacy, perc: ‘tions s 
of control over room activities, and po: \ivity “ 
of interpersonal attitudes. This type © find- 
ing is consistent, for example, with A) -nan’s 
proposal that crowding occurs wl the 
achieved level of privacy is less than | «e de- 
sired level, as well as Zlutnick and nan’s 
view that the central factor in the pe -ption 
of crowding is people’s ability or bility % 
to control interaction with others. deed, 
one could christen this an “Altman ctor, 
so well does it mirror Altman’s wr s in 
this area. 

Doubles, on the other hand, whe cr in 
terms of the preliminary correlations’ © »aly- 
sis or in terms of the factor analytic i, do 
not at all fit Altman’s model. First, < ped- 
ness does not share the same phe menal 
space as control and privacy. Second, | "ivacy | 
does not appear to be perceived by  »ubles l 
as dialectical in character. On the © ‘rary, 
privacy loads with those aspects of « ontrol 
that focused on a withdrawal from thers 
(e.g., study, control over noise). ally, 
crampedness appears to be seen in p} alis- 
tic terms that are closer to Stokols’ (1972, 
in press) conception of perceived crow« ing as 
involving a negative discrepancy between the 
supply and the demand for space as con- 
trasted with Altman’s focus on interpersonal 


regulation processes, 

These interpretations receive further clari- 
fication when the path analysis data are con- 
sidered. Specifically, the causal path between 
satisfaction with privacy and crampedness is 
greatly weakened when the joint paths in- 
volving privacy are taken into account. Thus, 
the -path analysis suggests that perceived ; 
crampedness is best viewed as caused by (a) 
the direct effects of room density or perceived 
control over activities and (b) the indirect 
effect of room density as mediated by control. 


i 
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Support for a control over activities con- 
ception of crampedness fits the general link 
vetween control over activity choice and 
rowding, which was first suggested by Pro- 
shansky et al. (1970). Moreover, these data 
are consistent with Rodin’s (in press) findings 
linking high room density with perceived 
crowding and deficits in desire to exercise 
control analogous to learned helplessness (e.g., 
a preference for experimenter- as opposed to 
self-regulation of reward and a heightened 
susceptibility to transferring from an initial 
experience where effort and outcomes are non- 
contingent to a situation where contingent 
performance is possible). 


Two Recent Typologies of Crowding 


It may also be noted that the data regard- 
ing qualitative differences between doubles 
and triples fit two recent general typologies 
of crowding processes. Specifically, in terms 
of Stokols’ (1975) typology, one could argue 
that doubles represent a case of neutral 
thwarting in a primary environment whereas 
triples represent a situation involving personal 
thwarting in a primary environment, In proc- 
ess terms, Stokols would view the personal 
thwarting as involving a higher intensity 
crowding experience, since frustration is likely 
to arise from both a discrepancy between the 
actual and anticipated quality of the physi- 
cal environment as well as disappointment in 
the actions of specific others (a disappoint- 
ment that Stokols likens to Aronson & Lin- 
der’s, 1965, gain-loss model of interpersonal 
attraction). 

Baron and Rodin (Note 4) suggest that 
the key factor differentiating stress in dou- 
bles and triples is that triples have to deal 
with social overstimulation problems in addi- 
tion to the privacy stress problems existing 
in doubles. Specifically, Baron and Rodin 
suggest that in addition to boundary regula- 
tion problems, triples have to deal with po- 
tential overload due to the greater complexity 
of the social environment, for example, greater 
unpredictability of social contacts, diversity 
of personal characteristics, and increased dif- 
ficulty in negotiating amount and use of space. 

In this regard, the findings in the present 
study that interpersonal adjustment and aca- 
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demic performance are inversely related in 
triples and unrelated in doubles may be seen 
as supporting the social overstimulation in- 
terpretation of what Baron and Rodin (Note 
4) refer to as “true crowding stress” (as 
opposed to privacy stress, a problem of inter- 
personal boundary regulation, and cramped- 
ness stress, a purely physical-spatial restric- 
tion analogous to Stokols’ neutral thwarting 
in a secondary environment). Specifically, the 
academic performance data may be inter- 
preted as indicating that the time and effort 
required to establish good interpersonal rela- 
tionships in triples exacted an adaptation cost 
in terms of residual effects on academic func- 
tioning that are analogous to the type of 
delayed environmental overload effects found 
by Glass and Singer (1972) for unpredictable 
noise and by Sherrod (1974) for noncon- 
trollable, high-density interactions (see, espe- 
cially, S. Cohen’s, in press, overload model 
of reduced residual channel capacity). 
Finally, the fact that the present study 
was only able to demonstrate the differential 
effects of social density on task performance 
when the mediating effects of interpersonal 
adjustment were considered points to certain 
dangers not previously noted in the type of 
oversimplified crowding paradigm used by 
Freedman and his associates (cf. Freedman 
et al., 1971; Freedman, Levy, Buchanan, & 
Price, 1972). That is, Freedman investigated 
task performance and interpersonal function- 
ing in discrete, nonoverlapping sets of stud- 
ies and hence was denied even the possibility 
of seeing the types of complex mediation proc- 
ess observed in the present study. It is also 
debatable whether the present type of adapta- 
tion cost interpersonal dynamics could be 
captured in a short-term laboratory setting. 
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College students and psychiatric rehabilitees performed a manual dexterity 
task in which consistent success or failure was manipulated over four con- 
secutive task trials, Contrasting predictions for the use of causal attributions 
(luck, task difficulty, effort, and ability) following Trial 1 and Trial 4 for the 
two populations were derived from Heider’s balance theory and “naive theory 
of action.” Consistent with naive theory, students who succeeded and rehabil- 
itees who failed used stable attributions to account for these first trial out- 
comes, whereas students who failed and rehabilitees who succeeded used un- 
stable attributions. Only college students who failed made significant changes 
in their attributional patterns from Trial 1 to Trial 4. Results are discussed 
regarding intervention with psychiatric rehabilitees and other groups with 


severe achievement difficulties. 


Based on the seminal work of Heider 
(1958), numerous authors have applied an 
attributional analysis to the study of achieve- 
ment behavior (Weiner, Frieze, Kukla, Reed, 
Rest, & Rosenbaum, 1971; Weiner & Kukla, 
1970). Basically, this approach assumes that 
achievement behavior is cognitively mediated 
by four attributional categories: ability, ef- 
fort, task difficulty, and luck. Following any 
event with a discernible outcome (e.g., suc- 
cess or failure), the degree to which indi- 
viduals vary in their assignment of causality 
to these factors will differentially affect subse- 
quent achievement behavior. Thus, much of 
the recent work dealing with an attributional 
analysis of achievement behavior has centered 
on the variables (self-esteem, need for achieve- 
ment, success expectancy, etc.) that affect 
the choice of attributional categories following 
success and failure. 

Early work by Feather (1969) focused on 
the use of ability and luck to explain achieve- 
ment outcomes. Feather viewed these two 
factors as extreme poles on a locus of control 
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dimension (personal-impersonal). Predictions 
concerning the use of these two attributional 
factors were based on two analyses proposed 
by Heider (1958)—one in terms of balance 
theory and the other in terms of a “naive 
theory of action.” 

Balance theory conceives of a structure in- 
volying the following three entities: the per- 
son, the self, and an outcome (success or 
failure). These entities exist simultaneously in 
two types of relations—attitudinal (e.g. likes 
and dislikes) and unit or belonging (e8, 
owns and disowns). Specifically, on the basis 
of balance theory, Feather predicted that 
subjects who have positive self-evaluations 
would attribute success to the internal factor 
of ability and would attribute failure to the 
external factor of bad luck. Conversely, those 
subjects who have negative self-evaluations 
would attribute success to the external attri- 
bute of good luck and failure to the internal 
attribute of ability. Feather’s study (1969), 
using a semantic differential format to derive 
self-esteem scores and competence scores, 
failed to confirm these hypotheses based on 
balance theory. 

Within the same experiment Feather also 
derived predictions based on Heider’s “naive 
theory of action.” Rather than focusing on 
the general level of self-evaluation, this analy- 
sis is concerned with expectations for success 
for a specific task presented to the individual. 
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The theory assumes that success expectancies 
are estimated by the individual comparing 
his or her ability with the difficulty of the 
task at hand. When expectancies are con- 
firmed, Feather hypothesized that, ability 
(i.e., a stable factor) should be seen as the 
cause of this outcome, Conversely, when ex- 
pectancies are disconfirmed, luck (i.e., an un- 
stable factor) should be seen as the cause of 
the outcome. Feather (1969) and Feather 
and Simon (1971a) confirmed these predic- 
tions based on “naive” theory. 

However, Frieze and Weiner (1971) have 
criticized Feather’s study on the basis of a 
possible confounding of the internal—external 
dimension with the stability dimension. These 
authors point out that ability is both a stable 
and an internal attribute. Further, luck is not 
only an external attribute but also an unstable 
attribute, Thus, Feather’s findings based on 
dependent measures of ability and luck leave 
unclear whether subjects were responding to 
the internal-external dimension or the stable- 
unstable dimension to account for success and 
failure. Thus, an adequate test of balance 
theory and naive theory should include the 
four attributional categories. Table 1 illus- 
trates the two-dimensional breakdown sug- 
gested by Frieze and Weiner. 

This elaborated model using four factors 
suggests that a balance theory interpretation 
would predict difference in attributions along 
the internal-external dimension. On the other 
hand, predictions derived from a “naive analy- 
sis of action” would specify differences on 
the stable-unstable dimension. In a direct test 
of balance theory and naive theory using the 
four attributional factors, McMahon (1973) 

found partial confirmation for the naive pre- 


ROBERT H. MENAPACE AND CAROL DOBY 


the high-expectancy subjects were more likely u i 
to make attributions for failure to the un- 
stable factors of luck and effort. Subjects 


with high expectancies for success were also 
more likely to attribute the causes for success 
to the stable factor of ability, but they were 
no more likely to attribute the cause of suc- | 
cess to task ease. Most significant the 
comparison of balance theory with the naive 
theory was the finding that effort, an in'`ernal 
unstable factor, was not used differer ially / 
by subjects in the success condition. How ver, 
Feather and Simon (1971b), ina recen! -‘udy 
using the four causal factors, have gain 
found partial support for the balance | cory 
predictions. Thus, it remains unclear ther 
balance theory predictions involving t cus 
of control dimension or naive theory oly- 
ing the stability dimension can best Junt 
for these equivocal findings. One pur; ose of 
the present study was to allow a ther 


examination of these two contrasting t eti- 
cal perspectives. 

The present study was also condi as 
part of a program of research by the «hors 
and their associates (Menapace & Av | ony, 


Note 1; Anthony & Menapace, N 2 


Menapace, Note 3) investigating the ibu- 
tion process in psychiatric rehabilitee- Our 2 
clinical experience and the findings nu- 
merous researchers strongly support tb- con- 
tention that psychiatric patients have | wer 
generalized expectancies for success and wer 
self-evaluations (Bellak, 1958; Hart & 


Hale, 1968) than the general populatio... Ob- 
viously, there are many other difference: be- 


tween the ex-psychiatric patient and the gen- 
eral population. However, one of the most 
consistent observations, and the focus of clin- 


ical attention, is the pervasive lack of confi- 
dence psychiatric rehabilitees manifest in 
dealing with achievement-related activities. 
These observations provided a rationale for 


rehabilitees’ causal attributions for success 


1The psychiatric rehabilitees in this study were 
from Horizon House, a community-based psycho- 
social rehabilitation agency located in Philadelphia, 


dictions cifically, when subjects with high 
expecta for success were compared with 
subjects low expectancies for success, 
TABLE 1 
PERCEIVE! NTS OF SUCCESS AND FAILURE 
a ~ and failure, 
Locus of control 

Stability nal External 

Fixed Task difficulty 

Variable Luck 


Pennsylvania, 


generating predictions about the psychiatric 
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Me On the basis of Heider’s naive theory of 
action outlined above, we expected psychi- 
atric rehabilitees, having extremely low ex- 
sectancies for success, to make stable causal 
‘tributions for failure (i.e., ability and task 
difficulty) and unstable causal attributions 
for success (i.e. effort and luck). On the 
other hand, college students with higher ex- 
pectancies for success would be more likely 
to attribute success to the stable factors of 
č ability and task ease, whereas failure would 
” be attributed to the unstable factors of effort 
and luck. 

One additional aim of the present study 
was to examine attributional patterns follow- 
ing repeated success or failure trials. Where 
individuals experience repeated confirmation 
of their expectancies one would expect little 
change in attributional patterns. However, 
when expected outcomes are consistently dis- 
confirmed there should be a gradual change 
in outcome expectancies (c.f. Feather, 1966; 
Feather & Saville, 1967) and a concomitant 
change in attributional patterns. That is, 
when an individual expects failure or success 
but consistently experiences the opposite out- 
come over a number of task trials, there 
should be a change in the individuals’ causal 
explanation for these outcomes. 

This reasoning seems more compelling for 
college students than for psychiatric rehabili- 
tees. Clinical experience and research (Arieti, 
1970; Cancro, 1970) suggest that outcomes 
that disconfirm prior beliefs and expectancies 
are most problematic for psychiatric rehabili- 
tees. Clearly, one of the most difficult aspects 
of therapeutic intervention with psychiatric 
patients is the intractability of their cognitive 
schemas, particularly with regard to improv- 
ing negative self-image. Thus, we hypothe- 
sized that psychiatric rehabilitees would be 
less likely to change their causal attributions 
over a number of trials that disconfirmed 
their prior expectancies than would a com- 
parison group of college students. Specifically, 
in accord with naive theory, college students 
were expected to change their attributions 
over repeated disconfirming trials (i.e., fail- 
ure) from unstable factors (effort and luck) 
to stable factors (ability and task). College 
students experiencing repeated success and 
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psychiatric rehabilitees experiencing repeated 
success and failure were not expected to alter 
their attributional patterns. 


SUMMARY OF THE CONCEPTUAL AND 
CONCRETE HYPOTHESES 


Trial 1 Attributions for Success and Failure 


1. Individuals with low success expectancies 
will attribute task failure to stable attributes. 
Specifically, psychiatric rehabilitees will be 
more likely than a comparison sample of col- 
lege students to attribute failure to the stable 
factors of ability and task. 

2, Individuals with low success expectancies 
will attribute task success to unstable attri- 
butes. Specifically, psychiatric rehabilitees will 
be more likely than a comparison sample of 
college students to attribute success to the 
unstable factors of effort and luck, 

3. Individuals with high success expectan- 
cies will attribute task success to stable at- 
tributes. Specifically, college students will be 
more likely than psychiatric rehabilitees to 
attribute success to the stable factors of abil- 
ity and task. 

4. Individuals with high success expectan- 
cies will attribute task failure to unstable 
attributes. Specifically, college students will 
be more likely than psychiatric rehabilitees 
to attribute failure to the unstable factors of 
effort and luck. 


Changes in Attributional Patterns from Trial 
1 to Trial 4 


5. Consistent confirmation of prior expec- 
tancies will result in no change in attribu- 
tional patterns. That is, college students suc- 
ceeding over consecutive trials and psychi- 
atric rehabilitees failing over consecutive 
trials will not change their attributional pat- 
terns. i 

6. Consistent disconfirmation of prior ex- 
pectancies will result in no significant changes 
jn attributional patterns from Trial 1 to Trial 
4 for psychiatric rehabilitees. Consistent dis- 
confirmation of prior expectancies with col- 
lege students will result in changes of attribu- 
tional patterns from Trial 1 to Trial 4. Spe- 
cifically, college students will increase attri- 
butions for disconfirmed expectancies to the 
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stable factors of ability and task difficulty and 
decrease attributions to the unstable factors 
of effort and luck. 


METHOD 
Subjects 


Subjects were recruited from a psychiatric reha- 
þilitation center (m= 40) and a nearby university 
(n=40). Participation was on a volunteer basis. 
Twenty males and 20 females from each sample were 
distributed equally across experimental conditions. 
Psychiatric rehabilitees had had total previous psy- 
chiatric hospitalization ranging from 1 month to 9 
years with the majority having hospital diagnoses of 
schizophrenia. None of the college sample had ever 
been hospitalized for psychiatric reasons. The col- 
lege sample was also significantly younger than the 
rehabilitees (mean ages=23 and 28, respectively) 
and slightly more educated (mean years of schooling 
= 13.5 and 11.5, respectively). Living arrangements 
were similar for the two samples; most students and 
rehabilitees lived either in dormitories or with their 
families. 


Apparatus 


The experimental task was the Purdue Pegboard, 
a test of manual dexterity requiring that cylindrical 
metal pegs be placed in holes in a perforated board. 
This task was chosen because of the ease with which 
the experimenter could assign the rating of success 
or failure without the subject’s realization that it 
had been manipulated, 


Procedure 


Each subject entered the research laboratory and 
was seated at a table across from the experimenter. 
A standard set of instructions was then read to the 
subject explaining the nature of the task and ap- 
paratus, the number of trials to be performed, and 
that following each trial a set of questions would be 
asked. 

The experimenter then demonstrated the pegboard 
and allowed the subjects a few minutes to try the 
procedure and ask any questions. Subjects were then 
allowed 45 seconds for each of the first four trials. 
Following each trial subjects were informed whether 
they had succeeded or failed, and the dependent 
measure was completed. Trials 5 and 6 (nonexperi- 
mental trials) were performed to allow the failure 
subjects to finish the experiment with a success ex- 
perience. Unknown to the subjects, the experimenter 
allowed an additional 15 seconds on these trials to 

ensure increased performance. Following completion 
of the sixth trial, all subjects were thanked for 
their participation and asked not to reveal the pro- 
cedure for a week. 


Independent Variables 


The experimental design was a 2X2X2 with 
repeated measures on the last factor. The two sub- 
ject populations defined the first independent varia- 
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ble. The second independent variable was task 
come (i.e. success or failure). This variabie 
manipulated by the experimenter following 
experimental task trial. After each of four t 
subjects were informed that they had “failed ( 
ceeded) in reaching the performance standar 
their age group.” The third independent ver: 
was task trials, Although there were four test trials, 
only Trials 1 and 4 were included in the ansiysis in 
order to maximize the possibility of change. 


Dependent Measure 


After completion of each trial, subjecis were 
asked to complete a questionnaire based Mc- 
Mahan’s (1973) paired-comparison format, which 
allows for all possible combinations of the at- 
tributional factors. For example, in the faiu - con- 
dition subjects were asked to choose the ap- 
propriate of two responses, as follows: 

I failed because: I do not have mu bility 

OR___the task was hard; The t was 

hard OR____I was unlucky; The was 

hard OR____I do not have much abili 
Attribution scores were collected by sum: ox the 
checks for each attributional category. the 
range of scores for any attributional cat: was 
0-3. 
RESULTS 

Results for each attributional catego ~ were 
analyzed by four separate 2 X 2 X 2 'opu- 
lation X Outcome X Trial) analyses vari- 
ance with repeated measures on the l= fac- 
tor (Winer, 1962). Because the hy, heses 
outlined above specify a complex rela’ ship 
between these factors in accounting the 
assignment of causality, the analyses «ould 
yield a significant three-way interaction for 
each attribution category. The resu! pre- 
sented in Table 2 confirm this predicted 
higher order interaction for three of four 
categories (luck being the exception) and 


necessitate a number of simple effects analy- 
ses (cf. Kirk, 1968) to test the specific hy- 
potheses. Results of these simple effects are 
presented below according to Trial 1 and 
Trials 1-4 change hypotheses (see Table 3 
for cell means). 


Analysis of Trial 1 Attributions for 
Success and Failure 


To examine the hypotheses derived from 
naive theory the Population x Outcome inter- 
action at Trial 1 was subjected to a number 
of simple effects analyses for each attribu- 
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TABLE 2 
ANALYSES OF VARIANCE SUMMARY OF ATTRIBUTIONS FOR Task OUTCOMES 
Ability Effort Task Luck 
Source a Mms F MS F MS F MS. PF 
Population (A) 1 2.50 1.47 76 <1 1,23 1.03 F 
Outcome (B) 1 7.23 42t 3.31 2.24 4000 33.61% 2976 2221 
Trial (C) 1 63 242 181 3.07 03 <1 160 <1 
AXB 1 -63 <1 951  643* 7.23 6.08* 141 1.05 
AXC 1 1.23 4.74" 4.56  7.73* 1.60 4,32* 06 <1 
BX Ca aes 1 40 <1 4.56 7.73** 2.03 5.39 46 <1 
Subjects within 
groups 76 1.75 1.48 1,19 1.34 
AXBXC 1 6.40  24.75*** 10.51 17.81*** 1.60  4.32* 31 1.29 
C X Subjects 
within groups 76 26 59 37 24 
*p <.05. 
wD < 01. 
trey < 2001. 


tional category. Hypothesis 1 was confirmed. 
Psychiatric rehabilitees were more likely than 
the college students to attribute failure to 
the stable factors of task difficulty, F(1, 76) 
= 4,06, p < .05, and poor ability, F(1, 76) 
= 4,81, p < .05. Hypothesis 2 was partially 
confirmed. As predicted, rehabilitees were 
more likely than the college students to at- 
tribute success to the unstable factor of effort, 
F(1,76) = 9.11, p < .01. However, contrary 
to our hypothesis, there were no differences 
between populations in the use of luck to ac- 
count for success, F(1, 76) = 1.51, p > .20. 
Hypothesis 3 was also only partially con- 
firmed. In accord with naive theory, college 
students were more likely than psychiatric 
rehabilitees to attribute success to task ease, 
F(1, 76) = 8.55, p < .01. However, students 


were no more likely to attribute success to 
ability than were the rehabilitees, F (1, 76) < 
1. Partial support was also found for Hy- 
pothesis 4. College students were more likely 
than the rehabilitees to attribute failure to 
lack of effort, F(1, 76) = 15.55, p< .005, 
but no more likely to attribute failure to bad 
luck, F(1, 76) < 1. 

In general, the results of Trial 1 confirm 
the hypotheses derived from Heider’s naive 
theory of action. Table 4 summarizes the pre- 
dictions based on naive theory and balance 
theory and compares them with the actual 
findings summarized above. Chi-square analy- 
sis of those attributional factors confirmed 
and disconfirmed by the two theories strongly 
supports the naive theory interpretation, x? (1) 
= 4.27, p < .05. (See Table 5). 


TABLE 3 
Mean ATTRIBUTION SCORES FOR TASK OUTCOMES 
Outcome 
Success Failure 
Population Ability Effort Task Luck Ability Effort Task Luck 
Trial 1 
5 
Rehabilitees 1.10 2.25 2.10 .55 e AN lor in 
Students 95 1.45 2.70 90 1.05 i ‘ 
Trial 4 PAOS 
35 1,70 
Rehabilitees 1.40 2.20 1.90 -50 n ne rey 1160 
Students -80 1.75 2.50 95 R ` 
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TABLE 4 
‘THEORETICAL PREDICTIONS AND ACTUAL RESULTS OF CAUSAL ATTRIBUTIONS FOR STUDENTS AND 
REHABILITEES FOLLOWING TRIAL 1 SUCCESS AND FAILURE 
Outcome 
Students Rehabilitees 

Predictions - 5 

and results Success Failure Success Failur 
Naive theory task and ability effort and luck effort and luck task and a 
Balance theory effort and ability task and luck task and luck effort and 
Actual results task effort effort task and 


Changes in Attributional Patterns from 
Trial 1 to Trial 4 


Hypotheses concerning changes in attribu- 
tional patterns from Trial 1 to Trial 4 were 
examined by a number of simple effects anal- 
yses of the Population X Trials interaction 
under success and failure conditions (i.e., 
levels of task outcome). Consistent with Hy- 
pothesis 5, college students who succeeded 
on consecutive task trials, F(3,72) < 1, and 
rehabilitees who failed on consecutive task 
trials, F(3,72) <1, did not significantly 
change their attributional patterns. That is, 
when prior expectancies were confirmed, 
neither group altered their causal attribu- 
tions. 

Consistent with Hypothesis 6, college stu- 
dents who failed on consecutive trials did 
change their attributional patterns, F (3, 72) 
= 5.75, p < .01. Thus, when prior expectan- 
cies were consistently disconfirmed, students 
changed their causal attributions. Specifically, 
in strong support of naive theory, students 
significantly increased their attributions to 
the stable factors of ability and task difficulty, 
F(1, 76) = 8.53, p <.005; F(1,76) = 6.41, 
p <.01, and significantly reduced their at- 
tribution to the unstable factor of effort, F (1, 
76) = 19.60, p< .001. Attribution to luck, 

however, remained unchanged over task trials, 


TABLE 5 
NUMBER OF PREDICTED AND NONPREDICTED RESULTS 


FOR BAL THEORY AND NAIVE THEORY 
Theory Predicted Nonpredicted 
Naive 5 3 
i 


Balance 1 


Note. x2(1) = 4.27, p < 05 


F(1,76) = 1.36, p > .25. In accord witb Hy- 
pothesis 6, rehabilitees who succeeds on 
consecutive task trials did not significantly 
change attributional patterns, F(3, 72 1. 
Thus, consistent expectancy disconfir: 
did not change causal attributions for | 
atric rehabilitees. 


Discussion 
The results of this study lend strong up- 
port to a naive theory interpretation of sal 
attributions for achievement-related outcomes. 
Although college students and psychiv‘ric 


rehabilitees differ on dimensions other | \an 
success expectancy (some caution shoul be 
exercised in generalizing about between 
ulation differences), it is clear that the 
phasis of naive theory is most salient in ihe 
interpretation of these results. Further. as 
predicted by naive theory, repeated discon- 
firmation of student expectancies led to al- 
tered attributional patterns along the sta 
unstable dimension. As expected, students De- 
gan to make more stable causal attributions 
when exposed to repeated disconfirmation of 
prior expectancies. 

The findings that psychiatric rehabilitees 
did not change their attributional patterns 
following disconfirmed expectancies suggests 
that individuals who have had severe psychi- 
atric disabilities have difficulty integrating 
information that is discrepant with prior 
beliefs and expectancies. However, this con- 
clusion must be made quite tentatively be- 
cause the present design did not include a 
“Jow success expectancy” group of college 


iea 


students. That is, it remains unclear whether | 


the attributional patterns of nonrehabilitees 
with low success expectancies would aiso re- 
main unchanged when exposed to information 
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zant with their prior expectancies. Ka- 

nd Hanson (1971) suggest that nega- 

»rmation (failure, negative traits, etc.) 

more “meaning” to the individual, 

high-expectancy subjects who repeat- 

edly fail may change their causal attributions 

more readily than a group of low-expectancy 

su>jects who experience repeated success. This 

“norativity bias” could account for our re- 

with the psychiatric rehabilitees but 

also predict similar results for college 

ts who have low expectancies for suc- 

‘ture research using additional control 

f s is necessary to more clearly delineate 

tl le of psychiatric status in relation to 
c s in attributional patterns. 

rally, however, the results with psy- 

cl c rehabilitees conformed to our predic- 

r this population, When exposed to 

tas uccess and task success over trials, 

attributed these outcome(s) to 

ieir Ffort. Although this finding was in 

cor predictions based on naive theory, 

tive of a common difficulty for 

intervention. Clearly, if following 

uccesses psychiatric rehabilitees 

view success as based on an 

siable personal characteristic, there is little 

permanent gain in a reassessment of their own 

ca] Fhe issue of how much success 

»ssary to change expectancies 

il patterns in psychiatric re- 

| be the subject of our future 


ilities, 


the other hand, failure and failure over 
viewed by rehabilitees to be a 
heir poor ability and the extreme 
v of the task. The situation is equally 
roblematic. Failure is a realistic possibil- 
‘ity for any achievement activity and can be 
expected to occur with some frequency for all 
individuals, However, the tendency for reha- 
bilitees to view failure as reflecting their own 
poor ability and the difficulty of the task 
leaves little incentive to try again at these and 
similar tasks, 
Two rehabilitation strategies are suggested 
^ by this study. First, it would appear that 
cognitive appraisals of both success and fail- 
ure experiences should be focused upon as 
part of the rehabilitation process. Tradition- 
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ally, intervention has focused upon the utili- 
zation of success experiences with little atten- 
tion paid to reinterpreting past and present 
failures (Hart & Risley, 1968; Meacham & 
Wiessen, 1969). Recently Dweck (1975) and 
Dweck and Reppucci (1973) have demon- 
strated the beneñt of focusing upon the in- 
terpretation of failure experiences in their 
work with underachieving children. Similarly, 
the work of Gioe (1975) and Beck and his 
associates (Rush, Khatami, & Beck, 1975), 
with depressed patients stressing cognitive re- 
structuring, though not explicitly stated in 
attribution terminology also indicates the po- 
tential benefit of utilizing failure experiences 
in clinical intervention, Second, it may be 
quite beneficial to identify individuals who, 
across a number of achievement situations, 
attribute failure to stable factors and success 
to unstable factors and deal more directly 
with interventions aimed at changing their 
expectancies and attributional patterns. 

In summary, this study has confirmed a 
number of expectations for psychiatric reha- 
bilitees and will be the basis for further study 
with this population. Additionally, the study 
has afforded an examination of naive theory 
and balance theory predictions for achieve- 
ment situations and indicates that Heider’s 
naive theory is a viable way of interpreting 
attributional patterns for achievement out- 
comes. 
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The Development of Causal Attributions in Social Perception 


_ Rachel Karniol and Michael Ross 
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Two studies were conducted to examine discounting of a plausible internal 
cause for a behavior in the presence of an extrinsic cause. The subjects were 
kindergarten, first- and second-grade children and college students. Subjects 
listened to tape-recorded stories that contrasted target persons who engaged in 
toy play of their own accord with those who were commanded or bribed to 
play with the same toy. The subjects were asked to indicate which story 
character really wanted to play with the target toy. Experiment 1 indicated 
that the majority of kindergarten children used an additive principle such 
that tangible rewards and parental commands were seen as increasing the 
target person’s desire to play with the toy. Use of the additive principle de- 
creased and use of the discounting principle increased with age. Similar age 
trends were obtained in Experiment 2. The possible reasons for these findings 
and their implications for the study of self-perceptions of intrinsic motivation 


were discussed, 


discussion of the attribution process, 

K 1973) attempted to describe how 

people make causal inferences for a behavior 

I mber of plausible alternative expla- 

cist. One of the most interesting 

volves what Kelley has termed a 

ufficient causal schema. In the sim- 

plest instance, this schema occurs when there 

yo vossible causes for a behavior and 

either cause, by itself, would be sufficient to 
ehavior. 

to Kelley (1973), the multiple 

chema is ordinarily employed for 

inferences with respect to motiva- 

ctors, For example, suppose that a 

s promised a reward for performing 

ing activity. There are, then, at 

i , reasonable explanations for the 

’s willingness to perform the behavior: 

intrinsic interest in the activity and his 


his 
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desire to obtain the extrinsic reward, How- — 
ever, Kelley suggested that when there is an . 
extrinsic reason for undertaking a behavior, 
an observer may discount the possible contri- 
bution of any intrinsic factors. Discounting 
occurs because the extrinsic cause is often 
more salient and more readily verifiable than 
the intrinsic cause. A great deal of research 
within the framework of attribution theory 
provides support for Kelley’s analysis (€g 
Jones, Davis, & Gergen, 1961; Thibaut & 
Riecken, 1955). 

More recent research on intrinsic motiva- 
tion suggests that the dicounting principle is 
employed in self-perception as well as in the 
perception of others (Calder & Staw, 1975b; 
Deci, 1971; Kruglanski, Freedman, & Zeevi, 
1971). Even young children appear to dis- 
count the internal cause for their own be- 
havior when an extrinsic cause is present 
(e.g., Green & Lepper, 1974; Lepper, Green, 
& Nisbett, 1973; Ross, 1975; Karniol & Ross, 
Note 1). In these studies, preschool and 
kindergarten children who were promised and 
given a reward for undertaking an enjoyable 
activity subsequently evidenced less intrinsic 
interest in that activity than did children 
who had not been previously rewarded for 
undertaking the activity. Pe inib 

research conducted on intri 
Laan n has assumed that the 


interest in childre : at t 
discounting principle posited by attribution 
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theory underlies the obtained effects. How- 
ever, there is no direct evidence for this hy- 
pothesized attributional process, and a con- 
troversy has developed concerning the most 
appropriate interpretation of the data. Sev- 
eral investigators (e.g., Reiss & Sushinsky, 
1975; Ross & Karniol, Note 2) have sug- 
gested that the studies are perhaps best un- 
derstood in nonattribution terms, in part, 
because the young children employed as sub- 
jects may not be capable of the causal rea- 
soning required by the discounting principle. 

The present research examines the tendency 

of subjects of different ages to employ the 
discounting principle in social perception. A 
developmental approach to this question is of 
interest for a number of reasons. First, it 
would indicate the age at which children be- 
gin to employ the discounting principle in 
social perception. Second, it would reveal 
what attributional rules, if any, children 
systematically use prior to this development. 
Finally, it would contribute to a theoretical 
understanding of the effects of rewards on 
intrinsic motivation in children by indicating 
whether young children are capable of per- 
forming the causal analyses associated with 
the discounting principle. 

One recent experiment that was designed 
to examine children’s inferences in the pres- 
ence of multiple sufficient causes is directly 
relevant to these issues. Smith (1975) re- 
ported that young children do not employ 
the discounting formulation or any other sys- 
tematic mode of attribution in social percep- 
tion. In particular, kindergarten children pre- 
sented with stories about one child who was 
rewarded for playing (or was commanded to 
play) with toy X and one child who played 
with toy X of his own accord were found to 
choose randomly when asked to indicate 
which child wanted to play with toy X and 
why he wanted to play with it. Older chil- 
dren and college students, on the other hand, 
tended to employ the discounting principle in 
a reliable fashion, as evidenced by their 
choice of the unconstrained target person as 
the one who really wanted to play with the 
toy. 

The results from Smith’s (1975) experi- 
ment seem to argue against an attribution 
interpretation of the intrinsic motivation re- 
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search conducted with young children. ‘The 
implication of Smith’s data is that children in 
the age range employed in the intrinsic moti- 
vation studies are incapable of the causal 
reasoning required by the discounting princi- 
ple. However, Smith’s results are rendered 
problematic by his failure to assess story 
recall, Conceivably, one of the reasons the 
kindergarten children may have responded 
differently from the rest of the sample is 
because of inaccurate recall of the more com- © 
plex constrained story.* 

This possibility is supported by studies 
indicating that when accurate recall is en- 
sured, young children are capable of reason- 
ing that is supposedly beyond their develop- 
mental level. For example, while Piaget, 
Inhelder, and Szeminska (1960) reported 
that children below the ages of 7-8 years 
could not infer A > C, given A > B, B > C, 
Bryant and Trabasso (1971) demonstrated 
that children of ages 4—6 years were able to 
draw valid inferences after they had thor- 
oughly learned the individual elements of 
the problem. McLaughlin (1963) similarly 
showed that limitations of reasoning ability 
correspond to limitations of immediate mem- 
ory or attention. 

Consequently, Experiment 1 was undertaken 
to examine the use of the discounting prin- 
ciple in social perception when accurate recall 
of the experimental passages is ensured. The 
subjects were presented with eight stories. 
Four of the stories included a plausible 
external cause for the target person’s behav- 
ior, in the form of either a parental command 
or a promised reward. Each of these external 
constraint stories was contrasted with a story 
in which the target person performed the = 
same behavior of his own accord, As in 


1 The constraint was embedded in the middle of 
the story and was presented in the form of a com- 
plex sentence: “Judy’s mother was there, and she 
said that Judy could have some cake if she played 
with the ball.” There is evidence that young chil- 
dren engage in simplification when they hear such 
complex passages (Harris, 1975; Menyuk, 19635% 
Smith, 1970). Thus, given an embedded phrase, the’ 
young child may drop the grammatically complex) 
phrase but maintain an accurate representation of 
the remaining parts of the passage (Blank & Frank, 
+1971). 
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Smith’s (1975) study, the major dependent 
measure was the subject’s choice of either 
the constrained or unconstrained target per- 
son as the one who most preferred the target 
activity. In contrast with Smith’s study, the 
current experiment employed two parallel 
story sets per type of constraint in order to 
determine whether subjects consistently used 
the discounting principle in the presence of 
either rewards or commands, While a devel- 
opmental trend was expected, such that con- 
sistent use of discounting would increase with 
age, it was also anticipated that unlike those 
in Smith’s study, many of the younger chil- 
dren in the present study would systematically 
employ this principle. 


EXPERIMENT 1 
Method 


Subjects. Subjects were 64 male and female chil- 
dren from kindergarten, first grade, and second grade 
and.8 college students who were attending summer 
session. 

Procedure. The school children individually par- 
ticipated in the experiment, which was conducted in 
a mobile trailer parked on the school grounds. On 
entering the trailer, the subject was told that he or 
she would hear some tape-recorded stories. The child 
was told to listen carefully, since he or she would be 
required to answer questions about each story. 

The eight stories were adapted from Smith (1975). 
All of the story characters were male, and the agent 
of constraint was always the mother. There were 
two forms of constraint.2 In two of the stories, the 
mother commanded her child to play with a par- 
ticular toy. In two other stories, the child was 
promised a reward for playing with the desired toy. 
The story character was always described as playing 
with the toy selected by his mother. For all subjects, 
the reward stories preceded the command stories. 

Each of the four constraint stories was contrasted 
with a story in which a different child played with 
an identical toy in the absence of any external de- 
mands. The stories were tape-recorded in a male 
voice. A sample constraint story is as follows: 


Mike was home and there were two of his toys 
there, a ball and a puzzle. Mike’s mother said that 
Mike could have some cake if he played with the 
ball. Remember, Mike’s mother said that Mike 
could have some cake if he played with the ball. 
And Mike played with the ball. 


This story was contrasted with the following 
account; 


Tony was home and there were two of his toys 
there, a ball and a puzzle. And Tony played with 
the ball. 
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After the child heard one of the stories in the set, 
he was asked to recall the story content. If he was 
unable to do so, the story was replayed until correct 
recall was obtained (up to three repetitions). Fol- 
lowing correct recall of the second story in the set, 
the subject was asked (a) which boy really wanted 
to play with toy X and (b) why did want 
to play with toy X. The subject was also asked 
why the boy in the other story played with the 
same toy. 

The adult subjects listened to the taped stories and 
recorded their answers to these questions. 


Results 


All of the children were able to repeat the 
stories accurately once the tapes had been 
replayed. It should be noted, though, that 
many children (and especially those in kin- 
dergarten) required several presentations of 
the stories. 

Subjects’ answers to the question concern- 
ing which child really wanted to play with 
the toys were coded as 0 or 1, depending on 
whether a constrained or an unconstrained 
choice had been indicated. The responses to 
the reward and command story pairs were 
separately coded. Thus, there were four pos- 
sible patterns of response associated with each 
of the constraints. For example, on the two- 
reward story pairs, subjects could choose the 
unconstrained story character on both stories, 
the constrained story character on both 
stories, or respond inconsistently, selecting 
the constrained character on one story and 
the unconstrained character on the other. For 
purposes of analysis, then, each subject re- 
ceived a score ranging from 0 to 2 for re- 
sponses to reward stories and a second score 
ranging from 0 to 2 for his or her responses 
to the command stories. Consistent selection 
of the unconstrained story character (a score 
of 2) indicated that the subject was employ- 
ing the discounting principle in the presence 
of a particular constraint. A score of 0 indi- 


2The constraint was mentioned twice in each 
story. The repetition occurred in the middle of one 
reward and one command story and at the end of 
the other reward and command stories. It was 
thought that perhaps children would remember the 
constraint better when it appeared at the end of the 
passage. However, preliminary analyses revealed no 
significant differences attributable to this variation, 
and it does not appear as a factor in the statistical 


analyses presented. 
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cated that the subject consistently chose the 
constrained target person, and a score of 1 
showed that he or she responded inconsist- 
ently. In this manner, all subjects were 
included in the analysis. 

The subjects’ explanations of their choices 
were coded by two independent raters who 
were unaware of the subjects’ treatment 
conditions. The answers were coded as being 
extrinsic causal reasons (e.g., he wanted to 
have cake), intrinsic ones (e.g., he liked to 
play with the ball), or irrelevant (including 
“don’t know”). The interrater agreement was 
90%. The raters were subsequently asked to 
discuss the items on which they disagreed in 
an attempt to resolve the differences. Items on 
which no agreement was reached even after 
discussion between the raters (m = 20/1120) 
were randomly assigned to an extrinsic or 
intrinsic classification.* 

Preliminary analyses indicated no main ef- 
fects or interactions involving sex of subject, 
and the analyses reported here are collapsed 
across this factor. An analysis of variance 
was conducted on the subjects’ responses to 
the question about which child really wanted 
to play with the toy.° Grade was a between- 
subjects factor, and story type (reward vs. 
command) was a within-subjects factor. This 
analysis revealed a significant grade effect, 


TABLE 1 


CHOICE PATTERNS ON REWARD AND 
COMMAND STORIES 
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F(3,69) = 6.89, p < .001, and a story type- 
effect, F(1, 69) = 8.37, p < 005. The Grade 
X Story Type interaction was nonsignificant 
(F <1). Table 1 shows the percentage of 
subjects at each grade level who consistently 
chose the unconstrained target pers 


sistently selected the constrained targei per- 
son, or responded inconsistently. As Table 

reveals, the statistically significant effects in- 
dicate increased use of the discounting prin- 
ciple with the older subjects and mo:e use 
of discounting on command stories toòsn on 


reward stories. 

Of greater interest is the finding t on 
the reward stories, children in kin‘ 
and Grade 1 tended to choose the cv 
story character as the one who really vanted 


Consistent Consistent 
constrained unconstrained 
choices Incon- choices 
(additive sistent (discounting 
Grade level model) choices model) 
Reward Stories 
Kindergarten 68.18 22.73 9.09 
Grade 1 62.50 8.33 29.17 
Grade 2 50.00 16.67 33.33 
College 0 25.00 75.00 
Command stories 
Kindergarten 59.09 18.18 22.73 
Grade 1 41.67 4.16 54.17 
Grade 2 44.44 16.67 38.87 
College 0 0 100.00 


Note. Figures represent the percentage of subjects who con- 
sistently or inconsistently reported that the constrained or 
unconstrained story character most wanted to play with the toy, 


to play with toy X. On the comma: ories, 
the same trend appeared in kindergay «n, but 
it was considerably attenuated by de 1. 
An examination of subjects’ explana! ons for 
their choices revealed that kindergarten and 
Grade 1 children chose the constrain | child 
as the one who wanted to play wit ioy X 
because of the external constrain‘ it is, 
because his mother offered him a rd or 
because his mother told him to ņ with 
toy X. Thus, on the reward stori: f the 
children who consistently chose con- 
strained target person, 93.3% in kin: garten 
indicated the extrinsic factor as the re :son for 
the constrained child’s wanting to | with 
the toy and 100% in Grade 1 indic: ed this 

8The coding was conducted after cor ion of 
both Studies 1 and 2. Hence, the same ra ors were 
employed for both studies, and the coders e blind 
as to which experiment and what grade ihe sub- 
jects were in. 

4These items were all of the same type: “He 
didn’t want to play with the puzzle” (i.c, “that’s 
why he played with the ball”). One of the raters 
had coded this item as being intrinsic, while the 
other had coded it as extrinsic. Since the coders 


ssign- 


were unable to resolve this conflict, random 
ment was used. 

5 Although nominal data are not usually sub- 
mitted to an analysis of variance, recent empirical 
studies have indicated that an analysis of variance 
is an appropriate means of analyzing such data, aS 
it provides comparable results to more complex 
procedures such as a partitioned chi-square analysis 
(cf. Glass, Peckham, & Sanders, 1972; Lunney; 
1970). 
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type of rationale. On the two command 
stories, the percentages were 84.6% and 
100%, respectively, for kindergarten and 
Grade 1 children who consistently chose the 
constrained character. It would seem, there- 
ore, that the younger children were remark- 
‘bly consistent in rationalizing their choice 
of the constrained child as the one who really 
wanted to play with the toy. Furthermore, 
the few kindergarten and Grade 1 children 
who chose the unconstrained target person 
gave intrinsic reasons for their choices. 
Discussion 

While the results of Experiment 1 do not 

ipport Smith’s (1975) conclusion that kin- 

‘garten children fail to employ any system- 

mode of attribution, the data also con- 
dict the hypothesis guiding the current 
riment. It was assumed that young chil- 
would tend to employ the discounting 
iple when recall is ensured. In fact, the 
ms to have accomplished the exact 
f what had been intended. While 
‘he kindergarten children were not 
iding randomly, fewer than 10% were 
(ently employing the discounting prin- 
ciple in the presence of a reward, and fewer 
than 25% did so in the presence of a com- 
ind. In contrast, the majority seemed to be 
additive schema. The younger 
| parently viewed the mother’s com- 
| or offer of a reward as an added incen- 
tive for playing with the toy. This pattern 
of responding was less prevalent by Grade 2 
and was virtually absent for the college stu- 
dent e finding that rewards were more 
likely to result in the use of an additive 
schema than were parental commands sug- 
gests that rewards are perceived as more 
likely to enhance the value of an activity. 
Smith also obtained this result on a rating 
scale assessment of how much the target 
persons wanted to play with the toy. 

The results of the present study may not 
be as contradictory to those of Smith as they 
first appear to be. Since Smith did not employ 
two story sets per type of constraint, it is 
impossible to determine whether a child would 
reliably use either a discounting or additive 
schema in the presence of a particular con- 
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straint, However, on the parallel story in 
Smith’s experiment, 58% of the kindergarten 
children chose the constrained child in the 
presence of a reward, and 38% did so in the 
presence of a command. While presumably 
only some of these children would consistently 
employ an additive schema, the percentages 
are reasonably consonant with those obtained 
in the present study. Though Smith concluded 
that the kindergarten children were respond- 
ing randomly, the present study indicates 
that many of them reliably responded in a 
manner different from adults. 

The current results are consistent with a 
suggestion by Costanzo, Grumet, and Brehm 
(1974) that young children infer liking in 
keeping with adult sanctions. Thus, adult 
approval of toy play would yield attributions 
of greater wanting, while adult disapproval 
would lead to attributions of lesser wanting. 
Similarly, Piaget (1948) and Kohlberg (1971) 
have proposed that young children evaluate 
actions in terms of adult sanctions such that 
a rewarded act is good, while a punished act 
is bad. 

On the other hand, it may be that the 
explanation for young children’s choice of the 
constrained story character is, in part, seman- 
tic in that the word want was used in the — 
present study as well as in Smith’s study, 
That is, the story characters can be inter- — 
preted as wanting to play with the toy be- — 
cause they know that they will get a reward j 
or because they want to please their mothers. 
This wanting need not reflect the story char- 
acter’s liking for the toy. Furthermore, re- 
search with children of ages 5 to 7 years has 
indicated that the word want is much less 
prevalent in their vocabulary than is the 
word like (Edwards, Gibbon, & Mason, 1964). 
Perhaps, then, the word want is improperly 
understood by the younger children. These 
considerations imply that a more adequate 
test of children’s use of the discounting prin- 
ciple would involve assessing their estimate 
of which story character really liked, as 
opposed to really wanted, to play with the 
toy. Additionally, the story type main effect 
obtained in Experiment 1 is difficult to inter- 
ory type was confounded with 


ret, since sti n 
ie in that command stories always fol- 


| 
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lowed reward stories. It is possible, then, that 
the differences are due to the sequence of 
presentation rather than the type of constraint 
employed. 

EXPERIMENT 2 


Experiment 2 was conducted in order to 
test whether the pattern of attributions found 
in Experiment 1 would be maintained when 
the word like was employed and whether the 
story type main effect would occur when the 
sequence of stories was varied. Thus, in 
Experiment 2, sequence of presentation was 
included as a between-subjects factor, with 
story type as a within-subjects factor. 


Method 


Subjects. Subjects were 81 children in kindergarten 
through second grade and 12 college students attend- 
ing summer session. The data from 6 children were 
eliminated from the study because these children 
refused to listen to the stories and/or answer the 
recall questions. Of these, 4 were in kindergarten 
and 2 in first grade. 

Procedure. The procedure was, for the most part, 
similar to that employed in the first study. The 
taped stories were changed to include female story 
characters, and there was one male and one female 
story character for each type of constraint. The 
sequence of stories was varied such that half of the 
subjects heard the reward stories first and half heard 
the command stories first. The questions asked of 
the subjects were the same as in Experiment 1, ex- 
cept for the substitution of the word like. The first 
question was the following: “Which boy [girl] really 
liked playing with the ?” The second question 
was as follows: “Why did like playing with 
the id 


Results 


Preliminary analyses revealed no significant 
effects for sex of subject or sex of story 
character. The data were collapsed across 
these factors for all of the analyses reported 
below. Once again, subjects received a score 
of O with respect to a particular constraint 
if they chose the constrained character twice 
in response to the first question, 2 if they 
chose the unconstrained character twice, and 
1 if they selected one constrained and one 
unconstrained alternative. Coding of the ex- 
planatory reasons was done by the same two 
raters who had done the original classification. 

An analysis of variance was conducted on 
subjects’ responses to the question asking 
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which story character really liked to play with 
the toy. The analysis included grade and se- 
quence of presentation (reward stor 
lowed by command stories vs. con 
followed by reward stories) as bety 


subjects factors. Type of constraint (reward 
vs. command) was a within-subjects factor. 
The analysis revealed main effects for g1-de, 


F(3, 75) = 4.35, p < .007, sequence, F (1, /5 
= 14.14, p < .001, and type of consi 
F(1, 75) = 4.30, p < .05. None of the 


actions approached significance. Table 2 ore- 
sents the percentage of subjects at each « ade 
level who systematically employed the lis- 
counting principle, consistently used the «\di- 
tive principle, or responded inconsistent! As 
Table 2 indicates, use of the discounting iin- 
ciple increased with age, was more frec ent 
in the command-followed-by-reward sequ nce 
than in the reward—command sequence, nd 


was more frequent on command stories t-an 
on reward stories. Table 2 also suggests |. at 
the number of subjects systematically employ- 
ing the additive principle was lower than in 
the first study and that this reduction w is 
particularly pronounced for command stories. 
In addition, the number of subjects consis'- 
ently using the discounting principle increased. 
Nevertheless, 39% of the kindergarten end 
26% of the Grade 1 children consister ly 


TABLE 2 


CHOICE PATTERNS ON REWARD AND 
ComMAND STORTES 


Consistent Con: 
constrained uncon 
choices Incon- cho 
{additive sistent (disc 
Grade level model) choices 


Reward stories 


Kindergarten 39.28 28.57 

Grade 1 25.93 11.11 y 
Grade 2 20.00 10.00 70.00 
College 10.00 0 90.00 


Command stories 


Kindergarten 10.71 39.29 50.00 
Grade 1 22.22 22:22 55.55 
Grade 2 15.00 5.00 80.00 
College 0 0 100.00 


_ Note. Figures represent the percentage of subjects who con- 
sistently or inconsistently report that the constrained or uncon- 
strained story character most liked to play with the toy. 


CAUSAL ATTRIBUTIONS IN SOCIAL PERCEPTION 


employed the additive principle when the 
constraint was a reward, 

The sequence effect obtained in Study 2 
suggests that only a proportion of the increase 
in discounting from Study 1 to Study 2 was 
attributable to the change in wording from 
want to like. Specifically, in Experiment 1, only 
the reward—command sequence was employed. 
In Experiment 2, this sequence was shown 
to produce less discounting than the com- 
mand-reward sequence. It is highly likely, 
therefore, that at least some of the increment 
in discounting from Study 1 to Study 2 is 
due to the increase in discounting associated 
with command stories when they precede 
reward stories.° 

An analysis of the explanatory reasons for 
choosing the unconstrained choice shows that 
even the younger children gave intrinsic rea- 
sons in explaining their choices. Thus, on the 
reward stories, 89% of the kindergarten and 
88% of the Grade 1 children who chose the 
unconstrained story gave an intrinsic reason 
for this choice (e.g., “he played with it be- 
cause he liked it”). The corresponding per- 
centages on the command stories were 93% 
and 100%. Of the subjects who chose the 
constrained choice, 63% in kindergarten and 
86% in Grade 1 gave an extrinsic explana- 
tion for the target person’s toy choice. The 
percentages on the command stories were 
67% and 50%, respectively, for kindergarten 
and Grade 1 children, These percentages may 
be deceptively low because of the small 
number of subjects falling into these latter 
categories. 

One additional analysis was conducted on 
the subjects’ explanatory reasons for their 
choices. This analysis focused on subjects who 
either consistently discounted or added and 
examined whether they tended to give con- 
junctive intrinsic and extrinsic explanations 
for the story characters’ toy play. For ex- 
ample, if a subject were consistently additive, 
did he also explain the constrained charac- 
ter’s toy play with an extrinsic rationale 
while giving an intrinsic rationale for the un- 
constrained character’s toy play? The pro- 
portion of children who consistently employed 
the additive or discounting principle in the 
presence of either rewards or commands and, 
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TABLE 3 
PERCENTAGE Or CHILDREN Usine Conjuncrive 
INTRINSIC-EXTRINSIC EXPLANATIONS 


POR THEIR CHOICE 


Reward stories Command stories 


Experi» Experi- Experi- i 
Grade level i meld E TE 
Kindergarten 529 70,0 64.7 “688 
Grade 1 68.2 83.3 87.0 85.7 
Grade 2 80.0 94.4 80.7 89.5 


Note. The figures only include children who consistent: 
liber the additive or the discounting principle, Se? Se 


in addition, engaged in conjunctive use of 
intrinsic-extrinsic explanations is shown in 
Table 3. In both experiments, the conjunctive 
use of intrinsic and extrinsic explanations was 
greater than would be expected by chance 
for all age groups (p < .005)." 


Discussion 


The data from the two experiments reveal 
that use of the discounting principle increases 
with age and that it is more likely to be em- 
ployed when the extrinsic cause is a command 
as opposed to a reward. The data also suggest 
that most kindergarten children do not sys- 


We have recently collected additional data 
(Karniol & Ross, Note 3) from kindergarten chil- 
dren for whom only the reward stories were played, 
and the dependent measure was phrased in terms of 
which child really liked to play with the toy. Of 
the children tested, 559 responded in a consistently 
additive fashion, while 18% discounted, ‘This pro- 
vides further evidence that the increase in discount- 
ing in Experiment 2 is attributable to the presence 
of the command-reward sequence rather than to 
the change in the wording of the dependent mea- 
sure from want to like. 

7 An estimate of the probability of obtaining two 
consecutive patterns of conjunctive intrinsic-extrin~ 
sic choices was calculated by assuming that only 
three patterns can occur: intrinsic-extrinsic, €X- 
trinsic-intrinsic, and intrinsic-intrinsic. An extrinsic- 
extrinsic pattern could not occur, since it would indi- 
cate erroneous recall and lead to replaying of the 
two stories. Since on each type of constraint two 
such patterns must occur, the probability of any 
subject responding with consistent conjunctive €x- 
planations is approximately .10, A probability test 
based on the advice of Sam Himmelfarb (personal 
communication, August, 1975) showed that each of 
the obtained percentages was highly significant when 
compared with this value. 
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tematically engage in discounting. All of these 
findings are consistent with the results of 
Smith’s (1975) research. 

In contrast with Smith’s data, however, the 
present studies, which ensured recall of the 
experimental stories, indicate that the major- 
ity of kindergarten children do make reliable 
causal inferences for a behavior that occurs 
in the presence of an external constraint. The 
current data suggest that two quite distinct 
heuristics are used to infer causality in this 
instance. Some of the kindergarten children 
appear to employ systematically an additive 
model in the presence of a particular con- 
straint: More liking or wanting is inferred 
when both extrinsic and intrinsic factors can 
plausibly explain‘the occurrence of a behav- 
ior. Other kindergarten children seem to em- 
ploy the multiplicative model that is implied 
by the discounting principle (cf. Calder & 
Staw, 1975a); that is, more liking or wanting 
is inferred in the absence of extrinsic causes, 
These different heuristics are implied both by 
the children’s selection of a particular story 
character as really wanting (liking) to play 
with the toy and the children’s explanation 
for their selections. 

The proportion of children employing either 
heuristic (or responding inconsistently) can- 
not be exactly specified at any age level, for 
it appears to depend on the particular extrin- 
sic constraint and the sequence in which the 
stories are presented to the children, Whivs 
significant, though, is that even at the kin- 
dergarten level, many of the children appear 
to employ systematic modes of attribution in 
social perception. 

One of the reasons for undertaking this 
study of the discounting principle was to 
discover the implications of these findings for 
the research on children’s self-perceptions 

of intrinsic motivation. Smith’s research 
suggested that young children were incapable 
of discounting, a finding that was incon- 
sistent with an attribution interpretation of 
the data from studies of intrinsic motiva- 
tion. The results from Expermient 2 of the 
current research, however, indicate that ap- 
proximately one third of the kindergarten 
children and over 60% of the Grade 1 chil- 
dren consistently employed the discounting 
principle when inferring the causes of an- 
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other’s behavior. Piaget (1948) has specu- 
lated that children can employ more sophisti- 
cated processes in judging their own behavior 
than in judging the behavior of other people. | 
Tf this is the case, it is not unreasonable to 4 
assume that a substantial number of children 
in the intrinsic motivation studies are capable 
of employing the discounting principle with j 
respect to their own behavior. Thus, the ; 
attribution analyses of the intrinsic motiva- or 
tion findings do not appear to be ruled out | 
on the basis of young children’s cognitive | 
capacities. 

The data from the current experiments | 


suggest that use of the additive rule decreases E 


while use of the multiplicative rule increases f. 
with age. It is interesting to note that simi- pt 
lar developmental trends have been reported) cg 
by Baldwin and Baldwin (1970) on moral q 
judgments. In this study, more than one jy 
quarter of the kindergarten children reported y¢ 
that a child who was ordered by his mother * 
to be helpful was kinder than a child who h 
spontaneously engaged in the same act. By 45 
the fourth grade, almost all of the children ` 
judged the spontaneous act as kinder than 
the commanded act. In addition, Kohlberg | 
(1963) described a stage in moral evaluation 
where acts are defined as good or bad accord- 
ing to reward or punishment consequences. | 
Thus, if a forbidden act is rewarded, the ` 
young child will evaluate it as a good act, In 
later stages of moral development, Kohlberg 
suggested that the consequences of acts be- 
e less important in determining children’s 
evaltiations, 

TheNconcept of decentration provides a 
further parallel between the area of moral 
evaluation and the current discussion of at- ‘ 
tributional schema. Specifically, it has been 
suggested by Piaget (1948) and others (e.g., 
Flavell, Botkin, & Fry, 1968; Whiteman, 
1970) that young children’s failure to take 
intentions into account when making moral 
evaluations is due to their inability to con- 
sider two aspects of a situation at one time. 
Similarly, Smith (1975) and Schultz, But- 
kowsky, Pearce, and Shanfield (1975) have 
speculated that young children’s failure to 
employ the discounting principle is due to, 
their inability to decenter, that is, to focus 
on both intrinsic and extrinsic aspects of the 


~*~ 


situation at the same time. Smi 


his experiment, 

The present studies, however, would seem 
to raise some doubt as to the value of the 
concept of decentration in explaining the 


questions, children who consistently employed 
either the discounting or the additive prin- 
ciple were highly likely to use intrinsic and 
extrinsic factors in conjunction with each 
other. That is, a high proportion of children 
who inferred preference according to the addi- 
tive principle explained the constrained story 
character’s behavior in terms of extrinsic rea- 
sons and used intrinsic explanations for the 
unconstrained story character’s toy play. 
_ Since additive children seemed to be able to 
take into account both intrinsic and extrinsic 
) causes of behavior, it seems highly unlikely 
that use of additive principle merely reflects 
an inability to decenter, 
In summary, it does not appear that in the 
present experiments, at least, young chil- 
| dren’s failure to employ the discounting prin- 
ciple is due to their inability to decenter, 
Rather, some of the children appear to differ 
from adults in their view of the role of ex- 
trinsic factors, and of rewards in particular. 
, It is interesting to speculate as to the 
psychological processes underlying this devel- 
opmental trend. Conceivably, while adults 
and older children tend to view rewards as 
“bribes,” some of the younger children tend 
to view them simply as “bonuses” for under- 
] ' taking the activity. Differential perceptions of 
à | the reward could result from the failure of 
K Be young children to ascribe ulterior motives to 
J s= thers spontaneously (Flapan, 1968; Piaget, 
í A 1948). In fact, Piaget suggested that young 
' by “ children first pass through a stage of “objec- 
l Ce tive realism” during which they exclusively 
4 Ti. focus on the consequences of actions rather 
t than on the actors’ intentions. 
situ From this perspective, it would appear that 
Conj some young children may not question why 
Swg a reward is being proffered and may simply 
3 


le 
ogi 


Y 
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focus on the improved consequence, that is, 
getting to play and- getting a reward, On the 
other hand, when the cognitively more ma- 
ture child is offered a reward in a situation 
that is not ordinarily associated with rewards 
(e.g, toy play), he or she may question the 
motive for the offer, The subject may reason- 
ably infer that the adult in the story must 
assume that the child would find the target 
toy less enjoyable than the alternate toy. 
Why else would the adult offer the child a 
reward for playing with the toy? This line of 
reasoning could lead to the inference that the 
constrained child would like the target toy 
less than the unconstrained child and hence 
to the discounting of the constrained child’s 
intrinsic motivation. Consequently, the differ- 
ence between children who consistently en- 
gage in discounting as opposed to those who 
employ an additive model may lie in a differ- 
ential predilection for imputing motivation to 
the agent who promises rewards. 

This line of reasoning can be employed to 
interpret the apparent advantage of command 
Stories over the reward stories as well as the 


Sequence effect obtained in Experiment 2, 7 


Commands offer unambiguous evidence of the 
agents’ manipulative intent, while rewards are 
more subtle means of obtaining the same ma- 
nipulative goals. It is likely, therefore, that 
young children would be more inclined to 
discount intrinsic interest in the presence of 
a command than in the presence of a reward, 
The sequence effect may indicate, moreover, 
that command stories prime the child to ques- 


tion the ulterior motivation of the adults in 


the stories, and hence the command-reward 
sequence leads to greater use of the discount- 
ing principle than does the reward-command 
sequence. , 
The possibility that inferences of ulterior 
motives underlie the use of the additive and 
discounting principles can be directly exam 
ined. This analysis implies that only children 
who employ the discounting principle in a 
reliable fashion should tend to interpret a 
reward offer as a “bribe,” or as an attempt 
to manipulate the target person’s behavior, 
Those who employ the additive schema, on 
the other hand, should tend to view the 
reward simply as a “bonus,” or added incen- 
tive for engaging in the rewarded activity, 


ical processes underlying children’s inferential 
patterns is currently in progress. 
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situations and behaviors in situations. 


Cronbach (1957, 1975) has brought atten- 
ion to the problem of the two disciplines of 
scientific psychology—the emphasis on varia- 
tion due to situation (treatment) differences 
as opposed to the emphasis on variation due to 
person (individual) differences. Over the years 
there has been an increase in studies involving 
oth person and situation variables (Sarason, 
Smith, & Diener, 1975), though such studies 
have remained a small proportion of the total 
sersondlity and social literature. Within the 
vast few years there has been a noteworthy 
increase in emphasis on interaction research 
in personality (Bem & Allen, 1974; Bowers, 
1973; Carlson, 1975; Endler, 1973; Mischel, 
1973; Pervin, 1968) to the extent that one 
psychologist reviewing the literature concludes 


This paper was prepared while I was a Visiting Re- 
search Fellow at Educational Testing Service. The 
support of ETS and my colleagues in the Division 
of Psychological Studies is gratefully acknowledged, 
The research reported was also supported in part 
by a grant from the Rutgers University Research 
Council. My appreciation is expressed to Floyd 
Turner, who assisted in the analysis of the data. 
Portions of this paper were presented in a paper 
entitled “The Representative Design of Person- 
Situation Research” presented at the International 
Conference on Interactional Psychology, Stockholm, 
Sweden, June 1975. 

Requests for reprints should be sent to Lawrence A. 
Pervin, Department of Psychology, Livingston Col- 
lege, Rutgers University, New Brunswick, New Jersey 
08903, 


A Free-Response Description Approach to the Analysis 
of Person-Situation Interaction 


J Lawrence A. Pervin 
Livingston College, Rutgers—The State University 


This study is a report on the utilization of a free-response description approach 
to address the following question: In what ways does an individual remain 
stable and vary as a function of situational characteristics, and what are those 
characteristics? Each of four subjects generated a list of situations in his or 
her current life which he or she then rated on lists of situation traits, feclings, 
and behaviors, which were also generated by the individual subject, For each 
subject the data were factor analyzed to determine groups of situations which 
were distinctive in terms of the feelings and behaviors associated with them. 
It was suggested that the personality of each individual could be understood 
in terms of the pattern of stability and change in feelings and behaviors in 
relation to defined groups of situations. It was also suggested that the approach 
utilized might serve as the basis for the future development of a taxonomy of 


that “if interactionism is not the Zeitgeist of 
today’s personality psychology, it will prob- 
ably be that of tomorrow’s” (Ekehammar, 
1974, p. 1045). 

Despite the increased attention being given 
to framing personality questions in interaction 
terms, serious methodological and conceptual 
problems remain, In particular, we know little 
about the dimensions people use to perceive 
and organize situations or about the process 
of person-situation interaction, The Endler 
studies (Endler & Hunt, 1968) have been im- 
portant in calling attention to person~situae 
tion interaction effects. In these studies the 
subject is symbolically presented with a va- 
riety of potentially anxiety-arousing or hos- 
tility-arousing situations and indicates on & 
scale the likelihood of his or her responding in 
each of a number of different ways to each 
situation. The data are then analyzed across 
subjects to determine the proportion of the 
variance that can be accounted for by person 
differences, situation differences, and person- 
situation interactions, While emphasizing per- 
son-situation interactions, however, such 
studies have done little to help us understand 
the process of such interactions, Also, a prob- 
lem with these and other efforts to assess per- 
son-situation interactions through the use of 
questionnaires (Ekehammar & Magnusson, 
1973; Magnussion & Ekehammar, 1975) has 
been that the situations described and re- 
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sponses provided may have limited applica- 
bility from the standpoint of the individual 
responding to them. An analogy may be drawn 
here to the person perception literature where 
responses of subjects have been found to vary 
according to whether the persons judged are 
known to them and whether the traits used 
are familiar to them (Koltuv, 1962). If we 
are going to consider persons in their interac- 
tions with situations, then we need to look at 
situations and responses that are relevant to 
the subjects being studied. 

The aim of the present research is the de- 
velopment of a conceptual scheme and meth- 
odology to answer the following question: In 
what ways, and why, do people remain stable 
(consistent) in their behavior and feelings, 
and in what ways do they vary according to 
which situational characteristics? In terms of 
conceptual approach, the goal is a more dif- 
ferentiated understanding of the interactions 
and transactions between people and situa- 
tions as opposed to an emphasis purely on 
persons, purely on situations, or on a global 
Person X Situation interaction effect. The ef- 
fort is to understand personality as the indi- 
vidual’s pattern of stability and change in 
relation to defined situational characteristics. 
In terms of methodological approach, the ef- 
fort is to adapt some of the techniques of 
personality research and person perception 
(implicit personality theory) research toward 
the specific goal of understanding person—situ- 
ation interactions, In particular, emphasis is 
placed on the development of free-response 
methods, as opposed to checklists or question- 
naires, to study an individual’s perception of 
situations and his or her perceived pattern of 
behavior in relation to these situations. Such 
techniques permit one to study situations and 
responses that are meaningful for the indi- 
vidual and still provide for systematic data 
analysis. 


MetHop 


Procedure 


The approach to data collection used in this study 
was an adaptation of Rosenberg’s (Note 1) method 
of obtaining free-response descriptions of persons or 
“naturalistic personality descriptions.” Each of four 
subjects was asked to perform the following five 


tasks: 
List the situations in his or her current life. A 
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situation was defined as involving a specific | 
specific time, and one or more individuals i: 
in specific activities. Current life was defin 
cluding situations occurring during the pa 
The subjects reviewed their daily lives a 
situations that occurred and were of so 
tance. For example, situations noted inc 
items as presenting one’s ideas before a cias 
ing with one’s mother, studying alone, being 
party, and being alone with a good fri 
situation represented a concrete situation 
subject. Because the subjects generated their 
lists of situations, they were free to list situ: 
were relevant to them and to survey a br 
of situations over which their feelings and 
might remain stable or vary. 

Describe (adjectives, traits, phrases) edci 


This generated a list of situation traits for ou sub- 
ject. 

Describe how he or she feels in each sit This $ 
generated a list of feelings which, again rele- $ 
vant for the individual. 

Describe his/her behavior in each situat This | 


generated a list of behaviors in situatio 
Indicate the applicability of each situat! 
feeling, and behavior to each situation. The 
the first four steps was the generation by 
ject of four lists: situations, situation cha 
feelings in situations, and behaviors in 
the fifth step, the subjects scored the ay 
(2 = very applicable, 1 = somewhat applica 
not at all applicable) of the above iten 
situation. In other words, data were o 
all subjects on the situations in their live: } 
perceived these situations, and how they |) 
feeling and behaving in the situations. 


As noted, the data gathering procedure 1 was 
similar to that used by Rosenberg (Note in his: 
research on implicit personality theory, “owever, 
while Rosenberg asked subjects to list p traits 
of people, and feelings elicited by people, the current 


research asked people to list situations, charas 
of situations, and feelings and behaviors in 
Whereas Rosenberg asks questions conce 
co-occurrences of traits and feelings with 
current research asks questions concernin. 
occurrences of feelings and behaviors in T 
situations with defined perceived character 
procedure used also bears some similarity to 
cedure used by Block and Bennett (19: 
study of an individual’s perception of bi 
interactions in varying behavioral contexts. However 
in the Block and Bennett study the subject was 
asked to sort prepared statements into a fixed dis- 
tribution, and the behavioral contexts were limited 
to personally relevant interpersonal situations. 


Subjects 


The subjects in this study were four (two male, 
two female) Rutgers undergraduates. All subjects 
were told that the research involved the study of the 
ways in which they were the same and different at- 
cording to different situations in their current life. 
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TABLE 1 
ILLUSTRATIVE RESULTS FROM THE FACTOR ANALYSIS y 
S OF JENNIFER'S RATINGS OF 
SITUATION CHARACTERISTICS (n = 64), FEELINGS (n = 62), AND 
BEHAVIORS (n = 59) ror 23 SITUATIONS 


F 


(% var e) Illustrative situations Situation traits Feelings Behaviors 
Home-Volatile mother blows up at me emotional, angry, angry, pressured, sensitive, ETA 4, 
G5% honest with parents about volatile, excitable involved, insecure, caring, suppressed, 
leaving unhappy confused, not compulsive 
mother refuses gift 
¢ someone else comes home 
upset 
- have to participate in class demanding, threatening, self-conscious, self-conscious, controlled, 
have to perform at work pressuring, awkward, challenged, vulnerable, ambitious, determined, 
do the job wrong at work challenging, embarrass- awkward, pressured, compulsive, cool, 
in a strange place ing, unconcerned anxious responsible, diligent, 
nonrebellious 
Friend: se with friend—no problem emotional, gentle, caring, concerned, concerned, caring, 
(6%) with friend—problem friendly, generous comfortable, melan- emotional, involved, 
alone choly, sad insightful, responsive 
Uncert 6) come home from ambiguous, nondefined bottled-up, melancholy, preoccupied, detached, 
Philadelphia uncertain, un- sad, lonely, frustrated, quiet, self-conscious, 
4 in a crowd concerned, ignoring confused controlled, cool, 
taking the bus to class introverted 
want to leave to go to 
Philadelphia 


in a strange place 


sr analyses were run on the data for each 1-4 contain, for each of the four subjects, the 
factor analyses of (a) situations based on results of the factor analysis of situations 


it ratings, (b) situations based on feel- all ratin iven by the subject. 
s, (c) situations based on behavior ratings, across all r; 8S 8 y J 
ituations based on all three sets of ratings.* 
for each subject the data were factor 


Interindividual Comparisons 


. e. . p p 
¿ analyzed to determine those feelings and behaviors It is possible to compare the subjects in 
y h we i pacana to T porani with specie terms of the number and content of situations, 
situations, These data were taken to be expressive © Š . A : i . 
the v BN one perceives oneself to be stable traits (situation tats, feelings, bare 
and in feelings and behaviors in situations in and factors. In relation to situations, the num- 
| 


panis ber of situations listed by the subjects was 
not inordinately large, ranging from 23 to 29, 
RESULTS AND DISCUSSION In the Block and Bennett (1955) study the 

Because this study does not involve the test- subject listed 23 individuals with whom she 
ing of hypotheses, but rather involves an ef- had personally relevant relationships. Sa 
fort to develop a conceptual scheme and it would have been possible to be in more than 
methodology for the study of personality, 
there will be a simultaneous presentation Of 1 The BMD 08M factor analysis package was used, 
the results and discussion of them in terms of consisting of a varimax rotation, squared multiple 
the following four areas: interindividual com- correlations in the diagonal, and an eigenvalue greater 


f 
i i indivi the cutoff for selecting the number o 
parisons, ine ant ee rate Sotto The intercorrelations among 


terns of stability and change across situations, situations were factored and Dieslien ge hate 
relation of affects (feelings) to the perception tained for the traits rated. ia Rint wie ar 
of situations and behavior in situations, and by situaht™ pee ai i 


situation characteristics, feelings, and behaviors asso- 


implications for future research. ciated with these situations were identified from 


Since space does not permit the presenta- suet et a ape of situation. chat= 
> z S c + 
tion of the four factor analyses for each sub- Dee UIs Gnas, feelings ratings, and behaviors rat- 


ject, only the factor analyses of situations ings for the individual subjects are available on re- 
based on all ratings are presented.” Tables quest to the author. 


468 


ILLUSTRATIVE RES! 


ULTS FROM THE Factor AN. 


TABLE 2 


LAWRENCE A. PERVIN 


SITUATION CHARACTERISTICS (n = 89), FEELINGS (n = 63), AND 


BEHAVIORS (nm = 


58) FoR 25 SITUATIONS 


ALYSIS OF HARRY’S RATINGS OF 


Factor 
(% variance) 


Illustrative situations 


Situation traits 


Feelings 


Behaviors 


Home-Family 
(35%) 


Work-School (18%) 


Private Recreation 
(8%) 


Tension (5%) 


Friends and Public 


talking with parents 
eating dinner with wife 
visiting my in-laws 
talking with wife 


talking with patients at 
the clinic 

having lunch with 
patients and staff 

tutoring kids with 
problems 

participating in staff 
seminar 

seminar at school 


reading for pleasure 

going to museum in 
New York 

shopping in New York 


driving alone in bad 
weather 

arguing with my wife 

with patients who are 
acting out 


drinking in Mike's Bar 


warm, caring, friendly, 
relaxed, inviting 


thought-provoking, 
interesting, challenging, 
stimulating 


diverse, stimulating, 
enjoyable, spontaneous, 
fun 


frustrating, threatening, 
intense, tense, 
anxiety-producing 


innocuous, tolerant, 
permissive, pleasant 


affectionate, warm, 
secure, caring 


challenged, curious, 
actualizing, interested, 
fascinated 


excited, pleasure- 
seeking, without a care 
in the world, stimu- 
lated, uninhibited 


nervous, tense, anxious, 
under pressure, 
displeased, depressed 


riendly, easygoing, 
exhibitionistic, bored 


caring, warm, 
affectionate, easygoir 
emotional, pleasant, 
warm, open 


curious, extroverted, 
warm, friendly, 
questioning 


relaxed, intelligent 
elated, awed, cu 


frustrated, anxiou 
angry, emotion 


flashy, extrover 
friendly, so 


joking with friend Pete 
relaxing in Jack's 
apartment 
talking with friends 
in the snack bar 
at school 
drinking in a 
New York bar 


Reaction (3%) 


exhibition 


one type of situation with each person, there 
appears to be some agreement between the 
two studies in relation to the number of situ- 
ations individuals find necessary to list to 
capture the essence of their current lives. 
There appears to be considerable diversity 
among subjects in the types of situations listed 
and in the number and content of the char- 
acteristics associated with these situations. 
Obviously, the effort here needs to be directed 
toward a useful taxonomy of situations (Fred- 
eriksen, 1972). It should be clear that the 
study of many more subjects will be required 
before it is possible to develop such a tax- 
onomy. However, the factor analyses do sug- 
gest some groupings of situations, such as 
Home-Family, Friends-Peers, Relaxation-Rec- 
reation-Play, Work, School, and Alone. As 
Krause (1970) points out, there are in every 
culture certain traditional classes of social 
situations, The categories of situations listed 
would appear to have considerable generality 
across subjects and to involve distinguishable 


differences in effects on behavior. In 
words, it seems likely that most of the 
tions encountered by individuals could 
veniently classified into one of a rel 
small number of categories, with in 
differences found in the behaviors ass 
with the situations in the various cat: 
While it is possible to categorize si 
objectively, it is also clear that situa 
classified in the same category will b , 
ceived, experienced, and responded to quite 
differently by different subjects. In other 
words, the data from two subjects may each 
involve a Home factor and a Work factor, but 
the perceived characteristics associated with 
the situations grouped in these factors may 
differ (e.g., the Home and School-Work fac- 
tors for Jennifer and Harry in Tables 1 and 
2). This involves the traditional distinction 
made between the alpha press, or environment 
as it is, and the beta press, or environment as 
perceived by the individual (Murray, 1938; 
Pervin, Note 2). In terms of the latter, one 


ions so 
per- 
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ILLUSTRATIVE RESULTS FROM THE F, 


TABLE 3 


SITUATION CHARACTERISTICS (7 = 66), FEELINGS (n = 37), AND 


BEHAVIORS (n = 59) ror 2 


8 SITUATIONS 
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ACTOR ANALYSIS OF BEN’s RATINGS oF 


Feelings 


Behaviors 


humorous, participating, 
not guilty, happy 


eager to leave, aloof, 
bored, withdrawn 


emotional, affectionate, 
guilty, not free, not 
relaxed 


nervous, powerless, 
scared, learning 


together, interested, 
participating, not calm, 


uninhibited, receptive, 
friendly, masculine, 
humorous, tolerant, 
attentive 


loner, aloof, masculine, 
intolerant, adamant 


affectionate, loving, 
emotional, masculine, 
bright, sensitive, loyal, 
not uninhibited, 
compromising 


nervous, learning, 
growing, masculine, 
eager, not 
uninhibited, sleepy 


responsible, contributings 
serious, decisive, 


(% va Illustrative situations Situation traits 
Social-Friends drinking with Joe uninhibited, comfortable, 
talking with Paul open, flexible, receptive, 
harmony in the band masculine, talkative, 
with girl friend not demanding 
Y 
Work-Nonsti:: in a nonstimulating boring, unstimulating, 
Situations classroom masculine, independent, 
living with group in traditional, affected 
on by mood 
loading cartons at 
work 
at a boring party 
Family (8% visit my parents affectionate, loving, 
talk with my mother traditional, caring 
visit sister and 
her family 
i 
# Army (5%) working at Army humorous, authoritarian, 
headquarters new, serious, tense, 
in Army drills militaristic, traditional, 
short timer in Army structured 
Band-Confi onflict in the band challenging, demanding, 
new band musically tight, 
performance exclusive, frustrating 
_ may ask her there are common dimen- 
sions tha ividuals use to perceive situa- 
tions; tha ubjects may differ in the way 
«they perc a particular situation but be 
Similar in dimensions used to perceive 
Situations generally, Analysis of the factor 
score data the trait ratings suggests that 
there is a limited range of dimensions subjects 
Use to perceive situations and that there may 
be considerable similarity in the nature of 
these dimensions, In the data presented, four 


demanding, intolerant, 


not secure 
controlling, loyal 


or five factors were found in each case. If 
One considers all of the factor analyses run 
(ie, four for each of the five subjects), the 
Tange of number of factors in any one analy- 
Sis was from four to nine. The finding of a 
limited number of dimensions along which 
Situations are perceived is of interest in rela- 
tion to Miller’s (1956) work on the limits of 
Our capacity for processing information. What, 
then, might be the nature of the dimensions 
used by individuals to construe situations? 
Four dimensions appear to be most salient 
m the ratings of situations: friendly-un- 
friendly, tense-calm, interesting-dull, and 
constrained-free, These dimensions are simi- 
ar to the dimensions of semantic meaning 


identified by Osgood, Suci, and Tannenbaum 
(1957) and to the emotion-eliciting qualities 
of environments defined by Mehrabian and 
Russell (1974). The dimensions also are simi- 
lar to the three categories noted by Insel and 
Moos (1974) to characterize environments: 
relationship, personal development, and sys- 
tem maintenance and system change. 


Intraindividual Analyses and Patterns of 
Stability and Change Across Situations 


The data presented indicate considerable 
diversity across subjects in the situations 
listed and the feelings and behaviors asso- 
ciated with these situations, One important 
statement about the personality of the indi- 
vidual would appear to be the kinds of situa- 
tions he or she encounters in daily life. We 
could undoubtedly gain some insight into a 
person from an analysis of the range of s 
tions encountered, the frequency with whi 
various kinds of situations are encountered, 
and the time spent in each, A second inter- 
esting aspect of the analysis of the data from 
any one subject is the linkage among ae 
tions, For example, in the case of Jennifer 
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TABLE 4 


ILLUSTRATIVE RESULTS FROM THE FACTOR ANALYSIS OF JAN’s RATINGS OF 
SITUATION CHARACTERISTICS (% = 139), FEELINGS (n = 104), AND 


BEHAVIORS (n = 99) FoR 29 SITUATIONS 


Factor 


(% variance) Mlustrative situations 


Situation traits 


Feelings Behaviors 


sharing with a male friend 
on a date 

talk with a student 

at an old friend's party 

with older friend like a 
brother 


Peer-Male (33%) 
intellectual 


at work in Washington difficult, tiring, 
doing research in Boston 
at a male's party 

in a large, new group 


Work (10%) 


in personal counseling unique, special, 
session 

in an encounter group 

with a male therapist 

in a therapy group 

talking with mother 


‘Therapy-Support 
(1%) 


at home—general 
with my relatives 
fighting with mother 
with my brother 
drinking alone 


Family (4%) 
familiar 


easygoing, sociable, 
light, friendly, 


demanding, interest- 
ing, intimidating 


personal, important 


defensive, unaware, 
closed, lonely, 


mature, enjoying, laughing, 
interested, respectfu 
honest, healthy, 
extraverted 


fun, okay, mature, 
appreciation 


listening, not demand: 
fearful, shy, polite, í 
aloof, introverted 


shy, inadequate, 
overwhelmed, intro- 
verted, quiet 


loving, hopeful, 
questioning, feeling, 
grateful 


love, sadness, affection- 
ate, gratitude, tender- 
ness, closeness 


demanding, exploding., 
feeling, sharing, 
questioning, exper 
too much 


want attention, 
defensive, frustrated 


(see Table 1) the situation “mother blows up 
at me” loads on the same factor as “honest 
with parents about leaving,” suggesting prob- 
lems in the area of separation. In the case of 
Jan (see Table 2) “drinking alone” and 
“fighting with mother” load on the same fac- 
tor, suggesting a possible relationship between 
the two. 

What is particularly instructive about the 
data concerning personality and person-situa- 
tion interaction is that each individual has a 
reported pattern of feeling and behavior which 
in some ways is constant across all situations, 
in some ways is generally present except for 
some situations, and in some ways is almost 
never present except for a few specific situa- 
tions, For example, according to her reports, 
Jennifer is almost always sensitive, vulner- 
able, and ightful. She also is friendly, 
warm -pting most of the time except 


when ome volatile home situations, 
at wl is uniquely irritable, angry, 
upset, depressed, uncontrolled, and rebellious. 
She also tend involved and caring, ex- 
cept when certain situations where 
she is det ceupied, introverted, 
controlled, all subjects one can 


ney reportedly are 


define 
cir | havior throughout 


stable in sor 


ways 


all situations, varying in some of thei 
havior in relation to different groups of 
tions, and either never or very rarely e> ibit 
other behaviors. 


While some individuals may be introv ted 
in all situations and some individuals <- ira- 
verted in all situations, many individuais are 
introverted in some situations and extray ted 
in others. Individuals may differ, then, in the 
behaviors on which they are stable or varying 
(Bem & Allen, 1974) or in the kinds of situa- 
tions in relation to which their behavior 
varies. An illustration of the former would be 
an individual who shows cross-situational 
variability in being friendly but stability in 
being exhibitionistic as compared with an- 


other individual who is always friendly and 
sometimes exhibitionistic. An illustration of 
differences in the kinds of situations in rela- 
tion to which behavior varies would be an 
individual who is exhibitionistic in public 
settings and nonexhibitionistic in private set- 
tings as compared with an individual who fol- 
lows the reverse pattern. It would appear to 
make some sense to include both stability and 
change in our definition of the person, as well 
as the relationship of stable and varying be- 
haviors to specific kinds of situations. In sum, 
the data suggest that all subjects are stable in 
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some of their behavior across all situations 
and variable in some of their behavior in rela- 
tion t fferent situations, Whether a person 
exhibi's a specific behavior in a situation will 
depen’ on whether that behavior is part of 
one’ vertoire, if so, whether it is stable or 
varies according to the situation and, if vari- 
able, relationship to the particular situa- 
tion er consideration, One’s personality, 
then ) be defined as one’s pattern of sta- 
bility ond change in relation to defined situa- 
tions’ characteristics. 

Ac ing the view that all behavior reflects 
both son and situation, we may also ac- 
cept view that some behaviors are more 
perso’ ‘etermined for some people and other 
beha more situation determined for other 
peop Instead of deciding whether behavior 
is m erson determined or more situation 
deter ned, the task of research becomes that 
of un‘crstanding the person and situation 
force at account for the pattern of sta- 
bilit d change in behavior. The Moos 
(19¢ tudy has often been cited as evidence 
of the -ontribution of both person and situa- 
tion rs to behavior; however, what has 
been often cited is the finding that the 
relati roportions of variance accounted for 
by per-ons and situations varied according to 
the | ior considered, While relevant data 
were presented, one would expect to find 
that clative contributions varied not only 
acros aviors but also across individuals on 
the sa ehavior. What appears to be needed 
is the d of analysis suggested by Tucker’s 
(196 hree-mode (person, situation, re- 
sponse) tor analysis—that is, analyses that 
indicate how groups of individuals show dif- 
ferent patterns of behaviors in relation to 


&roups of situations. 


Relation of A fects to the Perception of 
Situations and to Behavior in Situations 


Within the context of this research, affects 
have emerged as an important basis for the 
Organization of situations, What is striking is 
the extent to which situations are described in 
terms of affects (e.g., threatening, warm, in- 
teresting, dull, tense, calm, rejecting) and 
Organized in terms of the similarity of affects 
aroused in them. In other words, we may 
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Organize situations not so much in terms of 
cognitively perceived similar attributes but in 
terms of bodily experiences associated with 
them. A somewhat similar view is expressed 
by Mehrabian and Russell (1974) who, as 
noted, emphasize the emotion-eliciting quali- 
ties of environments. The use of affects as the 
basis for categorization of phenomena has also 
been noted by Plutchik (Note 3) in relation 
to the perception of people and interpersonal 
behavior. 

A second important role of affects that 
emerges from this research is that of a medi- 
ator between the initial contact with a situa- 
tion and the expression of behavior in it. The 
data, particularly some of the separate factor 
analyses, indicated a very complex relation- 
ship between feelings and behaviors, This 
complex relationship can be illustrated in the 
results from the factor analysis of Jennifer's 
behavior ratings (see Table 5). In the first 
factor, Jennifer reports behaving the same 
way regardless of whether she is criticized for 
doing the job wrong or praised for doing the 
job well. In both situations she behaves in a 
determined, responsible, ambitious way, Yet, 
the feelings in the two situations, while simi- 
lar in some ways, are different in important 
ways. Ratings on the two situations correlate 
.49 in the factor analysis of behavior ratings, 
but they correlate —,05 in the factor analysis 
of feelings ratings. 

Whereas in the above case different feelings 
were associated with similar behaviors, we 
also have evidence of similar feelings being 
associated with different behaviors, For ex- 
ample, similar feelings are experienced in the 
situations described in the Home-Volatile fac- 
tor (see Table 1). In fact, in the factor analy- 
sis of feelings, all of the following situations 
load high on the first factor (Upset-Tension) : 
“home from school visibly upset,” “mother 
blows up at me,” “want to leave home to go 
to Philadelphia,” “mother refuses father’s 
gift,” and “honest with parents about leav- 
ing.” Although there are some common aspects 
to the behaviors in these situations, there are 
important differences as well. As the per in 
Table 5 indicate, in some of these situations 
Jennifer is sharp-witted and jocular, in some 
she is emotional and uncontrolled, and in some 
she is peaceable, suppressed, and loyal. 


472 


ILLUSTRATIVE RESULTS FROM THE FACTOR ANALYSIS OF JENNIFER'S RATINGS OF 
BEHAVIORS (n = 59) ror 23 SITUATIONS 


LAWRENCE A. PERVIN 


TABLE 5 


Factor (% variance) 


Illustrative situations 


Illustrative behaviors 


Pressure to Perform (32%) 


Uncertain—Potentially 
Moving (19%) 


Uncertain (9%) 


Involved (7%) 


Upset (4%) 


have to perform at work 
have to participate in class 
do job wrong at work 

praised for doing well at work 
In a strange place 

with friend with a problem 


return to my old neighborhood 
come home from Philadelphia 


want to leave home to go to Phila. 


talking with Mel 


in a crowd 

taking the bus to class 

caught not working 

with friend, no problem* 

come home from school pleased 
alone* 


honest with parents about leaving 
with friend with a problem 

come home from school upset 
mother blows up at me 


mother refuses father’s gift 
someone else comes home upset 


Self-conscious, ambitious, 
determined, responsible, 
involved 


sharp-witted, jocular, 
sensitive, moody, 
nostalgic, not diligent, 
not assertive 


detached, cool, controlled, 
introverted, self-conscious 


emotional, uncontrolled, 
involved, upset, confused, 
not jocular, not sharp- 
witted, not detached 


peaceable, ees 
insightful, loyal, not 
honest, not aggressive, 


mother blows up at me 
come home from Philadelphia 


not outgoing 


^ Negative loading on the factor. 


Thus, we have illustrations of the complex 
relationship between feelings and behaviors, 
wherein behaviors may be expressive of or 
mask feelings, depending on the meaning of 
the feelings for the person and the context 
within which they are experienced, This em- 
phasis on the complex relationship between 
situation, affect, and behavior is similar to 
Ekman’s (1971) emphasis on the situation as 
capable of eliciting different emotions in differ- 
ent people, with the behavioral consequences 
being governed by learned display rules. Situa- 
tions may also trigger the same facial affect 
program for two individuals, indicating that 
the same emotions have been elicited, but be- 
cause they have learned different display rules, 
the behavioral consequences will differ in the 
two cases. An understanding of the linkages 
between feelings and behaviors, as governed 
by perceptions of the situation and learned 
display rules, becomes another component 

of understanding person-situation interaction 
and another dimension of intraindividual anal- 


ysis. 


Implications for Future Research 


The research presented expresses an effort 
to develop an approach to the study of per- 
son-situation interaction that allows for repre- 
sentativeness of design (Brunswik, 1956; Per- 
vin, Note 4) and that goes beyond description 
of person-situation interactions toward an 
understanding of the processes involved. At 
least three major avenues of future research 
emerge. First, there is the need for more free- 
response description studies with many indi- 
viduals so that we can move toward a mean- 
ingful taxonomy of situations. This taxonomy 
should be allowed to emerge from considerable 
observation of the ways in which individuals 
perceive and respond to situations they en- 
counter in their daily lives. It is suggested 
that a priori taxonomies and taxonomies based 
on laboratory research will not be useful in 
advancing understanding and prediction; 
rather, observations related to individuals in 
situations in the natural habitat are needed. 

Second, there is the need for observations of 
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actual behavior in situations to accompany 


the reports of behavior presented in the cur- 
rent research, To the extent that the reports 
presented are matched by independent obser- 
vations, we are basing our work on a solid 
foun ation, However, we know that this may 
not *) vays be the case, In the Block and Ben- 


nett (1955) study it was found that the sub- 


ject descriptions of herself interacting with 
other individuals were matched by the de- 
scrij)i.ons of some individuals and not by the 
desc: tions of others, In neither case, how- 
ever was there the objective analysis of be- 
havi- in the situation, Naturalistic observa- 
tion ©’ behavior in a representative sampling 
of siivations is a difficult task, but the work 
of B -ker and his associates, the ethologists, 
and -havior modificationists are suggestive of 
what an be done (Barker & Wright, 1955; 
Bijou, Peterson, Harris, Allen, & Johnston, 
1969, ‘slurton-Jones, 1972). 


Fi: ly, the research presented suggests that 


it wo d be useful to consider the perception 
of situations and behavior in situations from 
a dev ‘opmental perspective, The conceptual- 
izatio: of the flood of detail of experience 
at th, more general level of situations, en- 
abliny ‘he individual to integrate his or her 
own rience and to anticipate the behavior 
of ot). vs, represents an important though lit- 
tle ur cerstood developmental achievement 
(Freese, 1975; Livesley & Bromley, 1973). 
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Testifying in One’s Own Defense: The Ingratiator’s Dilemma 


Arthur Frankel and William N. Morris 
Dartmouth College 


Testifying in one’s own defense poses a dilemma for defendants, That is, al- 
though they may have favorable information about themselves to offer such 
as the mitigating circumstances surrounding their offense, jurors may react 
negatively when a defendant offers testimony of this sort because they suspect 
that the defendant is either lying or attempting to manipulate them, Two 
studies showed that college students who role played as jurors in a case of 
plagiarism penalized a defendant more harshly when he offered testimony citing 
extenuating circumstances. When the same testimony was offered by impartial 
witnesses, the defendant received lesser penalties, The results of Study 2 showed 


that the “boomerang” effect, which occurred when the defendant cited mitigat- 
ing circumstances, was not mediated by impressions of the defendant's cred- 
ibility, Two alternative explanations of the data were discussed, 


‘| effects of communicator credibility on 


mess <e acceptance is an old issue in social 
psy: ology. In early research (e.g., Hovland 
& V ss, 1951), credibility, as manipulated, 
ofte: included elements of both trustworthi- 
nes: l expertise of the source. Such studies 
tend: | to show greater acceptance of the mes- 
sage nen the source was highly credible, al- 
thoi this was not always the case (Hoy- 
land © Mandell, 1952). 

M recently, interest in credibility has 
beer wed, Operationally, this research has 
exar l the effects of the presence versus 
abse of “ulterior” motives on believability 
and persuasion, For example, Walster, Aron- 


son, «id Abrahams (1966) found that com- 


munic* ‘ors arguing against their own best in- 
terest: were seen as more honest and were 
more }:-rsuasive than communicators advocat- 
ing a position that would benefit themselves. 


One of the most obvious instances in which 
the s; terest of a communicator threatens 
his credibility is the case of a defendant 
testifying in his own defense. In spite of the 
Tecent surge of interest in jury decisions, there 
has been little attention shown in juridical 
Teactions to the testimony of defendants. One 
exception is the classic work of Kalven and 
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Zeisel (1966) in which it is noted that the 
credibility of defendants in criminal proceed- 
ings is an important determinant of the ver- 
dict in close cases. Unfortunately, Kalven and 
Zeisel were unable to present much systematic 
evidence as to the variables affecting credi- 
bility. 

Spurred by the paucity of data on the im- 
pact of defendants’ testimony on verdicts and 
penalties, Morris and Frankel (Note 1) pre- 
sented college students with a case history in- 
volving a student caught plagiarizing a term 
paper. This type of case was selected, as op- 
posed to a case involving criminal proceedings, 
in an attempt to maximize the relevance, and 
therefore the impact, for their particular sub- 
ject sample. 

In a factorial design, Morris and Frankel 
(Note 1) manipulated various aspects of the 
defendant’s testimony, With respect to how 
the term paper was acquired, the transcript 
showed that the plagiarizer testified that he 
either asked another student for his term 
paper or had been offered it. With respect to 
his rationale, he stated either that he had used 
another student’s paper because he needed 
good grades to be admitted to graduate school 
or that he feared the embarrassment of a low 
grade. Finally, with respect to his past de- 
meanor as a college student, the plagiarizer 
either described himself as a loner who was 
malcontent and had not engaged in any civic 
or campus activities or as socially aor 
happy at school, and engaged in useful col- 
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fied to by the defendant or by impartial wit- | 
nesses. The circumstances are either mitigat- ` 
ing, that is, suggestive of lesser responsi lity 
on the part of the defendant, or not. it is 
predicted that the defendant who spor'ane- 
ously offers testimony of an exculpatory na- 
ture will be penalized more severely thax the 
defendant whose testimony does not (rep? cat- 


It had been anticipated that aspects of the ing Morris and Frankel), whereas this pa‘ ‘ern 
defendant’s own testimony that tended to should be reversed when the testimony is 
place him in “a good light” might be dis- provided by witnesses. 
counted by subjects (insofar as the testimony 
was clearly self-serving) with the result that EXPERIMENT 1 
assigned penalties would not be much af- Method 
fected. The actual results painted a quite dif- Subjects, The subject population for this ¢ peri- 
ferent picture. It was found that such testi- ment consisted of 192 undergraduate studeni> en- 


mony actually operated to the defendant’s rolled in an introductory psychology course. The 
detriment. Thus, a series of main effects and subjects were of both sexes, although the large ajor- 
ity were male. Subjects were solicited over the course 


interactions showed that the defendant who prea oH oin volunteer basis 
testified that he had been offered the paper, Procedure. The experiment was conducted two 
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lege activities. (Other variables manipulated 
in the study did not involve the defendant’s 
testimony and thus do not concern us here.) 
Subjects were assigned the task of recom- 
mending how much punishment (e.g., proba- 
tion, one-term suspension, expulsion, etc.) the 
defendant, who had admitted his guilt, would 
receive, 


that his plagiarism was influenced by his fear large, roughly equal-sized, group sessions. jects 
of not getting into graduate school, and that were first informed that the investigator ws in- 
he was a “good citizen” received the harshest terested in how students felt about “honor ode” 
penalties. violations. The investigator noted that he ught 
AIS A X that the best way to do this was to give stu s an 

Such results are in line with Jones’s discus- actual case that had been brought before ap- 
offer 


sions of the risk entailed when using the tactic propriate college committee and to have th 


of ingratiation (Jones, 1964; Jones & Wort- their opinions about it. A case history conce g an 
man, 1973). According to Jones, although alleged incident of academic plagiarism was ti disi 
favorable self-presentations may be effective tributed. Subsequent to reading the case hist sub- 
x a a A jects were asked to suggest an appropriate jalty, 

in enhancing the actor’s attractiveness to and in addition, they were queried about rim- 7 
alter, such a tactic may be dangerous, espe- pressions of the credibility of the testimony sud the 
cially when it is apparent to the target per- pasar i aaa of Teika i 
. + . iy esign and stimulus materials. The desi f the 
o Ea the AE ee to gain study was a 2 X 2 X 2 factorial. The first tv vari- 
y impressing him. In such a situation, the ables were intended to manipulate the appeal cir- 
target person may conclude that the actor is cumstances surrounding the incident of p'ssiarism. 
being deceitful and manipulative, attributions Thus according to the transcript, the paper ultimately 
ant or 


that are likely to have negative consequences Plagiarized was either solicited by the dei 
y 8 sa offered to the defendant by the author of the | 


r th ‘ aa z 
for the Eeee - and the motivation for the plagiarism was 
The situation of our defendant, then, com- avoid the embarrassment of a bad grade 


prises what Jones refers to as the “ingrati- crease the chances of getting into graduate $ 
ator’s dilemma.” To the extent that he has Finally, these circumstances were testified to < 
the defendant or unidentified witnesses. Th 


in i i i p = 

peas favorable to his case, it behooves nipulations were folded into a story which, regard- 
1m to make it known so s to enhance his less of condition, conveyed the facts that the de- 

attractiveness and reduce his penalty. In of- fendant admitted his guilt, had been doing C+/B~ 


fering it himself, though, he risks being per- Work in the course, was generally satisfied with 
school, got along well with other students, and pat- 


ceived as manipulative. According to Jones, |. . : de 
luti to this dil Si t ticipated in some extracurricular activities. 

gne See p PERS peal Aur Soule Manipulation checks and dependent measures. In 

third party to provide the testimony. order to be assured that the manipulations of ap- 


In order to test these notions, the present parent circumstances were successful, a separate sam- 
ple of 80 subjects were randomly assigned one of 


study replicates and extends Morris and , 

F A r PN s d- G t i the eight possible case histories and were asked two 
eS s ( ote 1) study. IECUINS, ances SUF- questions: “How much did coincidental circum- 

rounding the incident of plagiarism are testi- stances contribute to J.’s plagiarism?” and “How 
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much cid environmental pressures contribute to J 
plagiarism?” Both questions used 9-point scales an- 
chored at the low end by the phrase “not at all” and 
at the bigh end by “very much.” These questions 


were ned to assess the attribution of diminished 
respo ity inherent in assigning cause to the en- 
vironment, The former question was meant to be 
princip«!ly responsive to the conditions under which 
J. ob'sined the paper, whereas the latter question 
was (irccted toward the manipulation of motive or 
reas or plagiarizing. 

T} incipal dependent measure was the penalty 
subj: hose to assign to the defendant. Six differ- 
ent | lies were available to subjects. In order of 
incree og severity they are as follows: probation, 
tem; suspension for one term, temporary sus- 
pens or two terms, temporary suspension for 
three ns, indefinite suspension (the student is al- 
lowe reapply for admission after 2 years), and 
perm separation from the college. These penal- 


ties chosen, as they are the ones most fre- 


quent mployed by the college in actual cases, For 
purp: of analysis, these penalties were assigned the 
num! 1 through 6. 

In ition to assigning penalties, subjects were 
aske rate how “straightforward and candid” the 
testi was and how much personal responsibility 
the idant appeared to accept for the incident, 
For ialf of the subjects, these questions preceded 
the po «ity question, and for half they followed. The 
meas of perceived credibility and acceptance of 
respo ibility were included in order to determine 


mpressions on these dimensions were medi- 
expected effects in terms of the penalties 
selected. More specifically, a defendant who 
nuating circumstances may be suspected of 
ing in order to create a favorable impres- 
‘natively, he may be believed, but sub- 
v see his excuse making as an inappropriate 
to escape responsibility for his misdeed. 


Either »‘tribution could conceivably mediate penalty 
assignznont, Clearly, such effects are not expected to 
occur witnesses other than the defendant pre- 
sent the testimony. y 
Resul. 


tiveness of the manipulations. Re- 
; to the two manipulation check ques- 
tions indicate that the mitigating evidence was 
appropriately decoded by subjects. Analysis 
of variance revealed that responses to the first 
question, concerning the contribution of coin- 
cidental circumstances to the plagiarism, de- 
pended upon the way in which the defendant 
obtained the paper; when a student offered 
him the opportunity to read his paper, coin- 
cidenta] circumstances were said to contribute 
More (M = 5.80) than when he had asked for 
the paper (M = 5.00), F(1, 72) = 2.87, P< 
-10. Responses to the second question, con- 
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TABLE 1 
MEAN PENALTIES ASSIGNED TO THE DEFENDANT, 
EXPERIMENT | 


Means of acquiring 
the paper 


Source of testimony /rationale Solicited Offered 


Defendant 
Grad school 225 2.83 
Embarrassment 1.6 2.33 
Witnesses 
Grad School 2.08 1.79 
Embarrassment 2.92 2.54 


Nole, 
seh ml hte be 
cerning the contribution of environmental 
pressures, depended upon the apparent ra- 
tionale behind the plagiarism, Competition for 
graduate school was perceived by subjects as 
more influential (M = 7,10) in terms of induc- 
ing plagiarism than potential embarrassment 
(M = 6.08), F(1, 72) = 5.56, p < .025, 
Neither the main effect of source of testimony 
nor its interactions with either of the circum- 
stance variables approached significance, 

Penalty assignment, The means for penalty 
assignment as a function of condition are 
presented in Table 1. Analysis of variance of 
these data indicated that both types of testi- 
mony about the circumstances surrounding 
the plagiarism interacted significantly with the 
source varaible; for the means of acquiring 
the paper, F(1, 176) = 8.47, p < .01, and for 
the motive for plagiarism, F(1, 176) = 16.40, 
p < 001.1 There were no significant main ef- 
fects. 

Planned comparisons rather clearly support 


1In order to assure ourselves that the equal in- 
terval treatment of the data did no serious violence to 
reality, we did a scaling study. Fifty-four undergradu- 
ates were given the six penalties and asked to pèr- 
form a magnitude estimation task, The most lenient 
penalty, probation, was assigned the value 100, and 
subjects were asked to assign numbers to the other 
penalties based on how much more severe they 
thought them to be; that is, a rating of 250 would 
be used if the subject thought a particular penalty 
to be two and one half times as severe as probation, 
The median values for one-, two-, and three-term 
suspensions; indefinite suspension; and permanent 
separation were 250, 363, 492, 783, and 977, re 
tively. Analyses of variance and planned compan ons 
using these scaled values resulted in levels o sig- 
nificance virtually identical to those reported. 


F 
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the hypotheses of the study. When witnesses 
provided the testimony, the presence of com- 
pelling reasons and coincidental circumstances 
led to lower penalties; in the case of the 
means of acquiring the paper, the result was 
marginal, F(1,176) = 2.01, p < .20, but the 
result for the motive variable was significant, 
F(1,176) = 12.11, p < .001. However, when 
the plagiarizer was the source of the testi- 
mony, the opposite pattern obtained. Testify- 
ing that he was offered the paper and giving 
as his reason the desire to get into graduate 
school resulted in harsher penalties, F (1,176) 
=7.10, p<.01 and F(1, 176) = 5.38, p< 
.025, respectively. The striking nature of these 
data is perhaps best indicated by two sets of 
equivalences in Table 1: Penalties were mild- 
est and virtually identical when witnesses 
cited both mitigating circumstances (M = 
1.79) and when the defendant cited neither 
(M = 1.67); penalties were harshest and vir- 
tually identical when witnesses cited neither 
mitigating circumstance (M = 2.92) and the 
defendant cited both (M = 2.83). 
Impressions of credibility and acceptance of 
responsibility. These data were analyzed by 
means of 2 X 2 X 2 analyses of variance.” 
There were no significant main effects or in- 
teractions in either analysis. Planned compari- 
sons were carried out within the condition in 
which the plagiarizer offered the testimony. 
Here, again, there were no differences ap- 
proaching significance; that is, subjects did 
not find the plagiarizer less credible or less 
accepting of responsibility when he spontane- 
ously referred to two possible mitigating cir- 
cumstances as compared to when he offered 
none, Finally, correlations between perceived 
credibility and penalty assignment and be- 
tween perceived acceptance of responsibility 
and penalty assignment were calculated both 
for the entire sample and within cells. None 
of these correlations were significant. 
Unfortunately, there is reason to doubt the 
sensitivity of the measure designed to assess 
impressions of the credibility of the testimony. 
The problem is that the question referred to 
the testimony per se with no specific refer- 
ence to the plagiarizer’s testimony. Since all 
case histories, regardless of condition, con- 
cluded with testimony by the plagiarizer’s in- 
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structor and a review of his school records, 
these aspects of the transcript may ha 
erally inflated credibility ratings, mai 
less likely that differences in the credibil: 
the plagiarizer would be detected. 


EXPERIMENT 2 


Accordingly, a second study was u»der- 
taken in an attempt to remedy this posible 
deficiency. A stro% statement vouching for , 
the accuracy of the testimony was ins: ted 
into the case history just prior to the e ‘ual 
testimony. If the penalty effect in the ‘rst 
study had been mediated by impressio: s of 
credibility, then the presence of the credibility 
assurance should wash out the effect. A tepli- 
cation of the effects of the first study would 


suggest that perceptions of credibility were 
not mediating penalties. 
Method 

Subjects. Subjects were 47 students who were 
Solicited on a voluntary basis from a large soc’! sci- 
ence course. 

Procedure. The procedure was identical to (he one 
employed in the first study with two exce ‘ions, 

Sec- 


First, the only dependent measure was pena! 
ond, the following paragraph was inserted in case 


history just prior to the beginning of the transcript: 
The following testimony was heard be the 
Committee. It was corroborated and ve d by 

several other witnesses, so there is no d that g 
these were the actual circumstances sur: vnding 
the incident. In other words, the following in- 
formation provides an accurate picture of \\:e cir- 
hind, 


cumstances leading up to, and the reason 
this individual’s plagiarism. 
Design. The design of the study was a 2 `> 2 fac- 


torial. The circumstance variables from the firs! study 
were replicated, while the source of testim vari- 
able was dropped; that is, all manipulated ‘»stimony 


in Experiment 2 came from the defendant. 
Results 


The penalty data are presented in Table 2. 
As may be seen, they provide a fairly close 


2 It will be recalled that half of the subjects re- 
sponded to these questions after suggesting a penalty, 
with the other half responding prior to the penalty 
question. The analyses are collapsed across this vari- 
able since it did not interact with any other vari- 
ables. However, there was a tendency to perceive the 
defendant as less credible (p < .11) and less accepting 
of responsibility (p < .025) when these questions 
were asked subsequent to penalty assignment, These 
findings may reflect the tendency of subjects to justify 
their choice of penalty. 
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replication to their counterpart data from Ex- 
periment 1, A 2 X 2 factorial analysis of vari- 
ance revealed main effects of marginal sig- 
nificance for both types of mitigating circum- 
sisnces, F(1, 43) = 3.12, p < .10, and F(1, 
43) = 3.03, p < .10, for the means of acquir- 
ing the paper and the reason for plagiarism 
vo ables, respectively. The interaction did 
noi approach significance. 

addition, the penalty data in this study 
were combined with their counterpart data 
from the first study so that a 2 (Presence vs. 
Absence of a Credibility Assurance) x 2 
(deans of Acquiring the Paper) x 2(Motive 
fo: Plagiarizing) analysis of variance could be 
per‘ormed, The main effects for both types of 


mi. (ating circumstances were significant, F(1, 
13°) = 9.13, p < .01, for the means of acquir- 
ing ‘he paper, and F(1, 135) = 7.79, p < .01, 
fov ‘he reason for plagiarism, However, the 
an« ysis revealed that neither the presence 
vi absence of a credibility assurance main 
effc- nor its interaction with the circumstance 
vari bles approached significance. 
Discussion 

data from these, studies show that 
test mony about mitigating circumstances in- 
duces subjects to assign greater causal signif- 
ica to the environment for a misdeed. 
Win such testimony comes from a witness, 
it ics the expected effect of leading to a re- 
duciion in the penalty assigned the defendant. 
However, when the testimony comes from 
the defendant, it results in harsher penalties. 
This latter effect occurs without any apparent 
mediation by way of impressions of credibility 
or responsibility acceptance. 


s lack of mediation by perceived credi- 
is somewhat surprising both intuitively 
and from the point of view of Jones’s the- 
orizing about ingratiation attempts. According 
to Jones and Wortman (1973), the primary 
danger to the ingratiator making self-presen- 
tational claims is that the target may decide 
that he is making false or exaggerated state- 
ments. However, Jones and Wortman go on 
to suggest that even when the ingratiator’s 
self-enhancing statements are regarded as 
credible, they still may boomerang if the tar- 
get perceives the presentation as an attempt 
at impression management. In such cases, one 
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TABLE 2 


MEAN PENALTIES ASSIGNED TO THE DEFENDANT, 
EXPERIMENT 2 


Means of acquiring the paper 


Rationale Solicited Offered 
Grad school 2.69 2.85 
Embarrassment 1.73 2.70 


could conceivably invoke reactance motivation 
Seles, 1966) to explain the boomerang ef- 
ect, 

Such an explanation of our data seems un- 
likely given that the defendant’s citation of 
extenuating factors was meant to influence 
the college committee rather than the subject, 
A more likely argument centers about what 
sort of person subjects believe the self-exon- 
erating defendant to be, Either he (she) is 
seen as attempting to justify inappropriately 
his (her) action or, perhaps, he (she) is the 
sort of person who does not feel responsible 
for misdeeds when environmental forces con- 
tribute.’ Subjects may recommend harsh pen- 
alties because the defendant has one or both 
of these qualities or because one or both of 
these qualities are seen as increasing the likeli- 
hood that the defendant will commit future 
misdeeds. n 

Alternatively, one could focus on the de- 
fendant who presents little by way of a de- 
fense, that is, cites circumstances not thought 
to be conducive to his plagiarism. When as- 
signed light penalties, such a defendant is, 
perhaps, being rewarded for his forthrightness, 
There are some data suggesting that such ef- 
fects may occur, Jones and Gordon (1972) 
performed a study in which a stimulus per- 
son who volunteers the information that he 
was expelled from school for cheating is liked 
better than a stimulus person who withholds 
that same information until asked, Jones and 


8 We do not mean to imply that the presentation 
of justifications by a defendant will always backfire. 
Whether or not they will seems likely to depend upon 
how compelling the justification is. For example, a 
thief who can (credibly) claim that he stole in order 
to feed his starving children may well be able to 
engage the sympathies of a jury. However, even ki 
this case, we suspect that it would be to the ad- 
vantage of a defendant to have such testimony come 


from impartial sources. 
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Gordon suggested that by volunteering such 
information early in the interview, the stimu- 
lus person “advertises that he does not want 
to establish a relationship under false pre- 
tenses, and, by the very act of early dis- 
closure, he moves part of the way toward 
neutralizing the effects of the past indiscre- 
tion” (p. 364). 

The findings we have obtained augment the 
growing list of extralegal variables that have 
been shown to affect the decisions of simu- 
lated jurors (e.g., DeJong, Morris, & Hastorf, 
1976; Kaplan & Kemmerick, 1974; Landy & 

ronson, 1969; Mitchell & Byrne, 1973; Si- 
gall & Ostrove, 1975). Our particular results 
seem especially relevant to the decision a de- 
fendant must make as to whether or not to 
testify on his own behalf. Data reported by 
Kalven and Zeisel (1966) suggest that it is 
dangerous for the defendant to refuse to take 
the stand, especially if he has no prior record 
and the crime is serious, However, taking the 
stand is no guarantee of success. In close 
cases where a jury convicts while a judge would 
acquit, Kalven and Zeisel pointed out that 
sometimes “the defendant, in an apparent ef- 
fort to make a good case better, tells a story 
which neither the judge nor the jury believes” 
(p. 387). Although the judge, recognizing the 
defendant’s motivation, may be inclined to dis- 
miss this, the jury, according to Kalven and 
Zeisel, may decide to misbelieve all of the de- 
fendant’s testimony and vote for conviction. 

Note that Kalven and Zeisel assumed that 
differences in perceived credibility account for 
the differences in sentencing between judge 
and jury. Our data would appear to render 
that assumption less tenable. Whatever the 
reason, however, defendants seemingly must 
walk a tightrope, avoiding both silence and 
self-serving testimony, in order to achieve the 


most favorable outcome. 
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Informational Factors Underlying Inferences About 
Hypothetical Persons 


Robert S. Wyer, Jr., and Ronald L. Hinkle 
University of Illinois at Urbana-Champaign 


Procedures Suggested by Gollob’s S-V-O model of social cognition were used 
to investigate the manner in which information about two persons combines to 
affect both judges’ own evaluations of these persons and their predictions of 
these persons’ liking for one another, In one experiment, judges inferred A's 
liking for B on the basis of two adjectives describing A and two adjectives 
describing B; they also estimated their own liking for B, In a second experi- 
ment, judges estimated their own liking for A and B on the basis of adjectives 
describing A and B and A’s liking for B. Information was varied over several 
levels of favorableness. Results provided insight into the assumptions on which 
judges based their inferences. Judges weighted adjectives with extreme implica- 
tions relatively more heavily both when estimating their own liking for persons 
described by them and when predicting others’ liking for these persons; how- 
ever, they weighted unfavorable adjectives more heavily than favorable ones 
only when estimating their own liking for persons, Although the evaluative 
implications of the information presented often had interactive effects on in- 
ferences, these effects were not always of the sort implied by cognitive balance 
theory. The contributions of several factors other than balance, and of the de- 
scriptive implications of the information presented, were identified and in- 


terpreted. 

Re arch on the manner in which informa- gated are interrelated. An integration of the 

: tion ut persons affects liking for these per- two conceptual and empirical approaches is 
sons traditionally been of two types. One suggested by the Subject-Verb-Object (S-V- 
type, -cmplified by the research on impres- ©) formulation of social cognition developed 
sion nation (for reviews, see Anderson, by Gollob (1974b). The present paper re- 
1971 yer, 1974), has been concerned with ports the results of two investigations, each 
how evaluative implications of different performed within the conceptual framework 
person: ‘ty trait adjectives combine to affect suggested by this formulation, One study 
judge: -stimates of how well they would like concerns the manner in which information 
a per described by the adjectives. A sec- that affects a judge's liking for each of two 
ond, ulated by cognitive balance theory persons affects the judge’s belief that these 
(Feat! 1967; Heider, 1958; Newcomb, persons will like one another. The second 
1968), has focused on the interrelations investigates the manner in which information 
among a judge’s own expectancies to like each describing two persons and their liking for 
of two versons and his or her belief that one another affects a judge’s own expect- 


these persons like one another, The theoreti- 
cal bases for these two areas of research dif- 
fer, However, the empirical issues investi- 
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ancies to like these persons. 

Since S-V-O methodology is presented in 
detail elsewhere (Gollob, 1974a, 1974b; 
Wyer, 1975), it is described here only 
briefly. Gollob notes that the essential fea- 
tures of a social interaction situation may 
often be conveyed in a single sentence of the 
form “adja A verbs adj,B” (egy “warm A 
likes unfriendly B”), where adj, is an adjec- 
tive representing a value along an attribute 
dimension (S) used to describe the subject of 
the sentence, adja denotes a value along an 
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attribute dimension (O) used to describe the 
object, and the verb denotes a value along a 
dimension (V) pertaining to A’s behavior or 
subjective reactions to B.* S, V, and O may 
be either stimulus dimensions along which 
information is presented or response dimen- 
sions along which judgments are made. Judg- 
ments may be expressed as a linear function 
of the contributions of a series of informa- 
tional cues, each cue being defined in terms 
of values along one or more stimulus dimen- 
sions and also the dimension of judgment. 
Which cues contribute to judgments, and the 
nature of these contributions, often provide 
insight into the assumptions that judges make 
in arriving at their inferences (Rossman & 
Gollob, in press; Wyer, 1975). 

An important feature of S-V-O methodol- 
ogy lies in the fact that the contribution of 
each cue to judgments can be extracted using 
standard analysis of variance procedures. 
When only two dichotomous values are 
available along each dimension, the contribu- 
tion of each cue may be inferred from one of 
the seven orthogonal contrasts tested in a 
standard analysis of variance of judgments as 
a function of values along the three dimen- 
sions. Moreover, when more than two values 
are manipulated along each dimension, the 
interactions involved in such an analysis may 
often be partitioned into several components, 
each corresponding to the contribution of a 
different cue (cf. Gollob, 1968a, 1968b). 
These cues are most likely to appear when 
the information presented varies along two or 
more underlying psychological dimensions, 
each of which contributes to judgments in a 
different way. 

The S-V-O formulation and the procedures 
of analysis involved in its application have 
been of considerable value in analyzing the 
informational determinants of social judg- 
ments (cf. Gollob, 1968a, 1974a; Wyer, 1974, 
1975; Wyer, Henninger, & Wolfson, 1975). 

Although this formulation was not applied 
rigorously to the series of studies reported in 
this paper, aspects of this formulation and 
the methodology suggested by it were of con- 
siderable conceptual value in predicting and 
interpreting the informational determinants of 


judgments. 
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EXPERIMENT 1 


Wyer (1975) found that estimates of one 
person’s (A’s) liking for another (B) were 
affected not only by the adjectives describing 
B but also by the adjectives describing A ənd 
by the evaluative similarity of the adjectives 
describing A and B. At a gross level, tese 
results are not very surprising. The favora’le- 
ness of the adjectives describing B may a! ct 
a judge’s perception that B is popular, vad 
therefore is apt to be liked by persons in on- 
eral (including A), whereas the favoralle::<ss 
of the adjectives describing A may affect he 
judge’s perception that A is generally frien- iy, 
and thus is apt to like persons in gev val 
(including B). Moreover, to the extent | at 
the evaluative implications of the adject ves 
describing A and B establish a sentiment v ia- 
tion between each of these persons and he 
judge, the effect of evaluative similarity is 
consistent with the predictions of cogni’ ve 
balance theory. That is, the judge should «x- 
pect A to like B if the judge personally ei!’ er 
likes both A and B or dislikes both, but ne 
judge should expect A to dislike B if |ne 
judge’s own sentiments toward A and B cif- 
fer in valence. 

The general effect predicted by cognit ve 
balance theory seems intuitively reasonal ‘ce. 
Judges may assume that if persons are ic- 
scribed by evaluatively similar adjectives, 
they are similar in general values and iti- 
tudes and, therefore, are apt to get along well 
together. In other words, perceived similarity 
may lead to perceived attraction (Byrne 
1971). However, the precise manner in 
information about two persons affects percep- 
tions of their similarity is unclear. On one 
hand, judges may perceive the similarity of 
persons to be an inverse function of the abso- 
lute difference between the implications of 
the adjectives describing these persons. On 
the other hand, the perceived similarity of 
persons may be a positive multiplicative func- 
tion of the implications of the adjectives de- 
scribing them. The second hypothesis, unlike 


1Note that a triad of relations of the sort to 
which balance theory applies may be described in 
such a sentence. For example, a situation in which 
P likes A, A dislikes B, and P dislikes B may, from 
P’s point of view, be conveyed by the sentence 
“likable A dislikes dislikable B.” 
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TABLE 1 
List or STIMULUS ADJECTIVES USED IN EXPERIMENTS 1 AND 2 


Adjectives describing A 


Adjectives describing B 


Xı Xs 


Yı Ya 


Replication 1 


Note, Normative : 
in parentheses. 


the first, imp! 
judged more si 
by extreme vai 
‘sion than if th 
‘trate values alo 
_ The present 
the above tw 
ance” effect. It 
eral issues, First 


about B (object 
to affect these | 


Diychology students 


A stimulus items. 


The subjects were 


ectives, describing 


as for B and B's l 
= estimates, participa 


Personally would expect to like a person de- 


grateful (4.82) 


excited (3.52) 
argumentative 


informal (4.16) 


unintelligent (1,68) 


prejudiced (1.06) 


good-tempered 
tiscreet. (4:16) 


respectful (4.83, 
studious (4.18) BOS) 


sensitive (3,58 
(2.27) silent Gs à 
angry (1.69) 
hard-hearted (1,07) 


(4.82) bright (4.83) 
self-confident (4.21) 


talkative (3.51) proud (3.58) 


troubled (2.35) 
unfair (1.07) 


istless (1.69) 


% frivolous (2.37) 
noisy (1.73) 


superficial (1,09) 


\lues reported by Anderson (1968), along a scale from 0 (least favorable) to 6 (most favorable), are given 


that two persons will be 
if they both are described 
ilong an attribute dimen- 
th are described by mod- 
‘nis dimension. 
y explores the merits of 
rpretations of the “bal- 
explores two more gen- 
ompares (a) the manner 
formation about A (per- 
are combined to affect 
ng for B with (b) the 
jar pieces of information 
mation) are combined 
tions. Second, it com- 
mation to predict an- 
erson with the use of 
to estimate one’s own 


and female introductory 
rticipated in the study to 


lla course requi t. The experiment took 
proximately 30 mir to complete. Participants 
Me told that the stuc was concerned with the 
nner in which persons form impressions of others 
ki basis of smal ints of information they 
Ve about them, Pa pants received a series of 


I 


a 


item consisted of two 
erson (A) and two adjec- 
mn (B). Based on the 
ges estimated both A’s 
for A. After making 
estimated how much 


scribed by each of 16 pairs of adjectives, These 
pairs were the same as those used in the first part 
of the study to describe A and B. 

Preparation of materials. Two stimulus replica- 
tions were constructed using the normative data 
compiled by Anderson (1968). The adjectives com- 
prising these replications, along with their normative 
scale values, are shown in Table 1. Adjectives in 
two sets, Xs and Ys, were at four levels of norma- 
tive favorableness: H (very favorable), M+ (mod- 
erately favorable), M— (moderately unfavorable), 
and L (very unfavorable). Adjectives in the other 
two sets, X, and Yı, were selected such that the 
scale value of one adjective in each set (H—) was 
midway between the scale values of the H and M+ 
adjectives in the X: and Ye sets, and the scale value 
of the other (L+) was midway between those of 
the M— and L adjectives in these sets. 

Within each replication, all eight possible pairs 
consisting of one X, and one Xe adjective, and all 
eight possible pairs consisting of one Yı and one Ya 
adjective, were constructed. Then, each XX, pair 
(used to describe A) was combined with each YiYs 
pair (used to describe B) to form a total of 64 
different sets of descriptions. These stimuli com- 
prised a 2X 4X24 factorial design consisting of 
two levels of favorableness of one adjective describ- 
ing A (X;), four levels of favorableness of a second 
adjective describing A (X+), two levels of favorable- 
ness of one adjective describing B (¥:), and four 
levels of favorableness of a second adjective describ- 
ing B (¥:). 

Presentation of materials. The 64 test stimuli com- 
prising each replication, preceded by five practice 
stimuli that covered the full range of normative 
likableness, were presented in a questionnaire. Stimu- 
lus information was presented in the following form: 
“A is and . Bis and a 
After reading each set of descriptions, judges first 
estimated A’s liking for B along a scale from —5 
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(dislike very much) to 5 (like very much) and then 
estimated B’s liking for A along a similar scale. 

There were four forms of the questionnaire. In 
one form, the 64 test stimuli were presented in 
random order, with X: adjectives preceding 
adjectives in each description of A, and Yı adjec- 
tives preceding Ys adjectives in each description of 
B. In a second form, stimulus sets were arranged 
similarly, but Xs and Y adjectives preceded X: and 
Y, adjectives, respectively. The third and fourth 
forms were similar to the first and second, respec- 
tively, except that test stimuli were presented in 
the reverse order. 

‘An equal number of judges completed each form. 
Thus, averaged over questionnaire forms, the order 
of presentation of the test stimuli and the order of 
presentation of adjectives within each stimulus set 
were both counterbalanced. In addition, having 
judges estimate both A’s liking for B and B’s liking 
for A eliminated a confound that might otherwise 
have existed if certain adjectives always described 
the evaluator (the person whose judgments were 
being predicted) and certain others always described 
the object of evaluation. 

‘After completing one of the questionnaires de- 
scribed above, all judges received a second question- 
naire, containing the 16 two-adjective sets used to 
describe A and B in the first form they had com- 
pleted, and estimated their own liking for a hypo- 
thetical person described by each set. These judg- 
ments were made along the same scale used in re- 
porting estimates of A’s and B’s liking for one an- 
other. The order of presentation and the order of 
adjectives in each description were again counter- 
balanced. 


Results and Discussion 


In terms of S-V-O methodology, judges’ 
predictions of A’s liking for B and B’s liking 
for A may each be conceptualized as infer- 
ences of a sentence subject’s liking for a sen- 
tence object, based on information varying 
along two subject dimensions, Sı and Sz (the 
favorableness of each adjective in the pair 
describing the evaluator), and two object di- 
mensions, O, and Oz (the favorableness of 
each adjective in the pair describing the ob- 
ject of evaluation). Data were reorganized to 
allow an analysis of these estimates to be per- 
formed as a function of values along each of 
the four informational dimensions, stimulus 
replications, the group of adjectives used to 
define Sı and Sz (X or Y), and the order of 
presentation of adjectives within each stimu- 
lus pair. 

On the basis of previous research (Wyer, 
1975) and the considerations raised earlier, it 
seemed reasonable to expect on a priori 
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grounds that predictions of how mech thei 
subject would like the object would increase 
(a) with the favorableness of each of the 
adjectives describing the object, (b) ith the 
favorableness of each of the adjectives de- 
scribing the subject, and (c) with tte simi 

tions, 


larity in favorableness of these de 
Moreover, if the adjectives in eac! scrip- 


tion are differentially weighted (Birnbaum; 
1974), the two adjectives describing ‘€ sub: 
ject, and also the two adjectives d ribing 
the object, should have interactive effects) 
Results support these general exp ations. 
Speciñcally, all four main effects of t:> infor 

1, and ) and 


mational variables (Si, S2, 
all six two-way interactions invol\ 
variables were significant ($ < 02 
ever, only one higher order inte 
volving informational variables (si 
Sı, Se, and 01) reached signific , FG 


210) = 4.18, P< 01.8 The specifi ture 
these results and their implicatic or th 
issues of concern in this paper 4 scribe 
below. 
Effect of object information. es’ pre 
dictions of how much the subje« ld lik 
the object increased with the fav ness í 
each of the adjectives describin objet 
(p < 01). However, the interaction 9! valut 
along O; and Oz was also sig nt, F 
210) = 3.45, p < .025. This iction | 
described in Figure 1a. It is att: table pi 
the t 


marily to a slight convergenc: 
curves when Os is extreme. That is, Ú 
effect of O, (as inferred from ths en 
between the two curves) is less Y hen 
or L (mean difference = .40) than when 
is M+ or M— (Maite = -62) 
8.45, p< 01. Residual cor 
overall interaction were nons 
140) < 1.0. This convergence 


might be í 


2In light of the number of design effects off 
terest, a more conservative criterion of signife 
(p <.025) was used in this experiment. 4 

8Here and in other analyses to be reporte 
this paper, certain interactions involving stimulus i 
lications and the particular set of adjectives “fc 
describe stimulus persons were also sign 
These interactions suggest that evaluations 
based in part on the descriptive implications j 
information presented as well as its eva: uat 
plications per se. Certain of these effects W 
considered presently. 


of 
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Prediction of Subject 's Liking 
for Object 
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B. Subject Information 


10 
S,=H- 
tLe 
5 S, 
o 
-5| 
10 
L M- M ÈH 


FIGURE 1. Judges’ predictions of the subject’s liking for the object as a function of the favor- 
‘bleness of adjectives describing the object and the favorableness of adjectives describing the 
subject (Experiment 1). (H= very favorable; M+ = moderately favorable; M— = moderately 


nfavorable; L = very unfavorable.) 


tr (ed to a floor or ceiling artifact of the 
bo: ded response scale used in this study. 
Ho «ver, such a possibility seems unlikely, 
becuse the mean judgments contributing to 
thi: interaction (between —1 and 1; see Fig- 
ure la) are substantially less extreme than 
the scale endpoints (—5 and 5), It is conceiv- 
able that a transformation of the response 


sca’ would eliminate much of the extremity 
effect (Birnbaum, 1974), However, the de- 
creased effect of O, when O; is extreme seems 
more likely to indicate an averaging process 
of information integration in which the rela- 
tive weight attached to each adjective in- 
creases with the extremity of its evaluative 
implications, This interpretation, which seems 
plausible in light of the relatively low am- 
biguity of adjectives with extreme implica- 
tions (Wyer, 1973), is consistent with theory 
and research reported elsewhere (Osgood, 
Suci, & Tannenbaum, 1957; Wyer & Watson, 
1969, Experiment 1). 

Effects of subject information. The effects 
of information about the subject roughly 
paralleled the effects of information about 
the object. That is, although predictions of 
the subject’s liking for the object increased 
with the favorableness of each adjective de- 
Scribing him or her (Sı and S2), the interac- 
tive effect of this information was also sig- 
nificant, F (3,210) = 6.79, p < .01. This in- 
teraction, shown in Figure 1b, is not quite as 


regular as the interaction involving informa- 
tion about the object. However, the two 
curves again converge at the extremes; the 
difference between them (and thus the influ- 
ence of S;) is less when Ss is H or L (Maire = 
18) than when it is M+ or M— (Matt = 
45), F(1, 70) = 16.28, p < .01. Moreover, 
the residual interaction, after eliminating the 
effects of differences in the extremity of So, 
is nonsignificant, F(2, 140) = 1.56, This also 
suggests an averaging process in which the 
relative influence of each adjective used to de- 
scribe the subject increases with the extremity 
of its evaluative implications. 

Combined effects of subject and object 
information. Predictions of A’s and B’s liking 
for one another were expected to increase with 
the similarity of the evaluative implications 
of the adjectives describing them. However, 
it was unclear whether judges’ perceptions of 
this similarity, and thus their predictions, 
would be (a) a function of the absolute dif- 
ference in favorableness of the adjectives 
describing A and B or (b) a positive multi- 
plicative function of these favorableness lev- 
els. The four significant two-way interactions 
involving adjectives describing the subject 
and adjectives describing the object bear upon 
these alternative possibilities (specifically, the 
interactions of S, and O,, F(1,70) = 6.98, 
p< Ol; of Sı and Oz, F(3,210) = 3.73, p 
< .025; of S2 and 01, F(3, 210) = 3.24, p< 
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A. S x0, B. 


Ficure 2. Residual interactive effects of subject (S) and object (O) information on 
ences of subjects’ liking for objects (Experiment 1). (H = very favorable; M+ = moderate 
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vorable; M— = moderately unfavorable; L= very unfavorable.) 


025; and of Sy and Oz, F(9, 630) = 17.97, 
p< .01). The nature of these interactions, 
uncontaminated by the independent contribu- 
tions of subject and object information, can 
be seen by eliminating the contributions of 
the two relevant main effects from each cell 
of the interaction and plotting the residuals 
as a function of values along the subject and 
object dimensions involved.‘ 

The S, X O, interaction is undiagnostic 
because only two levels along each dimension 
were manipulated. It simply shows that per- 
sons were predicted to like one another more 
when they were described by similarly polar- 
ized adjectives (mean residual [Mres] = 
.065) than when they were described by dis- 
similarly polarized ones (Mres = —.065). The 
Sı X Oz and Sə X O; interactions, which are 
plotted in Figure 2a and 2b, are somewhat 
more informative. The normative scale value 
of the favorable (H—) adjective along S, is 
midway between the scale values of the two 
favorable (H and M+) adjectives along O2 
(see Table 1). Therefore, if judges’ estimates 
of persons’ liking for one aonther were based 
on the absolute difference in the evaluative 
implications of the adjectives describing these 
persons, their estimates should be the same 


a 

S, x0, 

1,=M+ 

iH 

SM- 

Spek 

i An 

s, 0, 

r- 

a- 
when the values of Sı and Oz are H— and Hy f 
respectively, as when these values +e He 
and M+, respectively. For similar reasons, ji 
judges’ estimates should be the save when 
the values along Sı and Oz are L+ nd L=; : 
respectively, as when these values are Lt 
and M—, respectively. Predictions nvolving 
Sz and O; are analogous. The data d scribed 
in Figures 2a and 2b provide little support 
for these predictions, particularly when the 
adjectives involved are favorable. Except for 


one point (in Figure 2a, when Oz is mod- » 
erately unfavorable), these data are consis- 
tent with the hypothesis that the favorable- 
ness of subject and object information com 
bines multiplicatively to affect judgments. $ 
(The deviant point is probably due to the | 


sstimated 
4In general, the residual (Rij) may be estimate! f 
RA 


from the equation 


Ry = My — Ma — Mj = M., } 
where M: is the mean of the cell in row ? aa 
column j, M:. is the marginal mean of row 4 i 
is the marginal mean of column j, and M. is wail 
grand mean. In this particular case, the row 
column variables correspond to values alone 
subject and object dimensions involved in the ¥ Í 
action. ! 
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unique descriptive implications of the M— 
adje ves used to describe O; we will return 
to th’. matter presently.) 

Į esidual interaction of S and O% is of 
grex interest because it involves four val- 


ues ig each dimension. Data relevant to 
thi raction, shown in Figure 2c, also do 


not ort the “absolute difference” hypothe- 
sis, his hypothesis were correct, judges 
shou expect persons to like one another 
mor ven these persons are described by ad- 
jec with the same evaluative implications 
tha ien they are described by adjectives 
with {ferent implications, This was not al- 
way. ‘he case. Controlling for the effects of 
eac! scription considered alone, persons 
wel ged to like one another more when 
bot ve described by extremely favorable 
(H extremely unfavorable (L) adjectives 
than shen either one or the other was de- 
scri vy a less extreme adjective. However, 
pers. were judged to like one another less 
whe sth were described by M+ adjectives 
(M .07) than when one was described 
by (+ and the other by an H adjective 
(M,, 21). They were also expected to like 
one another less when both were described by 
M- zdjectives (Mres = —.16) than when one 
was described by an M— and the other by 
an L adjective (Mres = 14). 

On the other hand, the combined effects of 
subjects and object information cannot be 
completely accounted for by a simple multi- 
plicative function of the evaluative implica- 
tions of this information. For one thing, the 
effect of the adjective describing the object 


depends much less on the extremity of the 
adjective describing the subject when this 
latter adjective is favorable (H or M+) than 
when it is unfavorable (M— or L). In addi- 
tion, the residual evaluation when Sz and Oz 
are both moderately unfavorable (M-—) is 
much more negative than would be expected 
if the combined effects of values along these 
dimensions were simply a multiplicative func- 
tion of their evaluative implications alone. 
This latter effect may be a result of the idio- 
syncratic descriptive implications of the M— 
adjectives used in this study. The two adjec- 
tives representing M— values along Sz and 
Oz were argumentative and silent in one rep- 
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TABLE 2 


Summary OF SUBJECT X OBJECT INTERACTION 
Factors Extracten Usine FANOVA 


Leveliof Factor 1 Factor 2 
stimulus Subject Object Subject Object 
information loadings loadings loadings loadings 

Replication 1 
H-H 36 44 32 26 
H— M+ 31 -36 23 22 
H—M-— 06. —.13 -57 67 
H-L —41 —38 20 —.03 
L+H 22 30 02 —.54 
L+M+ 37 23 —07 . —.30 
L+M— —.36 =—.28 -A —.17 
L+L —54 4 S mM 

Replication 2 
H-H -50 -60 40  —0l 
H— M+ .23 16 — 26 39 
H-—M— A3 19 39 24 
H-1 —48 =—.53 —.19 123 
L+H .23 27 04 =A 
L+M+ —.01 —.06 -73 =—04 
L+M— 02 —.21 34 — ót 
L+L -6 —43 2B 08 


Note, FANOVA = factor analysis of variance. 


lication, and troubled and frivolous in the 
other. In each case, it seems intuitively likely 
that the behavior of a person described by 
one adjective would be particularly abrasive 
to a person with the attribute implied by the 
other. 

A more general understanding of the con- 
tributions of descriptive and evaluative fac- 
tors to the interactions shown in Figure 2 
can be obtained by applying the factor analy- 
sis of variance (FANOVA) model developed 
by Gollob (1968b). This model enables two- 
way interactions to be decomposed into sev- 
eral interaction factors, each reflecting a 
multiplicative relation between the subject 
and object dimensions defined on the basis of 
these factors. In applying the model to the 
data collected in this study, the eight pairs of 
adjectives describing the subject were used as 
levels of a single subject variable (S), and 
the eight pairs describing the object were used 
as levels of a single object variable (O). The 
FANOVA model was applied to data for each 
stimulus replication separately. Two signifi- 
cant interaction factors emerged in each case. 
The loadings of each subject and object de- 
scription on each factor are shown in Table 
2. In Replication 1, Factors 1 and 2 ac- 
counted for 75% and 12% of the total inter- 


) 
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action variance, respectively; in Replication 
2, the two factors accounted for 57% and 
22%, respectively. These factors are most 
easily interpreted by multiplying each of the 
loadings of descriptions defining S by each of 
the loadings of descriptions defining O and 
noting the products that are of the highest 
magnitude (most positive or most negative). 
The first factor in each analysis, which was 
highly significant, F (13,35) = 3.27 and 3.47, 
p < .01, respectively), is clearly evaluative. 
That is, the loadings of both S and O de- 
scriptions increase with the normative favor- 
ableness of the adjectives contained in them. 
Thus, 75% of the variance of the S Xx O 
interaction in Replication 1 and 57% of the 
variance of the interaction in Replication 2 
can be accounted for by a multiplicative func- 
tion of the evaluative implications of S and 
O, consistent with implications of cognitive 
balance theory. However, by the same token, 
over 25% of the variance in each interaction 
cannot be accounted for by purely evaluative 
aspects of the information presented. 
Certain nonevaluative aspects of the infor- 
mation about persons that affect predictions 
of their liking for one another are suggested 
by the second interaction factor emerging in 
each analysis. These factors may be inter- 
preted with reference to Table 1, which lists 
the particular adjectives involved in each 
analysis. The second factor extracted in analy- 
sis of Replication 1 data, F(11,35) = 1.76, 
p <.05, appears due to the fact that an in- 
formal argumentative (H—M—) person was 
expected to like someone who was angry and 
respectful (L+H) and to dislike someone 
who was studious and silent (H-M—); how- 
ever, an informal grateful (H—H) person and 
an unintelligent prejudiced (L+L) were both 
expected to have the opposite feelings toward 
these others. The second factor extracted in 
Replication 2, F(11,35) = 2.92, p < .05, ap- 
pears due largely to the fact that a listless 
talkative (L+M-+) person was expected to 
like both noisy bright (L+H) and noisy 
frivolous (L+M—) others and to dislike 
self-confident proud (H-M+) others, while 
discreet troubled (H—M—) and listless 
troubled (L+M—) persons were both ex- 
pected to have the opposite reactions. These 
factors make some intuitive sense. However, 
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FIGURE 3. Judges’ own evaluations of obje- s as 
a function of the favorableness of the adj: tives 
describing these objects (Experiment 1). (H ery 
favorable; M+ = moderately favorable; M— od- 
erately unfavorable; L= very unfavorable.) 
they point out that a general understar ‘ing 
of the descriptive factors that underlic the 
prediction of persons’ liking for one another 


may be hard to attain. 

Judges’ own liking for stimulus persons. 
Judges’ own evaluations of persons described 
by each adjective pair used to characterize A 
and B in the first part of the experiment are 
shown in Figure 3 as a function of the favor- 
ableness levels of the two adjectives compris- 
ing the pair. The interaction suggested in 
this figure, which is significant, F(3, 204) = 
7.70, p< .01, appears attributable to two 
factors. First, the effect of Adjective 1 is less 
when the implications of Adjective 2 are un- 
favorable (L or M—) than when they are 
favorable (H or M+); the mean difference 
between the two curves is less in the former 
case (M = 1.28) than in the latter (M = 
1.78), F(1, 68) = 15.40, p < .01. Second, the 
effect of Adjective 1 is less when the implica- 
tions of Adjective 2 are extreme (H or L) 


5 Details of the procedures underlying these and 
other statistical tests involved in FANOVA are 
described in Gollob (1968b). (The F ratio given 
ae equivalent to that designated by Gollob as 


— 
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th hen they are moderate (M+ or M—); 
thai is, the difference between the two curves 
is ; in the first case (M = 1.36) than in 
the second (M = 1.70), F(1, 68) = 7.56, p 
< oi, In combination, these data are con- 
sis'ont with a weighted averaging process of 
ir vation integration in which the relative 
in!'.ence of information increases with both 
its -<tremity and its unfavorableness. 

his conclusion is valid, it suggests an 
in ting difference between the use of in- 


fo ion to estimate one’s own liking for a 
pe and the use of this information to 
pr t others’ estimates (see Figure la). 
Th is, judges weight information with ex- 
tre mplications more heavily than infor- 
m ı with moderate implications both when 
est ing their own liking for a person and 
wheo predicting others’ liking for this person, 
Ho judges weight unfavorable attri- 
bu f a person more heavily than favorable 


on when estimating their own liking 


for person; they do not do so in predict- 
ing rs’ liking for them. A direct statisti- 
cal parison of judges’ own estimates and 
pre ons of others’ estimates is difficult 
becauco the overall effect of information on 
the ormer judgments (Figure 3) is much 
greate: in magnitude than the effect of this 
infor. .tion on the latter (Figure 1a). How- 
ever the variance accounted for by the 
interactive effects of adjectives on predic- 
tions of others’ liking for B, the contrast used 
to indicate the effect of extremity contributed 
81%, whereas the contrast used to indicate 
the effect of negativity contributed only 14% 
and was nonsignificant. On the other hand, of 


variance accounted for by the interactive 
effects of these same adjectives on judges’ 
own estimates of their liking for stimulus per- 
sons, the contrasts used to assess extremity 
and negativity effects contributed 32% and 
66%, respectively. These data are consistent 
with the interpretation suggested above. 


EXPERIMENT 2 


The first study was primarily concerned 
With judges’ predictions of other persons’ lik- 
ing for one another. The second experiment 
focused on the determinants of judges’ own 
evaluations of persons, based both on adjec- 
tives describing them and on explicit informa- 
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tion about these persons’ liking for one an- 
other. Suppose a judge receives information 
consisting of an adjective describing Person 
A, an adjective describing Person B, and a 
description of A’s liking for B. Then, based 
on this information, the judge is asked to es- 
timate how well he or she personally would 
expect to like both A and B. Let us consider 
each judgment in turn from the perspective 
suggested by S-V-O methodology. 
Liking for A 

It seems reasonable to suppose that a 
judge’s evaluation of A is a positive function 
of the favorableness of the attributes he or 
she believes A to have. One attribute may be 
conveyed by the adjectives used to describe A 
(subject information). A second may be con- 
veyed by A’s liking for B (verb information) ; 
that is, A’s reaction to B may be used as an 
indication of A’s friendliness, This reasoning 
predicts that a judge’s liking for A will in- 
crease with the favorableness of both subject 
and verb information. A third source of in- 
formation about A may be the company he or 
she keeps (object information). Gollob 
(1968a) found that evaluations of A were 
based in part on attributes of B, indepen- 
dently of A’s reactions to this person; this 
suggests a slight halo effect, or a tendency to 
assume that birds of a feather flock together. 

An additional attribute of A may be in- 
ferred from values along the verb and object 
dimensions in combination. That is, liking a 
good person and disliking a bad person may 
both be considered desirable attributes and, 
therefore, may contribute positively to a 
judge’s liking for A. In contrast, liking a bad 
person and disliking a good person may both 
be considered undesirable attributes and, 
therefore, may contribute negatively to liking 
for A. Similar contributions would also be 
predicted by cognitive balance theory. Assume 
that a judge’s liking for B is a function of 
the favorableness of the adjective describing 
him or her. Then, if the adjective is favor- 
able (if the judge likes B), the judge’s liking 
for A should increase with A’s liking for B. 
However, if the adjective describing B is un- 
favorable (if the judge dislikes B), the judge’s 
liking for A should decrease as A’s liking for B 
increases. Both lines of reasoning suggest that 
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the evaluative implications of verb and ob- 
i ject information may combine multiplica- 
tively to affect the judge’s liking for A. 
Liking for B 

A judge’s liking for B may increase with 
the favorableness of the attributes that B is 
believed to have on the basis of the informa- 
tion presented. One attribute is conveyed by 
the adjective describing B. A second may be 
inferred from A’s liking for B, which may be 
assumed to reflect B’s general popularity. A 
third may be inferred from the character- 
istics of A alone, independently of A’s reac- 
tions to B; this would reflect a halo effect 
similar to that expected to contribute to lik- 
ing for A. Thus, liking for B should also 
increase with the evaluative implications of 
subject, verb, and object information, each 
considered independently. 

In addition, being liked by a likable person 
or disliked by a dislikable person may both be 
regarded as favorable attributes, whereas 
being liked by a dislikable person or disliked 
by a likable person may be regarded as un- 
favorable attributes. The contributions of 
such attributes should be reflected in a multi- 
plicative effect of subject and verb informa- 
tion. This effect, like the combined effect of 
verb and object information upon liking for 
A, is also predicted by cognitive balance 
theory. 

The above considerations suggest that an 
analysis of variance of judges’ evaluations of 
A as a function of informational variables 
should yield significant main effects of sub- 
ject, verb, and object information and a sig- 
nificant Verb X Object interaction. A corre- 
sponding analysis of evaluations of B should 
also yield three significant main effects and a 
significant Subject X Verb interaction. The 
other three interactions among informational 

variables in each analysis do not correspond 
to any obvious assumption a judge might 
make in arriving at his evaluations, and thus 
would not be predicted to affect his evalua- 
tions. In this regard, Gollob (1968) found 
that in a judgment situation somewhat simi- 
lar to that outlined above, 96% of the pre- 
dictable variance in judgments of A was ac- 
counted for by the four informational cues 
postulated to affect these judgments. How- 


hi 
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ever, evaluations of B were not obt'»ed in 
Gollob’s study. The present study expiored in 
more detail the nature of the contrib- ‘on of 
information to evaluations of both A ond B 
in a design that allowed the effects of corre- 
sponding sources of information on jt vents 
of A and B to be directly compared. 
Method 

Subjects were 48 male and female int tory 
psychology students participating in the to 
fulfill a course requirement. The informati iven 
to judges about the general purpose of the st was 
the same as that provided in Experiment 1 idges 
received a series of stimulus items consisting of the 
following: an adjective describing a hypo «ctical 
person (A), an adjective describing a second crson 


(B), and a verbal description of A’s likin 


Tn each case, they were told to form an im sion 
of the persons described and then to estim how 
much they would expect to like both A and The 
favorableness of the adjective descending A ject 
information) and A’s liking for B (verb i ma- 
tion) were each varied over four levels, 
Preparation of materials. Descriptions of A .nd B 
were constructed from the four sets of four sdjec- 
tives denoted X, and Ve in the two repli: itions 
shown in Table 1, In addition, four descrip! of 
A’s liking for B were selected that were : ex- 
pected to vary over four levels: very muci likes 
(H), likes (M+), dislikes (M—) and very much dis- 


likes (L). 

Two groups of 128 test stimuli were cons! 
on the basis of these adjective and verb descri 
tions. In one group, the Xs adjectives used in 
cation 1 of Experiment 1 were used to d 
Person A, and the Ye adjectives in this replicati 
were used to describe Person B. Each of ti 
Possible pairs of descriptions formed from X- 
Ya adjectives was then combined with each of the 
four levels of A’s liking for B to form 64 stimulus 
sets. A second 64 stimuli were constructed similarly 
using adjectives from Replication 2 of Experiment 1. 
This procedure generated two stimulus replications 
of a three-factorial design involving four levels of 
subject information (the description of A), four 
levels of verb information (A’s liking for B), and 
S levels of object information (the description of 

The second group of 128 test stimuli was con- 
structed by interchanging the adjective describing A 
and the adjective describing B in each of the test 
stimuli noted above. (That is, adjectives denoted Y2 
were used to describe A, and those denoted X, were 
used to describe B.) This procedure enabled possible 
idiosyncratic effects of the sets of adjectives used to 
describe A and B to be controlled. In addition to 
the test stimuli, four Practice stimuli were con- 
structed using adjectives that varied over the en- 
tire evaluative continuum. 

Each group of 128 test stimuli was presented in a 
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Evaluation of A (residual) 
o 


5 


ró; M- Me H 
A's liking for B (v) 


*IGURE 4. Main effects and residual interactive effects of subject (S), verb (V), and object (O) 
ormation on evaluations of A, the subject (Experiment 2). (H = very much likes; M+ = likes; 


= dislikes; L = very much dislikes.) 


diffe ~t questionnaire, preceded by the four prac- 
tice muli, Each stimulus item was presented in 
the owing form: “A is iB as 

A - B.” An equal number of judges completed 


each cuestionnaire, which took about 30 minutes. 


Jud were instructed to (a) read each stimulus 
item, (b) form an impression of A and B on the 
basis of the information given, (c) estimate how 


muci ihey would like A by placing a check along a 
scale ‘rom —§ (dislike very much) to 5 (like very 
much), and (d) estimate how much they would 
like 3 along a similar scale. With the exception of 
the practice stimuli (which always occurred first in 
the stionnaire), stimuli were presented in counter- 


bala order; thus, averaged over the judges 
exposed to each set, the serial position of each stim- 
ulus item was the same. 


Results and Discussion 


Evaluations of A and B were each analyzed 
separately as a function of four within- 


subjects variables (the favorableness of the - 


adjective describing A, the favorableness of 
the adjective describing B, A’s liking for B, 
and stimulus replications) and one between- 
subjects variable (the group of test stimuli, 
X or Y, used to describe A). Results per- 
taining to each set of judgments will be 
discussed in turn. 

Liking for A. Four informational cues were 
expected to contribute to judges’ estimates 
of their liking for A: the favorableness of the 
adjective describing A (subject information), 
A’s liking for B (verb information), the 


favorableness of the adjectives describing B 
(object information), and the social desir- 
ability of A’s reactions to particular types of 
persons (the interaction of verb and object 
information), The contributions of subject, 
verb, and object information are reflected by 
the main effects of these variables. These ef- 
fects are shown in Figure 4a. The contribu- 
tions of the first two cases were substantial 
as expected, in each case, F (3,138) = 55.23, 
p< .001. Object information also had a small 
but significantly positive effect, /(3, 138) 
= 3.29, p < .05, suggesting the existence of a 
halo effect. 

The contribution of the fourth cue to 
judgments is reflected in the interactive ef- 
fects of A’s liking for B and the adjective 
describing B. This interaction was significant, 
F(9,414) = 44.90, p< .001, [Although the 
interaction was contingent on the particular 
set of adjectives used to describe B, F(9, 414) 
= 4.63, p < .01, suggesting that the descrip- 
tive implications of the information presented 
played a role in judgments, this contingency 
was small in magnitude.] The nature of this 
contribution, uncontaminated by the indepen- 
dent contributions of verb and object infor- 
mation, can be seen most easily by subtracting 
the contributions of the two relevant main 
effects (of A’s liking for B and the adjective 
describing B) from judgments at each combi- 
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nation of these variables (see Footnote 3) 
and plotting the residuals as shown in Fig- 
ure 4b. The pattern of these data suggests 
that verb and object information combines 
multiplicatively to affect judgments. 

To supplement the above analyses, the 
FANOVA model proposed by Gollob (1968b) 
was used to decompose the Verb x Object 
interaction. Only one significant interaction 
factor emerged, regardless of whether data 
were collapsed over stimulus replications and 
the group of adjectives used to describe A, 
F (5,47) = 24.28, p < .01, or whether data 
pertaining to each combination of the two 
replication variables were analyzed separately 
(p< .01). This factor accounted for 99.7% 
of the variance in the Verb x Object inter- 
action in the overall analysis and over 96% 
of the variance in each of the individual 
analyses. The factor is clearly evaluative and 
is consistent with the interpretation described 
above. In the overall analysis, the loadings of 
the four levels of verb information (H, M+, 
M-, and L) were .55, .44, —.46, and —.54, 
respectively, whereas the loadings of the same 
four levels of object information were .51 RIAs 
-08, and —.82, respectively. The slight posi- 
tive loading of M— object information re- 
flects the positive slope of the effect of 
verb information when object descriptions 
were moderately unfavorable, as portrayed in 
Figure 4b, 

The four effects described above were the 
only ones expected a priori to contribute to 
evaluations of A. In fact, none of the re- 
maining three interactions (Subject x Verb, 
Subject X Object, and Subject x Verb x Ob- 
ject) approached significance (p > +10). 

Liking for B. Four informational factors 
were expected on a priori grounds to contrib- 
ute independently to judges’ liking for B: 
the favorableness of the adjective describing 
B (object information), A’s liking for B (verb 
information), the favorableness of the adjec- 
tive describing A (subject information), and 
the interaction of subject and verb informa- 
tion. In fact, analyses of these evaluations 

indicated that the seven main effects and 
interactions involving the three informational 
variables, only those corresponding to these 
contributions were reliable. The main effects 
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corresponding to the contributions of subject, 
verb and object information are wn in 
Figure 5a. The effects of verb and o` -ct in- 
formation are highly significant, / 138) 
> 36.18, p<.001, and of the naire ex- 
pected. That is, judges’ liking for B in veased 
both with the favorableness of the « ective 
describing B and with A’s liking for B. valu- 
ations of B also increased with the fa. able- 
ness of the adjective describing A, F (<, 138) 
= 16.13, p< .01, again suggesting mall 
but highly reliable halo effect, or tend- 
ency to generalize characteristics o° A to 
characteristics of B. 

The interaction of A’s liking for B «>d the 
favorableness of the adjective describing \ was 
also significant, 7(9,414) = 14.59, p — .O1. 
This interaction was not contingent the 
particular set of adjectives used to desc be A, 
F(9,414) = 1.11, p> .25. The nature f this 
interaction, after eliminating the contri! tions 
of subject and verb information, is shown in 
Figure 5b, This pattern, which was sin: ar in 
each stimulus replication, is somewha! sur- 
prising. The directional effects expe: on 


the basis of a balance hypothesis are quite 
evident; that is, judges expected to |ie B 


more when B was liked by someone with a 
favorable characteristic or was dislik« d by 
someone with an unfavorable characteristic 
than when B was disliked by someone with a 
favorable characteristic or was liked by some- 


one with an unfavorable characteristic, How- 
ever, in seven of eight cases, residual evalua- 
tions of B were slightly less extreme when A 
was described as liking or disliking B very 
much than when A’s apparent reactions to B 
were not as intense. Perhaps when a person’s 
feelings toward someone are described as ex- 
treme, they tend to be attributed relatively 
more to characteristics of that person and less 
to characteristics of the object toward whom 
these feelings are directed. Thus, as results 
indicate, they may have less effect on judg- 
ments of this object. 

To explore further the nature of the factors 
underlying the combined effects of subject 
and verb information, the FANOVA model 
was used to decompose the Subject x Verb 
interaction. When data were collapsed over 
the two replication Variables, a single factor 
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IGURE 5. Main effects and residual interactive effects of subject (S), verb (V), and object (0) 
ormation on evaluations of B, the object (Experiment 2). (H = very much likes; M+ = likes; 


= dislikes; L = very much dislikes.) 


acc. sited for over 98% of the variance, F 
(5,°°) = 7.16, p<.01. This factor was 
cle: iy evaluative; the loadings of the four 
lev’. of subject information/(H, M+, M—, 
and L) on this factor were .59, .28, —.11, 
and —.75, respectively, whereas the loadings 
of four levels of verb information were 
46, 54, —.48, and —.51, respectively. The 
contribution of the evaluative factor was rela- 
tively less (89%-94%) in analyses of data 
pertaining to each combination of replication 
variables separately. In each of these latter 
two analyses, and also in the overall analysis, 
a second factor appeared, the nature of which 


was consistent over analyses. In the overall 
analysis, the loadings of subject information 
on this factor were .60, —.78, —.01, and .18 
at levels, H, M+, M—, and L, respectively, 
whereas loadings of verb information at these 
levels were .48, —.42, —.58, and .51, respec- 
tively. The psychological implications of this 
factor are somewhat unclear. It may be most 
reasonable to interpret the factor as simply 
reflecting the nature of the deviations from a 
purely multiplicative effect of the evaluative 
implications of subject and verb information. 
To this extent, the similar pattern of load- 
ings on this factor over stimulus replications 
is noteworthy, because it suggests that though 


the attributional tendency underlying the ef- 
fects described in Figure 5b is small, it is 
generalizable over different sets of adjectives. 

Comparison of contributions. Of the total 
variance in evaluations of A accounted for 
by the three primary informational vari- 
ables, stimulus replications, and experimental 
groups, the four effects corresponding to the 
contributions of subject, verb, and object in- 
formation and the Verb X Object interaction 
accounted for 62%, 12%, .2%, and 12%, 
respectively. Thus, 86% of the predictable 
variance in judgments of A was accounted for 
by the 18 degrees of freedom corresponding 
to informational cues expected on a priori 
grounds to contribute most to these judg- 
ments, and the remaining 237 degrees of free- 
dom contributed only 14% of the variance. 
Of the total systematic variance in evalua- 
tions of B, subject, verb, and object infor- 
mation and the Subject X Verb interaction 
accounted for .5%, 2.5%, 85%, and 1.5%, 
respectively, or a total of 89.5%, with the 
remaining 237 degrees of freedom accounting 
for only 10.5%. In both cases, one informa- 
tional cue (specifically, the favorableness of 
the adjective describing the person being 
rated) contributed substantially more than 
any other. These results are consistent with 
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the conclusion drawn by Wyer (1974), 
the judges tend to base their inferences on 
a very few assumptions at any given time, 
and therefore use only a small number of 
informational cues. 

Collapsed over experimental conditions, 
adjectives were used the same number of 
times in descriptions of A and B, Thus the 
contributions of informational cues to these 
different judgments can be meaningfully 
compared. The favorableness of the ad- 
jectives describing A contributed less to 
judgments of A than the favorableness of 
the same adjective describing B contributed 
to judgments of B. On the other hand, cues 
involving A’s liking for B contributed more 
to judgments of A than to judgments of B. 
This suggests that judges attached relatively 
more weight to behavioral (verb) information 
when judging the actor than when judging the 
object. This could indicate that A’s reactions 
to B were attributed more to a character- 
istic of A (e.g., friendliness) than to a char- 
acteristic of B (e.g., popularity). Alterna- 
tively, judges may simply believe that a 
person’s friendliness is a better indication 
than popularity of how much they will like 
this person, 


GENERAL Discussion 


Several important results emerged from the 
set of studies reported here, the nature of 
which may be summarized briefly. 

1. Inferences of one person’s (A’s) liking 
another (B) were based on the characteristics 
of both the object (B) and the actor (A). 
The effects of information describing each 
could be accounted for by an averaging model 
of information integration in which pieces of 
information with more extreme implications 
receive greater weight. However, the effects 
of this information on predictions of others? 
liking for B differed from its effects on 

judges’ own evaluations of B; in the latter 
case, the relative influence of each piece of 
information increased with the unfavorable- 
ness of its implications as well as with the 
extremity of these implications. 

2. Judges’ inferences of A’s liking for B 
were greater when the adjectives describing A 
and B were both favorable or were both un- 
favorable than when one was favorable and 
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the other was not. This suggests that judges 
based their inferences of persons’ lit og for 
one another on the evaluative simil:ty of 
the adjectives describing these person. More- 
over, their perception of this similar.'y ap- 
peared to be a multiplicative function of the 
evaluative implications of the adjectives de- 
scribing A and B, rather than a func on of 
the absolute difference between these i plica- 
tions. In addition, idiosyncratic imp!i: tions 
of the particular adjectives describing A and 
B for how they are apt to get along wiih one 
another appear to have affected ini ‘ences 


over and above the general favorable: ss of 
these adjectives. 


3. Judges’ own liking for a person was a 
positive function of (a) the favorable: «ss of 
the adjectives describing this person, { this 
person’s tendency to like others, ai (c) 


others’ evaluative reactions to him or |r. In 


addition, a slight halo effect occurred; tat is, 
evaluations of one person in a situation in- 
creased with the favorableness of the djec- 


tive describing a second, independer iy of 
these persons’ reactions to one another. 


4. Judges used information about A 
for B to a greater extent when they ~ valu- 
ated A than when they evaluated B. Tiat is, 
they based their own liking for a perso: more 
on this person’s reactions to others tha: other 
persons’ reactions to him, 

5. A person’s evaluative reactions to certain 
types of others appeared to function as 2 gen- 
eral attribute of this person and, therefore, 


affected evaluations of the person in much the 
same way as did his or her other attributes. 

6. Information that a person was liked or 
disliked by someone with favorable or unfa- 
vorable attributes affected judges’ own liking 
for this person in a manner similar to that 
predicted by cognitive balance theory. How- 
ever, these effects were less pronounced when 
the reactions to this person were described 
as intense. This suggests that extreme feel- 
ings about a person may often be attributed 
to characteristics of the perceiver rather than 
to characteristics of the person, and conse- 
quently may have less effect on the judge’s 
own evaluation of the person. 

Cognitive balance principles often provided 
a gross description of the manner in which 
pieces of information combined to affect in- 


king 
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ferences in the studies reported here. How- 


ever, (hese principles were not very useful in 
gaining a general understanding of the nature 
of these inferences and the assumptions under- 
lying ‘hem. Inferences were generally based 


on several informational factors in addition 


to those implied by the theory. In contrast, 
altho Gollob’s S-V-O formulation was not 
applic: rigorously in this paper, the concep- 
tualization and methodology suggested by this 
forn ion proved to be a valuable tool for 
isolat and interpreting the independent 
contri>utions of informational cues to judg- 
men ontinued use of this approach may 
provi» further insight into the complexities 
of so. ol inference phenomena. 


C: n should, of course, be taken in gen- 


eraliz the specific results of this study. As 
bot! lob (1974a) and Wyer (1974, 1975) 
have wn, the relative contributions of in- 
form val cues often depend on the par- 
ticul mensions along which information is 
prese and judgments are made. For 
exam; +, Gollob (1974a) found that the rela- 
tive ributions of informational cues to 
judg: along the evaluative dimension 
likes skes differed substantially from the 
relati ontributions of these cues to judg- 
ments ong a second evaluative dimension, 
admit espises. A consideration of the 
evalua implications of the information 
preseni-d and the judgment to be made is 
hot sufcient to understand the dynamics of 
social i: ference phenomena. Not only the use 
of different cues, but also the assumptions 
under! their use, may vary with the de- 
scriptive implications of the information pre- 
sented, the dimension of judgment, and the 


type of object being judged (Wyer, 1974). 
Moreover, the use of these cues may depend 
on assumptions that judges make about un- 
specified characteristics of the persons de- 
scribed and the nature of their relationship 
(eg., whether they are roommates, members 
of the same or opposite sex, employer and 
employee, or parent and child). It is con- 
ceivable that different sets of descriptive in- 
formation have different effects in part be- 
cause they elicit different general impressions 
of the role relations of the persons described. 
Although the identification of such contingen- 
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cies may be an arduous task, future research 
should be directed toward this objective. 
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Judges indicated their acceptance of generalizations of the form “As verb Bs” 
on the basis of information about whether the relation held between individual 
instances of A and B (inductive evidence) or whether it held between more 
general classes of elements to which A and B belonged (deductive evidence). 
Evidence was specific to either the subject or the object of the proposition to 
be judged. Generalizations were more readily accepted on the basis of inductive 
than deductive evidence. The acceptance of generalization based on inductive 
evidence appeared to depend in large part on judges’ assumptions about the 
implicit quantifiers (“all,” “some,” etc.) of the categories described in these propo- 
sitions, whereas the acceptance of generalizations based on deductive evidence 
depended on whether the definitions of these categories were interpreted as 
disjunctive or conjunctive, In some instances, the acceptance of generalizations 
about affective relations appeared to be based on different criteria than was 
the acceptance of generalizations about overt behavior. Specifically, these 
criteria appeared similar to those used as a basis for making evaluations of ob- 
jects in an impression formation task. A general positivity bias was also de- 
tected, Results were generally consistent with those obtained in an earlier study 
by Abelson and Kanouse; however, they clarified the reasons for these earlier 


findings and, in some instances, suggested contingencies in their occurrence, 


Generalizations are often made about the 
relations between persons and objects (e.g., 
“college students smoke marijuana,” “teen- 
agers like rock music”), even though certain 
pieces of relevant evidence do not support 
them. An understanding of the conditions 
under which this occurs has implications not 
only for cognitive processes in general but 
also for a variety of social inference phe- 
nomena ranging from stereotyping to attribu- 
tion (for a discussion of these latter implica- 
tions, see Kanouse, 1971). Research by 
Abelson, Kanouse and their colleagues (for a 
summary, see Kanouse, 1971) suggests that 
a person's willingness to make a generaliza- 
tion is affected by several factors, including 
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the type of evidence presented and tl» type 


of relation to which the generalizati: per- 
tains. The present paper extends en- 
larges on this earlier work. First, we »:ovide 
a framework for conceptualizing the «fiects 


of several different factors on persons’ will- 


ingness to accept a generalization. © cond, 
we report the results of a conceptual rcplica- 
tion and extension of the original study by 
Abelson and Kanouse (1966) that enables 


the independent and combined effects of these 
factors to be more systematically ideutified 
and statistically evaluated. Finally, we will re- 
port two additional experiments that validate 
certain assumptions underlying the interpreta- 
tion of the first study and that haye other 
heuristic implications. 


THEORETICAL CONSIDERATIONS 


In this section we provide a general con- 
ceptual framework for interpreting the factors 
that affect persons? willingness to accept 
generalizations. Many of the ideas to be 
Presented are not original (see Abelson & 
Kanouse, 1966; Gilson & Abelson, 1965; 
Kanouse, 1971). Our objective is to organize 
these ideas in a manner that allows their 
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ACCEPTANCE OF GENERALIZATIONS 


implications for a general understanding of 


general 2ation processes to be more fully 
realized. 

The eneralizations to be considered have 
the fc “As verb Bs,” where A-and B are 
two € s of objects and the verb describes 
a rel between them. This relation may 
pertain to either a manifest or observable 
beha (e.g, “As buy Bs,” “As kill Bs”) 
or a jective or attitudinal reaction (“As 
like ‘As fear Bs”), The acceptance of 
such neralization may be based on two 
types »vidence. One type, inductive, per- 
tains specific instances of A and/or B 
that or are not related in the manner de- 
scribe 9 the generalization. The second type, 
dedu pertains to more general categories 
of w! \ and/or B are instances. 


Thu io borrow a concrete illustration from 


Abel nd Kanouse (1966), consider the 
gener ion “tribes buy bees.” Inductive 
eviden specific to instances of the subject 
of thi position (subject-specific evidence) 
woul: icate whether different tribes buy 
bees. e such pieces of evidence are pro- 
vided Figure 1, Case 1. Inductive evi- 
dence » cific to the object of the proposition 
(objec cific evidence) would indicate the 
extent vhich tribes buy different types of 
bees (mble bees, honey bees, etc.). Ex- 
ample this type of evidence are provided 
in Fig , Case 2. Deductive evidence bear- 
ing on ‘he generalization would indicate 
either |). extent to which bees are bought 
by more seneral groups (e.g., communal soci- 
eties) io which tribes belong or the extent to 


which tiibes buy more general classes of 
objects (e.g., flying insects) to which bees 
belong. !xamples of these types of evidence 
are given in Figure 1, Cases 3 and 4, respec- 
tively, Similar sets of evidence could, of 
course, be constructed for different subject 
and object classes and for different verbs. 
Two pieces of evidence in each example 
shown in Figure 1 are consistent with the 
generalization, whereas a third piece is incon- 
sistent with it. Since the effects of varying 
amounts of confirming and disconfirming evi- 
dence on judgments of a conclusion have been 
analyzed in detail elsewhere (Abelson and 
Kanouse, 1966; Gilson & Abelson, 1965; 
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Gollob, Rossman, & Abelson, 1973), they are 
not considered here, Rather, we focus on 
factors that affect the acceptance of a gen- 
eralization, holding constant the amount of 
confirming and disconfirming evidence. The 
nature of these factors and their effects may 
depend on whether the evidence is inductive 
or deductive. Let us consider each type of 
evidence in turn, 


Inductive Evidence 


To determine whether or not to accept a 
generalization of the form “As verb Bs,” 
based on inductive evidence, a judge must 
first infer from this evidence the proportion 
of As and Bs that are related to one another 
in the manner described. Then, the judge 
must decide whether these proportions are 
sufficiently large to warrant the generaliza- 
tion, As Abelson and Kanouse (1966) sug- 
gest, this latter decision may depend in part 
on the quantifiers (“all,” “some,” “a few,” 
et al.) that the judge assumes are implicitly 
attached to A and B in the generalization to 
be evaluated. 

Let us consider this process in more detail 
with reference to the examples in Figure 1, 
The implications of subject-specific and object- 
specific inductive evidence for the general- 
ization “tribes buy bees” are summarized in 
the Venn diagrams accompanying Cases 1 
and 2, In Case 1, the evidence pertains to 
each of three mutually exclusive subclasses 
of the more general subject category “tribe,” 
Two types of tribes buy bees, and one type — 
does not. In other words, the evidence implies 
that some but not all tribes buy bees. A 
judge's willingness to accept the generaliza- 
tion “tribes buy bees” on the basis of this 
evidence clearly depends on the individual's 
assumption about the implicit quantifier 
(“all,” “some,” etc.) of tribes. For example, 
judges are less apt to accept this generaliza- 
tion if they interpret it as “all tribes buy 
bees” then if they interpret it as “some 
tribes buy bees.” More generally, the accept- 
ance of a generalization on the basis of in- 
ductive evidence about members of a given 
category should be relatively greater when 
the implicit quantifier of this category is 
assumed to be less universal. 3 

The above observation helps to interpret 
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Case 1: 


Inductive Subject-Specific Evidence 
There are three types of tribes: 
Punjabi buy bees. 


Swahili buy bees. 
Bengalese do not buy bees. 


Punjabi Swahili Bengali 
+ + - 
Case 3: 


Deductive Subject-Specific Evidence 
‘Tribes are primitive, communal, agricultural societies. 


Primitive societies buy bees. 
Communal societies buy bees. 
Agricultural societies do not buy bees. 


Not 
Communal | communal 
Primitive + + Nonagri- 
cultural 
-| Agricultural 
Not + Nonagri- 
primitive cultural 


Ficure 1, Examples of evidence bearing on the generalization “tribes buy bees.” 
companying each set of evidence indicate implications of information for the generalization, (Plus 
denote positive instances of the general proposition, and minus signs denote negative 


apparent contingencies in the acceptance of 
generalizations based on inductive evidence. 
Abelson and Kanouse (1966) found that gen- 
eralizations were accepted more readily on 
the basis of inductive evidence about in- 
Stances of object categories (Case 2) than on 
the basis of inductive evidence about in- 
stances of subject categories (Case 1). This 
would be explained if implicit quantifiers of 
object categories are typically assumed to be 
less universal than implicit quantifiers of sub- 
ject categories (or, alternatively, if the gen- 
eralization “As verb Bs” is assumed to hold 
for a greater proportion of As than Bs), A 
possible reason for this is suggested by 
Gilson and Abelson’s (1965) observation that 
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Case 2: 
Inductive Object-Specific Evidence 
There are three types of bees: 
bumble, honey, and sweat. 


Tribes buy bumble bees. 
Tribes buy honey bees. 
Tribes do not buy sweat bees. 


bumble honey sweat 
H + = 
Case 4: 


Deductive Object-Specific Evidence 


Bees are flying, furry, buzzing insects. 


Tribes buy flying insects. 
Tribes buy furry insects. 
Tribes do not buy buzzing insects. 


Not 
Furry furry 
Flying fe + N 
buzz 
Buz 
Not + No 
flying buzzing 
Venn diagrams ac- 


Signs 
instances.) 


the subject categories used in their study 
(and also in the experiments reported here) 
pertain to persons, whereas object categories 
pertain to nonpersons. They speculate that 
groups of persons are considered to be more 
heterogeneous than groups of nonpersons. 
Judges may believe that a greater proportion 
of confirming evidence is necessary to justify 
making a generalization about members of a 
heterogeneous category, and therefore they 
may assign the category a more universal 
implicit quantifier when they consider such a 
generalization. Additional implications of this 
are considered later in this paper. 

: A second determinant of the quantifier a 
judge implicitly assigns to categories of a 
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generalization may be the type of relation 
between them. If a judge believes that the 
type of relation described in a generalization 
of the form “As verb Bs” is unlikely to occur 
on 2 priori grounds, the judge may assume 
that bis generalization is meant to apply to 
relati few members of A and B and, 
thereiore, may assume that less universal 

$ quant crs are implicitly assigned to these 
categcriss, 

Th’ line of reasoning suggests a possible 
differs between a judge’s interpretation of 
statem nts that describe subjective affective 
relati: between A and B (eg., “likes” or 
“dislik .) and his interpretation of state- 
ments at describe manifest relations (e.g., 
“buys or “kills”). Positive and negative feel- 
ings a an object may often be considered 
prereq. ‘tes for favorable and unfavorable 
overt iviors toward the object, respec- 
tively wever, these feelings may not be 
consid sufficient to produce this behavior. 
(Thus example, tribes may be expected 
to buy ver bees than they like, whereas 
fewer s may be expected to buy bees 

` than to ke them.) To this extent, general- 


izations 
interpr 
describe 
bees”) 
one (“t 
case, a 
confirmi 
greater a 


the form “As verb Bs” may be 
as less uniyersal when the verb 
manifest relation (“tribes buy 
n when it describes a subjective 
s like bees”). And if this is the 
n amount of confirming and dis- 
nductive evidence should lead to 
ptance of a generalization about a 


manifest »clation than one about a subjective 
relation, 
Two hypotheses are suggested by the above 


ns, each of which is consistent 


with results obtained by Abelson and Kanouse. 
Hypothesis 1, A generalization of the form 
As verb Bs” will be accepted more readily 


on the basis of inductive evidence about in- 
stances of the object category (B) than on 
the basis of the same amount of inductive 
fvidence about instances of the subject 
Category (A). 

Hypothesis 2, A generalization about a 
Manifest relation will be accepted more read- 
ly on the basis of inductive evidence than 
Will a generalization about a subjective, or 
Affective, relation, 
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Deductive Evidence 


The considerations that underlie the ac- 
ceptance of a generalization on the basis of 
deductive evidence differ from those that 
underlie its acceptance on the basis of induc- 
tive evidence. Consider again the examples 
described in Figure 1. Deductive evidence 
specific to the subject category (Case 3) per- 
tains to whether bees are bought by members 
of more general categories to which tribes 
belong. Analogously, deductive evidence 
specific to the object (Case 4) pertains to 
whether tribes buy members of more general 
categories to which bees belong. The classes 
of elements to which the generalization 
(“tribes buy bees”) pertains are presumably 
defined in terms of their membership in these 
more general categories. However, the impli- 
cations of the evidence presented for this 
generalization depend on the precise nature 
of these definitions. For example, suppose in 
Case 3 that the definition of tribes is inter- 
preted as conjunctive (that is, suppose tribes 
are defined as societies that are primitive and 
communal and agricultural). Then, the gen- 
eralization “tribes buy trees” would necessar- 
ily be invalid because agricultural societies 
(of which tribes are necessarily an instance) 
do not buy bees. On the other hand, suppose 
the definition of tribes is interpreted as 
disjunctive (that is, tribes are defined as soci- 
eties that are either primitive or communal 
or agricultural). Then, the generalization 
“tribes buy bees” might be more readily ac- 
cepted on the basis of the evidence presented 
because some tribes (those that are primitive 
or communal but not agricultural) could in 
fact buy bees. pit: 

The acceptance of a generalization based 
on deductive evidence, therefore, depends 
largely on whether the definition of the cate- 
gory to which the evidence pertains is inter- 
preted as conjunctive or disjunctive. If the 
definition is interpreted as conjunctive, the 
acceptance of the generalization is less likely. 

The fact that judges accept generalizations 
with some probability on the basis of evidence 
of the sort described in Cases 3 and 4 
(Abelson & Kanouse, 1966) suggests that 
they do often interpret the category defini- 
tions as disjunctive. However, the extent to 
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TABLE 1 
DESCRIPTION or STIMULUS MATERIALS USED IN EXPERIMENT 1 


Relation (verb) 


Manifest Manifest Subjective Subjective 
Stimulus item Subject Object favorable unfavorable favorabl: favorable 
Replication 1 
Category tribes bees 
Subclasses (used in Ate, 
inductive evidence) Punjabi bumble n : a 
Swahili honey buy kill like Jislike 
Bengali sweat 
Attributes (used in Ti 3 
deductive evidence) primitive flying 
communal furry 
agricultural buzzing 
Replication 2 
Category Foreigners American cities 
Subclasses Italians Chicago 3 
Germans New York vacation in bomb ike lislike 
Frenchmen Los Angeles 
Attributes (persons with) A 
accents industrial 
(persons with) 
passports busy 
(persons with) 
travelers’ checks populous 
Replication 3 
Category urbanites magazines 
Subclasses residents of 
New York City Newsweek 
residents of 
Chicago Readers’ Digest buy destory like islike 
residents of 
Los Angeles Saturday Evening Post 
Attributes: business-oriented readable 
active Paperbound 
independent lightweight 
Replication 4 
Category Student Class 
Subclasses enan history 
sophomores mathematics attend disrı i slike 
Juniors English isrupt like li 
Attributes carry books informative 
sit in classrooms formal 
do homework challenging 


which this is true may depend on the type 
of categories involved. If categories of per- 
sons (subject categories) are apt to be seen 
as more heterogeneous than categories of 
nonpersons (object categories), the former 
categories should be more apt to be inter- 
preted as having disjunctive definitions. This 
reasoning suggests a third hypothesis which, 
like the first two hypotheses, is consistent 
with previous findings. 

Hypothesis 3. Generalizations will be ac- 
cepted more readily on the basis of deduc- 
tive evidence specific to the subject category 
than on the basis of deductive evidence 
specific to the object category. 

Note that the difference between the effects 
of subject-specific and object-specific evi- 
dence predicted above is opposite in direction 
to that predicted when the evidence presented 
is inductive (Hypothesis 1). To recapitulate, 


when a category is heterogeneous, a greater 
proportion of confirming instances may be 
required to justify a general statement about 
members of the category as a whole. On the 1 
other hand, because the definition of such 4 
category is apt to be disjunctive, a discon- 
firming piece of deductive evidence should 
have less effect on the acceptance of such a 
statement. In combination, this line of rea- 
soning suggests that the more heterogeneous 
a category, the less likely it is that a general- 
ization pertaining to it will be accepted on 
the basis of inductive evidence, but the more ` 
likely it is that such a generalization will be 
accepted on the basis of deductive evidence: 
Comparison of evidence forms. If the gen- 
eralization to be drawn is not assumed to be 
universal (i.e., if it is not assumed to pertain 
to all members of the subject and object cate- 
gories to which it pertains), then a discon- 
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a firming e of deductive evidence will have 
greater ct on the acceptance of this gen- 
eralizatio= than will a disconfirming piece of 
inductiv- evidence. This can be seen with 
referenc the Venn diagrams in Figure 1. 
That is ne piece of disconfirming evidence 
and tw es of confirming evidence are pre- 
sented, neralization would hold 66% of 

. the time when the evidence is inductive. 
Howev: would hold no more than 43% of 
the tim en the evidence is deductive, even 
if the « ition of the category involved is 
interpre as disjunctive. This suggests a 
fourth hvoothesis. 

Hype is 4, Generalizations are less likely 
to be a » ted on the basis of deductive evi- 

- dence | on the basis of inductive evidence. 
Additic ‘onsiderations 

Two tional factors not taken into ac- 
count | velson and Kanouse may also need 
to be co idered. First, many studies (for a 
summar ee Zajonc, 1968) suggest that 
judges a general positivity bias when 
inferrin ations among persons and objects; 

> that is, are more apt to infer relations 
to be favorable than unfavorable, indepen- 
dently « > implications of the information 
presented. Tf this is the case, judges may tend 
to accep \eralizations about favorable rela- 
tions (“b2y,” “like,” etc.) more readily than 
generaliz ions about unfavorable ones (“kill,” 
“dislike,” cic.), regardless of the type of evi- 
dence pres-nted and whether the evidence is 
Specific to the subject category or specific to 
the object ; itegory. 

Second, in our analysis so far we have ap- 
peared to #ssume that similar inference proc- 
esses underlie the acceptance of generaliza- 


tions, regardless of the type of relation to 
Which the generalization pertains. This as- 
Sumption may not be valid when subjective 
telations are to be inferred on the basis of 
deductive evidence about the object. Suppose 
4 judge is told that bees are flying, furry, 
uzzing insects, and that tribes like flying 
and furry insects but not buzzing ones. It 
‘tems reasonable to suppose that in this case, 
the judge's inference that tribes like bees may 
be the result of a process similar to that 
“lggested by research on impression forma- 
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tion. This research suggests that evaluations 
of an object are on algebraic function (sum 
or average) of the favorableness of the indi- 
vidual attributes of the object (cf. Anderson, 
1971; Fishbein & Hunter, 1964; for a review, 
see Wyer, 1974). It is conceivable that when 
asked to infer whether tribes like bees in the 
above example, judges base their judgments 
on some composite of tribes’ apparent liking 
for each of the individual attributes of bees, 
two of which (“flying and “furry”) are fa- 
vorable and one of which (“buzzing”) is un- 
favorable. To this extent, the judge should 
expect tribes to have a moderately favorable 
opinion of bees and, therefore, should accept 
the generalization that tribes like bees. 

The process described above, which was not 
considered by Abelson and Kanouse (1966; 
see also Kanouse, 1971), would not be used 
to infer manifest relations between elements. 
Overt behavior must typically be directed 
toward an object as a whole, whereas subjec- 
tive reactions to an object may be a compos- 
ite of the reactions to individual aspects of 
the object. For example, if all bees are flying 
and furry and buzzing, it is impossible to buy 
bees without buying buzzing objects. How- 
ever, it is clearly possible to like bees without 
liking buzzing objects in general, provided 
other attributes of bees are desirable. 

The preceding analysis has implications for 
Abelson and Kanouse’s conclusion that gen- 
eralizations about manifest relations are ac- 
cepted less readily on the basis of deductive 
evidence than are generalization about subjec- 
tive relations. The above reasoning suggests 
that this may indeed be the case when the 
evidence presented is specific to attributes of 
the object. However, there is no reason for 
this to be true when the evidence is specific 
to the subject category. The three experi- 
ments to follow bear on this and other 
consideratidfoutlined above. 


EXPERIMENT 1 


In the original study by Abelson and 
Kanouse (1966), only the proportion of sub- 
jects who reported agreement under each con- 
dition was determined. These data did not 
allow a systematic investigation of the inde- 
pendent and interactive effects of the several 
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TABLE 2 


MEAN ACCEPTANCE OF CONCLUSIONS BASED ON 
INDUCTIVE AND DEDUCTIVE EVIDENCE AS A 
FUNCTION OF THE TYPE AND EVALUATIVE 
IMPLICATIONS OF THE RELATION 
DESCRIBED 


Manifest relation Subjective relation 


Fayor- Unfavor- Favor- Unfavor- 

Evidence able able able able 
Inductive 

Subject oriented 4.1 2.2 24 5 

Object oriented 5.5 2.9 2.9 0 
Deductive 

Subject oriented -3 -2.0 -4 —2.0 

Object oriented 2.9 —3.9 1.2 sai 


variables identified above on the acceptance 
of generalizations. In the present study, these 
variables were manipulated independently. 
Moreover, judges reported the magnitude of 
their agreement with each generalization, thus 
enabling analysis of variance procedures to 
be used to identify the combined effects of 
factors postulated to affect these judgments. 


Method 


Subjects were 31 introductory psychology stu- 
dents, who participated in the study to fulfill a 
course requirement. Participants completed a 64-item 
questionnaire, the nature of which is described below. 

Construction of questionnaire items. Stimulus 
items were patterned after those used by Abelson 
and Kanouse (1966). Four stimulus replications were 
constructed. Each replication pertained to a dif- 
ferent pair of subject and object categories. For 
each pair of categories, three subclasses were selected 
for use in constructing inductive evidence, and three 
attributes (superordinate classes) were selected for 
use in constructing deductive evidence. In addition, 
four verbs were selected; one verb described a 
favorable manifest relation, a second described an 
unfavorable manifest relation, and the third and 
fourth described a favorable and an unfavorable 
subjective relation, respectively. The specific ma- 
terials used in each stimulus replication are shown in 
Table 1. Note that the materials in Replication 1 
are the same as those used to construct the example 

described in Figure 1. 

Presentation of stimuli. Stimulus items were con- 
structed for each replication in the manner analogous 
to that described in Figure 1. Within each replica- 
tion, all 16 combinations of evidence type (inductive 
vs. deductive), the orientation of the evidence (sub- 
ject-specific vs. object-specific), the type of relation 
(manifest vs. subjective), and the evaluative implica- 
tions of the relation (favorable vs. unfavorable) 
were constructed. In each item, the relation de- 
scribed in two pieces of evidence supported the 
generalization to be inferred, but the third piece of 
evidence did not. 
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The 64 items were arranged randomly 
sented in a questionnaire, In responding to 
judges were told to read the informatio: 
and then, based on this information, to es! 
extent to which they would agree or disa 
the generalization following it (e.g., in t 
described in Figure 1, with the generalizat 
buy bees”). 

Judges’ estimates were recorded along 
gory semantic differential, the end cat 
which were labeled “strongly disagree” and 
agree.” Responses along this scale were 2 itrarily 
assigned numerical values from —3 (stro: iy dis- 
agree) to 3 (strongly agree). Abelson and _anouse 
(1966) found that agreement with gene: zations 
was unrelated to the particular subject a object 
categories to which they pertained. Ther ore, to 
simplify analyses, these differences were noi consid- 
ered in the present study. Rather, each judge’s 
responses to corresponding stimulus itero- were 
summed over stimulus replications to pi vide a 
single score, ranging from —12 to 12, for eac of the” 
16 combinations of informational variables 


Results 


The mean agreement with genera! 
based on inductive and deductive evi 
shown in Table 2 as a function of th- three 
variables expected to affect it. Analy es of 
these data yielded seven significant m-in ef- 
fects and interactions, which are sum:iarized 
in Table 3. These effects will be described as 
they bear on the various issues of concern in 
the study. 

Data provided support for two gencral ef- 
fects expected on the basis of considerations 
raised in the introduction. First, generaliza- 
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TABLE 3 
ANALYSIS OF VARIANCE SUMMARY OF DATA 
IN TABLE 2 
Source MS MS. F(1,30) l 
Evidence type (A) 2,210.01 54.60 40.50 | 
Evidence orientation (B) 9.05 12.73 aa 
Relation type (C) 35.66 9.56 3.72 
Evaluation implications (D) 340.12 39.12  8.70** 
xB 73.55 831 8.84" 
AXC 36.76 10.23 35.93 < 
AXD 60.34 11.01 5.47 
BXC 16.70 7.13 2.34 
BXD 219 547 40 
AXBXC 96.70 8.33 11.61% 
AXBXD 37.84 9.30 4.06" 
AXCXD 8.00 6.51 1.23 
BXCXD 3.06 3.09 99 
AXBXCXD 5.24 7.20 -173 


A 


S 


ACCEPTANCE OF GENERALIZATIONS 


tions wsre accepted less readily on the basis 
of dedu-tive evidence (M = —1.7) than on 
the t of inductive evidence (M = 2.6), 
consis with Hypothesis 4. This difference 
was in ihe same direction at each combina- 
tion o' he other three independent variables. 

Sec. «|, generalizations about favorable re- 
latior ‘tween the subject and object cate- 
gorie re more readily accepted (M = 1.3) 
than generalizations about unfavorable 
relati (M = —.4), suggesting a general 
posit bias. The significant triple inter- 
action i relation favorableness, type of evi- 
dene | the element to which evidence was 
speci! indicates that this difference was less 
when ductive object-specific evidence was 
preser (—2.1 vs. —2.3) than under other 
condi ons. Reasons for this contingency are 
consi d later, However, it is important to 
note | ıt in this study, in contrast to conclu- 
sions wn by Abelson and Kanouse (1966), 
gener -ations about favorable relations were 
gene: accepted more readily than were 
Zener ations about unfavorable relations, 
even . en the evidence presented was de- 
ducti 

Tho effects of evidence orientation and 
type elation were both contingent on the 
type evidence form. These contingencies 
are sin lar in most respects to those identified 
by AL son and Kanouse and are consistent 
with potheses 1-3. That is, generaliza- 
tions were accepted more readily on the basis 
of inductive evidence about instances of the 
object category (M = 2.8) than on the basis 
of inductive evidence about instances of the 
subject category (M = 2.3). However, they 
were accepted less readily on the basis of de- 
ductive evidence when it was object specific 
(M = ) than when it was subject spe- 
cific (4 —1.2). Also, generalizations about 


manifest relations were accepted more readily 
than generalizations about subjective rela- 
tions on the basis of inductive evidence (M = 
3.8 and 1.4, respectively), but were accepted 
less readily than generalizations about sub- 
jective relations on the basis of deductive 
evidence (M = —2.3 and —1.1, respectively). 

Certain of the above effects must be quali- 
fed by the highly significant triple interac- 
tion of type of evidence, the category to 
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which the evidence was specific, and the type 
of relation. To clarify the nature of these 
contingencies, separate analyses were per- 
formed on inferences based on inductive and 
deductive evidence forms. Analyses of judg- 
ments based on inductive evidence revealed 
significant main effects of type of relation, 
F(1,30) = 32.33, p < .01, and of the favor- 
ableness of the relation, F(1,30) = 14.99, p 
< .01, and a marginally significant main ef- 
fect of evidence orientation, F(1,30) = 3.37, 
b < .08. However, no interactions among these 
variables approached significance (p > .10). 
The predicted effects of these three factors 
under inductive evidence conditions were 
therefore confirmed. Moreover, these effects 
appeared to be additive. 

In contrast, whereas analysis of inferences 
based on deductive evidence also yielded sig- 
nificant effects of evidence orientation, F(1, 
30) = 4.08, p < .05, and of type of relation, 
F(1,30) = 8.70, p < .01, these variables in- 
teracted significantly, F(1,30) = 10.30, p< 
.01. Data in Table 2 reveal that the type of 
relation had no effect on acceptance of gen- 
eralizations based on deductive evidence spe- 
cific to the subject category. However, the 
acceptance of generalizations based on de- 
ductive evidence specific to the object cate- 
gory was much greater when the relation 
described was subjective (M = —.9) than 
when it was manifest (M = —3.4). In the 
introduction, we speculated that the process 
of drawing inferences about subjective rela- 
tions based on deductive evidence about ob- 
jects may differ from that employed under 
other conditions. Specifically, judges may in- 
fer the subject’s (A’s) feelings toward mem- 
bers of the object category (B) by either 
summing or averaging A’s subjective reactions 
to each of B’s three attributes, as described 
in the evidence presented. Because two of 
these reactions are similar in direction to the 
subjective reaction described in the generali- 
zation, this should lead judges to agree more 
with generalizations in this condition than 
under other conditions in which deductive 
evidence is presented. Data in Table 2 are 
consistent with this hypothesis. 

If this hypothesis is correct, a second con- 
sideration arises that is also suggested by im- 


nes fe 
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pression formation research. Specifically, in- 
formation with unfavorable implications tends 
to have more influence on evaluative judg- 
ments than does information with favorable 
implications (Hamilton & Zanna, 1972). If 
this is so, a judge should be more apt to infer 
that the subject (A) dislikes the object (B) 
when told that A dislikes two of B’s three 
attributes than to infer that A likes B when 
told that A likes two of B’s three attributes. 
This tendency, which may counteract the 
positivity bias described earlier, would, of 
course, not occur under conditions in which 
impression formation processes are not in- 
volved. Results described in Table 2 are 
consistent with this interpretation; that is, 
generalizations about unfavorable relations 
were accepted more readily (M = —.7) than 
generalizations about favorable ones (M = 
—1.2) when deductive, object-specific evi- 
dence was presented, but not in any other 
case. 

Summary. Results reported here are con- 
sistent with the general conceptual analysis 
of generalization processes outlined in the 
introduction. That is, generalizations were 
accepted more readily on the basis of induc- 
tive than of deductive evidence. Second, gen- 
eralizations were accepted more readily on 
the basis of inductive evidence about in- 
stances of the object category than on the 
basis of inductive evidence about instances of 
the subject category; however, they were 
accepted less readily on the basis of deductive 
evidence when this evidence was specific to 
the object category. Third, generalizations 
about manifest relations were more readily 
accepted on the basis of inductive evidence 
than were generalizations about subjective 
relations, This latter relation was reversed in 
the case of deductive evidence, consistent with 
findings reported by Abelson and Kanouse 
(1966). This reversal appears due primar- 


ily to the greater acceptance of generali- 
zations about subjective relations based on 
object-specific deductive evidence. Results 
obtained under this condition support the 


his condition, unlike other 
an inference process 
‘sed in an impression 

te objects described 


conjecture that in t 
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by sets of attributes. Finally, a genera! posi- 
tivity bias similar to that detected by Za- 
jonc (1968) and others appears to wu» ‘erlie 
inferences based on both inductive a» ` de- 
ductive evidence. 


EXPERIMENT 2 


Two primary assumptions underlie t- in- 
terpretation of results obtained under i luc- 
tive evidence conditions of Experime H 
First, judges were assumed to assign ore 


universal implicit quantifiers to subject ate- 
gories than to object categories. Second, hey 
were assumed to assign more universe im- 
plicit quantifiers to categories involve’ in 
subjective relations (“like,”’ “dislike”) han 
to those involved in manifest rels ‘ons 
(“buy,” “kill”). Evidence supporting | ‘ese 
assumptions was obtained by Abelson ənd 
Kanouse (1966). In their study, judge: re- 
ported the minimal evidence of a rel ‘ion 
between instances of two categories, A ind 
B, that would be necessary to accept 2 en- 
eralization of the form “As verb Bs.” In on- 
trast, judges in the present study made \ de- 
pendent estimates of the likelihood ‘hat 
propositions containing different combina‘ ons 
of explicit quantifiers (e.g., “all As ver! all 
Bs,” “all As verb some Bs,” et al.) would 
follow from a stimulus proposition in which 
quantifiers were only implicit (“As verb lis”). 
This procedure enabled us to compare more 


directly the extent to which judges were apt 
to assume different implicit quantifiers in 
making generalizations of the type considered 
in Experiment 1 and to identify contingen- 
cies in these assumptions on the type of rela- 
tion to which generalizations pertained. 

Two additional variables were considered. 
First, the propositions presented were either 
affirmations (“As buy Bs,” “As dislike Bs”) 
or negations (“As do not buy Bs,” “As do not 
dislike Bs,” et al.) Second, the subject and 
object categories were either concrete, as in 
Experiment 1, or abstract, being designated 
only by letters of the alphabet (X or Y). 
Although these variables were not of direct 
relevance to the results of Experiment 1, they 
were expected to have heuristic interest if our 
interpretation of these results is valid. 


“a 
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Method 

Two sols of stimulus materials were constructed. 
One st tained to relations between concrete cate- 
gories i the other pertained to relations between 
abstract categories. A different group of 25 intro- 
ductor psychology students was exposed to each 
set. I: -ach case, judges received a series of stimu- 
lus it of the form “As verb Bs.” On the basis of 
each they inferred the validity of each of 
four inse propositions of the form “all As verb 
all B. all As verb some Bs,” “some As verb all 
Bs,” ‘some As verb some Bs.” 


Prep ciion of materials. As noted previously, 


Abels and Kanouse found no evidence that in- 
feren iepended on the particular subject and ob- 
ject ories involved. The replication of most of 
their lings in Experiment 1 also suggests that 
these wits are stable and independent of the par- 
ticula imulus categories to which generalizations 
pertain. Thus, to simplify the design of the present 
study ily one pair of concrete categories, tribes 
and was used. Five affirmative propositions 
were structed using these categories. Two propo- 
sitior tribes buy bees” and “tribes have bees”) 
descr: | favorable manifest relations one (“tribes 
kill ) described an unfavorable manifest rela- 
tion, the remaining two (“tribes like bees” and 
“tril slike bees”) described favorable and un- 
favo subjective relations. Five negations were 
also iructed by replacing the verbs used in the 
first : propositions by their negations (“do not 
buy, lo not have,” et al.). 

Th ond set of materials, pertaining to abstract 
items, was identical to that described above except 
that and Ys were substituted for tribes and bees, 
respe el; 

The 10 stimulus propositions in each set were or- 
dered sandomly in a questionnaire. In responding 
to exci proposition, judges were told to assume 
that someone had made the statement (e.g, “Sup- 
pose -omeone told you that tribes buy bees”) and 
then io indicate how likely it was that each of the 
four response items (“all tribes buy all bees” et al.) 


was true. These responses were recorded along a scale 
from 0 (not at all likely) to 10 (extremely likely). 


Results 


Because no hypotheses were made in this 
study concerning differences in the effects of 
different types of manifest relations (see Foot- 
note 1), corresponding responses to stimulus 
propositions containing “buy” and “have” 
were averaged to provide a single set of scores 
pertaining to favorable manifest relations. 
Responses were then analyzed as a function 
of one between-subject variable (the ab- 
stractness of the subject and object cate- 
gories) and five within-subject variables: 
the explicit subject quantifier of the response 
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item (all vs. some), the explicit object quanti- 
fier of the response item (all vs. some), the 
type of relation (manifest vs. subjective), 
the evaluative implications of the relation 
(favorable vs. unfavorable), and proposition 
form (affirmation vs. negation). Two ques- 
tions were of primary interest. The first was 
whether implicit subject quantifiers are re- 
garded as more universal than implicit object 
quantifiers. The second was whether implicit 
quantifiers are assumed to be more universal 
when the relation described is subjective than 
when it is manifest. Let us consider data 
relevant to each question in turn. 

Comparison of subject and object quanti- 
fiers. Differences in the implicit quantifiers 
assigned to subject and object categories can 
be inferred from the combined effects of ex- 
plicit subject and object quantifiers on esti- 
mates of the likelihood that response items 
would follow from the stimulus item. Three 
interactions bearing on these effects were sig- 
nificant, The first two interactions involve 
(a) the subject and object quantifiers and the 
abstractness of the categories contained in 
the generalization, F(1, 48) = 6.57, p < 01, 
and (b) these three variables and the evalu- 
ative implications of the relation described, 
F(1,48) = 6.57, p <.05. Data relevant to 
these interactions are shown in Table 4. The 
first interaction appears attributable primarily 
to differences in judgments of response state- 
ments of the form “all As verb some Bs,” 
When A and B were concrete, these state- 
ments were judged as likely to follow from 
the stimulus proposition as were statements 
of the form “some As verb some Bs” and 
more likely to follow than statements of the 
form “some As verb all Bs.” Although the 
latter difference is not large, it is consistent 
with the assumption that object categories are 
assigned less universal implicit quantifiers 
than subject categories. Note that the differ- 
ence between judgments of “all As verb some 
Bs” and “some As verb all Bs” occurred pri- 


1 Because the implications of favorable manifest 
relations appeared to be more heterogeneous than 
those of unfavorable relations (‘“‘kill,” “destroy,” 
et al.), it was decided to consider two such relations 
although only one instance of other relation types 


was considered. 
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TABLE 4 


INFERENCES OF EXPLICITLY QUANTIFIED PROPOSITIONS 
AS A FUNCTION OF CATEGORY ABSTRACTNESS AND 
FAVORABLENESS OF THE RELATION DESCRIBED 


Concrete categories Abstract categories 


Subject 
quantifier- Favor- Unfavor- Favor- Unfavor- 
Object able able able able 
quantifier relation relation relation relation 
all-all 6.26 6.52 6,06 6.50 
all-some 7.73 8.12 6.65 6.18 
some-all TAS 7,97 7.22 7.39 
some-some 7.78 7.87 7.79 7.93 


marily when the relation described was fa- 
vorable. This implies that the differences be- 
tween the influence of inductive evidence 
about subjects and the influence of inductive 
evidence about objects should also be greater 
when the relation involved is favorable. Data 
obtained in Experiment 1 (see Table 1) sug- 
gest that this may, in fact, be the case. The 
assumptions underlying the interpretation of 
me first experiment, therefore, appear justi- 
ed, 

However, the pattern of judgments made 

when the generalizations pertained to abstract 
categories is somewhat different. Here, state- 
ments of the form “all As verb some Bs” were 
judged no more likely to follow from the 
stimulus proposition than were statements of 
the form “all As verb all Bs,” and they were 
judged much less likely to follow than were 
statements of the form “some As verb all 
Bs.” The latter difference suggests that judges 
assign less universal quantifiers to abstract 
subject categories than to abstract object 
categories. To this extent, the results ob- 
tained in Experiment 1 would not generalize 
to conditions in which subject and object 
categories are abstract. Rather, propositions 
describing the relations among abstract cate- 
gories will be accepted more readily on the 
basis of inductive evidence about the subject 
than on the basis of inductive evidence about 
the object. 

The second interaction, Proposition Type 
(affirmation vs. negation) X Subject Quanti- 
fier X Object Quantifier, F(1,48) = 17.10, p 
< .01, is also worth noting in this context. 
Data relevant to interaction are shown 
in Table 5. These 1 suggest that judges 
were less apt to infer that the subject and 
object quantifiers were both “all” when the 
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stimulus proposition was an affirm 
= 6.0) than when it was a negati: 
6.7). Correspondingly, they were 
more apt to infer that either or boi 


atively 
yuanti- 


fiers were “some” in the former (¢° = 7,9) 
than in the latter case (M = 7.1). 1f gen- 
eralizations are accepted more readi! on the 
basis of inductive evidence when t «ir im- 
plicit quantifiers are less universal, is im- 
plies that affirmations will be accep!‘ more 
readily than negations on the basi such 
evidence. 

Effects of manifest vs. subjective ro ations. 
In Experiment 1, generalizations abou. mani- 
fest relations were accepted more re’ ily on 
the basis of inductive evidence t} were 
generalizations about subjective tions. 
This finding was attributed to the that 
judges assume manifest relations to >e less 
universal than subjective relations an there- 
fore, assign. relatively less universal — plicit 
quantifiers to the subject and objec) cate- 
gories involved in them. Results of t+ pres- 
ent experiment are consistent with th- inter- 
pretation. Response statements were «idged 
less likely to follow from stimulus ; oposi- 
tions describing manifest relations (M = 
6.91) than from propositions describi=s sub- 
jective relations (M = 7.41), regardi»ss of 
whether the subject and object quantifiers 
were “all” or “some,” F(1,48) = 25.50, p < 


01. This suggests a strong tendency for 
judges to believe that manifest relations are 
less universal than subjective relations—a 
tendency that is apparent even when the 
quantifier is “some.” 

Although the difference between inferences 


TABLE 5 


INFERENCES OF EXPLICITLY QUANTIFIED AFFIRMATIONS 
AND NEGATIONS 


Object quantifier 


Subject 
quantifier All Some 
Affirmations 
All 5.99 7.63 
Some 7.82 8.23 
Negations 
All 6.68 6.70 
Some 7.04 7.45 


ACCEPTANCE OF GENERALIZATIONS 


% of mani/es! and subjective relations was simi- 
Jar in di:ection at each combination of other 
experi? il variables, its magnitude varied. 
Three tiple interactions were significant, 
each of which involved one but not both ex- 
plicit quantiñers; specifically, the interactions 
of Type of Relation X Type of Proposition X 
Explicit Object Quantifier, F(1,48) = 13.51, 

4 p<. lype of Relation X Evaluative Im- 

` plications of the Relation X Object Quantifier, 
F(1,45) = 8.05, p<.01, and Type of Re- 
lation < Evaluative Implications of the Rela- 
tion » Subject Quantifier, F (1, 48) = 4.10, $ 
<.05. Because these contingencies are com- 
plex «id are not germane to the issues of con- 
cern © this paper, they are not discussed.* 

EXPERIMENT 3 


Our interpretation of the effects of deduc- 


tive evidence was based in part on the as- 
sumption that judges were more likely to 
intepret definitions of subject (person) cate- 


gowes as disjunctive than definitions of ob- 
jec (nonperson) categories. This experiment 

_ provided some support for this assumption. 

d udges may not be able to state clearly 
thcir interpretations of category definitions. 
Moreover, asking them to do so may intro- 
duce experimental demands to respond simi- 
larly to all category definitions, thereby de- 
creasing the probability of detecting the dif- 
ferences expected. Thus, a more indirect mani- 
festation of judges’ interpretation of category 
detinitions was obtained. Suppose, as in Case 
3 of Figure 1, that judges are told to assume 
that “tribes are primitive, communal, agri- 
cultural societies,” and are then asked to es- 
timate both the likelihood that “tribes are 
agricultural societies,” and the likelihood that 
“agricultural societies are tribes.” If judges 
treat the definition of tribes as conjunctive, 
they should estimate the first statement to be 
very likely to be true, and this estimate 
should be much greater than their estimate of 
he likelihood that the second statement is 
true. However, if judges treat the definition 
as disjunctive, the first of these estimates 
should be relatively lower and not necessarily 
much different from the second. Thus, the 
greater the difference between the two esti- 
mates, the more likely it is that judges are 
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treating the definition as conjunctive rather 
than disjunctive. 

The third experiment was based on this 
line of reasoning. Specifically, if definitions of 
subject (person) categories are more likely 
to be treated as disjunctive than are defini- 
tions of object (nonperson) categories, dif- 
ferences analogous to those described above 
should be less when subject than when object 
categories are involved. 


Method 


Thirty-one introductory psychology students par- 
ticipated in the study. Stimulus materials were con- 
structed using three of the pairs of subject and 
object categories considered in Experiment 1: tribes 
and bees, students and classes, and urbanites and 
magazines. Stimulus items consisted of the defini- 
tions given to these six categories under deductive 
evidence conditions of the first experiment (e.g, 
“tribes are primitive, agricultural, communal soci- 
eties”; “bees are flying, furry, buzzing insects,” ete.). 
In each case, judges were told to assume that the 
stimulus statement was true and then, based on this 
assumption, to infer the likelihood that each of six 
response statements was true. Three of these state- 
ments (eg, “a tribe is a primitive society,” “a 
tribe is an argricultural society,” and “a tribe is a 
communial society”) pertained to the likelihood 
that members of the subcategory belonged to cach 
of the three superordinate categories. The remain- 
ing three items (eg, “a primitive society is a 
tribe,” “an agricultural society is a tribe,” and “a 
communal society is a tribe”) pertained to the like- 
lihood that members of each superordinate category 
belonged to the subcategory. The six stimulus items 
and the six response items pertaining to each were 
presented in random order in a questionnaire. Judg- 
ments of each item were made along a scale from 0 
(not at all likely) to 10 (extremely likely). The 
mean judgment of the three response items pertain- 
ing to membership in superordinate categories, and 
the mean judgment of the three response items per- 
taining to membership in the subordinate category, 
were calculated for cach stimulus item and used in 
subsequent data analyses. 


Results 


Judgments were analyzed as a function of 
three variables: item set, the category within 
each set to which the stimulus item pertained 
(subject vs. object), and proposition form. 
Subcategory members were judged more 


2 Details on the nature of these contingencies may 
be obtained from the second author. 

3The fourth pair used jn Experiment 1 was 
omitted from the experiment due to oversight. 
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likely to belong to superordinate categories 
(M = 9.70) than were superordinate cate- 
gory members judged to belong to the sub- 
category (M = 3.77), F(1,30) = 405.81, $ 
< .001. However, this difference was signifi- 
cantly less when subject (person) categories 
were involved (9.70 vs. 4.06) than when ob- 
ject (nonperson) categories were involved 
(9.70 vs. 3.47), F(1,30) = 9.58, p < .01. 
This latter result was not significantly con- 
tingent on item set, F(2,60) = 2.27, p> 
10. These data, therefore, support the hy- 
pothesis that the definitions of subject cate- 
gories are generally perceived as more dis- 
junctive than are the definitions of object 
categories, 


GENERAL Discussion 


The results of this series of studies con- 
firmed many of the results reported by Abel- 
son and Kanouse (1966) and the assump- 
tions underlying their interpretation. The fac- 
tors that affected the acceptance of generali- 
zations based on inductive evidence were 
those that affected assumptions about the uni- 
versality of the generalizations to be made, 
whereas those that affected the acceptance of 
generalizations based on deductive evidence 
were those that affected assumptions about the 
manner in which the categories described in 
the generalization are defined. In addition, a 
general positivity bias, leading to greater 
acceptance of generalizations about favorable 
relations than of generalizations about un- 
favorable relations, seemed to occur regard- 
less of the type of evidence presented. 

The acceptance of a generalization de- 
pended on whether the evidence presented 
was specific to the subject or to the object 
category of the generalization. This contin- 
gency seemed to result from apparent ten- 
dencies for judges to assign different implicit 
quantifiers to subject and object categories 

and to interpret the definitions of those cate- 
gories differently. However, an ambiguity in 
interpreting these data arises from the fact 
that in the studies reported here, like those 
conducted earlier by Abelson and Kanouse, 
subject categories pertained to groups of per- 
sons (tribes, students, et al.), whereas object 
categories pertained to nonpersons or objects 
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hesized 
judges 


(bees, classes, et al.). We have hyp 
that the apparent assumptions th 


make about subject and object cate. vies re- 
flect the assumptions they make abo) classes 
of persons and nonpersons, respectiv How- 
ever, future research should attem) > dis- 
tinguish between the effects of evic per- 


taining to subject and object cate S; 


one hand, and the effects of evide e per- 
taining to person and nonperson cate vies on 
the other, 

Whereas the results reported in t paper 
are generally consistent with thos: rorted 
by Abelson and Kanouse (1966) » Con- 
tingencies in the conclusions draw n the 
basis of this prior research should |} noted. 
First, the process of making inferenc about 
subjective relations on the basis o object- 
specific deductive evidence appears || akin 
to that employed in an impression-! nation 
task in which judges are called on to 'aluate 
an object on the basis of informati about 
its attributes. As a result of this pr ‘ss, ac- 
ceptance of generalizations is greate .n this 
case than when the deductive evide e per- 
tains to other types of relations, or an the 
evidence is subject specific rather — an ob- 
ject specific. 

A second contingency is suggested -y indi- 
cations that when the categories in ‘he gen- 
eralization to be drawn are abstrac:. judges 
appear to infer that subject categories per- 
tain to a smaller proportion of instan:s than 
do object categories. This tendency is the 
reverse of that obtained when the categories 
involved describe concrete, familiar groups. 


Conceivably, when categories are abstract, 
judges are relatively less likely to interpret 
the subject category as pertaining to persons 
and relatively more likely to interpret the 
object category as pertaining to persons. If 
categories of persons are considered to be 
more heterogeneous than categories of non- 
persons, and therefore are assigned less uni- 
versal implicit quantifiers when generaliza- 
tions are made about them, the differences 
obtained under this condition would be ac- 
counted for. In any event, these data suggest 
that the greater effect of object-specific in- 
ductive evidence than of subject-specific in- 
ductive evidence obtained in Experiment 1 
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would » occur when the categories of the 
genera!’ ‘ion to be evaluated are abstract. 
Genera ions about the relations between 
abstrac. -ategories are not intrinsically of 
much chological interest. However, the 
possibi that judges make different assump- 
tions ıt the universality of these categor- 
ies th iey do about the universality of con- 
crete -gories has methodological implica- 
tions ause it suggests that the results of 
reseai' that uses abstract information, for 
the } ose of eliminating the idiosyncratic 
effect of past experience on inferences, may 
actus v not generalize to conditions in which 
more oncrete information is involved. 

Al ough the present research was ostensi- 
bly — ncerned with generalizations based on 
diffe nt types of evidence, the factors identi- 
fied as affecting these generalizations had 
litt! directly to do with the specific implica- 
tior of this evidence. The manner in which 
fac rs interact with the amount of confirm- 
iny and disconfirming evidence to affect con- 
clu ‘ons is a matter for further consideration. 


Ge ‘ob et al. (1973) found that the willing- 
nes to accept conclusions appeared to be an 
ac litive function of the numbers of confirm- 
in: and disconfirming pieces of evidence pre- 
seated. However, it seems reasonable to sup- 
pose that the magnitude of these effects will 
depend on whether the evidence presented is 
inductive or deductive and whether it is sub- 
ject or object oriented. For example, if judges 
assign less universal quantifiers to object 
caiegories than to subject categories, then 
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increasing amounts of confirmatory inductive 
evidence may have less effect when it is 
object specific than when it is subject specific. 
This and other possibilities are worth explor- 
ing. 
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A Longitudinal Study of the Genetics of Personality 
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A longitudinal twin study was conducted to determine whether personality 
traits with significant heritability in adolescence remain so in adulthood. A 
subsample of a group of twins who had been administered the Minnesota 
Multiphasic Personality Inventory and the California Psychological Inventory 
in adolescence was readministered the same two inventories 12 years later. 
The subsample was found to be representative of the sample from which it 
was drawn. Different patterns of significant heritability were found for the 
two ages studied: Several personality traits demonstrated evidence of significant 
heritability in either adolescence or adulthood, while others demonstrated 
evidence of significant heritability at both ages. In addition, genetic influences 
on the change from adolescence to adulthood were found for several person- 
ality traits. The results raise important questions for the interpretation of twin 
research and the understanding of genetic influences on personality development. 
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There has been renewed interest in the 
biological bases of personality (e.g., Buss & 
Plomin, 1975; Claridge, Canter, & Hume, 
1973; Eysenck, 1967). One consequence of 
this is an increased reliance on the twin 
method in determining the importance of 
genetic influences on personality traits (for 
reviews of twin studies of personality, see 
Mittler, 1971, and Vandenberg, 1967). Most 
twin studies of personality have used ado- 
lescent twins, because of the relative ease with 
which such samples can be collected. An as- 
sumption of many of these studies has been 
that results from one age would be compara- 
ble with results from other ages. Although 
often implicit, this assumption has been 


This research was supported in part by grants 
from the Milton Fund to the third author and from 
the Department of Psychology and Social Relations 
of Harvard University, and by a National Institute 
of Mental Health Graduate Traineeship to the first 
author and a Scottish Rite Fellowship for Research 
in Schizophrenia to the second author. 


We thank Andrea L. Megela and Sarah W. Bart- 
lett for valuable assistance at several stages of the 
research, Ronald S. Wilson for advice regarding data 
analysis, and n C. Loehlin for a critical reading 


of the ms 
Requests fox 
Dworkin, Dep 
Relations, Harva 
Cambridge, Massac 


cprints should be sent to Robert H. 
rent of Psychology and Social 
| University, 33 Kirkland Street, 
husetts 02138, 


clearly stated by Buss and Plomin 75) 
and Vandenberg (1967). These author: have 
argued that personality traits that arc «nder 
substantial genetic control at one age ould 
remain so at other ages and should show 
longitudinal stability, Although not incom- 
patible with this position, others have em- 
phasized that the extent to which traits are 


under genetic control (heritability estimates) 
may be expected to differ from one «ze to 
another (Cavalli-Sforza & Bodmer, i971; 
Gottesman, 1963, 1974; Thompson, 1967). 
For example, Morton (1975; Rao, Morton, 
& Yee, 1974) has recently presented evidence 
suggesting that the heritability of IQ is very 
much lower in adults than in children; he 
argued that questions about heritability are 
Meaningless without specifying the age group 
studied. 

The issue is an important one. If twin 
studies at one age are inconclusive regarding 
genetic influences at other ages, then studies 
of personality using adolescent twin samples, 
although of interest in their own right, may 
yield little or no information with regard to 


+The terms heritability and genetic variance will 
be used interchangeably in this article. Strictly 
speaking, the terms are not equivalent; the twin 
study method as it is typically used estimates gè- 
netic variance (Cavalli-Sforza & Bodmer, 1971)- 
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nce of genetic influences on adult 


persona! i Indeed, even when personality 
traits are examined in an adult twin sample 
(eg., Eaves & Eysenck, 1975), the age range 
studied should perhaps be as narrow as 
feasible, ‘nce it might be the case that the 
degree eritability varies at different ages 
within span of adulthood. It is also pos- 
sible whether a trait is under genetic 
contro! t all varies as a function of the age 
at wh’ it is studied. If the latter possibil- 
ity pr | to be true, reviews of twin studies 
that estigate, without regard to age, 
whet! a given personality trait shows a 
signif ənt degree of heritability in several 
differ: -t samples would have to be reexam- 
ined, aking into account the ages of the 
twins studied, 


Ti- question of the comparability of esti- 


mate; of the heritability of personality traits 
obtained at different ages has been examined 
by everal authors who have divided their 


twit. samples into older and younger groups 
(Biss, Plomin, & Willerman, 1973; Claridge 
et ol., 1973; Partanen, Bruun, & Markkanen, 
1966; Willerman, 1973). Results from this 
type of investigation have been equivocal, 
often because of the small sample sizes re- 
sulting when twin pairs are divided into two 
groups. Because these investigations were not 
initially designed to examine the question of 
ave effects in the heritability of personality, 
the age analyses typically involved dividing 
the sample at the mean age and hence did not 
usually compare meaningful age groups. An- 
other strategy has been to administer, to one 
age twin sample, a personality measure for 
which twin data already exist for a different 
age (win sample. In only two instances has 
this type of comparison been made. Reznikoff 
and Honeyman (1967) administered the 
Minnesota Multiphasic Personality Inventory 
(MMPI) to a small sample of adult twins 
and compared their findings with Gottesman’s 
(1963) results for adolescent twins. They 
concluded that the comparison substantiated 
Gottesman’s conjecture that the extrapolation 
of his findings on adolescents to adults would 
Probably not be appropriate. Horn, Plomin, 
and Rosenman (1976) compared their results 
on the heritability of the California Psycho- 
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logical Inventory (CPI) scales in an adult 
male sample with those obtained by Nichols 
(1966) for high school boys. They concluded 
that the average differences between the in- 
traclass correlations for monozygotic (MZ) 
and dizygotic (DZ) twins were approximately 
the same in both samples. We calculated the 
Spearman rank-order correlation coefficient of 
the heritabilities obtained in the two studies, 
a more sensitive index of the extent to which 
the results of the studies are similar, and 
found it to be nonsignificant (rs = .29). 

Given the sensitivity of estimates of 
heritability to sampling considerations and 
numerous genetic and environmental factors 
(Fuller & Thompson, 1960), conclusions 
about the comparability of such estimates 
obtained from these types of cross-sectional 
investigations are understandably inconclu- 
sive. In order to determine whether the per- 
sonality traits that show evidence of signifi- 
cant heritability at one age are also under 
significant genetic control at another, a longi- 
tudinal investigation was conducted, A sub- 
sample of a group of twins who had been 
administered two personality inventories in 
adolescence was readministered the same two 
inventories approximately 12 years later, 
There has not been a previous longitudinal 
investigation of the heritability of personality 
traits using personality inventories, and it was 
hoped that the results of this study would 
speak to the questions raised above regarding 
the nature of genetic influences on personality 
at different points in the life cycle and the 
extrapolation of the results of twin studies 
from one age group to another. 


METHOD 

Subjects 

In 1961 and 1962 Gottesman administered the 
MMPI and CPI to a sample of 178 high school age 
same-sex twin pairs in the Boston area (Gottesman, 
1965, 1966). Of the 147 pairs for which both twins 
had valid MMPIs in adolescence (sce Gottesman, 
1966, for the criteria used in determining validity), 
43 pairs were Jocated at follow-up and readminis- 
tered the two inventories. One of these pairs was 
dropped from further analysis because of an invalid 
MMPI for one of the twins at follow-up. Of the 
42 remaining pairs of twins with valid MMPIs at 
both initial and follow-up testing, 25 pairs were MZ 
(16 female, 9 male) and 17 pairs were DZ (8 fe- 


male, 9 male). 
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The original twin sample was collected through 
cooperating school systems, advertisements in neigh- 
borhood newspapers, and Mothers of Twins Clubs. 
Additional sampling and demographic data are 
reported by Gottesman (1966). The zygosity of most 
of the pairs in the original sample was established 
by extensive blood grouping. For reasons of econ- 
omy, some pairs who were clearly MZ or DZ were 
not blood typed. The average age of the 42 retested 
pairs when seen in adolescence was 15.9 years and 
when seen at follow-up was 27.9 years. 


Procedure 


At both the initial and follow-up contacts, most 
twins were administered the MMPI and CPI in 
groups of varying size. At follow-up several twins 
were individually tested. Both inventories were com- 
pleted in a single session, the CPI always being 
administered first. A few twins were mailed the two 
inventories with instructions on how to complete 
them in their homes. Biographical information was 
collected at both ages. Twins were paid for their 
participation at both contacts. 


Data Analysis 


Evidence of significant heritability in the MMPI 
and CPI scales was assessed by two complementary 
techniques (Gottesman, 1963). For each personality 
scale, the intraclass correlation coefficients for the 
MZ pairs and the DZ pairs were calculated with the 
formula suggested by Haggard (1958). The MZ 
intraclass correlation coefficients (Rxz) and the DZ 
intraclass correlation coefficients (Rpz) for each 
scale were then transformed into z scores. Whether 
Ruz is significantly greater than Roz, as the genetic 
hypothesis predicts, was tested with a one-tailed test. 
In addition, the F ratio consisting of the DZ within- 
pair mean square divided by the MZ within-pair 
mean square, which has been widely used as a sig- 
nificance test for the presence of genetic variance in 
twin data, is also reported for each scale. Because 
there is some dispute in the literature regarding the 
relative merits of these two approaches (e.g., com- 
pare Christian, Kang, & Norton, 1974, with Jinks & 

Fulker, 1970), both are presented here. 

Quantitative estimates of heritability are not re- 
ported. Loehlin, Lindzey, and Spuhler (1975) and 
Eaves (1972) have shown that the confidence limits 
around estimates of heritability are very wide with 
Samples of the size typically used in twin studies. 
Given the small sample size on which our data are 
based, we have limited our analyses to determining 
the presence of significant genetic variance for the 
personality traits studied and have not attempted 
a numerical estimate of the extent of genetic variance 
present. However, both Holzinger’s (1929) and 
Falconer’s (1960) estimates of heritability can be 
easily calculated from the data we present, Another 
consequence of the small sample size is the impos- 
sibility of analyzing the data for sex differences, 
although we realize that they are important for some 
traits (Gottesman, 1963, 1966). 
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RESULTS 

Representativeness of the Retested Suv imple 

Since the sample of retested pairs © only 
29% of the sample of pairs with valid ^ [PIs 
at the initial testing, the demonstra‘ that 
this subsample is a representative on `s es- 
sential for interpreting subsequent fin’ «s, In 
Figure 1 the mean MMPI profiles fo {males 
and females for the entire sample of | airs 
and for the retested subsample at ado ence 
are presented. The norms used ar ‘hose 
given by Dahlstrom, Welsh, and Da’ trom 
(1972) for 16-year-olds, without K  rrec- 
tion. As can be seen from the figure ado- 
lescence the subsample that was ret: >d in 
adulthood was very similar to the en ado- 
lescent sample. The slight but con ently 
greater elevation of the entire samp’ when 
compared with the subsample is a ı lt of 
the fact that the subsample was dra. from 
only those pairs in the sample havi: valid 
MMPIs. This is made especially ar by 
the differences between the subsan and 
entire sample on the L and F sca since 
these were the scales used in establi: ng va- 
lidity. Figure 1 also demonstrates w} many 


that 
ntative 
se, the 
This 
ns and 


twin studies have neglected to exan 
the sample of twins studied is repr 
of the population in general, in thi: 

population of 16-year-old adolesc: 

is true for both the entire sample oi 

for the subsample. 

Although the comparison of 
MMPI profile of the subsample a: 
cence with the mean MMPI profil 
entire sample has demonstrated substantial 
similarity, another method exists for assessing 
the representativeness of the subsample. This 
is to determine whether the personality scales 
that showed significant evidence of genetic 
variance in the subsample at adolescence are 
the same as those that showed significant 
evidence of genetic variance in the entire 
sample. For the original sample of pairs with 
valid MMPIs at adolescence, six of the 
MMPI clinical scales and six CPI scales 
demonstrated statistically significant evidence 
of genetic variance (see Gottesman, 1965, 


mean 
idoles- 
of the 


1966, and revised values in Table 1). For the 
retested subsample’s adolescent data, four 
of these six MMPI scales and three 0 


| 
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Fıcure 1. Mean Minnesota Multiphasic Personality Inventory profiles for retested subsample 
« adolescence and entire sample at adolescence. (T` scores, with a mean of 50 and a standard 
deviation of 10, are based on norms for 16-year-olds, without K correction.) 


the èe six CPI scales were also significant. 
Gi en the small size of the retested sample, 
th se results provide further evidence of its 
re sresentativeness. 


L omogeneity of Variance 


Inasmuch as heterogeneity of the total vari- 
ances of the MZ and DZ pairs invalidates the 
model on which twin study data analysis is 
besed (Kempthorne & Osborne, 1961), the 
F ratio of these variances for each scale at 
each age was calculated (the method sug- 
gested by Haseman and Elston, 1970, was 
used). Of the 66 two-tailed tests calculated, 
5 were significant at the .05 level. Only 2 of 
these, the tests for MMPI scales Pa and A at 
adolescence, were for scales that demonstrated 
evidence of significant genetic variance. 


Comparison of Adolescent and Adult 
Twin Data 


With respect to the adolescent subsample 
that was retested in adulthood, Table 1 
shows MZ and DZ intraclass correlation co- 
efficients and F ratios for the 10 MMPI clin- 
ical scales, the K scale, and four research 
scales. These data can be compared with the 
F ratios (presented in the last column of the 


adolescent section of the table) for the entire 
adolescent sample having valid MMPIs (cf. 
Gottesman, 1965, for preliminary analysis). 
In both the subsample and the entire sample, 
the D, Pd, Pa, and Sc scales show evidence of 
significant heritability. Because of sampling 
fluctuations, the Mf scale is significant in the 
subsample but not in the entire sample, while 
the Hy and Si scales are significant in the 
entire sample but not in the subsample. 

The adolescent data for the subsample 
demonstrate that the significance of the F 
ratio and the significance of the difference 
between Ryy and Rpz do not always corre- 
spond. This is a function of both the small 
sample size and, presumably, some nonsig- 
nificant heterogeneity of variance. Another 
consequence of the small sample size is that 
several of the Rpzs are zero or negative when 
there is evidence of significant heritability, a 
finding that with a larger sample would be 
considered counter to the genetic expectation. 

When the adult MMPI results presented 
in Table 1 are examined, a different pattern 
of heritabilities emerges. The clinical scales for 
which there was evidence of significant heri- 
tability in adolescence are not under signifi- 
cant genetic control in adulthood. For three 


scales, Ma, K, and Es, there is evidence of 
significant heritability in adulthood but not 
in adolescence, Two scales, A and Dy, dem- 
onstrate evidence of significant heritability at 
both ages. 

Table 2 reports the adolescent MZ and DZ 
intraclass correlation coefficients and F ratios 
for the 18 CPI scales. These data are for the 
subsample that was retested in adulthood and 
can be compared with Gottesman’s (1966) F 
ratios (presented in the last column of the 
adolescent section of the table) for the entire 
adolescent sample having valid MMPIs. In 
both the subsample and the entire sample, 
the Do, Sy, and Sa scales show evidence of 
significant heritability. Because of sampling 
fluctuations, the To scale is significant in the 
subsample but not in the entire sample. This 
is also the case for Ai and Je, although there 
was evidence of significant heritability for 
these scales in the female subset of the origi- 
nal sample. Three scales, Sp, So, and Gi, were 
significant in the entire adolescent sample but 
not in the subsample. 

As was true for the MMPI, when the adult 
CPI results presented in Table 2 are com- 
pared with the adolescent data, different pat- 
terns of heritability emerge. The only CPI 
scale with significant genetic variance in both 
adolescence and adulthood is Do. The results 
for the Sc and Gi scales are not clear-cut: 
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Both are significant in adulthood bi! not in 
the adolescent subsample; however, ‘here is 
evidence of significant heritability for the Gi 


scale in the entire adolescent sample and for 
the Sc scale in the female subset of entire 
sample (Gottesman, 1966). Both these 
findings are most simply interpreted! »s a re- 
sult of sampling fluctuations, and it `: prob- 
ably best to consider both of these les as 
exhibiting evidence of heritability du both 
adolescence and adulthood, in additio: to the 
Do scale. Five scales for which t! was 
evidence of significant heritability in :doles- 
cence, Sy, Sa, To, Ai, and Je, are no under 
significant genetic control in adultho:.’ Two 
scales, Wb and Py, demonstrate ev e of 
significant heritability in adulthood not 
in adolescence. 

The results of the preceding analy s are 
summarized in Table 3. The different | ‘terns 
of significant heritability that were found 
in adolescence and adulthood are a ction 
of changes in both MZ and DZ sin ‘arity. 
As shown by the intraclass correlati and 
within-pair mean squares in Tables | and 2, 
MZ similarity tends to increase v age, 
while DZ similarity tends to decrease ‘or the 
MMPI and CPI scales that are signiñcant in 
adulthood but not in adolescence. tor the 
MMPI scales that are significant in adoles- 
cence but not in adulthood, DZ similarity 


TABLE 1 


ADOLESCENT AND ADULT MINNESOTA MULTIPHASIC PERSONALITY INVENTORY INTRACLASS 
CORRELATION COEFFICIENTS AND F RATIOS 


Adolescent Adult 
MZ DZ MZ DZ 
within- within- within- within- 
oa RSA pait pair F Fo pair pair 
Scale wz pe OMS MS (17,25) (68,79) Ruzo Ros MS MS 
Hypochondriasis (Hs) 36.48 39.09 1.00 
Depr ession, (D) $410 92.27 1go 149 So 32:30 
steria (Hy) 35, 69.62 1.51* 16 40 53.36 
Psychopathic Deviate (Pd) 32, 3 Bii 
Max ulinity-Femininity a as EG Seer gis 
Ww) “1737.82 106.97 
Paranoia (Pa) 19 5256 161.21 tag -27 3130 
Psychasthenia (Pt) A4 51.30 7715 1146 19 715 
Schizophrenia (Sc) -24 47.08 100.44 1.48* $3 food 
Hypomania (1a) c31 057.76 91.32 117 D8 37.44 
Social Introversion (Si) 36 3304 50.56 1.69* 125 -44.14 
f -29 53,62 47.32 113 =; 47. 
Anxiety (A) —.04 49.20 54.06 ae “oo. 28.42 
Repression (R) —.04 81.22 92.03 1.24 1250.58 
Ego Strength (Es) 14 43.24 72.50 1.35 —06 30.12 
Dependency (Dy) 21 53.92 72.85 (85 Tios 26:26 


Note, MZ = monozygotic; DZ = dizygotic. 
A Significance levels given in Ruz € 


umn are for Ryz > Rpz. 


b Revised values for entire adolescent sample (cf. Gottesman, 1965, for preliminary analysis). 


*p <,05. 
p< .01. 


a 


IZ = monozygotic; DZ = dizygotic. 


aS 
“ f 


‘or entire adolescent sample (Gottesman, 1966). 


tencs to increase with age, while there is no 
consistent trend in MZ similarity. For the 


ance levels given in Ruz column are for Ruz > Roz. 


score from adolescence to adulthood is under 
significant genetic control. That the A and 
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; TABLE 2 
SCENT AND ADULT C. ` Di 
PAGESE PSYCHOLOGICAL INVENTORY ĪNTRACLASS CORRELATION 
s o OEFFICIENTS AND F RATIOS 
Adolescent Adult 
MZ 
Me MZ DZ 
saii 4 = within- within- 
r pair i F 
Ruze Roz MS (17, 25) (68,79) Ruzi Ro MS MS (17,25) 
Dominan 62 45 3462 7641 22 
. Re $ : $ i L214 aos 70% 
Capacity us (Cs) 64 164 49.74 67.14 1.36. 1.34 40 ee ae 
a ee gpi Sro Ht 
Seli-Acce; Sa) 166 44 38.78 83.32- 2428 1 2 3400 F809 1.08 
e > i 5 ie 3.3 12e 1858 x 
Sense of icing (Wb) 156 135-8420 87.29 1.04 1.11 sda ia IES 255 
Responsi (Re) a7 i61 25.94 48.74 1.88 1.35 343 4864 66.59 L37 
Söctalizati i (So) ‘51 i3L 4184 83.35 1,99 1,484 169 31.86 46.06 1,45 
Sell-Con sc SI, 26 4904 68.03. 1.39, 1-38 i 40,98 106.50 2,60% 
Good Im on Gi) “a “33 51.82 55.85 1.08 1.60"  .46* Eris 10800 LAA 
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scales that are significant in adolescence 
bu’ not in adulthood, MZ similarity tends to 
decrease with age, while there is no consistent 
trend in DZ similarity. The relationship be- 
tween changes in twin similarity with re- 
peated measurements and heritability has been 
discussed by Thompson and Wilde (1973). 


Twin Similarity in Personality Change 


Wilson (1968, 1974) has developed an 
analysis of variance for longitudinal twin data 
that yields values for Ruz and Rpz reflecting 
twin similarity in the pattern of change over 
age. With data for two ages, as in the present 
study, this analysis is equivalent to an analy- 
sis of variance of difference scores between 
the two ages (Winer, 1971). The 15 MMPI 
scales and the 18 CPI scales were each analyzed 


for adolescence-to-adulthood change by this 
3 and whether Ruz was significantly 


greater than Rpz was tested for each scale. 
For five MMPI scales, Hs, A, R, Es, and Dy, 
and two CPI scales, Wb and To, MZ similar- 
ity was significantly greater than DZ simi- 
larity at the .05 level, one-tailed test. This 
indicates that for these scales, the change in 


Dy scales show significant heritability in the 
retested subsample in both adolescence and 
adulthood as well as significant heritability for 
adolescence-to-adulthood personality change 
supports the importance of genetic influences 
on these scales. Data for the Hs, R, Es, Wb, 
and To scales suggest that the development 


TABLE 3 


SCALES WITH SIGNIFICANT / RATIOS OR Ruz > Ron 
IN ADOLESCENCE AND/OR ADULTHOOD 


Scales with significant Ps or Ruz > Roz 


Adolescence 


In- Adolescence Adulthood and 
ventory only only adulthood 
MMPI D, Pd, Pa, Sc, Ma, K, Es A," Dy* 
Mf" Hy, Si? 

CPI Sy, Sa, To Wh, Py Do, See Gi 
Sp,» So,» 
Ais I 


Note. For first and third columns, scales had significant F“ 
ratios or Ruz > Roz in both the entire adolescent ont and 
the adolescent subsample, unless otherwise noted. MMPI = 
Minnesota Multiphasic Personality Inventory; CPI = Cali- 
fornia Psychological Inventory. 

^ Significant in subsample only. 

ignificant in entire adolescent sample only. 4 
e Significant in female subset ‘of entire adolescent sample only. 
4 Significant in subsample and female subset of entire adoles- 


cent sample only. 
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of some traits may have a significant genetic 
component even though cross-sectional assess- 
ments of the heritability of these personality 
traits are not significant at both ages. 


Discussion 


The most important result of this study 
is that different patterns of significant heri- 
tability were found in adolescence and adult- 
hood. Three meaningful clusters of scales 
were found to be under significant genetic 
control in adolescence but not in adulthood: 
MMPI clinical scales originally constructed 
to assess psychopathology, CPI scales that 
are related to sociability, and CPI scales 
associated with achievement and intelligence. 
Exceptions to these generalizations were the 
MMPI Ma scale, significant in adulthood 
only, and the CPI Do scale, significant at 
both ages. A fourth group of scales was found 
to be under significant genetic control in 
adulthood but not in adolescence: several 
MMPI and CPI scales commonly thought to 
reflect emotional stability, self-confidence, and 
insightfulness. The characterization of the 
scales that demonstrated significant heritabil- 
ity in both adolescence and adulthood is less 
clear-cut. The three scales best representing 
the first factors of the MMPI and CPI (scale 
A for the MMPI and Sc and Gi for the 
CPI; Dahlstrom, Welsh, & Dahlstrom, 1975; 
Megargee, 1972) are included in the group of 
five scales for which there was evidence of 
significant heritability at both ages. This sug- 
gests that the major sources of variance in 
these two inventories correspond to traits 
that have a significant degree of genetic vari- 
ance in both adolescence and adulthood. 

In general, these patterns of significant 
heritability are supported by other investiga- 
tions. In a cross-sectional twin study of the 
MMPI using a Minneapolis adolescent twin 

sample, Gottesman (1963) found evidence of 
significant heritability for the D, Pd, Pt, Sc, 
Si, and A scales. With the exception of Pt, 
these scales were also under significant ge- 
netic control in the Boston adolescent sample. 
As with our adult sample, Reznikoff and 
Honeyman (1967) failed to find significant 
heritability in their adult sample for the 
MMPI scales that are significant in the 
Minneapolis and Boston adolescent samples. 
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Nichols (1966) reported substantial ©» ritabil- 
ity for the group of CPI scales re ‘ted to 
sociability and, to a lesser extent, or the 
scales related to achievement and int. ‘igence 
in a large sample of adolescent tws. Al- 
though Horn et al. (1976) have stu: ed the 
CPI in 45- to 55-year-old twins, the © -bstan- 
tially younger age of our adult twin- nakes 
any comparison inconclusive. 

It is important to point out that v le we 
have discussed the results in term: -f the 
presence or absence of significant her bility 
at one or both ages, the small san size 
involved makes it impossible to dis. \guish 
between the presence of genetic varia > that 
is not statistically significant and th: sence 
of any genetic variance at all. In othe vords, 
we have not shown that personalii traits 
that are under genetic control at one may 
be under no genetic control at anotho How- 
ever, we have demonstrated that i! © are 
substantial changes with age in w! per- 
sonality traits show evidence of si: !icant 
heritability.? 

How can such age effects in gen vari- 
ance in personality traits be explaine Since 
a change in any of the component: of the 
total phenotypic variance will affect |! value 
of the genetic variance, several ex)! -ations 
of the results are possible. Perhaps t ap- 
pealing, but also most difficult to pove, is 
that the changes in genetic varianc with 
age are a function of gene regulatio: (see 
Gottesman, 1974), genotype-environnicnt in- 
teraction and correlation, and/or epistasis 
(interactions between genes at two or more 


loci). Tt is also possible that when compared 
with either initial or final levels, development 
is associated with an increase in genetic influ- 
ences on a trait (Gottesman, 1963; Vanden- 
berg & Falkner, 1965). If this were the case, 
it would be expected that a greater number 


2 Because the adult sample is considerably smaller 
than the original adolescent sample, certain scales 
(Hs, Si, and Ac) have F ratios that while not sig- 
nificant in the adult sample, would have been sig- 
nificant in a sample the size of the larger adolescent 
sample. However, even if we consider these scales as 
exhibiting significant genetic variance in adulthood, 
the patterns of significant heritability described 
above are not substantially altered. 


DN 


of trai‘ are under genetic control in adoles- 
cence, was found in this study, since this 
is a iod when rate of development is 
acceler: ied, 

Cha ¿es in other components of the pheno- 
typic . riance could also account for the re- 
sults. . viler and Thompson (1960) pointed 
out t heritability will decline both as a 
funct of increases in the environmental 
varia’ +, which can be brought about by 
relax’ . environmental controls, and as a re- 
sult changing from an expressive environ- 
men! © a suppressive one. Heritability can 
be i ceased by corresponding reductions in 


the ¿© vironmental variance and by changing 
from : suppressive environment to an expres- 
sive ic. It can also be argued that MZ twins 
are: ore similar in personality than DZ twins 
bece se their shared environment is more 
simi ır. As adolescent twins become adults 
this similarity is less likely to occur; thus 
the appears to be a reduction in the herit- 
abi y of personality traits. This explana- 
tioi fails to account for the finding of sig- 
nif ant heritability for several MMPI and 
CI’ scales in adulthood but not in adoles- 
ce: ce and for the meaningful patterns that 
emerged. 

One final interpretation of the results should 
be mentioned. That a given MMPI or CPI 
scale exhibits significant heritability at one 
age but not at another might be the result 
of he scale measuring two different traits at 
the two ages (or conceivably, one trait at one 
age but nothing reliable at the other). Cor- 
rela. ional and multivariate longitudinal analy- 
ses that can be expected to shed light on this 
Question are in progress. 

It is not possible from the data collected 
in this study to determine which of the expla- 
hations offered above account for the results. 
However, this research has furnished the first 
Set of data suggesting that age is a variable 
that must be considered in future research on 
the genetics of personality. It is no longer 
Possible to assume that studies of the heri- 
lability of personality traits in one age group 
Yield valid information with regard to other 
äge groups, and it is no longer meaningful to 
teview twin studies of personality without 

taking into account the ages of the different 
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samples studied (Scarr, 1969; Vandenberg, 
1967). Prospective longitudinal twin studies 
with large samples and longitudinal adoption 
studies have the potential to specify in greater 
detail the nature of the changes in the herit- 
ability of personality with age and thereby 
answer the many questions that this research 
has raised. 
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1 recent review of implicit personality 


theo. y research, Schneider (1973) suggested 
thai greater attention be given to perceptual 
cate /ories that individual perceivers charac- 
teri ically use. This theme has often been 


stressed by others as well (Beach & Wer- 
the'mer, 1961; Cronbach, 1955; Hastorf, 
Richardson, & Dornbusch, 1958; Shrauger & 
Al rocchi, 1964). Nevertheless, there have 
beon few detailed studies of individual per- 
ce vers’ implicit theories, the study of Dreiser 
bv Rosenberg and Jones (1972) being a re- 


_ cent exception. 


Schneider (1973) also noted that research- 
ers have had relatively little success in relat- 
ing implicit theories systematically to other 
personality variables, although a number of 
studies have found differences in trait attribu- 
tion tendencies among different types of per- 
ceivers. For example, Wiggins, Hoffman, and 
Taber (1969) identified distinct types of 
judges, differing with respect to several per- 
sonality variables, who weighted cues differ- 
ently in judging intelligence. 

Several investigators have proposed that 
perceivers will differ in implicit personality 
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fillment of the requirements for the doctoral degree. 
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Individual Needs and Salient Constructs 
in Interpersonal Perception 


Jerome H. Cantor 
Herbert H. Lehman College, City University of New York 


Salient perceptual dimensions employed by each of 21 subjects were obtained 
by individual factor analyses of subjects’ Role Construct Repertory Test proto- 
cols. An inductive procedure was employed to relate each individual’s per- 
ceptual dimensions to his or her principal needs, as indicated by objective and 
projective need measures. On the basis of the inductive procedure, two rules 
were formulated, providing approximations of the relationships between per- 
ceivers’ perceptual categories and their personality characteristics, These rules 
were tested, and evidence was found for the incorporation of perceivers’ own 
values and standards in their appraisal systems for others, Some characteristics 
of individuals’ implicit personality theories are discussed, 


theories, or trait attribution tendencies, as a 
consequence of their own standing on the 
traits being rated, However, studies by Bene- 
detti, Morgan, and Bessemer (Note 1) and 
Secord, Backman, and Meredith (1962) were 
unable to find direct relationships between a 
perceiver’s possession of particular traits and 
consistent tendencies to attribute those same 
traits to others. 

It may be that perceiver possession of par- 
ticular traits is related to trait attribution 
tendencies in a “nonlinear” manner; that is, 
perceiver characteristics may affect tendencies 
to attribute traits, but not the same traits that 
the perceiver possesses to a significant degree. 
Consider an individual with a high need for 
autonomy. He might not respond differentially 
to others in terms of whether or not he per- 
ceived them as being autonomous, Rather, his 
mean ratings of others, as well as his tenden- 
cies to assume covariation among traits in 
others, might be related to a dominant-au- 
thoritarian versus egalitarian-permissive di- 
mension; that is, to whether he perceived 
others as allowing him to be autonomous or 
not. Thus, dominance-permissiveness might 
be more “central” for him as a trait in others 
and be more highly intercorrelated with other 
traits. And, as a consequence of the saliency of 
this trait in others for him, he might rate 
others on it with greater variability than he 
would rate them on nonsalient traits. 

The present study investigated the nature 
and extent of relationships between salient 
constructs employed in interpersonal percep- 
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tion by individual perceivers and personality 
characteristics (needs) of those perceivers. 
Following Koltuy (1962) it was hypothesized 
that these salient constructs account, in a fac- 
torial sense, for covariation in attributed 
traits within the individual perceivers’ implicit 
personality theories. Because of the way in 
which the constructs were derived—from the 
perceivers’ free-response descriptions—it can 
be proposed that the obtained personality fac- 
tors are at least as valid for those individual 
perceivers as are any others that might be ob- 
tained by factor analyzing their responses on 
preselected trait scales. Several writers (Beach 
& Wertheimer, 1961; Shrauger & Altrocchi, 
1964) have noted that when such scales are 
used, there is no guarantee that the traits 
rated are salient or even relevant for the 
raters, Moreover, as Rosenberg and Jones 
(1972) observed, even if dimensions obtained 
from free-response descriptions could be recast 
into the typically emergent semantic differ- 
ential factor structure, a great deal of indi- 
viduality in dimensional content would be lost. 

In the present study, each perceiver was 
asked to supply constructs for and rate a 
representative group of others on those con- 
structs. A number of studies (e.g., D’Andrade, 
1965; Hakel, 1969; Mulaik, 1964; Norman & 
Goldberg, 1966; Passini & Norman, 1966) 
have shown that differing stimulus character- 
istics of observed persons exert relatively little 
influence on factor structures emerging from 
trait ratings, providing that representative 
trait rating scales are used. Studies within the 
context of Kelly’s (1955) personal construct 
theory (e.g., Bannister & Mair, 1968; Fjeld 
& Landfield, 1961; Pederson, cited in Maher, 
1965) have shown relatively high consistency 
of constructs (traits) and superordinate con- 
structs (factors) elicited in free-response in- 
quiries, even when the figures described are 
varied. In addition, work by Bannister and 
Mair (1968) indicates that the method used 
here, in which 15 constructs were elicited from 
each perceiver, tapped by far the greatest part 
of these perceivers’ implicit theories. 

METHOD 

Subjects 

Subjects were 21 undergraduate and graduate stu- 


dents at Columbia University, 11 men and 10 women. 
All were between the ages of 18 and 27 years, with 
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a mean age of 20.5 years. All subjects were native- 
born Americans who spoke English. None w-re psy- 
chology majors or had ever had experienc psy- 
chological work and none had ever been trea by or 
consulted with a mental health professional were 
paid volunteers and were told that they wer: aking 
part in a study of personality and how pec hink 
about other people. The subject selection crit were 
intended to minimize idiosyncrasy in percep cate- 
gories due to cultural variation, psychologic: ilad- 
justment, or exposure to “professional” categ ition 
systems, 

Measures and Procedures 

Salient perceptual dimensions. The percep cate- 
gories employed by subjects were obtained leans 
of a modification (Mehrabian, 1968) of Role 
Construct Repertory Test (Reptest) origina’! intro- 
duced by Kelly (1955). As individually pri d to 
subjects here, the Reptest consisted of thr. parts. 
First, role descriptions were supplied for 15 ficant 
and representative persons likely to have 1 en- 
countered by subjects, including persons lik to be 
evaluated unfavorably as well as favorably. sub- 
jects provided the names (first name, la vitial) 
of the actual people in their own lives most 
closely fitted the role descriptions. 

Second, these names were then presented eata 
time to the subjects in 15 sorts. The subj were 
asked to say, in a word or phrase, one was which 
two of the people were alike yet differen m the 
third. An inquiry was made at the time « minis- 
tration to make explicit the connotative m igs of 
the phrases used by the subjects. 

In the third part of the test, the sul s each 
listed their 15 people and 15 constructs. n they 
rated all of the people on all of the const s using 
a 7-point scale. The subjects assigned scor: »ccord- 
ing to the degree to which their people w harac- 
terized by the construct given. This procedu: yielded 
a 15 X 15 person-construct matrix for eac! rject. 

In the subsequent analysis, persons wi disre- 
garded, and 15 X 15 intercorrelation matr! es were 
obtained for each subject’s set of ratings on his con- 
structs. A principal-components factor sis was 
then performed separately on each subj matrix. 


Factors with eigenvalues greater than 1.00 were de- 
termined, and the others were dropped from the 
analyses. The resulting factor structures were then 
rotated to the varimax criterion. 

Three to five factors were obtained for each sub- 
ject. The mean percentages of variance in individual 
subjects’ construct systems accounted for by the sev- 
eral factors were as follows: first factor, 41.1%; sec- 
ond factor, 20.5%; third factor, 11.9%; fourth fac- 
tor, 9.5% (n=15); and fifth factor, 7.4% (n= 6). 


1 In the interest of brevity, details of the individual 
factor analyses have been omitted. In addition, only 
three examples of the perceptual dimensions generated 
by individual subjects are included here. Additional 
details on these may be obtained from the author. 


Kelly’s (1955) theory, the factors ob- 
analyses can be considered “super- 


ordinate” ructs in that they subsume the con- 
structs on which the subjects made their 
ratings. ] igs of original constructs on the ob- 


tained su dinate constructs were at times quite 
high, re: 70 to .90 in many instances. The 
superord onstructs identified for subjects were 
consider constitute their salient perceptual di- 
mension: iese were interpreted in terms of the con- 
structs t oaded on them, and short descriptions of 
each su t's salient dimensions were prepared. These 
descript: is closely adhered to the original phrases 
used by subjects, An example is given below: 
Ti rimary dimension which Carolyn uses in 
eval: ing people is whether they are highly mo- 
p tiva ambitious, persistent, and intellectually 
= cur 5, People perceived as being this way are also 
seer as enjoying reading and as having a high 
lev. of confidence. To some extent, they are also 


see’ as being positive and dogmatic. Carolyn does 
no. define the opposite pole of this dimension, 

\ second dimension which Carolyn employs is 
v ether people are kind and open, as opposed to 
F ing distant and aloof. Those seen as kind and 
(pen are seen as having the ability to listen with 
anderstanding. They are also seen as being aware 
and perceptive, People seen as distant and aloof are 
also characterized by a harsh manner, and by 
coming on strong. 


eed measures, The Edwards Personal Preference 
Schedule (EPPS) was used as one measure of the 
sul ‘ects’ individual need patterns. It is a 225-item 
for -d-choice inventory providing an assessment of 
15 ajor needs identified by Murray (1938). The 


EPF was designed to minimize the influence of social 
desir jlity in self-reports of needs. In the present 
“stud: the EPPS was administered to each subject to 
dete: ‘ne his or her five prepotent needs. These were 
the i)» EPPS variables on which the subject ranked 
highe in percentile terms relative to the norms for 
college tudents (Edwards, 1959). 

The hematic Apperception Test (TAT) was used 
to pro. te another independent assessment of needs. 
This p: ‘ective technique was employed to aid ex- 

‘Ploratic:. for relationships between perceptual cate- 
gories a: i needs, Ten TAT cards were administered 
to each ¿f the subjects with standard instructions. 


Cards 1, 2, 3BM, 4, 6BM, 7BM, 10, 13MF, 14, and 


20 were us. for the men, Cards 3GF, 6GF, and 12F 
Were substituted for 3BM, 6BM, and 7BM in the 
Administrations to women subjects, The other cards 


" Were the same, 
„The TAT protocols were evaluated by four clini- 
an-raters in terms of the same 15 need variables as- 
Sessed by the EPPS, Using global assessment of entire 
Protocols, each rater determined the five needs most 
Salient for each subject. The five needs most fre- 
quently chosen by the raters for each subject were 
considered the consensus TAT needs for that subject. 
HS consensus was used as the TAT measure of 
Subjects’ needs in the study, 
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Reliability of the obtained TAT needs was evalu- 
ated by determining concordance of raters on needs 
of individual subjects. An interagreement matrix 
among raters on 105 needs (21 Subjects X 5 Needs) 
yielded an overall interagreement percentage of 
56.0%: 353 actual agreements out of 630 possible 
agreements (105 Needs X 6 Comparisons Among 
Raters). 

In summary, use of both the EPPS and consensus 
TAT provided two sets of 105 needs, 5 for each sub- 
ject on each measure. 

Exploratory procedure. Relationships between sub- 
jects’ salient perceptual dimensions and their need 
patterns were then determined by an exploratory in- 
ductive procedure. A panel of five clinical psycholo- 
gists was given the subjects’ perceptual dimensions, 
and each clinician was asked to infer each subject’s 
needs from his or her dimensions. The clinicians in- 
ferred 5 needs for each subject, selecting needs from 
the EPPS list of 15, Successes by the clinicians in in- 
ferring needs against the criterion need measures were 
then examined and the inference procedures they suc- 
cessfully used for their predictions studied. Analysis 
of the “practitioner’s rule of thumb” was recently 
advocated by McGuire (1972) as a way of generating 
hypotheses. 

On the basis of the clinicians’ results, two “rules” 
were formulated relating perceptual dimensions and 
needs. These rules were then tested by having two 
other largely naive panels repeat the inference task, 
one using the rules and the other left to its own 
devices. 


RESULTS 
Accuracy of Clinicians’ Inferences 


Since the clinicians used only the subjects’ 
perceptual dimensions in making their infer- 
ences, the extent to which they were able to 
predict subjects’ needs against the criterion 
measures provided an estimate of the “re- 
latedness” of dimensions and needs. For each 
subject, the 15 needs described by Edwards 
were considered a sample (N) of which 5 had 
the characteristic (k) of being associated with 
high percentile scores on the subject’s EPPS. 
Each clinician’s choice of 5 needs was con- 
sidered a sample (n) drawn from (N) of 
which (x) choices were successes, that is, cor- 
responded to high EPPS needs. The variable 
(x) was hypergeometric. The same model was 
applied to successes in predicting TAT needs, 
substituting the 5 consensus TAT needs for 
the 5 high EPPS needs. Since the predictabil- 
ity of needs of the total 21-subject group was 
of primary interest, successes on individual 
subjects’ needs were summed across all sub- 
jects. For each clinician, success in need pre- 
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TABLE 1 
SUCCESSES BY CLINICIANS IN INFERRING NEEDS OF 
SUBJECTS AGAINST CRITERION MEASURES 
TAT EPPS 
Clinician x A x s 
1 50 FOE 32 —.74 
2 43 1,96** 38 74 
3 39 98 44 ZZ Ore 
4 AL 1.47 40 1.22 
5 42 1.72* 38 74 
Consensus 42 1,72* 43 1.96** 
TAT = — 36 24 
Note, TAT =Thematic Appreception Test; EPPS 


= Edwards Personal Preference Schedule; x = the number of 
successful inferences made by clinicians, 
* p < 05, one-tailed test. 
** p < .03, one-tailed test, 
*#t p < ,02, one-tailed test. 
werk p < 001, one-tailed test, 


diction across subjects was evaluated by the 
statistic: 
a- Ez) 


fA E ra 


y/ var 
where var 7 = (yar x/21) with (x) hyper- 
geometric, For a sample size of 21, the statis- 
tic z was approximately normally distributed. 

Z scores were computed for individual 
clinicians’ total successes against each need 
measure, In addition, the clinicians’ consensus 
was treated as a sixth clinician, and z scores 
were computed for successes by taking the 
consensus against each measure. The total 
number of needs predicted by each clinician 
and the consensus was 105, Across all needs, 
the probability of successful prediction by a 
clinician was 1/3, and the expected number 
of successes 35. Results of the clinicians’ in- 
ferences are given in Table 1. A comparison 
between the two need measures is also in- 
cluded. 


? 


Clinicians’ Inference Procedures 


It was hypothesized that inference “rules” 
used by clinicians that led to successful pre- 
diction would mirror the ways in which needs 
and perceptual categories are actually related. 

Most of the clinicians relied on a general 
“saliency” criterion in making their infer- 
ences. If a subject gave attention to a par- 
ticular concern in his dimensions, for example, 
the extent to whicl `r people were domi- 
nant, then the subject was presumed to have 
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some “corresponding” need, sucli as either’ 


dominance or deference. While this saliency 
criterion was often too indefinite to lead to 
successful prediction, a more resed version 
of it was employed with somewhat greater 
success by Clinician 3, who predicted EPPS 
needs at the .02 level. This clinic: 1 inferred 
needs on the basis of predomina: concerns, 
but particularly when these con seemed 
to express positive values for t subjects, 
His theory was that “we value anc regard as 
important in others what we valu | regard 
as important in ourselves.” Hence, erceivers 
were likely to appraise others in ter. of these 
characteristics. 

Clinician 1, who predicted TAY needs at 
the .001 level, used the perceptual © :tegories 
to construct “psychodynamic mols” fori 
each subject. He then inferred the subjects’ 
needs from the intervening mode ver than 
“directly” from the dimensions. His t -ocedure 
is illustrated by his handling of the | ollowing 
subject’s categories: 

Peter’s primary dimension involves whe!!cr people 
are members of the liberal upper class, in lated by 
wealth, or whether they are politically »-actionary 
rednecks. The first group of people is < as being 
interesting and intelligent. They are articulate and 
wise and one can learn from them, They are also 
seen as worldly and cosmopolitan and e an ait 
of success and knowledge. Also, they are outwardly 


Successful and content and have analytical, sharp 
minds. The politically reactionary rednecks, on the 
other hand, are seen as narrow-minded and conserva- 
tive, with a limited perspective. They are middle class 
and traditionalist, 

Peter’s second dimension is whether people are fol- 
lowers, without initiative, or whether they are’ 
principled and independent. The followers are seen as 
secking leaders and as being helplessly dependent. 


Clinician 1 reasoned that Peter was añ 
“authoritarian personality” who “identified 
with the aggressor.” The “liberal upper class” 
represented a group that he could “apprentice 
himself to” and eventually become a part of. 
On this basis, Clinician 1 correctly predicted 
four of Peter’s five TAT needs (p < .02): | 
dominance, aggression, autonomy, and exhibi- 
tion. ; 

Another procedure used by the clinicians in- 
volved considering subjects’ perceptual cate- 
gories as having direct relevance to the fulfill- 
ment of their needs. Subjects were seen as 
sometimes appraising others in terms © 


others were receptive or com- 
them. Thus, subjects who 

ie in terms of their kindness or 
often correctly seen by the 
having a need for succorance. 
> categories focused on whether 
controlling or inhibiting were 
tly seen as having a need for 


whether t' 
plementar 
evaluated 
empathy 
clinician: 
Subjects 
others \ 
often c 
autonon 


Som« rect inferences were made by the 
clinicia: on the basis of “stylistic” rather 
than so ‘antive aspects of the subjects’ cate- 
gories, © example follows: 

One ; of Lyvia’s primary dimension is defined 
by the owing characteristics: being knowledgeable, 
both a \emically and intellectually; being painstak- 
ing, ex biting academic thoroughness and precision; 
being holarly and dedicated to academic pursuits; 
being edicated, to concepts, causes, and ideologies; 
havin self-discipline, accomplishing things that have 
fo g done; having great academic intelligence; and 


beine articulate, having the ability to speak precisely 
and intellectually, being expressive. Lyvia does not 
define the opposite pole of this dimension. 

Several clinicians reasoned that the pain- 
taking and diligent attributes described in 
thi: subject’s primary dimension had an over- 


dor: “atonement” quality, They inferred that 
she iad a need for abasement, which was in 
fact ¿er highest TAT need. 
Testi. v of Inference Rules 
Tw. rules of inference were abstracted from 
the cli cians’ procedures. These were as fol- 
lows: ‘Je 1: If a person places a strong posi- 
live v ve on some characteristic in other 
people. ` + or she is likely to have a need cor- 
,tespon’ ‘+ to (similar to) that characteristic. 


Rule 2: People appraise others in terms of 
ose others will be receptive to them 

{fill their needs. 

member panels, consisting largely 
‘veshmen in an introductory psy- 
J chology co irse, were used to test the rules by 
tepeating the same need inference task that 
the clinicians had performed. One panel was 
given the rules with some illustrations of their 
‘pplication. The second panel was given in- 
Sttuctions orienting them to the task but giv- 
§ no hint as to how they were to infer 
Needs,? Results for the two groups are given 
ih Table 2.3 
Successes of members of all three inference 
nels—rules” panelists, “nonrules” panel- 
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TABLE 2 


SUCCESSES BY “RULES” AND “‘NONRULES’’ PANELISTS 
IN INFERRING SUBJECTS’ NEEDS AGAINST 
CRITERION MEASURES 


TAT EPPS 
Panelist x z x z 
Rules 
1 37 49 43 1,96** 
2 36 24 41 1.47 
3 43 196%". 47. 2,94%% 
4 42 1:72" 45 2.450" 
5 37 49 42 1.725 
Consensus 42 1.72* 48 rM En 
Nonrules 
1 47 2.94** 29 —1.48 
2 37 49 45 2.450" 
3 39 98 38 74 
4 39 98 40 1,22 
5 40 1.22 37 49 
Consensus 41 1.47 38 74 
Note. TAT = Thematic Appreception Test; EPPS 


Prel ference Schedule; x = the number of 
e by panelists, 


= Edwards Pi 


+t p < L001, one-tailed test, 


ists, and clinicians—against each need mea- 
sure were compared by means of the Mann- 
Whitney U test. Against the EPPS, rules pan- 
elists achieved more successes than nonrules 
panelists (n = 5 for each group; U = 3, p< 
.03) and also surpassed clinicians (n = 5 for 
each group; U = 3, p < .03). The clinicians 
did not differ from nonrules panelists against 
this measure. With respect to the TAT, no 
group differed significantly, although the com- 
parison for clinicians with both student groups 
was U = 5, p < .08. Use of the rules thus ap- 
peared to aid in the prediction of subjects’ 
needs as evaluated by the EPPS but not as 
those needs were determined by the TAT. 


Evaluation of Need Measures 


The differing utility of the inference rules 
for prediction against the two need measures, 


2 The instructions to panelists have not been in- 
cluded here for the sake of brevity. They are avail- 
able from the author. 

3 The “rules” panel consisted of three students and 
two psychologists (Panelists 4 and 5). The psycholo- 
gists were included to ensure that the rules could be 
applied equally well by trained and untrained indi- 
viduals. Their performances appeared indistinguishable 
from those of the student “rules” panelists. The “non- 
rules” panelists were all students. 
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TABLE 3 


SALIENT NEEDS OF SUBJECT: 
BY EPPS ann T4 


As INDICATED 
le 


Need EPPS TAT 


Nurturance 10 
Achievement 14 
Endurance 7 
Change 
Dominance 
Succorance 
Autonomy 
Order 
Tntraception 
Aggression 
Exhibition 
Deference 
Affiliation 
Abasement 
Heterosexuality 


omnube onp 


ANS eahweeconnnrare 


Note. EPPS = Edwards Personal Preference Schedule; 
TAT = Thematic Apperception Test. 


and the fact that the need measures identified 
needs independently of each other (see Table 
1), suggested comparison of these “criterion” 
measures with each other. The five needs ob- 
tained for each subject on each measure were 
summed across subjects. For the total subject 
group, discrepancy scores were obtained that 
simply consisted of the number of subjects 
having each EPPS need as a salient one minus 
the number of subjects having the same need 
as a salient one according to the TAT. The 
data, ranked by the discrepancies, are given 
in Table 3. 

These data suggest that the EPPS tended to 
overidentify socially desirable needs, such as 
nurturance, achievement, and endurance, in 
comparison with the needs identified by the 
TAT. Similarly, in comparison with the TAT, 
the EPPS tended to underreport socially ques- 
tionable needs such as sexuality, self-criticism 
(abasement), and possibly passivity (defer- 
ence). Regardless of the validity of either 
need measure, it does appear that EPPS needs 
were more likely to be congruent with the 
personal ideals held by the subjects. 

Rule 1 instructed panelists to scan subjects’ 
perceptual dimensions for positively valued 
characteristics in others and to predict subject 
needs similar to those characteristics. Since 
EPPS needs likely to be congruent with sub- 
jects’ ideals were in fact predicted with some 
success, it appears that to some extent the 
perceptual dimensions incorporated or re- 
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flected ideals that the subjects held for o:hers 
and for themselves. Subjects, to some degree 
at least, appraised and evaluated others in 
terms of values and standards they aspire: to 
in their own lives. 

While perceptual dimensions were apn=r- 
ently reflective of subjects’ personal vas 
and ideals, the extent to which they v-e 
instrumentally used to satisfy subjects’ <s- 
sentially “receptive” needs is less clear, In 
some instances, dimensions were used to ic n- 
tify characteristics in others that were ^- 
tentially need fulfilling for perceivers. I v- 
ever, neither the EPPS nor the TAT prov. -d 
a sufficient appraisal of subjects’ recep ve 
needs. Hence, the extent to which percep ul 
dimensions incorporated criteria relevar’ to 
meeting these needs could not be determ od. 


Discussion 


This study indicated that among th 
terminants of impressions formed are er- 
ceptual dimensions related to perce’ ers’ 
motivational characteristics. Although `` ap- 
pears well-established that perceiver’ im- 


de 


plicit personality theories contribute re- 
sultant impressions (e.g, Hakel, 169; ~ 
Hamilton & Gifford, 1970; Mulaik, 1964; 
Passini & Norman, 1966; Wiggins, Ho mnan, 
& Taber, 1969; Tupes & Christal, Nos 2), 
the extent to which they do so is as yet © «gely 
undetermined. The relative contributions of 
stimulus characteristics of observed po sons, 
interaction contexts, perceiver sets, ano con- 
stant perceiver processes have not bees de- 
lineated. 
The inferential finding here that perceptual 
dimensions tend to reflect perceivers’ values 
; are 


and ideals suggests that such dimensio! 
especially instrumental for social interaction. 
Insofar as attitudinal similarity is a principal 
basis for interpersonal attraction (e.g., New- 
comb, 1961), then rewarding social interac- 
tion would be facilitated by the discovery of 
others who share the perceivers’ orientations. 
Perceivers can be considered as more or less 
continually evaluating others for compati- 
bility. Hence, they would tend to incorporate 
into their perceptual dimensions characteris- 
tics that made for compatibility, namely, ones 
reflecting their own attitudes and values (€.8. 
the desire for achievement). 


f he examples given above suggest, ap- 
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prai+ is of others may involve highly indi- 
vidu lized systems of categorization, with a 


gre: diversity existing among perceptual di- 
me: ions regarding both content and tone. 
Th. esearch method used here contrasts with 
tho attempting to discern highly abstract 
cov :nonalities in perceivers’ ratings of others. 
Th re is no reason why the study of per- 
ce) ors? implicit personality theories should 
no simultaneously proceed along both idio- 


grephic and nomothetic lines. However, at 
tives, dimensions derived from group data 
h: ve been tacitly assumed to represent indi- 
v luals’ dimensions as well. The highly indi- 
y dualized dimensions revealed in the present 
study suggest that this assumption may not 
be useful, 
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Differential Response to Anticipated Crowding: Psychologic: 


Effects of Social and Spatial Density 


Andrew Baum and Stuart Koman 
Trinity College, Hartford, Connecticut 


In order to assess the different psychological processes mediating perceptions 
of large numbers of people and inadequate space and to identify different 
behavioral responses to social and spatial density, an experiment was conducted 
examining the independent and interactive effects of anticipated social and 
spatial density. It was predicted that when stimulus features of the setting 
aroused perceptions of large numbers of people, subjects would not respond to 
spatial variation but would be more immediately concerned with consequences 
of high social density. It was also predicted that the expectation of social 
Structure would moderate response to social density but would not influence 
responses to spatial density. Furthermore, it was predicted that responses to 
high social densities would be oriented toward withdrawal, while response to 
spatial density, mediated by sex of subject, would be more aggressive. Data 
collected generally confirmed these expectations, suggesting that social and 
spatial density have consequences for mood and behavior that operate accord- 


ing to different dynamics. Implications of these findings were discussed. 


The expanding literature on crowding and 
human behavior is characterized by what ap- 
pear to be inconsistent experimental findings 
and conceptual systems. A major cause of this 
confusion is a general failure to distinguish 
between the psychological effects of social 
and spatial density. The present research is 
concerned with a more adequate distinction 
between the antecedent conditions and conse- 
quences of social and spatial density, 

Density, crowding and pathology are ele- 
ments of a complex interpersonal process 
characterized by a syndrome of stress and 
related coping behaviors (cf. Stokols, 1972). 
Density is best used to describe the physical 
conditions established by variable amounts 
of space and numbers of people; as density 
increases, an individual may be confronted 
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with larger numbers of others and/or w 
diminishing amounts of space (Saegert, N 
1). The experience of crowding and sub 
quent behavioral response are thus affecic’ 
some combination of the social and 

conditions of high density. Spatial inad 

may be experienced as “being cramped nd 
may result in arousal and response best ed 
for the attainment of goals in small spa es. 
In addition, activity in small spaces may ùe 
characterized as arousing to the extent ihat 
people get in each other’s way and constrain 
behavior (e.g, Sundstrom, 1975) or sre 
forced to maintain inappropriate interaction 
distances and invade each other’s persona! 
space (Worchel & Teddlie, 1976). The cou 
sequences of diminishing space are at this 
point similar to those of increasing numbers 
of people, as they both threaten interpersonal 
control; frequent, unwanted, and uncontrol- 
lable interaction is likely to induce stress as 
groups grow larger (e.g., Baum & Valins, in 
press; Calhoun, 1970; Milgram, 1970) and 
is associated with coping responses directed 
toward minimizing social contact or reestab- 
lishing control over social encounter. Al- 
though the physical conditions of high density 
can cause all of these psychological conse- 
quences, specific effects of density are medi- 
ated by the nature of the physical setting 
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DIFFERENTIAL RESPONSE TO ANTICIPATED CROWDING 


(ex, arrangement of space, room size), the 
exis ng social environment (e.g., group size, 
cob siveness, goals), and the individual (e.g., 
pe onal space requirements, tolerance for 
fr- ‘ration). As Stokols (1972) has noted, 
tl experience of crowding varies with these 
si ational and psychological mediators. When 
t) salient stimulus features of high density 
a use perceptions of inadequate space, the 
e veriential and behavioral effects should be 
c ‘erent than those caused by stimulus fea- 
tres promoting frequent interaction with 
] :ge numbers of people. 
This analysis suggests that studies varying 
vnsity by means of manipulating room size 
reate conditions, defined as spatial density, 
hat are different from those created by stud- 
ies varying density by manipulating group 
e and social density (cf. Loo, 1973), Ex- 
perimental manipulations of room size have 
yielted a fairly consistent pattern of sex dif- 
ferences, which suggests that men respond 
more aggressively to spatial limitation than 
do women (Epstein & Karlin, 1975; Freed- 
man, Klevansky, & Ehrlich, 1971; Freedman, 
evy, Buchanan, & Price, 1972; Ross, Lay- 
in, Erickson, & Schopler, 1973; Stokols, 
all, Pinner, & Schopler, 1973). There is 
| mited evidence of aggressive response 
toward others when space is restricted 
okols et al., 1973; Sundstrom, 1975) and 
nger evidence that both males and fe- 
les experience physiological arousal and 
d comfort in small rooms (Epstein & Karlin, 
1°75; Hopstock, Note 2). Furthermore, per- 
ceptions of crowding in settings characterized 
by spatial conditions of density are influenced 
by properties of the setting such as color, 
complexity, and arrangement of space, which 
either make the setting appear larger or di- 
veri attention from the consequences of in- 
adequate space (Baum & Davis, in press; 
Desor, 1972; Worchel & Teddlie, 1976). 
Studies of social density, which emphasize 
the social consequences of high density by 
varying group size, report somewhat different 
findings. Males and females respond compara- 
bly to high social densities (Baum & Valins, 
in press), but their behavior does not reveal 
evidence of aggressive response to large num- 
bers of others. Rather, these studies report 
evidence of stress, negative interpersonal af- 
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fect, active avoidance of others, and social 
withdrawal (Baum & Greenberg, 1975; Baum 
& Valins, in press; Griffitt & Veitch, 1971; 
Saegert, in press). Responses to high social 
density do not appear to be influenced by 
properties of the setting related to percep- 
tion of its spatial adequacy but are mediated 
by social structuring that reduces the proba- 
bility of social interaction or that allows in- 
dividuals to maintain adequate regulation of 
their social experiences (Baum, Harpin, & 
Valins, 1975; Schopler & Walton, Note 3). 
The primary mediators of experience and 
behavior in settings that are characterized 
by high social densities are variables influenc- 
ing the nature, frequency, and intensity of 
social contact and affecting individuals’ abil- 
ity to regulate social interaction. 

This discussion of relevant literature sug- 
gests that the salient stimulus properties 
characterizing conditions of social and spatial 
density have different consequences for mood 
and behavior. Experiential and behavioral re- 
sponses to high density vary with the nature 
of perceived antecedent conditions. Aggres- 
sive behavior, for example, may enable an 
individual to control more space and thereby 
reduce stress induced by high spatial density. 
Interpersonal aggression, however, may be 
accompanied by increased social interaction 
and thus may not be an effective way to deal 
with stress aroused by the presence of large 
numbers of others, Withdrawal and passive 
response, on the other hand, seem to represent 
effective coping strategies in socially dense 
situations. 

Evidence of the differential processes of 
crowding caused by social and spatial ante- 
cedents is limited. Few studies have indepen- 
dently manipulated room size and group size, 
and these studies have encountered unfore- 
seen problems related to extreme room sizes 
(Marshall & Heslin, 1975; Rall & Saegert, 
Note 4). The present research independently 
varied group and room sizes using the method 
developed by Baum and Greenberg (1975) 
in studying the perceptual and behavioral re- 
sponses of subjects anticipating crowding. 
Subjects expecting to participate in an ex- 
periment with nine other people, regardless 
of room size, should experience more discom- 
fort, perceive more crowding, assume more 
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isolated seat positions, and should avoid so- 
cial interaction with others as the group be- 
gins to gather when compared with subjects 
expecting only four other participants, In 
addition, anticipated group structure, manip- 
ulated by informing half of the subjects that 
the experimenter would appoint a group leader 
and would describe rules governing social 
interaction, was predicted to interact with 
anticipated group size. It was hypothesized 
that the stress and social withdrawal induced 
by the anticipation of large group size would 
not be generated when social structure was 
expected; because structuring reinforces in- 
dividuals’ ability to regulate social interac- 
tion, these subjects should be not concerned 
with frequent interaction and social overload. 
Similarly, subjects anticipating participation 
with small groups should not anticipate so- 
cial overload. As a result, structure expec- 
tancy should not influence the affective or 
behavioral responses of subjects expecting to 
participate with only four others, 

It was also hypothesized that the effects of 
spatial density would be apparent only when 
anticipated group size was small. When await- 
ing small numbers of others, subjects in small 
rooms should report more crowding and dis- 
comfort than those waiting in larger rooms. 
This manipulation of spatial density was ex- 
pected to produce effects similar to those 
typically reported by studies observing small 
groups in small rooms: Rather than with- 
drawing, men in this condition should mani- 
fest more aggressive behavior by assuming 
central seat positions, while women should 
not respond to varying room size. Consistent 
with relevant literature, sex of subject should 
not interact with anticipated group size, and 
anticipated social structure should not inter- 
act with room size. Verification of these hy- 
potheses will provide support for the notion 
that the consequences of density vary in ac- 
cordance with the differing stimulus features 
of the setting and that the psychological pro- 
cesses associated with salient social and spatial 
antecedents differentiate between the two. 


METHOD 


Subjects 


Subjects were 32 male and 32 female undergradu- 
ates at Trinity College who received $1.50 for par- 
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ticipating. Subjects were recruited by te’ 
after having been selected at random from the 7 
ity College student directory. 

A college-age male experimenter conducted ‘he 
study in the laboratory, In addition, two colo c- 
age males served as confederates in each session v» h 
male subjects, and two college-age females sery:d 
in a similar role with female subjects. 


Design 


In order to assess the relative effects of room 
size, anticipated group size, and social structu c, 
two levels of each variable were crossed with (e 
sex of subjects in a 2X 2 X 2X2 experimental d 
sign, Subjects were told to expect a 5-person grow > 
session or a 10-person group session in either 
small or large room, and half of the subjects 
each condition were told to expect a structured 
sion in which a group leader would be appoin 
and rules for session behavior would be 
by the experimenter. No mention of social si: 
was made in sessions involving remaining s 
In each condition, half of the subjects were mo! 
and the other half were females, With this excr 
tion, condition assignment was randomly determin: 

Six major dependent variables were assessed: 
spatial position taken by subjects in the room, | 
amount of facial regard between subjects and « 
federates, the number of conversations initiated 
subjects, subjects’ evaluations of the experime: 
room, their anticipation of the session, and 
reported discomfort in the setting. 


Procedure 


As with previous studies of anticipated crow 
(Baum & Greenberg, 1975), the procedure for this 
experiment was directed toward convincing 
jects that they would be participating in groups « 
previously determined sizes and toward maintaini 
this expectation during data collection, Arriving 
alone, subjects encountered the experimenter, who 
was seated at a table next to a large white partition 
that divided the room into two sections. Thc 
tition was positioned at one of two places c: 
sponding to the size of room variable. The room in 
which the experimenter was seated was furnis 
with a second, larger table, on which were s 
tered either 5 or 10 envelopes and 5 or 10 pencils. 
In addition, a box of clipboards was placed on the 
table where subjects were sure to see it, and a 
matrix for an N-person bargaining game showing 
either 5 or 10 participants was attached to the par- 
tition, After being greeted by the experimenter, 
subjects were asked their names and were checked 
off on a list on the table in front of the experi- 
menter. This procedure was exaggerated to ensure 
that subjects were cognizant of the fact that there 
were either 5 or 10 same-sex participants listed. 

Once this had been accomplished, subjects were 
told that they would be participating in a group 
experiment, that they were the first to arrive, and 
that either four or nine other subjects were coming 
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Ficure 1. The experimental room, with seat posi- 
ions numbered from 1 (peripheral) to 6 (central). 


» the session. In addition, half were told that once 
eryone had arrived, a group leader would be ap- 
ointed and rules governing behavior in the session 
vould be explained. Subjects were then asked to 
leave their coats and books with the experimenter 
and were told that the experiment would begin as 
soon as the others had arrived. They were then 
shown into the experimental room created by the 
partition and asked to have a seat anywhere they 
wished. 
The experimental room was either large (2.7 X 4.6 
n) or small (15 4.1 m), affording participants 
ith 12.42 m® or 6.15 m? and was completely de- 
vid of decor or furnishings except for 10 straight- 
lock chairs arranged in an inverted U (see Figure 
Tn all conditions chairs were positioned so that 
¿e were approximately 15 cm between them, ex- 
for the chair at the bend of the U, which was 
roximately 60 cm from the nearest chairs. The 
{. wall was a solid one-way mirror from about 
1 off the floor to the ceiling, and subjects could 
se a heavy black curtain on the other side of the 
gi that appeared to be completely closed, Actually, 
the curtain was open far enough to permit observa- 
ticu of the room from an adjacent laboratory. The 
l was created by the partition, which could 
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be ved so as to vary the width of the experi- 
menial room, and the narrow wall was defined by 
a movable partition with which the length of the 


room was varied. Subjects were instructed to sit in 
any seat they wished, and in the 5-person condi- 
tions, the presence of 10 chairs was explained in 
terms of another experimental condition, The ex- 
enter did not enter the experimental room at 
ime but merely ushered subjects into the room. 
e waiting alone in the room for 2 minutes, 
jects heard the first confederate arrive in the 
other room, The greeting and instructions to the 
first confederate were the same as those for the sub- 
jects and could be clearly heard in the experimental 
room. As with the subjects, the first confederate 
was shown into the experimental room and was told 
to take a seat anywhere he wished. The confederate 
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then took a seat two chairs away from the subject. 
After 2 minutes of waiting together, subject and 
confederate heard the second confederate arrive, 
The second confederate was also greeted and in- 
structed in the same way as were the subjects, and 
after entering the experimental room, he selected 
a seat so that the first confederate was between 
him and the subject. 

After an additional minute of waiting, the experi- 
menter entered the room with three clipboards, 
explaining that there were several preexperimental 
forms that had to be completed and that while they 
were waiting for the other subjects to arrive, the 
three already present could begin answering them. 
The first sheet was blank, lined paper on which 
participants were asked to number one line for each 
person in the experiment for use once the session 
had begun. The sheets that followed were a series 
of questionnaires asking participants to describe their 
own states at the time and to rate the experimental 
room and their expectations regarding the experi- 
mental setting. The first form was used to assess 
subject discomfort and was explained as a preexperi- 
mental state assessment used to determine personal 
arousal levels and other personal variables that could 
influence the data. The second questionnaire, assess- 
ing subjects’ perceptions of the experimental room, 
was explained as an attempt to control for possible 
environmental influences on the data. The third 
form, assessing subjects’ expectations regarding the 
nature of the experimental session, was explained as 
an attempt to control for different subject expecta- 
tions. Once all of the questions related to filling out 
these forms had been answered, the experimenter 
left the room and allowed the participants to com- 
plete them. Upon completion of the questionnaires, 
the experimental session was terminated, and sub- 
jects were thoroughly debriefed. 


Observation 


As soon as subjects were seated in the experi- 
mental room, an observer began observation through 
the two-way mirror, During the first 2 minutes of 
observation, subjects remained alone, and notes were 
made concerning seat position and body orientation, 
With the arrival of the first confederate in the ex- 
perimental room, the observer began continuous ob- 
servation of subjects for 2 minutes, noting facial 
regard, verbal behavior, and changes in body orienta= 
tion, With the arrival of the second confederate, 
the observer began a third minute of continuous 
observation, terminating observation when the ex- 
perimenter entered the room with the clipboards, 
The observer was not aware of experimental con- 
dition during observation. 


Confederates 


Confederates in this study played constant roles: 
The first male confederate was always the same 
person, as were the first and second female and male 
confederates. Behavior in the experimental setting 
was standardized, Confederates were instructed to 


530 


behave in a consistent manner with each subject, 
responding minimally to statements or questions by 
subjects. In addition, they were cautioned against 
jnitiating eye contact or conversation with subjects. 
Finally, confederates were told to look around the 
room when “completing” the questionnaires and to 
behave as if they were actually working on the 
questions. 


Dependent Variables 


During observation, data regarding subjects’ seat 
position, verbal behavior, and the amount of facial 
regard by subjects were recorded as dependent vari- 
ables. Seat position was determined by numbering 
chairs in descending order from the head of the U 
(see Figure 1). Higher “value” seats were considered 
to be more socially central, aggressive positions, 
whereas lower value seats were seen as more indica- 
tive of withdrawal or removal from the social cen- 
ter. Facial regard was defined as subject's looking 
at the face of confederates, and instances of talk- 
ing were defined as subject-initiated topic changes 
or comments following at least 10 sec of silence. 
Subjects’ comfort in the experimental setting was 
assessed on a 7-point scale as an item on the per- 
sonal state assessment questionnaire, Ratings of sub- 
jects’ anticipation of the experimental session were 
obtained by asking subjects to describe on a series 
of 7-point scales how confusing and interesting they 
expected the session to be. Finally, impressions of 
the experimental room were obtained by asking sub- 
jects to rate the room on 7-point scales along eight 
dimensions; subjects rated the room in terms of 
its temperature, how pretty it was, its illumination 
level, stuffiness, size, adequacy, and in terms of 
how crowded it appeared and how much physical 
discomfort it caused them. 


RESULTS 


Four-way analyses of variance were per- 
formed on most of the data collected, with 
the individual subject as the basic unit of 
analysis, Self-report data, assessed on 7-point 
(1-7) scales, are reported as means with 
higher values indicating more of the dimen- 
sion being measured. Subsequent comparison 
of interaction means was based on the pro- 
cedures suggested by Tukey (Myers, 1966). 


Manipulation Checks 


The manipulations of anticipated group 
sizes, spatial density, crowding, and social 
structure appear to have been successful. 
When asked to number a line on their sheets 
for each participant in the session, 91% of 
the subjects anticipating 5-person sessions 
numbered lines from 1 to 5, while 84% of 
those expecting 10 people in the session num- 
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bered lines from 1 to 10. Of those numbering 
inappropriately, 1 failed to include himself 
(numbering from 1 to 9) while all other 
misunderstood the instructions and did 
number any lines, These data indicate t) 
the manipulation of anticipated group 
was successful. 

Data regarding perceived size of the 
perimental room indicated that the manipu 
tion of room size was effectively perceived | y 
subjects. Those participating in the lary 
room rated it significantly larger (X = 3.5 
than did subjects waiting in the smaller roo 
(X = 1.97), F(1,48) = 38.06, p < 0 
Similarly, the small room was rated as mo 
inadequate than the large room (small 5 
3.5, large X = 2.62), F(1, 48) = 5.09, 

.05. Finally, subjects’ perceptions of 
structure manipulation influenced their s 
jective ratings of the expected session. Sı 
jects who were told that a group leader wou 
be appointed and that rules for the sessi 
would be presented reported that they íi 
the session would be less confusing (5 
2.56) than did subjects who were told nı 
ing about the structure for the experim: 
session (X = 4.44), F(1, 48) = 31.05, / 
001. 

Since both rooms were relatively sma! 
was assumed that people would anticip. * 
crowding with nine others in the room, wh 
anticipating crowding with only four oth 
in the small room. To assure that this as- 
sumption was valid, subjects were asked | 
indicate how crowded the room would be if 
5 or 10 people were present, Regardless of N 
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TABLE. 1 
MEAN SUBJECTIVE RATINGS OF CROWDING 


Anticipated 
group size 
5 10 
Variable persons persons 
Room size 
Large 1.68* 2.88* 
Small 3.75* 3.44 
Structure expectancy 
Expected 2.50 1.94* 
Not expected 2.93* 4.38* 


Note. Room size and structure expectancy were crossed in 
the experimental design, but for clarity they were collapsed 
and presented here as separate influences, 

* Significant mean differences (Tukey's 


s procedure for 
comparison of unconfounded means), p < .05, 
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co ition, subjects indicated that they would 

have felt crowded (X = 5.1) had there been 

10 people in the room, When asked how 

crowded they would feel if 5 people were 

p sent, subjects in the small room indicated 

rore crowding (X = 4.5) than did those in 
large room (X = 3.25). 


vception of Crowding 
While predicted effects of group size and 
i om size reached significance, F(1, 48) = 
5, p < .05 and F(1, 48) = 24.73, p < .001 
ee Table 1), each was qualified by a series 
interactions. The predicted Structure X 
roup Size interaction was the only structure 
“pectancy interaction to reach significance, 
gesting that subjects expecting large-group, 
v astructured sessions felt more crowded than 
cd those awaiting large-group, structured 
essions, F(1, 48) = 14.35, p < .001, regard- 
less of room size. A Group Size X Room Size 
interaction, which indicated that room size 
influenced perceptions of crowding only for 
those subjects expecting small groups, F(1, 
48) = 14.35, p < .001, further suggested that 
ye expectation of numbers of people was of 
i -latively greater importance in determining 
s bjects’ perceptions. Subsequent comparison 
o unconfounded means indicated that struc- 
ti © expectancy produced a significant differ- 
e only among subjects expecting large 
gv p interaction and that room size was ef- 
fec ve in shaping these perceptions only 


an: og subjects expecting small groups (p < 
05 

| was also predicted that sex differences 
reg: ling perceived crowding would be mani- 
fest -uly when spatial consequences of density 


were solated and subjects anticipated small- 
group interaction in the small room. A sig- 
ificant Group Size X Room Size X Sex of 
interaction verified this prediction; 
small groups were expected in a small 
room, men reported more crowding than did 
women, F(1, 48) = 4.54, p < .05 (see Table 
2). Comparison of unconfounded means 
strengthens this finding, indicating that men 
and women responded comparably in all con- 
ditions except the small-group, small-room 


situation (p <.05). No other higher order 


effects were found, 
Additional evidence of the differential rela- 


oo 


4 Structure expected 


male subjects 
© Structure not expected =o 


~~~ female subjects 


PERCEIVED CROWDING [7-pt. scale! 


3-person 
small room 


10-person 
large room 


Ficure 2. Relationships between salient antecedent 
conditions of density and intervening sex of subject 
and structure expectancy variables, as suggested by 
data reflecting perceived crowding. (Because the 
large room was exactly twice the size of the small 
room, anticipation of 5-person groups in the small 
room and 10-person groups in the large room is 
associated with equivalent expected physical density), 


tionships among these variables is provided 
by a comparison of the 5-person, small-room 
condition and the 10-person, large-room sit- 
uation. Because both room size and antici- 
pated group size were varied on a ratio of 
2:1, the anticipated physical densities were 
identical. As a result, any differences between 
these conditions may be attributed to the 
salient stimulus conditions of the densities as 
expected. As can be seen in Figure 2, when 
spatial limitation was salient in the 5-person, 
small-room condition, reported crowding 
varied as a functin of sex rather than struc- 
ture; men felt more crowded than did wo- 
men, but structure expectancy did not influ- 
ence reported crowding. However, when social 
consequences of density were salient in the 
10-person, large-room condition, this was re- 


TABLE 2 
MEAN SUBJECTIVE RATINGS OF CROWDING 


S Large room Small room 
Antes Men Women Men Women 
5 persons 1.88 1.50 4.75* 2.15 
10 persons 3.12 2.62 3.38 3.50 


* Significant mean difference (4.75 > 3.38, 2.75, 1.88, 1,5), 
p <.05. 
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TABLE 3 


MEAN SEAT POSITION SELECTED BY SUBJECTS 
ENTERING THE EXPERIMENTAL Room 


Anticipated 
group size 
pect ea, 


10 


Variable persons persons 
Room size 
Large 3.56* 2.88 
Small 4.44* Dig 
Structure expectancy 
Expected 4.13 yy id 
Not expected 3.88* 1.88* 


Nole. Room size and structure expectancy were crossed in the 
experimental design, but for clarity they were collapsed and 
presented, here as separate influences, 

* Significant mean differences, P < -05. 
versed; men and women comparably re- 
sponded, while subjects not anticipating struc- 
tured sessions felt more crowded than did 
those expecting structure. 


Seat Position 


Findings regarding subjects’ seat position 
generally followed a pattern similar to data 
for perceived crowding. A main effect of group 
size, F(1, 48) = 23.29, p < .001, qualified by 
a Group Size X Structure Expectancy inter- 
action, F(1, 48) = 10.9, p< 01, indicated 
that subjects expecting large-group sessions 
tended to sit in more peripheral positions than 
did subjects expecting small groups unless 
anticipating a structured session, When struc- 
tured sessions were expected, subjects sat 
further toward the center of the seating ar- 
rangement, approximating positions assumed 
by subjects expecting small groups. Structure 
expectancy did not influence seat selection 
among subjects expecting small groups, but 
‘size influenced seat position only when 
group . was small F(1, 48) = 4.13, p< 
(05. Subsequent mean comparisons support 
these analyses, indicating that seat position 
antly different among subjects ex- 
in the large and small 
subjects expecting nine 
nd unstructured sessions 
eat-position selections are 


was signifie 
pecting four others 
rooms and among 
others in structuri 
(p < 05). Mean 
given in Table 3. 
The predicted ( 
Sex of Subject int 
cance, F(1, 48) = 


ze X Room Size X 
.pproached signifi- 
< 07 (see Table 


4), and subsequent comparison of uncon- 
founded means indicated that while respond- 
ing comparably in most conditions, men sat 
in more central, aggressive seat positions tan 
did women in the small-group, small-room 
condition (p < .05). Men in this condi! on 
generally assumed more central seat positics 
than did subjects of either sex in any ot! or 
conditions. No other higher order effects w ʻe 
found. 

Comparison of the two “equivalent d 
sity” conditions represented in Figure 3 si- 
ports these analyses. Anticipation of crowa. 5 
in small groups caused by the size of te 
room (5-person, small-room condition) caus’ | 
males to assume central seat positions, wh 
females maintained relatively neutral p 
tions. Structure expectancy did not influ 
seat selection in this condition. Howe 
when anticipated crowding was caused by | 
expectation of nine others (10-person, la: 
room condition), structure expectancy ir 
enced seat position, but sex of subject did 


Discomfort 


Generally, subjects’ reported discon 
followed the same pattern as did findings 
perceived crowding and seat position. 
though neither anticipated group size 
room size exerted a main effect on the dais 
a main effect of structure expectancy, Kii, 
48) = 17.89, p< .001, qualified by a Group 
Size X Structure Expectancy interaction, 
F(1, 48) =8.58, $< 01, indicated 
structure expectancy influenced reported 
comfort only when anticipated group 
was large. As can be seen in Table 5, suv- 
jects anticipating large, unstructured groups 
reported greater discomfort than did subjects 
expecting large, structured groups, but struc- 


TABLE 4 
Mean SEAT POSITION SELECTED f 


Large room Small room 


Anticipated — 
Men 


group size 


Women 
3.88 
3.00 


Men Women 


5.00* 
2.50 


3.62 
2.88 


3.50 
2.88 


5 persons 
10 persons 


Note. Central seats are indicated by larger numbers, with a 
range of 1-6. 

* Significant mean difference (5.0 > 3.88 = 3.62 = 3.50 
> 2.50), p <.05. 


male subjects 
------ female subjects 


à Siructure expected 
& Structure not expected 
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Fıcure 3, Relationships between salient antecedent 
aditions and intervening variables, as suggested 
seat position data. 


ure expectancy did not influence these rat- 
ings when groups were expected to be small. 
An Anticipated Group Size X Room Size in- 
teraction indicated that room size influenced 
reported discomfort only when groups were 
small, F(1, 48) = 9.89, p < .01. The Group 
ize X Room Size X Sex of Subject inter- 
ection, indicating that men reported more 
d scomfort in the small room than did women 
wien small groups were expected, did not 
a roach significance. 

hese findings are further supported by the 
cc ‘parisons shown in Figure 4, When sub- 
jes expected 5-person groups in the small 
roc n, reported discomfort varied with sex but 
nc. with structure expectancy. However, when 
the antecedent conditions were social and 
sul cts were concerned with large numbers 
of |. ‘ential interactants, reported discomfort 
was ‘=duced by the expectation of structure 
but was not influenced by the sex of the 
subject. 
Faciai Regard 
It was predicted that subjects would show 
less al regard for the two confederates 
when anticipating crowding. Although neither 
anticipated group size nor room size manipu- 
lations produced a significant main effect, 
comparison of subjects in conditions expected 
to generate crowding (e.g., large-group, no- 
Structure conditions and men in the small- 
group, small-room condition) with those in 
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conditions in which crowding was not ex- 
pected (e.g., large-group, structured condi- 
tions, women in the 5-person, small-room con- 
dition, and small-group, large-room condi- 
tions) yields a trend in the predicted direction. 
Considered this way, subjects anticipating 
crowding looked at the confederates less of- 
ten (X = 7.33) than did subjects who were 
not anticipating crowding (X = 21.60). 

The Anticipated Group Size X Room Size 
interaction indicated that varying space in- 
fluenced facial regard only when anticipated 
group size was small, F(1, 48) = 11.52, p< 
-01, As can be seen in Table 6, subjects ex- 
pecting large groups showed slightly more 
facial regard for the confederates in the 
small room, while those expecting small groups 
looked at the confederates considerably less 
often in the small room than in the larger 
space. In addition, structure expectancy ex- 
erted a significant main effect, F(1, 48) = 
9.96, p < .01, but was qualified by a Struc- 
ture X Group Size X Room Size interaction, 
F(1, 48) =4.47, p< .05. While subjects 
looked at others more often when expecting 
a structured session than when they were not, 
this was not so in the 5-person, small-room 
situation. Structure expectancy increased fa- 
cial regard for subjects anticipating large 
groups, and for subjects expecting small 
groups in the large room, but did not affect 
facial regard shown by subjects in the small- 
group, small-room conditions. The effects of 
anticipating large groups on facial regard, 
then, were mediated by structure expectancy 
and were not influenced by varying room size. 
No significant effects were found for measures 


TABLE 5 
MEAN SUBJECTIVE RATINGS or DISCOMFORT 


Anticipated 


group size 
5 10 
Variable persons persons 
Room size 
Large 2,56* 3.56 
Small 4.31* 3.50 
Structure expectancy 
Expected 3.25 2.50" 
3.62 4.56* 


Not expected 


Note. Room size and expectancy were crossed in the experi~ 
mental design, but for clarity they were collapsed and pre- 
sented here as separate influences. 

* Significant mean differences, p < .05. 
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A Structure expected 
@ Structure not expected 


male subjects 
a ae temale subjects 


REPORTED DISCOMFORT {7-pt. scalel 


10-person 
large room 


-person 
small room 


Ficure 4. Relationships between salient antecedent 
conditions and intervening variables, as suggested 
by reported discomfort. 


of subjects’ initiation of conversation, and no 
sex differences along either of these dimen- 
sions were found. 


Perceptions of the Experimental Setting 


Perception of the experimental setting was 
assessed on a series of 7-point scales. It was 
predicted that subjects anticipating crowding 
would perceive the setting differently than 
would those not expecting crowding and that 
these perceptions would be consistent with 
the nature of the crowding anticipated. To a 
limited extent, the data obtained support this 
contention, Subjects expecting nine others in 
the session rated the room as being warmer 
(X = 4.47) than did subjects expecting four 
others (£ = 3.47), F(1, 48) = 8.00, p < .01. 
Room size influenced perceptions of the de- 
gree of comfort provided by the room when 


TABLE 6 


MEAN NUMBER OF SECONDS SPENT LOOKING 
AT CONFEDERATES 


Anticipated group size 


Room size 5 10 
tructure expectancy persons persons 
Larg 

Expected 30.62 27.00 

Not expectéd 12.88 6.88 
Small 

Expected 5.40 32.00 

Not expected 6.75 9.12 
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5-person groups were expected (sma! oom X 
= 5.37, large room X= 4,0), F(1, 48) = 
5.67, p < .05, but did not influenc: s rat- 
ing when 10-person groups were exp 1, No 
other effects were found for these dim: sions, 
and no effects were found when coi ering 
the ratings of the room’s stuffiness, (ce of 
illumination, or overall attractivenes:. low- 
ever, women perceived the room as — nifi- 
cantly more ugly (X = 5.60) than di. men 
(X = 5.10), F(1, 48) = 4.67, p < .05. 
Discussion 

This research was concerned with th- osy- 

chological differences between the expe nce 


of too many people and the experien — of 
spatial restriction. It was predicted tha 
jects would experience crowding diffe 
when high social densities were percep 
salient than when high spatial densities ere 
salient, When the salient stimulus featur of 
the setting arouse expectations of inade: 
space, dynamics that influence the adec 
of the space (e.g., sex) should media‘. re- 
sponses. When the nature of the situatio 
phasized the presence of many other 
pectancy of social structure was hypoth: zed 
as an important mediating variable. 


The data generally confirm these ex -ta- 


tions. Whenever subjects anticipated arge 
groups (high social density), they rej «ted 
more perceived crowding and assumed nore 
peripheral seat positions than did sul ects 


anticipating smaller groups (low social 
sity), The intensity of response to higii s 
cial density was not influenced by varying 
amounts of space (room size) but was medi- 
ated by the expectation of social struclure; 
subjects anticipating large, unstructured group 
sessions sat in more isolated seats, reported 
more crowding and discomfort, and looked 
at others less frequently than did subjects 
anticipating large, structured group sessions. 
As predicted, structure expectancy did not 
mediate the consequences of high spatial den- 
sity. Instead, sex of subject mediated re- 
sponse to high spatial density and did not 
interact with anticipated group size. 

These findings suggest that when the salient 
conditions of high density arouse perceptions 
of inadequate space, experiential and behav- 
ioral responses are different from those caused 
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DIFFERENTIAL RESPONSE TO ANTICIPATED CROWDING 


by \\imulus features promoting frequent in- 
ter «tion with large numbers of others, High 
so: al density is likely to arouse social over- 
lc | and is experienced as a threat to inter- 


rs with whom one must interact. As a 
lt, people attempt to withdraw and isolate 
t selves from the group, thereby minimiz- 
the likelihood of an encounter with others 
eestablishing their control over the social 
vation, However, expectations of social 
icture negate many of these consequences; 
cause structure reinforces individuals’ abil- 
to regulate social interaction, the freq- 
ency and intensity of large-group interaction 
ı a Structured setting is not as stressful as 
social contact in large, unstructured groups. 
hen subjects anticipated large, structured 
roups, they did not experience the stress and 
liscomfort reported by subjects in large, un- 
structured groups; in addition, they did not 
seek to avoid involvement with others by as- 
suming seat positions and maintaining in- 
frequent facial regard, as did subjects who 
were not expecting externally supplied struc- 
ure, 
High spatial density, on the other hand, 
i likely to arouse perceptions of inadequate 
s ace, and although aware of structure when 
i. was introduced, subjects anticipating small 
g ups were not influenced by it. Because the 
p lems associated with inadequate space 
(. ., inappropriate distancing, behavioral 
co straint) reflect different dynamics, situa- 
tic val elements influencing the regulation of 
so: al interaction do not necessarily influence 
iential and behavioral response to high 


aon regulation because of the number of 
€ 


ex] 
spa density. Rather, properties or pro- 
cesse- that influence perceived spatial extent 
or adequacy mediate spatial consequences of 


ed with sex influence the perceived 
of space; since women seem to pre- 
fer smaller interaction distances than do men, 
small rooms are more likely to appear inade- 
quate or inappropriate to men than to women. 
The present finding supports this notion. 
Consistent with previous research, men ex- 
perienced more crowding and responded more 
aggressively to high spatial density conditions 
than did women. 
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These data also suggest that social conse- 
quences of high density are more salient than 
spatial consequences. Whenever 10-person 
groups were considered, experiences and re- 
sponses were mediated by structure expec- 
tancy and social expectations, suggesting a 
pattern of withdrawal and interaction avoid- 
ance. Room size variations became percep- 
tually salient and influenced experience and 
response only when anticipated group size 
was small. It is possible that spatial density 
is experienced as arousal and intensifies on- 
going processes in the setting (e.g., Freed- 
man, 1975; Saegert, in press), whereas social 
density is experienced as a more immediate 
and stressful threat to social control, 

However, the present study provides only 
limited evidence for such a notion; since the 
smaller room failed to intensify the stressful 
anticipation of large numbers of others, it 
cannot be concluded that spatial density op- 
erates solely as an intensifier variable. Rather, 
the findings for small-group expectations sug- 
gest an interesting linkage with arousal-inten- 
sification hypotheses, and when combined 
with findings on large-group expectations, 
they provide a means for reconciling many 
contradictory findings. Studies varying room 
size have examined different processes and 
have reported different findings than have 
studies of varying group size, These findings 
become more consistent and predictable when 
the psychological effects of social and spatial 
density are recognized and systematically 
considered. 

When considering antecedents of crowding 
stress, distinctions between amounts of space 
and numbers of people are analogous to dis- 
tinctions between physical and social environ- 
ments (e.g., Stokols, Note 5). High-density 
response can be conceptualized as a func- 
tion of either or both of these factors, but 
equating them has not proven functional. 
Crowding is a joint experience of social and 
spatial conditions, and in the long run these 
distinctions may prove incomplete, as high 
density is most likely to be characterized by 
elements of both. However, as long as meth- 
odological distinctions based on isolated ante- 
cedents persist, it is necessary to recognize 
the psychological differences between social 
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and spatial density, Rather than ignoring the 
inconsistencies in experimental findings, a 
perspective emphasizing the numerous poten- 
tial consequences of high density, the psycho- 
logical differences among them, and the neces- 
sity of considering specific intervening con- 
ditions in different ways can facilitate our 
understanding of the complexities of the 
crowding process. 
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Physical Attractiveness and Marriage Adjustment 
in Middle-Aged Couples 


Bernard I. Murstein and Patricia Christy 


Connecticu 


t College 


The subjects were 22 middle-aged, middle-class couples for whom three mea- 
sures of physical attractiveness were obtained: photos, self-evaluations, and 


evaluations by spouse. In addition, each 


subject took the Locke-Wallace Mar- 


riage Adjustment Scale. It was hypothesized that (a) members of couples would 


be matched for physical attractiveness, (b) 
be correlated with marriage adjustment, 


equity of physical attractiveness would 
(c) perception of the partner as more 


attractive than the self would be correlated with marriage adjustment, and 


(d) attractiveness, in general, would be 


correlated with marriage adjustment. 


The first hypothesis was clearly supported, the second rejected, and the third 
was supported for husbands’ marriage adjustment only, but it was suspected of 
being artifactual, The fourth hypothesis was confirmed for subjective measures 


of attractiveness in relation to husbands’ 


The ubiquitous role that physical appear- 
ance plays in interpersonal attraction has been 
amply demonstrated in a host of studies (Ber- 
scheid & Walster, 1974), but little attention 
has been paid to the role of physical attrac- 
tiveness within marriage. One unanswered 
question is whether equity of physical attrac- 
tiveness exists in marriages of some duration. 
Most of the early literature, however, focused 
on experimental paradigms of contrived dates. 
Initial findings suggested that equity was not 
a factor, each subject desiring the most at- 
tractive partner he could get regardless of his 
own degree of attractiveness (Berscheid & 
Walster, 1974). After further research, how- 
ever, this failure of equity was shown to be a 
function of the contrived nature of the experi- 
ments. Actual courting or very recently mar- 
ried couples have been shown to possess 
greater than chance equity of attractiveness 
(Murstein, 1972). 

Silverman (1971) reported very high simi- 
larity of physical attractiveness in dating cou- 
ples. However, the judges knew who was dat- 
ing whom and may have inadvertently suc- 
cumbed to “halo” effects in their judgments. 


The authors are appreciative of the aid of Steven 
Gunn in giving the manuscript the benefit of his 
criticism, 

Requests for reprints should be sent to Bernard I. 
Murstein, Box 1581, Department of Psychology, Con- 
necticut College, New London, Connecticut 06320. 


537 


but not wives’ marital adjustment. 


Moreover, no control group was used to de- 
termine whether the high reported equity of 
attractiveness was greater than chance, As 
Udry (1971) has pointed out, the narrow 
range of scores might have ensured that even 
“random matings would produce most simi- 
larly rated couples” (p. 23). 

Murstein (1972) avoided both of these diffi- 
culties with a group of 99 premarital couples 
who were photographed and rated themselves 
and their partners on attractiveness. Judges 
rated the photos without knowledge of who 
was courting whom. For all variables a con- 
trol group was formulated by randomly pair- 
ing men and women. Results conclusively dem- 
onstrated that couples were matched for de- 
gree of attractiveness as gauged by photos and 
self-evaluations but not by perception of part- 
ners. A replication for self-attractiveness and 
perceived partner attractiveness measures 
again substantiated “self” matching but again 
failed to confirm matching by respective per- 
ception of partner (Murstein, 1976). 

The question now arises as to whether in 
actual marriage, and particularly after a con- 
siderable number of years, couples will still be 
matched for attractiveness, Unfortunately, 
there is only indirect evidence on this ques- 
tion. Stroebe, Insko, Thompson, and Layton 
(1971) argued that physical attractiveness is 
of much less importance regarding marriage 
than dating. Their college subjects viewed pic- 
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tures of individuals judged by a panel as 
differing in attractiveness and answered, 
among other questions, whether they would 
consider the person for a date and whether 
they could consider him as a marriage part- 
ner. Physical attractiveness was found to play 
a greater role for possible dating than for 
possible marriage. 

Cavior and Boblett (1972) took an oppo- 
site stance, arguing that dating was less of a 
commitment than marriage. They stated that 
a man in his quest for sexual satisfaction 
may take a woman less attractive than him- 
self if he cannot immediately find a more 
attractive one. However, since marriage has 
much more importance, individuals will hesi- 
tate to marry those beneath them in attrac- 
tiveness. Unlike Stroebe et al. (1971), their 
subjects were actually married or dating cou- 
ples from college. They found a correlation of 
.73 between married couples’ attractiveness 
but one of only .19 between dating couples. 

By extension of these findings, it could be 
thought that correlations for attractiveness 
between long-married couples would be small 
because many other factors than physical 
attractiveness would have entered the relation- 
ship of the marriage partner and influenced 
whether they remained married after one or 
two decades. On the other hand, if individuals 
continued to be concerned about physical at- 
tractiveness, those marriages in which one 
partner lost much more attractiveness than 
the other should undergo greater strain than 
those in which equity continued. 

The position we take, similar to that of 
Cavior and Boblett, is that physical attrac- 
tiveness continues to be an important consid- 
eration throughout much of marriage. Conse- 
quently, we predict a substantial positive cor- 
relation between spouses based on initial 
matching and a tendency for strain and possi- 
ble marital disruption to ensue when one mem- 
ber’s physical attractiveness diminishes at a 
more rapid rate than that of the other. 

Although we have so far referred to objec- 
tive (judges’) ratings of physical attractive- 
ness, it is possible to question whether these 
ratings are the most meaningful measures to 
compare with marriage adjustment. If beauty 
is in the eye of the beholder, then attractive- 
ness should also be investigated via the sub- 
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ject’s perception of his partner's attra 
ness. It is also conceivable that an indiv 
al’s choice of partner and his adjustme 


marriage are related to his image of hi i 


attractiveness. For this reason the self-c: it 
of attractiveness was also included i r 
study. 

The literature on the role of attracti 5S 
in marriage adjustment, unlike that ts 


effect on attraction, is almost non ti 
Berscheid and Walster (1974) point « ut 
the beautiful are believed to be high t 
socially desirable qualities, just as th y; 
like Shakespeare’s Richard III, are ed 


to have personalities as evilly shaped ‘ir 
bodies. Presumably, therefore, the | ful 
ought to be able to “cash in” on th: set 
to extract many pleasures from lifc in 
marriage beauty may be quickly ad to, 
and this fact, in addition to the ep ral 
nature of beauty, may limit the in of 
attractiveness on marriage. Further ere 
may be costs in being married to an ac- 
tive spouse. If a husband is not h me 
himself, he might be expected to r his 
contribution to the marriage by wait and 


and foot on his “princess”; thus, t jar- 


rator of a Trinidadian song advises you 
want to be happy and never blue, © ways 
marry a woman uglier than you.” 

The only published study we found on the 
subject was that of Kirkpatrick and Cotton 
(1951). They had students rate couples’ at- 
tractiveness in marriages in which they 
thought the individuals were either well ad- 


justed or poorly adjusted. The couples’ mean 
age was approximately 30 years. In two sepa- 
rate studies they found that well-adjusted 
wives were more attractive than poorly ad- 
justed ones. Slightly less strong evidence was 
found for concluding that equity in physical 
attractiveness and husband attractiveness were 
both related to good marital adjustment. The 
main weakness in this study is that both at- 
tractiveness and adjustment were assessed by 
the same individual. There was no control for 
possible “halo” effects in which beauty and 
good adjustment might be assumed to go to- 
gether. 

The only other relevant study we have 
found is the unpublished work of Berscheid, 
Walster, and Campbell, to which reference is 


—_—_ ——_ 
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made in Berscheid and Walster (1974). Pho- 
tos of men and women from the University of 
Minnesota between 1933 and 1936 were com- 
pared with interviewers’ estimates of the sub- 
jects’ marital satisfaction 20 years later. 
Berscheid and Walster report a “faint and 
marginally significant” (p. 201) negative rela- 
tionship between the women’s attractiveness 
and their marital adjustment 20 years later. 
No relationship was found for men. In a 
personal communication, however, Walster 
(Note 1) stated that a follow-up failed to 
support even this marginal significance. The 
role of both equity of attractiveness and at- 
tractiveness within marriage thus remains far 
from determined. 

There is another measure of attractiveness 
that seems worthy of study. Although equity 
between partners in objectively determined 
physical attractiveness may be found to be 
associated with marital adjustment, if we 
consider the perception of self and of the part- 
ner by the same subject, equity may not be 
as desirable to the perceiver as inequity which 
benefits him. If, for example, a person per- 
ceived his spouse as superior to himself in 
attractiveness, he should be more pleased 
than if he perceived his partner as equal to 
himself in attractiveness. 

In a study with premarital couples, Mur- 
stein (1976) found that men perceiving their 
partner as above them in attractiveness were 
much higher in agreement between various 
self- and partner concepts than men who per- 
ceived themselves as equal or superior to their 
partners in attractiveness. Women showed 
weaker evidence of a relationship between 
their responses to personality questionnaires 
and physical attractiveness. Agreement be- 
tween self- and partner concepts was some- 
what more likely to be manifested by those 
women who perceived equity between them- 
selves and their partners on attractiveness in 
comparison to women who perceived them- 
selves as inferior to their partners. Murstein 
attributed these findings to superior male 
power in marital choice, which enables the 
man to get partners he believes are more 
attractive than himself, whereas the women 
are willing to settle for perceived equity. 

The greater power of men in marital choice 
is somewhat, though not entirely, dissipated 


once they are married. Once the individuals 
are committed to each other, it is much more 
difficult to “uncommit” oneself. Once “settled 
in” to marriage, therefore, both partners may 
find their happiness associated with percep- 
tion of their spouse as superior to them in 
many areas, Years ago Kelly (1941) found 
that marital satisfaction was associated with 
both marital partners perceiving their partner 
as superior to them in interpersonal qualities, 
Accordingly, we predicted that, for both men 
and women, perceiving the partner as su- 
perior in attractiveness would be associated 
with marital adjustment. 

In sum, we can state the following four hy- 
potheses: (a) Middle-aged married couples 
will show a greater than chance equity for 
physical attractiveness; (b) Equity of physi- 
cal attractiveness is associated with high mari- 
tal adjustment; (c) Perception of the partner 
as more attractive than the self will be associ- 
ated with good marital adjustment; (d) At- 
tractiveness is associated with marital adjust- 
ment, 


METHOD 
Tests 


The measure of marital adjustment employed was 
the Locke-Wallace Short Form (Locke & Wallace, 
1959), which is generally regarded as the most popu- 
lar scale of marriage adjustment, with some demon- 
strable validity and reliability. The measures of 
physical attractiveness were (a) ratings of photos of 
the subjects taken separately in repose with Polaroid 
color film, (b) a 5-point self-rating for physical 
attractiveness, and (c) a 5-point rating of the spouse 
for physical attractiveness. 


Subjects 


The subjects were 22 married couples of a small 
New England coastal town who had an interest in 
historical houses, They volunteered to participate in 
an anonymous study on marriage and physical at- 
tractiveness in exchange for a tour of a historical 
house given by the second author. The subjects were 
white, mainly middle or upper-middle class, and 
ranged in age from 28 to 59 years, with the means 
and standard deviations as follows: men’s M = 39.7 
(SD =7.7); women’s M =37.6 (SD=7,7). They 
had been marricd 15.4 years on the average (SD = 
9,4) and had a mean of 2.8 children (SD = 1.2). The 
means and standard deviations of the marriage ad- 
justment scores were as follows: men’s M = 120.2 
(SD = 14.4); women’s M = 124.9 (SD = 12.7). None 
of these differences are significant. Locke and Wal- 
lace (1959) gave the test to two small groups of 
men and women who were approximately 30 years 
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of age. The mean score of the well-adjusted group 
was 135.9, and that of the maladjusted group was 
71.7. From a comparison with these data, our sub- 
jects would appear to be relatively well adjusted. 

The tests and ratings were administered to the 
subjects at their homes by the second author, who 
Was present so that communication between hus- 
band and wife was impossible. The couples were then 
photographed separately by the second author who 
again stressed that the photographs would be anony- 
mous and rated by judges not familiar with the cou- 
ples. 


RESULTS 


The ratings of attractiveness of the photos 
were obtained as follows: The judges (one 
middle-aged male professor and two male and 
two female graduate students) were instructed 
that in the general population a rating of 5 
(extremely good looking) would be obtained 
by 8% of the population, a rating of 4 (bet- 
ter than average) by 17%, a rating of 3 
(average) by 50%, a rating of 2 (somewhat 
below average) by 17%, and a rating of 1 
(considerably below average) by 8%. All 
judges viewed the pictures without knowing 
who was married to whom. After viewing all 
22 pictures of one sex to get a general over- 
view, they proceeded to look at each photo 
separately and record their scores. Interjudge 
reliability was computed by randomly assign- 
ing two judges to one group and three to the 
other, obtaining the mean scores for each sub- 
ject by each group, correlating the mean 
scores for each subject between the two groups 
of judges, and “correcting” the correlation by 
the Spearman—Brown prophecy formula. The 
corrected r was .59. 

Comparison of the three physical-attrac- 
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tiveness measures showed that the 

mean scores were achieved for photo: 

2.7; women, 2.7), with self-evaluation 
higher (men, 3.3; women, 3.1), and t} 
ception of partner scores being highest - 


bands perceived wives as 3.9; wives pe d 
husbands as 3.7). The order of these 15 
is identical to that reported earlier bs - 
stein (1972) for premarital couple y 
indicate (using the photograph rating r 


objective criterion) that most subject - 
estimate their own attractiveness so t 


and overestimate their partner’s attrac S 
still more. 
The test of equity in physical attrac ess 


may be made by examination of Tabl 
results show that Hypothesis a is cl p- 
ported in that all three measures of al 


attractiveness—photo, self, and | ed 
partner—are significantly correlated us- 
band and wife. Since the Pearson r m ive 
a high correlation even when equ of 
attractiveness did not exist, Hypothe yas 
tested by means of the intraclass co ion 
(Hays, 1973, pp. 534ff). The results al- 


most identical to those found with t ar- 
son r, the husband—wife photo, self, « jer- 
ceived partner correlations being res; 
-60, 39, and .40. The first value exc: the 


Ss 


-O1 level of significance, and the latie: two 
exceed the .05 level; hence, Hypothesis a is 
clearly confirmed. 

Of interest also, as shown in Table |, is 
the finding that both men and women evinced 


moderately high reality contact in thei: 
assessment as evidenced by the correla 


TABLE 1 


CORRELATION MATRIX FOR HUSBANDS’ AND WIVES’ PHOTO, SELF, AND PERCEIVED SPOUSE 
ATTRACTIVENESS RATINGS AND CORRELATION WITH MARRIAGE ADJUSTMENT (MA)* 


Husband's 
Item photo Wife's photo Selin Selfw Wifen Husbandw 
Husband’s photo 60%" 46" 27 —.02 10 
Wife’s photo 40* 42* —.03 —.04 
Self .39* 34 07 
Selfw 33 34 
Wifex 40* 
MA Husband -00 —.14 -08 AZ .41* 62** 
MA Wife 27 —.09 04 03 -08 .20 
Nole. The subscripts H and W refer to whether the husband or wife is doing the perceiving. 


n= 
*p <.05. 
p <.01. 
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of their self- and photo-attractiveness ratings. 
If we utilize the photo ratings as our objec- 
tive measure of attractiveness, Table 1 indi- 
cates clearly that neither the husband’s per- 
ception of his wife nor her perception of him 
was related to their objective attractiveness. 

Hypothesis b, that equity of attractiveness 
between spouses correlates with marital ad- 
justment, involves correlating the absolute dis- 
crepancies of our three attractiveness variables 
with husbands’ and wives’ marital adjust- 
ment. The results are shown in Table 2 within 
parentheses for the first three variables listed. 
No support is found for the hypothesis with 
respect to husbands’ or wives’ marital adjust- 
ment. However, contrary to hypothesis, abso- 
lute discrepancy in self-concepts is related to 
the husbands’ marital adjustment.’ Although 
not related to the hypothesis, the same cor- 
relations considered with regard to sign are 
also shown in Table 2, but none of these reach 
significance. 

Hypothesis c, that perception of the part- 
ner as superior to the self in attractiveness is 
correlated with marital adjustment, may be 
tested by examination of the fourth and fifth 
lines in Table 2, The hypothesis is verified 
for the husbands’ marital adjustment for both 
absolute and signed discrepancies but not for 
any scores relating to wives’ marital adjust- 
ment. Inequity between husbands’ and wives’ 
discrepancy scores (Line 6) is significantly 
correlated with husbands’ marital adjustment 
(absolute discrepancy) but not with wives’ 
marital adjustment. 

Hypothesis d, that attractiveness is associ- 
ated with marital adjustment, may be tested 
by examining the correlations in Table 1. 
These correlations indicate support only for 
each partner’s perception of the spouse, and 
then only for the correlation with the hus- 
band’s marital adjustment; thus, Hypothesis 
d receives only partial support. 


Discussion 


Our data indicate that couples are matched 
for all three measures of physical attractive- 
ness. However, neither photo nor self measures 
of attractiveness are correlated with marital 
adjustment. Neither do photo ratings relate to 
perceived partner attractiveness. But, percep- 


TABLE 2 


CoRRELATION OF PHYSICAL ATTRACTIVENESS 
DISCREPANCY SCORES WITH LockE-WALLACE 


MARRIAGE ADJUSTMENT (MA) SCORES 
Variable Husband's MA Wife's MA 
Photon-Photow «39 (,06) 
Selfu-Selfw 00 (21 
Wifeu-Husbandw 10 (—.08) 
Wifeu-Selin ( .04 (01) 
Hushandw-Selfw 253 (72) +18 (19) 
Wifen-Selfu- 
Husbandw-Seliw —12 (51) —.11 (1.9) 


Note. Signed discrepancies are outside of parentheses, and 
absolute discrepancies are within parentheses, Subscripts H or 
W indicate whether perception is by husband or wife. 

*p <.05, 

**p <01. 


tion of the spouse as attractive by the man or 
woman is related to the man’s marriage ad- 
justment. 

If one member of the couple likes the other, 
he (she) tends to perceive the other as attrac- 
tive. However, marriage adjustment is not a 
simple function of perceived attractiveness or 
liking. The strength of the relationship of 
physical attractiveness to marriage adjust- 
ment depends on the importance of physical 
attractiveness to the individual’s evaluation 
of his marriage. 

It seems plausible that men are more con- 
cerned than women about physical attractive- 
ness, both in themselves and their spouses. 
“Body beautiful” magazines, for both men 
and women, are largely bought by men. Thus, 
it is perhaps not so surprising that both the 
man’s perception of his wife as attractive and 
her perception of him as attractive correlated 
positively with his marital adjustment, but 
neither was correlated with her marital ad- 
justment. 

Perceiving the partner as above the self in 
attractiveness was found to be related to mar- 
riage adjustment of the husband, but this cor- 
relation is not particularly superior in mag- 
nitude to the correlation achieved just by the 
perception of the partner without considera- 
tion of the self-concept. This finding, plus the 
fact that discrepancy scores do not enable us 
to tell how much each component contributes 
to the correlation, makes us loath to conclude 


1 Because of the significance contrary to hypothesis, 
all correlations relating to this hypothesis are tested 
by the two-tailed rather than one-tailed test. 
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that the discrepancy score contributes any 
more predictive variance above that of the 
first component (perception of partner) alone. 
Apparently, idealization of the spouse alone 
can account for the correlation with marriage 
adjustment. Accordingly, we shall not discuss 
further the significant findings reported in 
Table 2. 

Our data do have some bearing on the 
Cavior-Stroebe controversy over whether at- 
tractiveness is more important for marriage 
as opposed to dating. Since our data show 
equity in attractiveness for middle-aged cou- 
ples, Cavior and Boblett’s position is sup- 
ported. 

In addition, our data do not support Kirk- 
patrick’s (Kirkpatrick & Cotton, 1951) find- 
ing that attractiveness (as measured by a 
judge) and equity of attractiveness are associ- 
ated with marriage adjustment. However, we 
have indicated that their findings were sus- 
pect because of certain weaknesses in their 
design. 

Last, the relationship of our findings to 
that of Murstein (1976) with premarital cou- 
ples should be considered. He found that the 
man’s perception of his girlfriend as above 
him in attractiveness was associated with good 
courtship progress, but the woman’s percep- 
tions with respect to these two variables did 
not significantly influence the couple’s court- 
ship progress. The perception of the partner’s 
attractiveness alone by either sex, however, 
was not related to either similarity of person- 
ality perceptions or to courtship progress. 

The finding of the importance of perceived 
partner attractiveness without reference to 
self-concept in the present study and not in 
the marital-choice study may reflect the fact 
that shopping for a marital partner is more 
exchange oriented than is maintaining a mar- 
riage. In this respect, it is noteworthy that 
the man’s perception of the partner as more 
attractive than him may lead him to believe 
that he has a good bargain and lead him to 
propose marriage. The absence of such a rela- 
tionship for the nubile woman may reflect 
her lower status in society, so that choosing a 
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partner for attractiveness is a less available 
option (Murstein, 1974), 

The common theme in both the cited pre 
marital studies and the present study is 
for men, perceived physical attractiy 
seems to be associated with satisfaction w 
the relationship. At what point beyond m 
age, if ever, it ceases to be importani 
topic for future research. 


REFERENCE NOTE 


1, Walster, E. Personal communication, Oct 
1974. 
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Children’s Preferences for Individualistic, Cooperative, 


and Competitive Outcomes 


Charles G, McClintock and Joel M. Moskowitz 
University of California, Santa Barbara 


Variations in individualistic, cooperative, and competitive choice behavior were 
observed in 342 children, aged 5 to 84 years, in three task settings: individual- 
istic, coordinative, and conflictual, as a function of age, sex, and the outcomes 
afforded self and other. As hypothesized, children across ages and sex 
displayed two dominant motivational orientations in the individualistic setting: 
own gain (individualism) and relative gain (competition) maximization. Also 
as hypothesized, the likelihood of joint gain (cooperative) choices increased 
with age in the coordinative setting, but, contrary to expectation, the frequency 
of competitive choices did not increase with age in the conflictual setting, 
partly because even the youngest children made predominantly competitive 
choices. Boys were observed to be more competitive than girls in the individual- 
istic and conflictual settings, but these differences, though significant, were 
small. Variations in the outcomes available to self and other produced the pre- 
dicted effects upon children’s preferred outcomes. Overall, the findings indicate 
that young children systematically take other’s outcomes, as well as the task 
setting, into consideration in making choices that afford valued commodities to 


self and other, 


One of the fundamental facts of social ex- 
istence is that in the calculus of decision mak- 
ing, human actors do and, in fact, must take 
into consideration how their behaviors affect 
both their own and others’ outcomes. This state 
obtains because humans are often reciprocally 
dependent upon one another regarding the 
attainment of valued outcomes and are mu- 
tually aware of this interdependence. This 
form of outcome interdependence has been 
previously described by Thibaut and Kelley 
(1959) as one of mutual fate and/or behavior 
control. 

The awareness of outcome interdependence 
may be uniquely human, although mutual 
outcome control occurs within and between 
all biological species. Although lower organ- 
isms often learn to modify their behavior as a 
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consequence of the effect it produces on oth- 
ers, it seems likely that many of their social 
behaviors are genetically programmed. For 
humans, social acts such as competition, coop- 
eration, and altruism are more likely learned. 
This does not mean that one can rule out the 
possible influences of genetic factors in deter- 
mining human social behavior (Alexander, 
1975), but socialization seems a more viable 
explanation than biological evolution for most 
human social acts (Campbell, 1975). 

During the past decade, there has been an 
increasing attempt made to define and assess 
systematically preferences that human actors 
display in settings of outcome interdependence 
regarding the distribution of own and others’ 
outcomes (Kuhlman & Marshello, 1975; Mac- 
Crimmon & Messick, in press; McClintock, 
1972a, 1972b; Messick & McClintock, 1968). 
There has also been an attempt to determine 
variations in the rate at which an observer 
can detect different patterns of preferred own/ 
other outcome distributions by another 
(Thorngate, Note 1). Finally, Goffman 
(1959), Jones (1964), and others have consid- 
ered the types of strategies actors employ in 
settings of outcome interdependence to mod- 
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ify the behaviors of others in order to achieve 
their more valued outcomes. 

The present investigation defines and mea- 
sures changes in three own/other outcome 
preferences in young Children in three dif- 
ferent settings. The following three general 
expectations serve as the conceptual basis for 
the study: 


1. Young children initially attempt to maxi- 
mize their own outcomes regardless of the 
implications of their acts for the outcomes of 
others with whom they share interdependence. 

2. Through experience with interdependent 
others, young children learn that they must 
take others’ outcomes into account in order to 
maximize their own. Thus, cooperating or 
sharing, competing to win, aggressing against 
others, affording other more positive outcomes 
than self, and so on become learned strategies 
in the service of attaining valued own out- 
comes. 

3. Finally, learned strategies that take into 
account both own and other’s outcomes may 
become autonomous social motives. Thus, 
maximizing joint gain, relative gain, other’s 
gain, and the like may become valued ends in 
themselves. Thus, a child may learn to prefer 
less of a valued commodity for self if that 
option provides him more of it than someone 
else (competition), if it provides more for self 
and other jointly (cooperation), if it provides 
more to other (altruism), if it minimizes the 
difference in amount obtained by self and 
other (equality), and so on. 


Given these three expectations, variations 
in children’s preferences for own, relative, and 
joint gain maximization in an individualistic, 
a coordinative, and a conflictual setting are 
observed as a function of age, sex, and game 
class, the latter variable to be defined subse- 
quently. The individualistic setting is one in 
which the child must forego own gain maximi- 
zation in order to maximize relative gain 
(competitive outcomes) or joint gain (coop- 
erative outcomes). The coordinative and con- 
flictual settings require the child to forego 
own for joint or relative gain maximization 
on given trials to obtain a valued outcome at 
the end of a game. 
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Age 

There have been numerous studies that de- 
scribe the effects of age upon the occurrence 
of competitive and cooperative behaviors in 
children. Most of these have been observa- 
tional studies that do not provide systematic 
information concerning changes in the relativ 
dominance of various outcome preferences fo 
self and other as a function of setting ar 
age. Recently, in a series of investigation 
(e.g., Kagan & Madsen, 1971, 1972; Madse 
1972) Madsen and Kagan have, in a syste! 
atic manner, observed that older children 
more competitive than younger children o' 
tasks that require coordination for the ma» 
mization of own and joint gain. Furthermo: 
these researchers have observed that the r 
of acquisition of competitive orientali 
varies as a function of the children’s cu! 
background. In general, urban children | 
been observed to be more willing than : 
children to forego own and joint gain 1 
mization for relative gain. 

In another series of cross-cultural inv 
tions (McClintock, 1972a, 1974; McC 
& Nuttin, 1969; Toda, 1975), childrer 
a number of cultures have also been foi 
be increasingly willing to forego ow: 
joint gain maximization for relative gain 
comes. Again, cultural differences in th: 
gree of preference for competitive out 
were observed at various ages. Hov 
neither of these prior series of studies pe 
a clear differentiation between preference 
cooperative, individualistic, and compe 
outcomes, and neither assesses whether 
preferences reflect children’s goals or merely 
differing strategies in the service of own gain 
maximization. Veroff (1969), in another re- 
search program, reports that children 5-6 
years old or less behave egocentrically and do 
not employ others’ outcomes as a standard 
against which to evaluate the “goodness” of 
their own outcomes. Veroff argues and pre- 
sents supporting data that children subse- 
quently enter a second developmental phase 
in which social comparison of outcomes leads 
to more competitive forms of behavior and, 
finally, that older children pursue both indi- 
vidualistic and competitive outcomes. 

In the present study, the dominance of pref- 
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erences for individualistic, competitive, and 
cooperative outcomes of children ranging in 
age from 5 to 84 years are assessed within 
three outcome settings. On the basis of prior 
findings, it is hypothesized that with increas- 
ing age, children in the individualistic setting 
will be more willing to forego own for relative 
outcomes and that, across ages, own and rela- 
tive gain preferences will strongly dominate 
joint gain preferences. Further, it is hypothe- 
sized that in the conflictual setting, choices 
reflecting a preference for competitive out- 
comes will increase with age, whereas in the 
coordinative setting, choices reflecting a pref- 
erence for joint gain will increase with age. 
Thus, it is anticipated that as children grow 
older, competition becomes a more dominant 
autonomous motivational orientation, and they 
learn to utilize more effectively relative and 
joint gain choices as strategies for obtaining 
valued own outcomes. 


Sex 


There has been considerable research on 
variations in children’s interests and behaviors 
as a function of sex, but relatively few stud- 
ies have assessed differences in the develop- 
ment of social motives and strategies, The 
findings of several empirical inquiries relate to 
differences in own/other outcome preferences. 
For example, in their extensive research on 
child rearing practices, Sears, Maccoby, and 
Levin (1957) found that girls develop a sense 
of conscience and social responsibility earlier 
than boys. This would suggest that girls are 
more likely to display cooperative and altru- 
istic behaviors earlier than boys. Szal (1972) 
conducted a very careful study of cooperative 
and competitive behavior in boys and girls in 
a task requiring turn taking and coordination 
of effort. She found strong evidence that girls 
were more cooperative, boys were more com- 
petitive with same-sex partners, and boys in 
mixed-sex dyads become more cooperative, 
whereas girls became more competitive. On 
the basis of Szal’s observations, one might 
expect that boys in the conflictual setting or 
task would be more competitive than girls, 
and the latter would be more cooperative in 
the coordinative task. One might expect on 
normative grounds that competition as an au- 
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tonomous social motive would emerge earlier 
and more strongly in boys. 


Case 


A third independent variable (within sub- 
jects) in the present investigation is that of 
case, In prior studies (e.g., Messick & Mc- 
Clintock, 1968; MacCrimmon, Note 2), pay- 
offs have been systematically varied in terms 
of whether a chooser’s outcomes afford him 
more or less than others. Case I games are 
defined such that, regardless of the choice 
made, a chooser is afforded as much or more 
of some commodity than other. Hence, a game 
in which a child is afforded a choice between 
three marbles for self and three for other 
versus two marbles for self and one for other 
would be defined as a Case I game. Case II 
games are ones in which the noncompetitive 
outcome affords’ other more of some commod- 
ity than self. Hence, if the above example 
game were modified so that the choice that 
afforded three marbles to self now afforded 
four marbles to other, the game could be de- 
fined as falling into Case II. Prior research 
has indicated that choosers are more likely to 
prefer competitive to own or joint gain out- 
comes when the noncompetitive choice affords 
other more than self. These observations are 
consistent with the findings of Messick and 
Thorngate (1967) that subjects in comparing 
combinations of own/other outcomes behaved 
in such a way as to indicate that being behind 
was more aversive than being ahead was re- 
warding. In the present investiagtion, it is ex- 
pected that children will also show stronger 
preferences for competitive outcomes in Case 
II than in Case I games across the three set- 
tings. 


Settings 


The present study requires children to indi- 
cate a preference for one of two own/other 
outcome combinations in three settings: (a) 
an individualistic setting in which there is no 
strategic requirement for a particular choice, 
(b) a conflictual setting in which competitive 
choices are required in order to obtain a val- 
ued commodity, and (c) a coordinative one in 
which cooperative choices are required to ob- 
tain the commodity. In comparing outcome 
preferences between these three settings, it is 
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TABLE 1 


THe MOTIVATIONAL DOMINANCE OF Four CLASSES OF 
TWO-CHOICE DECOMPOSED GAMES WITH EXAMPLES 
OF OUTCOMES EMPLOYED IN PRESENT STUDY 


Game classes 


Generic = 

form J.R(O) OJ.R ORJ ORJ 

Outcome A uB AFB A B A B A B 
Own ar a: 4 4 4 3 5 4 eck 
Other's br be 44 EEN 24 a 


Nole. O = own gain; R = relative gain; J = joint gain. 


expected that children will display more co- 
operative outcome preferences in the coordina- 
tive than in the other two settings and more 
competitive outcome preferences in the con- 
flict than in the other two settings. Finally, 
in the individualistic setting, children are ex- 
pected to show a stronger predisposition to- 
ward own gain maximization than relative or 
joint gain. However, we anticipate that there 
will be a strong indication that relative gain 
maximization functions as an autonomous mo- 
tivational orientation in all age groups, par- 
ticularly in Case II games, where to maximize 
own gain one must afford other a higher out- 
come than self. 


THE MEASUREMENT oF SoctAL MOTIVES 


In the present investigation, two-choice de- 
composed games are employed to assess own/ 
other outcome preferences. In a two-choice 
decomposed game, each child is given a choice 

of two options, A or B, each of which specifies 
a payoff to self (a; vs. a2) and to other (by 
vs. bə). It is assumed that in a given instance 
the child will select one of the two sets on 
the basis of the three criteria discussed earlier. 
Individualistically oriented children should 
select that set of outcomes where a; > as or 
ay > a;. Children who prefer that set of out- 
comes that maximizes relative gain should 
choose A, if a1 — bı is greater than az — bs, or 
B if the reverse obtains. If a child is intent 
upon cooperating and maximizing joint out- 
comes to self and other, then he or she should 
select that alternative with the greater sum, 
namely, A, if a; + bi > az + bs, and B if the 
reverse obtains. The logic of two-choice de- 
composed games is spelled out in greater de- 
{ail in Messick and McClintock (1968). 
Table 1 illustrates the four classes of de- 
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composed games employed in the present in- 
vestigation. Each game class denotes a unique 
configuration of the choices dictated by the 
three own/other outcome preferences being 
investigated. The code used to summarize 
these patterns is described as follows: The 
letters O, R, and J designate own gain, rela- 
tive gain, and joint gain. If one of these let- 
ters is in parentheses, then the two alterna- 
tives are equal with respect to that criterion 
for all games of this type. In effect, this own/ 
other outcome preference is controlled across 
choices. A period inserted between two letters 
indicates that the criterion or motive on one 
side of the period dictates one choice (either 
A or B), whereas the criterion on the other 
side of the period dictates the choice of the 
other alternative. All four examples of game 
classes in Table 1 are Case I games. 

There are a number of inferences regarding 
preferences that can be made as a function of 
the outcome properties of the four game 
classes employed in the present study. First, 
if, as assumed, children’s choices are strictly 
a function of the three outcome preferences of 
individualism (O), cooperation (J), and com- 
petition (R), then they should consistently 
choose that alternative that is optimal in 
respect to all three, the ORJ choice, in the 
ORJ game class. To the extent that children 
are competitively oriented, they should select 
the ORJ alternative in the ORJ game class, 
the OR alternative in the OR.J class, and 
the R alternative in both the J.R(O) and 
OJ.R classes. If they are individualistic in 
orientation, they should prefer the ORJ choice 
in the ORJ class, the OJ alternative in the 
OJ.R class, and the OR choice in the OR.J 
class. Finally, if cooperatively oriented, chil- 
dren should again prefer the ORJ choice in 
the ORJ class, the J choice in the OR.J class 
and the J.R(O) class, and the OJ choice in 
the OJ.R game class. It is the patterns of 
these preferences that serve as the dependent 
variable in the present investigation. 


COGNITIVE REQUISITES 


Tn the present study children between the 
ages of 5 and 8} are asked to express their 
preferences between two sets of outcomes for 
self and other. In order to express systematic 
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preferences consistent with the three own/ 
other outcome orientations under considera- 
tion, certain numerical abilities are requisite. 
To express an individualistic orientation, the 
child must have the capability to make a 
more-than/same/less-than judgment in regard 
to his own outcomes. Joint gain maximization 
or cooperation requires a child to add each 
set of own and other’s outcomes and decide 
which affords the greatest total amount of 
some commodity to self and other. Relative 
gain maximization requires children to sub- 
tract other’s from own outcomes and then 
make a comparison between two sets of dif- 
ferences to ascertain which provides the great- 
est relative advantage (or smallest disadvan- 
tage). 

Prior research, particularly that conducted 
by Iijima (1965, 1966, 1967), indicates that 
by 4 years of age children are generally able 
to make more-than/less-than judgments accu- 
rately in simple situations and that 6-year- 
olds can perform those operations necessary 
for joint and relative gain maximization. In 
the present study, all children were tested in- 
dividually to ascertain whether they could 
perform on request those forms of numerical 
operations necessary for expressing own, rela- 
tive, and joint gain outcome preferences. The 
number of objects that the child could select 
for self and for other ranged between 1 and 
5. For own gain maximization, the child was 
asked, “Show me the board on which you get 
the most marbles.” (Each one of two boards 
had 1-5 objects for self and 1-5 for other.) 
For joint gain maximization, the child was 
asked, “Show me the board on which we to- 
gether get the most marbles.” For relative 
gain maximization, the child was asked, “Show 
me the board on which you get more marbles 
than I do.” In the latter instance, on only one 
of the two boards did the child receive more 
marbles than other. 

The overall results of this testing indicate 
that children at all the ages tested are capa- 
ble of making those discriminations necessary 
for expressing own, relative, and joint gain 
preferences. Across the initial presentations of 
four own/other outcome combinations, the 
overall error rates were under 15% for each 
of the questions in each age group. By the 


fourth game, the error rate was less than 5% 
for each question for the children 4-5 years 
old, and the overall error rate across ques- 
tions and ages was less than 2%. 


METHOD 
Subjects 


Subjects were 180 boys and 162 girls drawn from 
all kindergarten through second-grade classes of three 
public elementary schools. In addition, third-grade 
children were used for pretesting and training the 
experimenters. Subjects were distributed according to 
age and sex as follows: 5 to 6 years—48 boys, 44 
girls; 6 to 7 years—5O boys, 49 girls; 7 to 8 years— 
56 boys, 51 girls; 8 to 84 years—26 boys, 18 girls. 
Six subjects did not participate in the second session 
due to scheduling problems. Same-sex and same-age 
dyads were formed with the constraint that subjects 
were not best friends according to teachers’ reports. 
The experimenters were carefully trained upper divi- 
sion university women. 


Apparatus and Tasks 


A two-choice decomposed game display was used 
for the three social settings (individualistic, coordi- 
native, and conflict) and for the numerical. ability 
task. This apparatus consisted of a long rectangular 
piece of wood to which two smaller boards were 
affixed. Each of these smaller boards had a yellow 
side and a blue side, and each side of the two boards 
had five holes in which the experimenter could place 
a predesignated number of marbles or beads that 
corresponded to the distribution of own and other’ 
outcomes for the appropriate decomposed game for“ 
that trial, 

In all three social settings, a child was assigned 
the same Wblor-coded side on each of the two smaller 
decomposed game boards. On a given trial, the ex- 
perimenter placed a prearranged number of marbles 
(or beads for girls in the individualistic setting) on 
each of the smaller boards corresponding to the own 
and other outcomes for a particular decomposed 
game. Thus, each of the two smaller boards con- 
tained a pair of own and other's outcomes with the 
chooser’s two possible own outcomes closest to him 
or her. 

In the individualistic setting, each child was pro- 
vided with an opaque jar in which to place the 
marbles (or beads) afforded him by both his own 
and other’s choices and a personalized bag within 
which he could take them home after the session. In 
the conflictual setting, a relative outcome display 
board was present. This display board consisted of 
two vertical adjacent slots, color-coded to be iso- 
morphic with those that the children were assigned 
on the choice boards. Each child placed the marbles 
afforded him by his own and other’s choices in his 
own slot. Thus, both children could know their 
relative outcome position after each game. In the 
coordinative setting, a joint outcome display was 
present. This display board consisted of two boxes 
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of different color: a large green box with 43 holes 
and a small black box with 6 holes in which the 
marbles afforded both subjects could be placed to- 
gether to provide joint outcome visual feedback of a 
cumulative nature after each game. Subjects were 
instructed to attempt to fill together first the large 
and then the smaller box. 


General Procedure 


Children were taken two at a time from their class- 
room to an experimental trailer. In the first session, 
for half of the dyads children were tested in separate 
rooms on the numerical ability task. On this task, the 
child and the experimenter were seated on opposite 
sides of a table with the two-choice decomposed 
game display between them. After completing this 
task, the two children were brought together and 
seated at opposite sides of the table facing each other 
with the game board in between them. The experi- 
menter was at the end of the table. The dyad then 
was tested in the individualistic setting. At this 
time, a second set of dyads was brought to the 
trailer and received testing in the individualistic set- 
ting without prior testing on the numerical abilities 
test. 

The second session was held approximately 2 weeks 
later. The dyads formed during the first session were 
maintained. Children who had not received the nu- 
merical ability task in the prior session were tested. 
Half of the dyads were then randomly assigned to 
the conflict setting and half to the coordinative set- 
ting. In these two settings, the seating arrangement 
~s- Was similar to that of the individualistic setting. 

During performance in each of the three settings a 
“no talking light” was turned on, and subjects were 
instructed not to talk to each other while the light 
was on. 


Individualistic Setting 


In the individualistic setting each child was in- 
structed that he could keep those marbles (or beads) 
that his or other’s choices afforded him, Children 
alternated making choices on blocks of four trials 
with each child making 20 choices in all. Each trial 
block contained one game from each of four game 
classes: ORJ, OJ.R, OR.J, and J.R(O). Within a 
dyad, each child received different instances of par- 
ticular games within a given trial block to rule out 
the possibility of behavioral imitation. Over the ses- 
sion both children received the same 20 games, but 
not in the same order, Of the five games presented 
from each of the OR.J and ORJ game classes, two 
were Case II and three were Case I; of the five 
games from the J.R(O) and OJ.R game classes, 
three were Case II and two were Case I. Each child 
was asked the color of marbles (beads) he (she) 
wished for himself or herself whenever it was his 
(her) turn to choose. The children were instructed 
that “the marbles (beads) you put in your jar you 
can keep and take home with you.” 
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Conflict and Coordinative Settings 


In the second session, half of the dyads performe:! 
in the conflict setting and half in the coordinati 
setting. The actual decomposed games used in th 
two settings were identical with those employed 
the individualistic setting. In these two settings, chi 
dren played five blocks of eight games each, t} 
chooser alternating after each game, The order 
presentation of games differed from the individu 
istic setting but was constant between the con 
and coordinative settings. Thus, dyads in these t 
settings received the same games in the same ordi 
Again, within dyads, each child was confronted 
the same games, but the order was varied. Wo 
child also responded to one decomposed game ‘~ 
each game class and case in each trial block. 

In the conflict setting children were instructed 
follows: 


In this game you will not keep any marbles 
you will win chewing gum. The person who 
the most marbles in his/her slot and gets 

to that orange line at the top will win 2 s 


chewing gum. 


Before each trial block the experimenter place: 
sticks of chewing gum at the top of the relatis 
come display board. At the end of each trial bic 
eight games, the child with the most marbl 
given these two sticks of gum, and the display 
was emptied. If the outcome was tied, an ad 
two games were played to determine the wim 
tie breakers were structured to ensure that onc 
would win, and tie breakers are not included as ps: 
of the data. At the end of the session, all chi 
were given some gum even if they had not won 
of the five games played. 

In the coordinative session, children were instructed 
as follows: 


In this game, you will not keep any of the 
marbles you win, but you will win some chewing 
gum. You will always put your marbles together 
in these boxes. First, you have to fill up the green 
box. If you both put lots of marbles in the green 
box and fill it up, you will each get one stick of 
chewing gum. If you get so many marbles that 
you fill the green box and you also fill the black 
box, you will each win a second piece of chewing 
gum. 


The experimenter then put one stick of chewing gum 
on each side of the green box and one on each side of 
the black box. At the end of each trial block of 
eight games, the chewing gum was distributed accord- 
ing to the joint outcome, and the display board was 
emptied. 


RESULTS 
Outcome Preferences in the Individualistic 
Setting 


All children, as members of dyads, partici- 
pated in the individualistic setting. The de- 
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pendent variables in all settings are the pref- 
erences that children expressed for various 
combinations of own and other’s outcomes. 
These preferences are assessed in four classes 
and two cases of decomposed games in which 
choices reflecting cooperative, competitive, or 
individualistic motivational orientations are 
combined, contrasted, or controlled. The first 
game class is a single dominance game, ORJ; 
the second controls own gain as a choice cri- 
terion, J-R(O); the third is the decomposed 
form of the classic Prisoner’s Dilemma game, 
OR.J; the fourth is the decomposed form of 
the Maximizing Difference game, OJ.R. 

Results are reported in terms of the propor- 
tion of choices for which R is a component. 
The meaning of an R choice, of course, varies 
as a function of game class. For example, in 
the ORJ class, an R choice would indicate 
that the child was selecting an alternative that 
maximized all three motivational dispositions. 
An R choice in the J.R(O) game class im- 
plies that the child prefers relative to joint 
gain when own gain is controlled, and so forth. 

Game case is treated as a second within- 
dyad variable. Case I defines those decom- 
posed games in which the non-R choice affords 
the chooser as many or more marbles or beads 
than other, whereas Case II defines those 
games in which this choice affords more mar- 
bles or beads to other. The major between- 
subject variables in the present investigation 
were age (5-6, 6-7, 7-8, and 8-8} years old) 
and sex (boy and girl dyads). Hence, the 
overall factorial design for each setting was a 
4X2X4X2 (Age X Sex x Game Class X 
Case), with game class and case treated as re- 
peated measures. The basic unit for all analy- 
ses is the dyad, 

The individalistic setting provided children 
with no specific orientation concerning the 
goals of the game, except they were informed 
that they would keep those marbles (beads) 
that were afforded them by their own and 
other’s choices. Given this setting, four hy- 
potheses were made. (a) O and R predisposi- 
tions would be dominant, and hence the game 
classes would be ordered as follows in terms 
of an increasing proportion of R component 
choices: OJ.R, J.R(O), OR.J, and ORJ. (b) 
The proportion of R component choices would 
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be greater for all game classes in Case II than 
in Case I games, (c) Boys would make a 
higher proportion of R choices than girls. (d) 
R choices would increase as a function of age. 

The level of R-choice responding by game 
class and case is presented in Figure 1. The 
data clearly support Hypotheses a and b 
above. There is a significant game class ef- 
fect, F(3,489) = 314.13, p < .001, and the 
order was as predicted. There is also a signifi- 
cant case effect, F(1, 163) = 107.61, p< 
.001, in the predicted direction, namely, more 
competitive choices in Case II where the non- 
R choice affords other more marbles or beads 
than self. In the ORJ game class, the ORJ 
alternative received 90% of the choices sup- 
porting the expectation that O, J, or R is the 
major base for choice preference. For the 
OR.J game class, 88% of the choices were for 
the OR alternative, confirming the expecta- 
tion that O and R are the principal motiva- 
tional orientations in the present setting. For 
the J.R(O) game class, 68% of the choices 
were R choices indicating that when O is not 
available as a choice criterion, R tends to 
dominate over J. Finally, for the OJ.R game, 
45% of the choices were for R, indicating 
that when O is in opposition to R, each is pre- 
ferred about equally as a choice criterion. This 
obtains, even when in the present setting an 
R choice implies having fewer marbles 
(beads) at the end of the game than if one 
consistently makes O choices. 

The effect of the case manipulation was to 
increase the preference for R outcomes and 
hence decrease the absolute number of mar- 
bles (beads) one would obtain for self. The 
case manipulation had its strongest effect in 
the OJ.R game. In Case I 27% R choices 
were made, whereas 57% were made in Case 
II. This finding indicates that children are 
particularly likely to forego maximizing mar- 
bles (beads) for self when the own maximiza- 
tion choice affords other more marbles (beads) 
than self, This strong shift in preference as a 
function of case clearly illustrates how the 
consideration of other’s outcomes affects a 
child’s definition of his own outcomes. The 
strength of the case manipulation in the pre- 
vious game class is reflected in an overall 
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Ficure 1. Proportion of R choices in individualistic 
setting by game class and case (n = 171 dyads). 


Case X Game Class interaction, F (3,489) = 
32.32, p < 001. 

The between-subjects analyses consider 
whether age and sex affect choice behavior 
across case and game class. The data indicate 
that, contrary to expectation, there were no 


systematic differences in the proportion of R 


choices as a function of age. There does exist 
a marginally significant sex difference, F(1, 
163) = 3.62, p = .06, with male dyads mak- 
ing more R choices than female dyads. There 
is also a significant interaction between class, 
age, and sex, F(9,489) = 2.27, p= .02. 
Given this interaction, a Sex X Case X Age 
analysis was made within each game class. 
The results of these analyses again yielded 
weak effects for sex and a marginal age effect 
in one game class, Males made more R choices 
in the ORJ game, F (1, 163) = 3.41, p = .07; 
in the J.R(O) game, F(1, 163) = 2.85, p= 
.09; and in the OJ.R game, F(1, 163) = 3.45, 
p = .07. Age had a marginal effect in the 
OJ.R game, F(3, 163) = 2.29, p=.08. An 
inspection of the data indicates that boys in 
the youngest age group made fewer R choices 
than those in the older groups. This did not 
obtain for girls, which is reflected in an Age 
X Sex interaction for this game class, F(3, 
163) = 2.41, p = .07. 

Thus, in the individualistic setting, the pre- 


dicted class and case differences were ob- 
served. However, the expected difference in 
competitive responding between boys and girls 
was only marginal, and there were no signifi- 
cant developmental changes in the motiva- 
tional orientations of children as reflected in 
their choice preferences as a function of age 

After all children had completed making 
choices in the individualistic setting, approxi- 
mately equal numbers of male and 
dyads were assigned either to the cc 
or coordinative settings. The games wert 
peated, but the rules governing the outc 
of the settings were modified, 


Outcome Preferences in the Conflictual and 
Coordinative Settings 

In the conflictual setting, children 
afforded sticks of gum if, after a ser 
games, they obtained more marbles 
they did not keep) than the other dyad 
ber. Given this definition of outcome 
setting becomes a zero-sum game in whi 
choices are the more rational means for a 
ing the prize of gum. It was expected t) 
proportion of R choices would increase 
setting as compared to the individuali 
Furthermore, as in the individualistic 
it was predicted that the frequency of | 
ponent choices by game class would diff: 

R choices would increase in the fol 
ordering: OJ.R, J.R(O), OR.J, ORJ. 
also predicted that more R choices wo 
made in Case II than Case I games, an 
difference would be greatest for the OR.| 
games in which an R choice implies foregoing 
own gain maximization (O). 

Figure 2 indicates that the predicted 
ing by class obtained, F(3, 213) = 50.78, p 
-001, as well as the predicted case effect, F (1, 
71) = 44.40, p< .001. The expected large 
effect for case in the OJ.R game also obtained 
and is reflected in a significant Case X Class 
interaction, F(3,213) = 18.57, p < .001. It is 
of interest to note that all points in Figure 2 
exceed 75% R choices except for Case I OJ.R 
games. In this instance, children are somewhat 
unwilling to forego own gain for relative gain 
even though the latter is the more rational 
means for winning the gum. 

The between-subjects analyses indicate that 
boys make somewhat more frequent R choices 
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than girls, F(1, 71) = 4.15, p = .05. No sys- 
tematic differences in preference were ob- 
served as a function of age. Analyses per- 
formed within game classes indicate that boys 
more frequently chose the alternative with an 
R component in the ORJ game, F(1,71) = 
2.99, p = .09, and in the OJ.R game, F(1, 
71)°= 3.87, p = .05. Thus, in the conflictual 
setting, the expected class and case effect are 
again observed. Furthermore, as in the indi- 
vidualistic setting, boys tend to make some- 
what more R choices than girls. And again, 
the predicted differences as a function of age 
are not observed. 

In the coordinative setting, the most effec- 
tive manner in which to achieve the prize of 
gum is for each player to make J choices and 
thereby maximize the total number of marbles 
afforded both self and other. Again, it was ex- 
pected that games would be ordered in terms 
of the proportion of R choices as in the two 
other settings. Figure 3 indicates that this 
ordering did indeed obtain. The high propor- 
tion of OR choices (60%) in the OR.J game, 
though J is the more rational choice, reflects 
the difficulty children have in foregoing both 
own and relative gain even when joint gain 
choices are more rational. Still the proportion 
of OR choices in this game class is substan- 
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Ficure 2. Proportion of R choices in conflict setting 
by game class and case (n=79 dyads). 
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Ficure 3. Proportion of R choices in coordinative 
setting by game class and case (n = 86 dyads). 


tially less in the coordinative than in the con- 
flictual or individualistic settings. 

Case was again expected to have a signifi- 
cant, but smaller, impact in terms of produc- 
ing more R choices in Case II than Case I 
games. Though the overall case effect is sig- 
nificant, F(1, 78) = 8.43, p < .005, the case 
manipulation accounts for considerably less of 
the variance in this instance (.3%) than ob- 
tains for the individualistic (3.1%) and con- 
flict (4.4%) settings. In the coordinative set- 
ting, there was again a significant interaction 
between case and class, F (3, 234) = 21,02, p 
<.001, but unlike the other two Settings, 
there was a case reversal. There were more R 
choices in Case I of the OR.J game than in 
Case II. An examination of the outcomes for 
the OR.J games indicates that in Case II 
games, selecting the J (non-R) choice pro- 
duces a higher total joint outcome than select- 
ing the J choice in Case I. This fact may in- 
crease the attractiveness of non-R choices in 
this particular setting. 

In the between-subjects analyses, the effects 
of age and sex were again examined. Across 
game class and case, there is no strong effect 
for sex. Boys tend to make slightly fewer 
choices than girls with an R component, F(1, 
78) = 2.45, p = .12. However, for the first 
time there is a sizable variation in choice be- 


havior as a function of age. Older children 
tend to be more J oriented than younger chil- 
dren, F(3, 78) = 4.89, p < .004. Analyses per- 
formed within each game class indicate in- 
-creases in choices with a J component as a 
function of age in each game class: ORJ, 
F(3,78) = 3.13, p= .03; ORJ, F(3,71) = 
4.17, p= .009; J.R(O), F(3,71) =7.71, p 
< .001; and OJ.R, F(3, 78) = 5.74, p = .002. 


Outcome Preferences Between Settings 


So far we have considered choice behavior 
as it reflects outcome preferences within each 
of three settings. In the present section, we 
compare preferences in the individualistic ver- 
sus conflictual and individualistic versus co- 
ordinative settings. It should be noted that 
this is again a within-subject analysis and 
that participation in the individualistic setting 
always precedes participation in the other 
two settings, For the latter reason, the follow- 
ing results are subject to the limitations of 
order effects. A fixed order was maintained in 
the present investigation because the set estab- 
lished in the conflictual or coordinative set- 
tings would be difficult to eliminate should the 
more neutral individualistic setting follow. 

Analysis of variance procedures were em- 
ployed to make the above setting compari- 
sons. For these comparisons, the following 
design is employed: 2 X 4 X 2 X 4 X 2 (Set- 
ting X Game Class X Case X Age X Sex), 
with setting, game class, and case treated as 
repeated measures. The dependent measure is 
again the proportion of choices with an R 
component, and the dyad is the unit of analy- 
sis. 

Strong main effects for setting were ob- 
tained for both comparisons with significantly 
more R choices in the conflictual setting 
(84%) than the individualistic setting 
(73%), F(1, 71) = 51.45, p < .001, and sig- 
nificantly fewer R choices in the coordinative 
(50%) than the individualistic setting, F(1, 
78) = 103.14, p < .001. If one omits the ORJ 

game in which J and R predispositions imply 
the same choice, the proportion of choices 
with an R component in the coordinative set- 
ting is even lower (32%). 

Also observed in the above analyses are 
several significant interactions that are, by 
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and large, consistent with the previously re- 
ported findings: R choices change as a func- 
tion of age in the coordinative but not in the 
individualistic setting, F(3,78) = 6.93, p < 
001; boys tend to be more competitive in 
the individualistic and less competitive in the 
coordinative setting than girls, F(1,78) = 
5.48, p = .02; the case manipulation has a 
differing effect in the coordinative setting 
than in the individualistic setting for the vari- 
ous game classes, F(3,234) = 13.57, p< 
-001; and a significant interaction is observed 
between the individualistic versus coordinative 
settings, game class, and age, F(9, 234) = 
4.98, p < .001. The nature of this interaction 
is evident in Figure 4. For three of the game 
classes, OJ.R, J.R(O), and OR.J, the pattern 
of choices as a function of age is quite dif- 
ferent in the coordinative than in either the 
individualistic or conflictual settings. For the 
ORJ, this difference does not obtain, though 
it should be noted again that for the ORJ 
game, all three predispositions lead to the 
same choice. Hence, regardless of setting, sex, 
or age, the proportion of R component choices 
in this game class should and indeed does re- 
main high. 

Figure 4 also provides a basis for illustrat- 
ing the strong congruence, particularly in the 
older age groups, between the type of setting, 
the outcome structure of the game class, and 
children’s choice behaviors. Thus, 8- to 84- 
year-old children in the conflictual setting 
make predominantly R choices in the OJ.R 
game class, those in the coordinative setting 
make primarily OJ choices, and those in the 
individualistic setting distribute their choices 
about equally between the two. Confronted 
with the J.R(O) game class, the older chil- 
dren in the conflictual setting again make pre- 
dominantly R choices, as do those in the indi- 
vidualistic setting, whereas those in the co- 
ordinative setting make principally J choices. 
For the OR.J game class, children in the con- 
flictual and individualistic setting make prin- 
cipally OR choices, whereas those in the co- 
ordinative setting make about the same num- 
ber of OR and J choices. Even older children 
in the coordinative setting, however, make 
44% OR choices, indicating again the diffi- 
culty children have in giving up both own 
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Ficure 4. Proportion of R choices by game class, setting, and age across case and sex. 


and relative outcomes even when selecting 
joint outcomes is a more rational strategy for 
obtaining a valued end. 


Discussion 


The principal finding of the present investi- 
gation is that children as young as 5 years of 
age systematically take into consideration out- 
comes to self and other in making choices. 
Namely, children from 5 to 84 years old vary 
their preferences in decomposed games as a 
function (a) of the own/other outcome struc- 
ture of the classes of game, (b) of whether or 
not they are afforded more or less points than 


other as defined by the case manipulation, and 
(c) of whether the setting is structured in in- 
dividualistic, conflictual, or coordinative terms. 
Because the variations of choice between 
classes of decomposed games are systematic, 
there is reasonably strong evidence that de- 
composed games themselves provide a viable 
means for assessing the own/other outcome 
preferences of young children in various set- 
tings of outcome interdependence. These ob- 
servations are consistent with prior empirical 
findings that obtain for adults. 

As regards the three general expectations 
proposed at the beginning of this article, the 
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data provide little information regarding the 
first, which asserts that children initially 
maximize own outcomes regardless of the 
impact of their choices on other’s outcomes. 
Children as young as 5-54 years old in the 
present study were as willing to forego own 
for relative gain maximization as those 8-84 
years old in the individualistic setting. Hence, 
to explore Expectation 1, it appears necessary 
to examine the preference behavior of children 
younger than 5 years of age. 

Expectation 2 asserts that children learn to 
employ relative and joint gain maximization 
as strategies toward achieving valued out- 
comes. The data partially support this asser- 
tion. For both the coordinative and conflictual 
settings, we observe clear shifts in preferences 
for outcomes that are consistent with increas- 
ing the likelihood of achieving a valued out- 
come. In the conflictual setting, there is no 
evidence for learning as measured by variabil- 
ity in competitive choices as a function of age. 
It appears that by the age of 5, children have 
a clear understanding of the appropriate 
choice strategy in a competitive task. Again, 
it appears to demonstrate the acquisition of a 
competitive choice strategy, and children 
younger than 5 will have to be observed. As 
regards a choice strategy for a coordinative 
game, there is strong evidence that children 
are learning to maximize joint gain within 
the ages sampled, namely, there is an increase 
in joint gain choices as a function of age. 

As noted above, there is also some evidence 
for Expectation 3. All age groups, particu- 
larly in Case II games, appear willing to fore- 
go own gain for relative gain maximization in 
the individualistic setting. This is most read- 
ily apparent in Case II games of the OJ.R 
game class where, in the individualistic set- 
ting, children made more choices that maxi- 
mized relative than own and joint gain. Thus, 
it would appear that children are willing to 
forego own gain (marbles or beads) in cer- 
tain instances to obtain more of a commodity 
than other. 

Finally, the data indicate that in the indi- 
vidualistic and conflictual settings, boys tend 
to be more competitive than girls. However, 
the magnitude of this effect, though statisti- 
cally significant, is quite small, and there is 
a sizable overlap in the distribution of com- 


petitive choice behavior for boys and girls. 
Hence, one should be extremely cautious in 
asserting that boys are more competitive than 
girls in those tasks that form the bases of 
the present investigation. 
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The present experiment was designed to study the physiological basis of the 
proposition that psychopaths are indifferent to the feelings of others. Young 
male subjects from a prison population were divided into groups according to 
their Minnesota Multiphasic Personality Inventory (MMPI) Pd score and then 
subdivided on the basis of MMPI Welsh Anxiety Scale (WAS) scores. Subjects 
in each group observed a model exhibiting either mild or severe distress to elec- 
tric shock. Heart rate and skin conductance were measured over 15 trials in each 
subject. We hypothesized that the different clinical groups would exhibit differ- 
ent physiological emotional response patterns, The results showed that low-Pd 
subjects were more autonomically responsive to an emotional stimulus 
(modeled distress) than high-Pd subjects from the same prison population. 
Moreover, high-WAS subjects were more responsive than low-WAS subjects. 
The level of distress exhibited by the model had no effect. 5 


One defining characteristic of psychopaths 
is a deficiency of affect (Hare, 1970; Maher, 
1966). Although this characteristic has been 
investigated in numerous clinical studies, con- 
siderable controversy surrounds the possible 
physiological concomitants of this emotional 
defect. For instance, Schachter and Latané 
(1964) have suggested that psychopaths may 
actually be more sensitive to emotional stim- 
uli than normals, but because of habituation 
to their higher autonomic arousal state, their 
threshold for emitting an emotional response 
is greater than normals’. In a recent study, 
Sutker (1970) compared the galvanic skin 
response of noninstitutionalized psychopaths 
with nonpsychopaths in a vicarious condi- 
tioning paradigm. A tape-recorded presenta- 
tion of the numbers 1 to 7 in ascending order 
served as the anticipatory stimulus. At Num- 
ber 4 a trained model reacted as if shocked. 
This study found that psychopaths responded 
less in anticipation of the stimulus (i.e., at 
Number 3) than nonpsychopaths but, para- 
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doxically, they exhibited significantly greater 
responses to the actual occurrence of mod- 
eled distress than did nonpsychopaths. 

The present experiment was designed to 
study further the physiological basis of the 
proposition that psychopaths are indifferent 
to the feelings of others. This experiment dif- 
fered from Sutker’s in that the effect of 
neuroticism as well as psychopathy was in- 
vestigated in the present study and no antici- 
patory stimulus was used. A psychopathic 
group, which is formed on the basis of Min- 
nesota Multiphasic Personality Inventory 
(MMPI) Pd scale scores, could be partly 
composed of what Hare (1970) identified as 
neurotics acting out within the antisocial 
aggressive population. If this is true, it would 
seem advisable to study the neuroticism di- 
mension also in an experiment dealing with 
the responsivity of psychopaths. In the pres- 
ent experiment all subjects were institution- 
alized, and psychopathy was operationally 
defined on the basis of MMPI-Pd scale scores. 
Neuroticism was similarly defined on the basis 
of MMPI-Welsh Anxiety Scale (WAS) scores. 


METHOD 
Subjects 


The subjects were male prisoners recruited in the 
summer of 1972 from an educational program at 
Manning Correctional Institute, Columbia, South 
Carolina. Subjects were told that they would be 
excused from work during their participation in 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS or MMPI Pd 
AND WAS Scores ror EACH or THE Groups IN 
i THIS EXPERIMENT 


MMPI-Pd MMPI-WAS 

Group M sD M SD 
High-anxiety/high-Pd 35:68 2.45 25.25 3.17 
High-anxiety/low-Pd 25.93 2.77 24.12 2.55 
Low-anxiety/high-Pd 34.37 2.52 11.43 2.13 
Low-anxiety/low-Pd 24.75 2.19 10.00 3.41 


Note. MMPI = Minnesota Multiphasic Personality In- 
ventory; WAS = Welsh Anxiety Scale, 


the experiment and that, at its conclusion, the results 
of their MMPI scores would be made available to 
them if they desired. Subjects who participated in 
the experiment were paid $1 each (although promise 
of payment was not used as an incentive in recruit- 
ing subjects). From an initial pool of 121, four 
groups of 16 subjects were formed on the basis of 
Pd and WAS scale scores. The procedure used in 
forming these groups was as follows: First, subjects 
were divided into four groups on the basis of whether 
they were above or below the mean on each scale. 
Then the 16 subjects who represented the most 
extreme values on both scales were selected from each 
of these four groups. Table 1 gives the mean and 
standard deviation of both scales (MMPI Pd and 
WAS) for each group. Table 2 summarizes £ tests 
on differences between each pair of mean Pd scale 
scores (upper half of table) and mean anxiety scale 
scores (lower half), Inspection of this table reveals 
that the differences between means of the high and 
low groups were significant with respect to the 
dimension that was different (Pd or anxiety) but 
Were not significant with respect to the dimension 
that was constant. Finally, each of these four groups 
was subdivided into groups of eight randomly as- 
signed subjects per group to observe either high or 
low levels of modeled distress. 


Apparatus 


The physiological responses were recorded on a 
Grass Model-5 polygraph. Skin resistance (SR) was 


measured in ohms using a calibrated (Grass) 5P1 
preamplifier; a Grass 5P4 preamplifier was used to 
record electrocardiograms. The paper speed was 10 
mm/sec. The model was a paid graduate student 
associate of the experimenter and was unknown to 
any of the subjects prior to the experiment, An iso- 
lated 60-Hz ac shock source with maximum output 
limited to .4 mA served as a signal stimulus to the 
model. Shock electrodes were 18 mm in diameter and 
were placed approximately 15 cm apart on the 
model’s forearm. A white noise blower provided 
ventilation and sound attenuation. 


Procedure 


Subjects were escorted one at a time in random 
order to the prison library, As soon as the subject 
was seated, the experimenter addressed both subject 
and model as follows: “You are both subjects in a 
study about the effects of electric shock. This study 
is interested in two things—what will happen to a 
person when he is shocked? and what will happen to 
another person who sees that person getting 
shocked?” The experimenter then explained to the 
subject that he would not be shocked and that the 
wires to be attached to him were for recording 
purposes only. However, he was invited to test the 


shock that would be delivered to the model. All 


subjects accepted the invitation to test the shock 
and appeared to be convinced of the validity of the 
shock procedure. For this test, a stimulating current 
of approximately .3 mA for 4 sec was passed through 
the electrodes. After the recording leads were at- 
tached to the subject and the stimulating leads con- 
nected to the model, the experimenter asked the 
subject and model to sit quietly for a few minutes. 
During this time the experimenter continuously 
monitored the subject's heart rate (HR) and SR 
record. The experimenter and recording equipment 
were located in the same room with the subject and 
the model. However, a solid wooden partition (room 
divider) obscured the experimenter and the record- 
ing equipment from the subject’s view. When the 
SR baseline had stabilized for at least 1 minute the 
experimenter administered the first trial. A trial 
consisted of delivering a signal shock to the model. 
Two levels of shock were used, corresponding to the 
two levels of distress to be modeled. Distress level 


TABLE 2 
} VALUES ON DIFFERENCES BETWEEN ‘TREATMENT GROUPS MEANS 


MMPI-Pd 
High-anxiety/ High-anxiety/ Low-anxiety/  Low-anxiety/ 
MMPI-WAS high-Pd low-Pd high-Pd low-Pd 
High-anxiety/high-Pd 10.21** 1.44* 12.89** 
High-anxiety/low-Pd 1,07* 8.72** 1.29* 
Low-anxiety /high-Pd 14.02 14.798 7 11.17 
fs Low-anxiety/low-Pd 12.71** 12.84** 1.378* 


Note, MMPI = Minnesota Multiphasic Personality Inventory: WAS = Welsh Anxiety Scale. Upper right half of matrix cone 
tains ¢ values for MMPI Pd scale scores; lower left half, for MMPI anxiety scores, 
* 


ms. 
+p <0, 
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was the only treatment condition known to the 
model. The model was instructed to make his re- 
sponses as homogeneous as possible within each dis- 
tress level. The same model was used with all sub- 
jects; however, no data regarding the homogeneity 
of the model’s responses were obtained. After each 
trial, the SR baseline was permitted to stabilize again 
before presenting the next trial. Fifteen trials were 
presented to each subject. Although the experimenter 
knew which distress level each subject was observ- 
ing, he did not know either the Pd or anxiety group 
to which the subject belonged. 


RESULTS 
Data Reduction 


Heart rate was recorded for 10 beats pre- 
ceding the stimulus and 30 beats following 
the stimulus. The duration of blocks of five 
beats was recorded in millimeters of record- 
ing paper at a paper speed of 10 mm/sec. 
Only the first two poststimulus blocks (i.e., 
the first 10 poststimulus heart beats) were 
analyzed. Both the prestimulus and poststim- 
ulus HR duration measures were approxi- 
mately normally distributed. 

SR baseline values were obtained from 
the settings of the bridge potentiometers, and 
SR change following stimulus presentation 
was computed at the maximum pen deflec- 
tion point. Inspection of the frequency distri- 
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level as a function of experimental treatment and 
blocks of trials (SRL = log transformed baseline 
skin resistance). 
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butions together with a comparison of group 
means and standard deviations of the SR 
baseline and SR response data revealed a sig- 
nificant departure from normality. Therefore, 
a logarithmic transformation was performed 
on the SR baseline data. The same transfor- 
mation was used on the SR “stress level” 
data; the latter was obtained by adding the 
baseline (prestimulus) level to the response 
(algebraic change from baseline level). These 
logarithmic transformations resulted in ap- 
proximately normal distributions (skewness 
= .177 and .138, respectively). 

Correlation coefficients were computed be- 
tween the log transformed SR baseline and 
stress level scores and between prestimulus 
and poststimulus HR durations. These corre- 
lation coefficients were used to compute 
autonomic lability scores for SR and HR 
(after Lacey, 1956). The autonomic lability 
score (ALS) measures were uncorrelated with 
prestimulus SR and HR, 


Response Measures 


Four dependent measures resulted from the 
data reduction procedures described above. 
These measures and their abbreviations are as 
follows: (a) prestimulus (tonic) heart rate 
level (HRL); (b) ALSs derived from heart 
rate data (HR-ALS); (c) log transformed 
baseline skin resistance (SRL); (d) ALSs 
derived from log transformed skin resistance 
data (SR-ALS). Conceptually, SRL and HRL 
might be thought of as reflecting arousal 
produced by the experimental situation, 
whereas SR-ALS and HR-ALS may reflect 
arousal associated with presentation of the 
stimulus (modeled distress). Extending this 
interpretation further, the ALS measures may 
be regarded as emotional response indicators, 
perhaps reflecting the degree of identification 
of the experimental subjects with the model 
exhibiting distress, 


Analysis 

Separate 2 X 2 X 2 X 15 repeated-measures 
analyses of variance were performed on the 
HRL, HR-ALS, SRL, and SR-ALS data, In 
the SR data several significant effects were 
obtained. The effect of trials was not signifi- 
cant in SRL but reached significance in the 
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SR-ALS data, F(14, 784) = 1.712 Figure 1 
shows SRL as a function of blocks of trials 
for each of the eight experimental groups. 
Low-Pd subjects had generally higher SRLs 
than high-Pd subjects, F(1, 56) = 8.10, For 
low-Pd subjects SRL varied directly with 
modeled distress level, whereas the opposite 
effect was found in the high-Pd group, F(1, 
56) = 6.89. Figure 1 also shows that SRLs 
were relatively constant across trials in six of 
eight experimental groups, including all of 
the high-Pd groups. However, two of the low- 
Pd groups exhibited variable SRLs. The high- 
anxiety group, which observed mild distress, 
showed SRL decreases over trials, while SRLs 
of the low-anxiety group, which observed the 
higher distress level, increased over trials. 
These two trends produced significant inter- 
actions of Pd X Distress X Trials, F(14, 784) 
= 2.82, Anxiety X Trials, F(14, 784) = 2.15, 
and Distress x Trials, F(14, 784) = 2.42. 

Figure 2 shows the main effects of Pd and 
anxiety in the SR-ALS data. The effect of Pd 
is seen by comparing the left half of the 
figures to the right half, and the effect of 
anxiety is shown by comparing the solid bars 
to the hatched bars. Inspection of this figure 
reveals that the low-Pd group was more re- 
sponsive than the high-Pd group, F(1,56) = 
9.79, and that high-anxiety subjects responded 
more than low-anxiety subjects, F156) = 
5.79, The level of observed distress produced 
no significant differences except in the high- 
anxiety, low-Pd group. This group responded 
significantly more to the low distress level 
than to the high level, F(1, 56) = 4.54. 

HRL increased slightly as a function of 
trials, F(14, 781) = 2.11.2 However, HRL did 
not vary as a function of Pd, anxiety, distress 
level observed, or the interactions of these 
variables. The analyses of HR-ALS produced 
no significant effects. Moreover, a ¢ test com- 
paring mean HRL with mean poststimulus 
HR over all test trials and groups was not 
significant, 


Discussion 


The clinical literature contains at least two 
viewpoints concerning the affective responsiv- 
ity of psychopaths. One is that psychopaths 
chronically oyerrespond to autonomically 


50}- - - 


AUTONOMIC LABILITY SCORE (SR) 


LOW Pd HIGH Pd 


Ficure 2. Mean skin resistance (SR) autonomic 
lability scores of high- and low-anxiety/high-Pd 
subjects and high- and low-anxiety/low-Pd subjects, 


arousing stimuli (Schacter & Latané, 1964), 
This chronic overarousal may produce more 
rapid adaptation to emotional stimuli and, in 
addition, may raise the threshold at which 
emotional behavior is emitted. Studies sup- 
porting this view find that psychopaths have 
a greater unconditioned response to an emo- 
tional stimulus but do not condition well 
(e.g., Sutker, 1970). 

A second hypothesis suggests that psycho- 
paths may be chronically underreactive so 
that their autonomic responsivity threshold is 
higher than nonpsychopaths. If this hypothe- 
sis is true, then psychopaths should exhibit 
a smaller unconditioned response than non- 
psychopaths. The results of the present ex- 
periment are generally consistent with this 
second hypothesis. Psychopaths in this experi- 
ment were significantly less responsive than 
nonpsychopaths, and no differential adapta- 
tion rate was found between psychopaths and 
nonpsychopaths. 

Although the present study was not con- 
ceived as a conditioning experiment, previous 
studies have employed conditioning para- 


1A rejection region of .05 was adopted for all 
Statistical comparisons. 
df reflect missing data, 
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digms, This suggests a possible resolution of 
these conflicting findings (viz., in studies in- 
= volving conditioning a warning stimulus was 
used), and it is conceivable that the presence 
of an anticipatory stimulus increased the 

magnitude of the conditioned response in 

psychopaths. In our experiment no warning 

stimulus was used so that the autonomic 
response must be regarded as a response to 
the modeled distress alone. 

The possibility that neuroticism may influ- 
ence the autonomic responsivity of psycho- 
paths and nonpsychopaths was also tested in 
this study. It was found that the subjects 
with high WAS scores were more autonomi- 
cally responsive to the modeled distress stim- 
ulus than subjects low on this measure, re- 
gardless of psychopathy level. 

A number of investigators (e.g., Schacter & 
Latané, 1964; Valins, 1963) have reported a 
paradoxical increase in arousal as assessed by 

SHRL of psychopaths during mildly stressful 
stimulation. This finding, which is considered 
further evidence favoring the hypothesis of 
chronic overarousal in psychopaths, was not 
supported by the results of the present ex- 
periment. 

The possibility that HR may be used as a 
measure of motivation or emotion has been 
widely studied. At present, agreement regard- 
ing the possible psychological significance of 
this variable is lacking (Elliot, 1974). How- 
ever, since the results of several studies (Craig 
& Lowery, 1969; Deane, 1961) suggest that 
HR changes may reflect arousal, it was hoped 
that HR would provide a sensitive index of 
responsivity to the affective stimulus in the 
present study. But no systematic differences 
were found between the various treatment 
groups in HR-ALS, It is interesting to note 
that most studies in which HR was recorded 
used either direct stimulation of the subject 
(e.g., Lacey, 1967) or required a motor re- 
sponse from the subject (e.g., Higgins, 1971). 
Studies that employed vicarious stimulation 
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of the subject usually did not study HR. The 
lack of a systematic HR response in the 
present experiment may be due to the fact 
that arousal was vicariously induced. How- 
ever, it is possible that a more refined (e.g., 
beat by beat) analysis of HR might produce 
a better emotional response measure. SR 
appears to be more sensitive to subtle manip- 
ulations (e.g., vicarious stimulation) than 
average HR as assessed in the present study. 
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Impact of Fear of Success and Sex-Role Attitudes on Women’s 


Competitive Achievement 
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A study of 91 college-aged dating couples compared the impact of sex-role at- 
titudes and Horner’s measure of fear of success on women’s achievement. As 
predicted, women with traditional attitudes performed significantly better on a 
verbal task when working as a team with their boyfriend than when working in 
individual competition against the boyfriend; nontraditionals showed an op- 
posite pattern. A similar interaction of fear of success and condition failed to 
reach statistical significance. Some evidence was found that women who com- 
bine traditionalism and high fear of success may be most affected by variations 
in achievement settings. Data did not support Horner’s prediction that boy- 
friends’ attitudes affect women’s fear of success. Further, fear of success was 
not correlated with women’s own sex-role attitudes, career aspirations, college 
grades, SAT scores, or self-ratings of “intelligence.” In contrast, sex-role tradi- 
tionalism was associated with significantly lower career aspirations, lower SAT 
verbal scores, and lower self-ratings of “intelligence.” Sex-role attitudes thus 
appear to have greater impact than fear of success on women’s achievement 


both in the laboratory and in daily life. 


Horner’s (1970, 1972) theory of “fear of 
success” provides a new approach to under- 
standing problems of women’s achievement. 
Horner suggests that many women are in con- 
flict about intellectual and professional 
achievement. Success—especially in tradition- 
ally masculine fields—can be a mixed blessing 
for women, Although such success brings the 
attainment of a desired goal, it may also lead 
to negative outcomes, such as social rejection 
or loss of femininity. As a result, bright 
women may actually have a fear of success. 

Despite the popularity of Horner’s theory, 
Tresemer (1974) concludes that it is largely 
unsubstantiated by research. Most studies 
have been limited to testing Horner’s predic- 
tion that fear of success is more prevalent 
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among women than men, Following Horner, 
women’s fear of success has typically been 
assessed by coding stories written to a single 
verbal cue depicting a woman’s outstanding 
success in a masculine field, such as, “After 
first term finals, Anne finds herself at the top _ 
of her medical school class.” Men respond 4 í 
an identical cue with a male protagonist, 
(e.g., “John”). Horner (1970) reported nè 
striking sex difference: 65% of college women ^ 
but only 9% of college men showed fear of 

success in their stories. Later research has, 

however, failed to replicate this finding. In te \ 


A 


studies reviewed by Tresemer (1974), the 
median rate of fear of success was 47% for 
women and 43% for men. In over half the 
studies, fear of success was actually more 
common among men than women. 

More important in testing Horner’s theory 
are studies of the effects of fear of success on 
actual achievement behavior. Horner pre- 
dicted that competitive situations would im- 
pair the intellectual performance of women 
who fear success. In her initial study (1968, 
1970), Horner found that college women 
with fear of success performed better on a 
verbal skill task when working alone than in 
mixed-sex competition. Women who did not 
fear success showed an opposite pattern, After 
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reviewing the few fear of success studies that 
have included behavioral measures, Condry 
and Dyer (1976) conclude that “there is 
little support for the contention that Hor- 
ner’s measure differentiates women who will 
do well or poorly in mixed-sex competitive 
situations.” 

The present study had two main objectives. 
First, given the inconsistent results of studies 
using Horner’s measure, this study assessed 
the usefulness of sex-role attitudes as an al- 
ternative predictor of women’s achievement 
behavior. Second, this study attempted to 
clarify the impact of mixed-sex competition 
on women’s performance by distinguishing 
types of competition that might facilitate as 
well as impair the performance of women who 
fear success. 


Sex-Role Traditionalism 


The question of whether a measure of sex- 
role attitudes might be as good a predictor of 
performance as Horner’s projective measure 
has not been investigated. According to Hor- 
ner (1970, 1972), fear of success is based on 
specific sex-role concerns, such as fear of loss 
of femininity. Earlier studies (e.g., French & 

_ Lesser, 1964; Houts & Entwisle, 1968) have 
wigown that sex-role attitudes can mediate the 
‘onship of females’ achievement motiva- 

nd performance. It seems plausible that 

-role traditional women might perform 
much like women who fear success, and there 
is some evidence that fear of success may be 
more prevalent among college women with 
traditional sex-role orientations (Alper, 1974; 
O’Leary & Hammack, 1975). A demonstra- 
tion that fear of success and sex-role attitudes 
are in some way distinct or that the projec- 
tive measure is uniquely able to predict 
achievement behavior would do much to af- 
firm the validity and usefulness of Horner’s 
measure. The present study compares the 
relationship of fear of success and sex-role 
attitudes to women’s performance in both 
noncompetitive and competitive achievement 
settings. 


Competitive Achievement 


Fear of success is conceived as a latent 
motive that hinders performance only when 
aroused by some aspect of the situation, Hor- 


LETITIA ANNE PEPLAU 


ner singled out competition as most likely to 
arouse fear of success. This study hypothe- 
sized that competition per se is not the crucial 
factor. Rather, the possible violation of sex- 
role standards associated with some forms of 
competition is of prime importance. 

A basic tenet of traditional sex-role ideol- 
ogy is the norm of male intellectual and 
occupational superiority. Mead (1949) ar- 
gues that American males derive their self- 
esteem in part by outperforming females. 
Bernard (1973) emphasizes that women are 
taught to want to “Jook up to” their hus- 
bands, a stance facilitated by the fact that 
traditionally men are not only taller but also 
older, better educated, and have higher earn- 
ings and higher occupational status. College 
women who find themselves in academic com- 
petition with men may worry about jeopar- 
dizing their femininity, especially if they are 
excelling. Concerns about preserving at least 
an illusion of male superiority may affect 
women’s achievement behavior. 

Possible violations of the norm of male 
superiority should be most likely in situa- 
tions of individual competition, where a 
woman competes directly against a man and 
where her level of performance will be made 
public. According to Horner (1970), nega- 
tive outcomes might be further increased if 
the competitor were an “important” man, 
such as a boyfriend. Not all mixed-sex compe- 
tition threatens sex-role standards, however. 
In a situation of team competition, where a 
woman works with a partner to outperform 
another team and where her own individual 
score will not be made public, negative con- 
sequences should be minimized. Team compe- 
tition might be even more acceptable if the 
woman’s teammate were a boyfriend, so that 
her achievement could be viewed as helping 
a loved one, a behavior consistent with the 
traditional feminine role. 

It was predicted that fear of success and 
sex-role traditionalism would affect women’s 
performance in similar ways. In a team-com- 
petition condition, high fear of success and 
traditional attitudes were expected to en- 
hance performance; in an individual-competi- 
tion condition, high fear of success and tradi- 
tionalism were expected to decrease perform- 
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ance. In words, type of competition 
would interact with fear of success and also 
with sex-role attitudes. 

Ways in which these two personality varia- 
bles might jointly affect women’s behavior 
were also examined. Recently, Alper (1974) 
has argued that sex-role traditionals and 
liberals do not represent motivationally 
homogeneous groups. She suggests that an ad- 
ditional distinction between women who are 
in conflict about femininity and those who 
are not in conflict may be useful. For in- 
stance, Alper speculates that in Horner’s 
original study, those low-fear-of-success 
women who performed well when competing 
may have held liberal sex-role attitudes. This 
study provides a direct comparison of the 
behavior of four groups of women who are 
either liberal or traditional in their attitudes 
and who, as reflected in Horner’s measure of 
fear of success, are either experiencing con- 
flict about femininity or not. 

Finally, the present study investigated the 
effects of fear of success and sex-role attitudes 
on achievement behaviors outside the labora- 
tory as measured by college grades, SAT 
scores, and career aspirations. 


METHOD 
Participants 


Participants were members of 91 college-aged dat- 
ing couples who were taking part in a 2-year study 
of dating relationships.1 Students were recruited for 
the larger project from four different colleges in the 
Boston area. The 91 couples had been “going with” 
their current partner for an average of 14 months, 
The modal couple was a sophomore woman dating 
a junior man. The mean age was 20.0 years for 
women and 21.2 for men. Students were paid for 
their participation. 

The grade-point average for both men and women 
was approximately B, Forty-five percent of the 
women had grades of B+ or better. In this respect, 
they were comparable to Horner’s (1968, 1970) 
sample in which 40% of the women were honors 
students. Fewer than 10% of women in’ this study 
had grades of C or lower. About 65% of the women 
and 70% of the men planned on some graduate 
education (master’s degree or higher). SAT scores 
were available for 84 women. Their mean quanti- 
tative score was 595, and their mean verbal score was 
613. In short, the sample consisted of relatively able 
young women for whom professional careers were 
generally a realistic possibility, 
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Procedure 


Experimental sessions. In the summer of 1972, stu- 
dents attended two sessions conducted by a female 
experimenter, At Session 1, measures of fear of suc- 
cess and of performance in a noncompetitive situa- 
tion were obtained. Students came to one of six 
administrations of Session 1 with their dating part- 
ner but were seated separately in the small audi- 
torium. Students first wrote stories to five verbal 
cues.? The single fear of success cue came third. For 
women, the cue read, “Diane has just received word 
that she was one of three students in the state to 
get a perfect score on her LSAT (Law School Ad- 
missions Test.)” For men, the same cue with the 
name “Tom” was used. After completing the stories, 
students worked individually on the timed scrambled 
words task used by Horner (1968). They were told 
that their performance would be taken as an accu- 
rate measure of their ability but that results would 
be completely confidential. Students did not expect 
to find out their own or their partner’s score. In- 
structions and timing for both the verbal cues and 
the scrambled words task were identical to those 
employed by Horner (1968) and were designed to 
create a neutral (nonarousing) testing condition. 

About a month later, couples returned for Session 
2 and were randomly assigned to either an indi- 
vidual-competition or team-competition condition. 
Five separate administrations of each condition were 
run. All students were administered the verbal facil- 
ity task used by Horner. Scores on this task and 
the scrambled words task from Session 1 are highly 
correlated.* Participants in both the team and indi- 
vidual conditions worked individually on the task 
and were seated identically, next to their partner but 
with one seat intervening so they could not read 
their partner’s responses. The only difference be- 
tween conditions was the instructions (both oral 
and written) about how the task would be scored. 
In the individual-competition condition, members of 
the couple were told that their scores would be di- 


1 Participants for this study came from a larger 
sample of 231 couples taking part in a longitudinal 
study of dating relationships conducted by Zick 
Rubin and his colleagues at Harvard University. The 
sample has been described more fully in Peplau 
(1973). 

2 Because of growing publicity about the “Anne” 
medical student cue, Horner has advocated use of 
other cues depicting women succeeding in nontradi- 
tional fields. The “Diane” cue was developed by 
Horner and used at her suggestion (Horner, Note 
1). The four other cues used in this study (e.g., 
“With her wedding day only a month away, Carol 
is full of anticipation”) cannot be scored for fear of 
success using Horner's (1968) scheme. 

3 Horner (1968) reports the overall correlation of 
the scrambled words task and the verbal facility 
task for males and females combined to be r= .67, 
p<.001 (p. 68). 
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rectly compared. Tests would be scored immediately, 
and at the end of the session they would be told 
their own score and the score of their dating part- 
ner. In the team-competition condition, students 
were told they would receive only a single team 
score based on their own and their partner’s joint 
performance. In addition, they were told that they 
and their dating partner had been paired with an- 
other couple team. At the end of the session, they 
would receive their own team score and the score of 
the competing team. 

After completing the verbal facility task, students 
indicated how well they thought they had done, how 
important it was to them to do well, how much they 
liked the task, and how well they would like to have 
done, They also estimated the number of words 
they thought they had made. To determine if 
students were familiar with Horner’s work, addi- 
tional open-ended questions inquired about the pur- 
pose of the Study, why it was important to have 
couples participate in the research, and whether they 
had ever participated in or read about similar 
research.* At the close of Session 2, a debriefing was 
held in which the purpose of the study was discussed. 

Scoring fear of success. The law school cue was 
scored for fear of success imagery according to 
Horner’s (1968) criteria, Fear of success was scored 
as present if any fear of success imagery ie O 
or absent if none was found. (The terms jz and 
low are used interchangeably with present and 
absent.) A male and female rater scored each story 
independently. Coders were in agreement on 86% of 
the women’s stories and on 88% of the men’s stories. 
In cases of disagreement, stories were discussed and 
a joint resolution reached for the final score. 

Questionnaire measures. As part of the larger study 
of dating relationships, both members of each couple 
had independently completed identical versions of a 
40-page questionnaire. Included were several mea- 
sures of sex-role attitudes. A 10-item sex-role tradi- 
tionalism scale asked the extent of agreement or dis- 
agreement with statements such as, “Women could 
run most businesses as well as men could” and “In 
marriage, the husband should take the lead in 
decision-making.” Coefficient alpha for the scale 
based on responses of 91 couples was .83 for women 
and .81 for men. Another question asked students to 
rank their personal preference ‘for four marriage 
options: remaining single or having a marriage in 
which the wife has a full-time career, works part- 
time, or is a full-time homemaker. Additional ques- 
tions assessed support for the goals of the women’s 

movement and students’ personal career aspirations. 


RESULTS 


The majority of women responded posi- 
tively to the cue about Diane's outstanding 
achievement. Fear of success was present 1n 
the stories of only 30% of women, and in 
44% of the stories men told about Tom’s 
achievement. These rates of fear of success 
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differ from those reported by Horner but are 
consistent with results of 22 studies reviewed 
by Tresemer (1974). No significant relation- 
ship was found between fear of success and 
any measure of sex-role attitudes. Women 
high on fear of success had a mean sex-role 
traditionalism score of 2.5 compared with a 
mean of 2.4 for women low on fear of suc- 
cess. High-fear-of-success women were just as 
favorable to the goals of the women’s move- 
ment (mean 4.8) as low-fear-of-success women 
(mean 4.6). Several checks for curvilinearity 
also failed to find a systematic relationship 
between fear of success and sex-role attitudes. 
Additional analyses investigated whether fear 
of success was more frequent among women 
whose attitudes and career goals were dis- 
crepant. Results indicated that fear of success 
was not more prevalent among traditional 
women who wanted a career or among liberal 
women who wanted to be full-time home- 
makers than it was among women with more 
congruent attitudes and goals. (Similar analy- 
ses found no significant relationships between 
men’s fear of success and their sex-role atti- 
tudes or achievement behavior.)° 


Noncompetitive Achievement 


All women performed a verbal-skill task in 
a noncompetitive setting. Mean scores classi- 
fied both on fear of success and sex-role tra- 
ditionalism (split at the median) are pre- 
sented in Table 1. A 2 X 2 unweighted means 


4 Three students, one male and two females, indi- 
cated that they had learned about a Thematic Ap- 
perception Test story-writing procedure in a psy- 
chology class. Only one female student alluded to 
Horner’s work. None of the women indicated in 
their responses that they saw a connection between 
the story writing and the intellectual tasks, The fact 
that students were participating in a longitudinal 
project, had already filled out several questionnaires, 
and took part in several other experiments during 
the summer sessions may have made the purpose of 
this study less obvious. 

5 Since a sizable proportion of men in this sample 
showed fear of success, it was possible to examine 
the impact of fear of success on men’s performance. 
Analyses similar to those reported for women found 
no significant relationship between men’s fear of 
success or their sex-role traditionalism and their 
performance in any condition. A discussion of the 
men’s performance is found in Pleck (in press). 
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analysis of variance indicated a Fear of Suc- 
cess X Traditionalism interaction, F(1,87) = 
5.6, p < .02. In the noncompetitive condition, 
women who combined high fear of success and 
traditional attitudes were the highest scoring 
group, scoring over 10 words better than 
women with nontraditional attitudes or low 
fear of success. A Scheffé test for post hoc 
comparisons between means (Keppel, 1973) 
indicated that of the four groups of women, 
only the high-fear-of-success traditionals dif- 
fered significantly from other groups in mean 


‘performance scores (p < .01). 


Horner (1968, 1970) viewed performance 
in a noncompetitive condition as indicative 
of between-subjects differences in initial abil- 
ity. Consequently, she used change scores 
from noncompetitive to competitive condi- 
tions to test predictions about fear of suc- 
cess. The present results indicate that a 
noncompetitive setting may not provide a 
baseline ability measure and may, in fact, 
enhance the performance of traditional high- 
fear-of-success women. If these women were 
truly higher in ability, we might expect them 
to have higher college grades or SAT scores 
than other women. No evidence was found, 
however, that traditional high-fear-of-success 
women score higher on any of these measures 
of performance outside the laboratory. Only 
one significant relationship emerged: Sex-role 
traditionalism was negatively correlated with 
SAT verbal scores (r = —.22, p < .05), lend- 
ing further support to the possibility that 
noncompetitive settings facilitate performance 
for traditional women. Horner’s baseline’ in- 
terpretation of noncompetitive performance 
seems dubious, and hence her use of change 
scores is questionable. It seems more reason- 
able to view noncompetitive and competitive 
settings as distinct achievement ecologies and 
to analyze performance in the two settings 
separately. 


Competitive Achievement 


At Session 2, women performed in either 
a team-competition or individual-competition 
condition. Table 2 presents mean scores for 
women in these two conditions classified by 
fear of success and sex-role traditionalism. A 
2 X 2 X 2 unweighted means analysis of vari- 


TABLE 1 


MEAN NUMBER OF WORDS PRODUCED ON SCRAMBLED 
Worps TASK IN THE NONCOMPETITIVE 
CONDITION (SESSION 1) 


Fear of Success 


Traditionalism* Low High 
Liberal 46,2 (35) 40.1 (12) 
‘Traditional 43.9 (28) 56.3 (16) 


Note. Number of women in each cell is given in parentheses, 
a Scores on the 10-item sex-role traditionalism scale were 
split at the median, 


ance indicated no main effects for condition, 
fear of success, or traditionalism. The predic- 
tion of a Fear of Success X Condition inter- 
action was not supported. Although there was 
a tendency for high-fear-of-success women 
to perform better than low-fear-of-success 
women in the team condition (M = 41.9 vs. 
40.6) and to perform worse in the individual 
condition (M = 36.3 vs. 38.8), the inter- 
action was not significant, (1,83) = 1.8, 
p = .18. The only significant finding was the 
predicted Traditionalism X Condition inter- 
action, F(1, 83) = 15.3, p < .001. In the team 
condition, traditionals scored higher than lib- 
erals (M = 43.9 ys. 38.5); in the individual 
condition, traditionals scored lower than lib- 
erals (M = 34.4 vs. 40.7). No other intet- 
actions approached significance (all Fs < 1). 

Although the analysis of variance found no 
significant effects for fear of success, inspec- 
tion of scores in Table 2 suggests that those 
women who combine sex-role traditionalism 
and fear of success may be most affected 
by condition, performing worse than other 


TABLE 2 
MEAN NUMBER or WORDS PRODUCED ON VERBAL 
Vacitrry TASK ION:2 BY FEAR OF 
TRADITIONALISM, 
AND TYPE OF COMPETITION 


Traditionalisms 


Traditional 


Type of competition Liberal 
Individual 
Low fear of success 41.1 (18) 36.4 (14) 
High fear of success 40.2 (6) 32.4 (9) 
Team 
Low fear of success 38.6 (17) 42.6 (14) 
High fear of success 38.5 (6) 45.3 (7) 


Note. Number of women in each cell is given in parentheses. 
«Scores on the 10-item sex-role traditionalism scale were 
split at the median, 
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women in the individual condition and better 
than other women in the team condition. A 
Schefié test for post hoc comparisons indi- 
cated that within the individual condition, 
high-fear-of-success traditionals performed 
significantly worse than low-fear-of-success 
liberals (p < .05), though not worse than the 
two other groups of women. Within the team 
condition, performance of high-fear-of-success 
traditionals did not differ significantly from 
that of other women (p > .05). 


Evaluation of Performance 


Women were quite accurate in estimating 
how well they had done on the verbal facility 
task. The correlation between actual number 
of words created and the woman’s estimate 
of how well she did (r = .62, p < .001) and 
how many words she thought she had made 
(r=.71, p < .001) were sizable. A 2 X 2 
X 2 unweighted means analysis of variance 
of type of competition, fear of success, and 
traditionalism was performed for each re- 
sponse measure. Mirroring the pattern of ac- 
tual performance, traditionals evaluated their 
performance more positively in the team con- 
dition, and liberals rated their performance 
higher in the individual condition; the Tradi- 
tionalism Xx Condition interaction was signifi- 
cant, F(1,83) = 4.4, p < .05, for how well 
women thought they did. A similar interac- 
tion was found for ratings of liking for the 
task; traditionals enjoyed team competition 
more, and liberals enjoyed individual compe- 
tition more, F(1,83) = 4.5, p< .05. Two 
measures of competitiveness, how well the 
woman wanted to perform and how strongly 
she wanted to do better than her opponent, 
were affected by condition but not tradi- 
tionalism. All women showed a greater desire 
to do well in team competition than in indi- 
vidual competition, F(1,83) = 5.2, p< .05. 
Similarly, women indicated greater desire to 
outperform their opponent in the team than 
individual condition, F(1, 83) = 11.1, p < .01. 
Ratings of how important it was to do well 
were unaffected by condition or traditional- 
ism. Finally, no significant main effects of 
fear of success were found on any response 


measure, nor was there an interaction of 
, 
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fear of success and either condition 


traditionalism. 


Other Correlates of Fear of Success and 
Traditionalism 


Extensive questionnaire data availah w 


couples provided an opportunity to co re 
the relationship of fear of success an x= 
role attitudes to characteristics of the an 
and of her dating partner. Fear of suc: tid 
not relate to any measure of the w vs 
ability or achievement goals outside |! Jab- 
oratory, including college grades, SAT » ores, 
attitudes toward having a career, © ans 
for graduate school. While sex-role ti ion- 
alism did not relate to grades or SA ath 
scores, it was correlated with low: SAT 
veral scores (r = —.22, p < .05), lest nter- 
est in a career (r = —.45, p< .00 and 
lower educational aspirations (7 = ae 
<.05). Self-ratings showed a similar erm, 
Traditionalism was associated with ev ting 
the self lower on “intelligence” (7 24, 
p < .05), but higher as a “desirable m+ viage 
partner” (r= .28, p< .05). Sex-role iradi- 
tionalism was also correlated signi®cantly 
with other measures of conservatism, sich as 


higher frequency of church attendance. less 
permissiveness toward premarital sex, less 
support for the goals of the women’s move- 
ment, and higher scores on a 10-item scale of 
authoritarian submission. 

The data provided no support for Horner’s 
(1970, 1972) prediction that fear of success 
is related to characteristics of a woman’s 
dating partner. Women who fear success 
were no more likely to be dating traditional 
men, nor were their boyfriends 1 
to oppose the women’s moveme? 
of women with high and low fear 01 ; 
were equally likely to prefer a career wile oF 
a full-time homemaker. It may be that char- 
acteristics of the dating partner are jess im- 
portant than the mere fact of having a boy- 
friend. Since all women in this sample were 
in dating relationships, it was not possible to 
test this notion. Horner also predicted that 
fear of success would be less prevalent in 
relationships where the male’s intellectual su- 
periority is clear. No relationship was found 
between the woman’s fear of success and her 


ess 
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hoyfriend's grades, SAT scores, or differences 
between his scores and hers, Women’s subjec- 
tive ratings of their boyfriends’ “intelligence” 
also failed to relate to fear of success. Nor 
was there a relationship between the woman’s 
fear of success and her boyfriend’s fear of 
success. In contrast, there was considerable 
matching in couples on sex-role traditional- 
ism (r = 48, p < 001). Women with liberal 
attitudes were significantly more likely to 
date men who support the goals of the 
women’s movement (7 = .42, p < .001), pre- 
fer a career wife (r = .41, p < .001) and are 
liberal on other measures. 


Discussion 


This study examined the effects of fear 
of success, sex-role attitudes, and competition 
on women’s intellectual performance, The 
clearest finding is the significant link between 
women’s sex-role attitudes and their perform- 
ance both in the laboratory and in their daily 
lives. Women with liberal attitudes have 
higher educational and career aspirations, 
rate themselves as more “intelligent,” and in 
fact have higher SAT verbal scores. Tradi- 
tional and liberal women differ in the type of 
achievement setting that facilitates high per- 
formance: Traditional do better in noncom- 
petitive situations or in team competition; 
liberals do better in individual competition. 
Interestingly, traditionals and liberals do not 
differ in grades in college, This suggests that 
when women set a personal goal such as com- 
pleting college, liberals and traditionals are 
capable of equally high levels of perform- 
ance. Traditionals and liberals may, however, 
structure their college experience differently. 
The data indicate, for example, that tradi- 
tionals are significantly more likely to major 
in “feminine” fields such as nursing or educa- 
tion. It is possible that traditionals may at- 
tempt to avoid direct academic competition, 
for instance, by selecting large, impersonal 
classes, 


Do Women Fear Success? 


Results of this study provide only limited 
support for Horner’s theory, Outside the lab- 
oratory, fear of success was not related to 
women’s educational plans or to measures of 
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achievement, such as SAT scores or college 
grades. Horner’s prediction that women with 
fear of success date men who disapprove of 
achieving women was not supported. In a 
laboratory setting, fear of success had no inde- 
pendent effect on behavior. Some evidence was 
found, however, that among women with 
traditional sex-role attitudes, fear of success 
may affect performance. When working non- 
competitively, traditional women with fear of 
success perform significantly better than all 
other women. When working in individual 
competition, traditional women with fear of 
success tend to perform worse than other 
women. These data support Alper’s (1974) 
suggestion that women’s achievement behav- 
ior is affected by both sex-role orientation 
and conflict about femininity. Perhaps when 
Horner (1968) conducted her original study 
in 1965 the majority of women she tested 
held traditional sex-role attitudes. (She did 
not actually assess this, of course.) During 
the past decade, there has been considerable 
change in sex-role attitudes. Today perhaps 
only women who maintain traditional beliefs 
about the feminine role are affected by fear 
of success, 

The data clearly indicate that women high 
on Horner’s measure do not have a broad 
fear of achievement. Although women with 
high and low fear of success may differ in the 
kind of setting that facilitates maximum per- 
formance, an image of high-fear-of-success 
women as intellectually disabled by achieve- 
ment conflicts is unwarranted. Nor do women 
high on Horner’s measure have a generalized 
fear of mixed-sex competition. Working as 
part of a mixed-sex team does not impair 
their performance, although working in indi- 
vidual competition against a boyfriend may. 
What fear of success appears to represent is 
a particular sensitivity to the sex-role impli- 
cations of achievement settings and a concern 
that achievement behavior be manifested in 
sex-role-appropriate ways. 


Fear of Success Versus Sex-Role Attitudes 


In this study as in others, the effects of 
fear of success are small and elusive. Sex- 
ole attitudes emerge as a more substantial 
“a “consistent determinant of women’s per- 
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formance. It is unclear whether the limited 
impact of fear of success is due to methodo- 
logical weaknesses or to conceptual difficulties. 
A serious problem of Horner’s measure is its 
uncertain reliability (Condry & Dyer, 1976). 
Low reliability could explain both the 
weak effects of fear of success on perform- 
ance and the failure of the measure to relate 
to questionnaire responses. 

Although it is currently fashionable to view 
women as in conflict, ambivalent, or anx- 
ious about achievement, such conflict models 
(Horner, 1970; Bardwick, Douvan, Horner, 
& Gutman, 1970) may be inadequate. An 
alternative view worthy of more attention is 
a theory of sex-typed achievement (e.g., 
Stein & Bailey, 1973). This perspective pro- 
poses that achievement motivation and be- 
havior are channeled into sex-role-appropriate 
domains. Traditionally, the proper achieve- 
ment domain for women has been the area of 
social skills associated with homemaking or 
“feminine” careers such as nursing. The 
present data are consistent with a sex-typed 
model of achievement. Women who adhere to 
traditional sex-role beliefs do differ in career 
goals from women who espouse less tradi- 
tional attitudes. Further, traditional women 
achieve at their highest level in a noncom- 
petitive or team setting, situations that may 
be viewed as compatible with traditional 
femininity. Liberal women show a different 
pattern, excelling in a more masculine condi- 
tion of individual competitive achievement. 

It appears that Horner’s measure is of 
limited usefulness. The popularity of Horner’s 
theory is based in part on the belief that 
fear of success has significant impact on 
women’s lives and professional careers. At 
present, there is little evidence to support 
this assertion. Speculations about the effects 
of fear of success outside the laboratory must 
be made most cautiously. Researchers con- 
cerned with women’s achievement might do 
well to emphasize sex-role attitudes rather than 
fear of success and to investigate alternative 
theories of women’s achievement. 
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The present study was designed to examine some interpersonal correlates of 
the field dependence construct within a naturalistic setting. By means of a 
factorial design, 40 field independent and 40 field dependent female participants 
in a weight reduction clinic were individually counseled by a male interviewer; 
these interviews were systematically varied in terms of (a) the verbal feedback 
(accepting vs. neutral) offered for clients’ self-disclosing statements and (b) 
the physical proximity of the interviewer to the client (personal vs. social 
distance), Behavioral and self-report measures of affect and a measure of long- 
term behavioral compliance were taken, During the interviews, field dependent 
clients were found to be less distancing vis à vis the interviewer and more 
willing to comply to his dieting recommendations; over a 5-week dieting period, 
however, they were less successful in losing weight. Other findings suggested 
that these field dependent clients were more sensitive to the emotional climate 
of the setting. Results were interpreted in terms of contrasting means of coping 
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with dependency feelings. 


In over two decades of published research, 
the field dependence construct has demon- 
strated powerful potential for predicting a 
wide range of psychological phenomena. In 
particular, it has found application to a sec- 
ond construct introduced by Witkin, namely, 
self-differentiation (Witkin, Dyk, Faterson, 
Goodenough, & Karp, 1962). This variable 
designates the degree to which an individual 
perceives his attributes and activities to be 
distinct from those of others, a sense of “sepa- 
rate identity” (Witkin et al., 1962). Further 
reflected in this construct is the degree to 
which an individual regulates and controls his 
interpersonal transactions. The individual with 
a field dependent or relatively undifferentiated 
perceptual style is characterized as developing 
relationships in which others assume some re- 
sponsibility for his words and actions; with- 
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out such external support, he is likely to feel 
vulnerable and anxious. In contrast, the field 
independent perceiver is said to foster rela- 
tionships that allow him to maintain a sense 
of separateness and independence. He can be 
described as one who prefers to minimize the 
amount of social influence and control in his 
life. For those with a more articulated percep- 
tual style, intimate social situations may be 
anxiety arousing. 

An impressive amount of empirical evidence 
has accumulated that tends to support these 
characterizations. Several studies have re- 
vealed that field dependent subjects are more 
socially dependent, more influenced by au- 
thoritative opinion, and less self-reliant than 
are field independent individuals (Freedman, 
O'Hanlon, Oltman, & Witkin, 1972; Kon- 
stadt & Forman, 1965; Linton, 1955; Witkin 
et al., 1962). Related studies on sensory and 
social deprivation have shown that field de- 
pendent subjects are more disturbed by these 
kinds of stressors (Silverman, Cohen, Shma- 
vonian, & Greenberg, 1961; Zuckerman, 
1968). Other findings have suggested that 
those with a field independent style maintain 
greater psychological distance and personal 
space (Justice, 1970; Pollack & Kiev, 1963; 
Schimek, 1968). Finally, there are data indi- 
cating that field independent subjects perform 
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more effectively on autonomous and imper- 
sonal tasks, while field dependent subjects are 
superior in interpersonally oriented activities 
(Lefcourt & Seigel, 1970; Quinlan & Blatt, 
1972). 

The present study was designed to expand 
this set of findings by examining some behay- 
ioral correlates of field dependence within a 
naturalistic setting. The particular context 
employed was a weight reduction clinic where- 
in clients were offered dieting help in exchange 
for their participation in research. As dis- 
cussed by Janis (1975), this choice of context 
provides opportunities for studying interper- 
sonal processes that are often unavailable 
under more restricted laboratory conditions. 
One important feature of the present setting 
was the potential for establishing a relatively 
high degree of psychological intimacy and de- 
pendence: The design called for a dyadic 
interaction between a female client and male 
interviewer in which personal topics were ex- 
plored and support offered for a difficult deci- 
sional conflict. If field independent and field 
dependent perceivers have contrasting atti- 
tudes and feelings about closeness and de- 
pendence, this context should readily evoke 
them. 

Based on previous empirical and theoretical 
contributions, it was generally expected that 
field dependent clients would be more posi- 
tively affected in this social situation. Affec- 
tive reactions of primary interest consisted of 
psychological distancing, interpersonal attrac- 
tion, and dependency longings. With reference 
to these dimensions, field dependent clients 
were expected to express: (a) less psychologi- 
cal distancing, (b) more attraction, and (c) 
greater dependency longings vis à vis the in- 
terviewer. 

While these differences in affective reactions 
were expected to emerge during the dyadic 
interactions, differences on outcome variables 
were also hypothesized. The procedures of the 
clinic involved asking clients to adhere to 
dieting recommendations for a 5-week period 
without direct contact with the interviewer. 
Since it is assumed that field dependent cli- 
ents have a greater need for the physical 

presence and emotional support of others, 
these clients should be relatively less compli- 
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ant and, consequently, less succe losing 
weight under these conditions. 

A second and related aim of resent 
study was to explore the interactiv cts of 
field dependence with a theoreticali evant 
aspect of the interpersonal context, | amely, 
the emotional availability of the ini: iewer. 
Previous studies have suggested that well- 
being and effective functioning of | de- « 


pendent individuals are relatively : de- 


pendent upon a supportive social envi “ment 
(Freedman et al., 1972; Konstadt & | xman, 
1965). In the present context, this situational 
dimension was systematically manip lated 
both verbally and nonverbally: by ying 
the nature of the verbal evaluative Í rack 
offered to the client for her self-di sing 
statements and by varying the interpi: onal 
distance between the client and intervie» or. 
The significance of evaluative feedh on 


interpersonal processes has recently bee’. ar- 
ticulated by Janis (1975). He theorized ‘hat 
receiving accepting feedback from a cou: slor 
for self-disclosures results in increased iising 
for and dependence on the counselor; ihese 
reactions, in turn, are said to enhance behav- 
ioral compliance to recommendations made 
by the counselor. If half of the clients in this 
study received positive verbal feedback (i.c., 
explicit statements of approval and accept- 
ance), while the other half received affectively 
neutral feedback, an interaction with field 
dependence should obtain, Field dependent 
clients should show comparatively more sensi- 
tivity to the nature of the verbal feedback 
offered, being more positively affected when 


receiving accepting feedback and more ad- 


versely affected when receiving neutral: feed- 
back. Field independent clients, hypothetically 
having a greater need to fortify their personal 
space and self boundaries, should show more 
positive reactions when the interviewer is 
emotionally neutral. 

While the communicative value of spatial 
behavior is not fully understood, several theo- 
ries have postulated the existence of univer- 
sally learned associations between human 
physical proximity and the attribution of such 
affective dimensions as nurturance and warmth 
(Hall, 1966; Mahler, Pine, & Bergman, 1975; 
Palombo & Bruch, 1964). Data from a variety 
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of research methodologies have documented a 
direct relationship between the physical close- 
ness of social interactants and the amount of 
their attraction and involvement toward each 
other (Evans & Howard, 1973). On the basis 
of these theoretical and empirical contribu- 
tions, it seemed that the degree of emotional 
support attributed to an interviewer could be 
manipulated by varying the distance of the 
interviewer from the client. In the present 
setting, interpersonal distances of 2 feet (.61 
m) and 5 feet (1.52 m), corresponding to 
Hall’s “personal” and “social” distances, were 
selected (Hall, 1966). The interactive effects 
of this nonverbal dimension are expected to 
parallel those of evaluative feedback. 


METHOD 
Overview 


The main and interactive effects of field depen- 
dence were examined by means of a 2X2X2X2 
factorial design. In the context of a diet clinic, 
structured interviews were individually administered 
to 40 field dependent and 40 field independent female 
clients. These interviews were varied in terms of (a) 
the verbal feedback given by the interviewer (acept- 
ing vs. neutral) and (b) the physical proximity of 
the interviewer to the client (personal vs. social 
distance). Clients were randomly assigned to one of 
two trained male interviewers. 

Primary dependent variables were measures of 
nonverbal behaviors considered to reflect psycho- 
logical distancing, attraction, and dependency needs. 
Other dependent variables included a 24-item ques- 
tionnaire designed to tap these same affective dimen- 
sions, a measure of experienced discomfort, and a 
measure of long-term behavioral compliance, 


Subjects 


Subjects were recruited by means of notices placed 
in the local newspapers advertising the services of a 
weight reduction clinic. They were informed that 
these services were free in exchange for their partici- 
pation in research on dieting. The only restrictions 
placed on respondents were that they be female, 
have no weight-related medical problems, and be no 
more than 45 pounds overweight. 

Subjects were 80 residents of the New Haven 
area whose ages ranged from 19 to 67 years, with a 
mean of 39.7 years. All clients expressed an interest 
in losing between 15 and 45 pounds. No differences 
across experimental conditions on these variables 
were obtained. An intelligence index, based upon 
level of completed schooling and husband’s occupa- 
tional status, also yielded no significant differences 
across conditions. 
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Procedure 


The clinic consisted of a comfortably furnished 
interviewing room adjoined, on one side, by a smaller 
waiting room and, on the other side, by an equip- 
ment room in which video recording apparatus was 
stored. At the scheduled time, one of the interview- 
ers met the client in the waiting room, introduced 
himself, and ushered her into the interviewing room. 
Here he described the overall clinic procedures, col- 
lected demographic information, and administered a 
test of field dependence. After her height and weight 
had been recorded, the client was asked to sit by a 
small table displaying some mimeographed dieting 
leaflets. The interviewer explained that he wanted to 
explore several personal topics that might be rele- 
vant to the problems of overeating and dieting. He 
added that this portion of their meeting would be 
videotaped and momentarily left the room to turn 
on the recorder. When he returned, he administered 
a 20-minute, “high disclosure” interview that focused 
on such matters as the personal meanings associated 
with food and eating, attitudes about one’s personal 
appearance, and ways of coping with such feelings 
as frustration and boredom. Assignment to the spe- 
cific verbal feedback and physical proximity condi- 
tions of the interview were randomized for the first 
60 clients; for the last 20 clients interviewed, assign- 
ment was made on the basis of ensuring equal num- 
bers across these experimental conditions, Following 
the interview, the client was asked to complete 19 
questionnaire items reflecting her involvement and 
attraction vis 4 vis the interviewer. She was asked to 
place this questionnaire in a box next to her chair 
so that the interviewer could not see her responses, 

At this point, the client was handed mimeographed 
materials consisting of lists of kinds and amounts of 
food appropriate for maintaining a 1,200-calorie diet 
and a series of planned daily menus. At the same 
time, she was asked to listen to a prepared tape 
recording that reviewed all of the dieting informa- 
tion in detail and presented specific recommendations 
for her to follow over a 5-week period. 

Each client was told that the clinic wanted to be 
kept informed of her progress during the following 
5 weeks. A plan was described that required her to 
complete and mail to the clinic five weekly reports, 
indicating how successfully the diet was followed and 
what difficulties were encountered. Finally, she was 
asked to answer five additional questions assessing 
both her willingness to comply with the dieting 
recommendations and her interest in returning to the 
clinic and seeing the same interviewer after 5 weeks. 


Experimental conditions 


Field dependence. The Embedded Figures Test, 
administered according to the manual by Witkin, 
Oltman, Raskin, and Karp (1971), was used to assess 
field dependence. Clients were directly informed that 
the test was being employed to assess its value in 
predicting dieting success. 

The score derived from the" test is the time, 
averaged over 12 presentations, needed to outline a 
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an embedding con- 
180 sec with a mean 
dependence and field 
dichotomized at the 


simple geometric figure within 
text. Scores ranged from 23 to 
of 100.9 sec. To form the field 
jndependence groups; scores were 
median of 102 sec. 

Evaluative feedback. In th 
condition, the interviewer provi 
“pm glad you're answering 
openly”) and acceptance (€.8-s 
would be difficult”). In addition, 
sitional comments as, “Let’s go on 
In contrast, in the neutral feedba 
interviewer remained silent or offered on. 
tional comments. 

Physical proximity. 
were used in conducting t 


e accepting feedback 
ded both praise (e-g., 
these questions so 
“J can see how that 
he made such tran- 
to another topic.” 
ck condition, the 
ly transi- 


Two large upholstered chairs 
he interviews: Interpersonal 


distance was manipulated by varying the position of 
the interviewer's chair to ensure that the client’s ori- 
entation to the room would be constant across all 
experimental conditions. At the personal distance, 
the interviewer's chair was positioned to allow 2 
feet (.61 m) between client and interviewer (mea- 
sured knee to knee) ; at the social distance, the inter- 
actants sat 5 feet (1,52 m) apart. At both distances, 
the chairs were angled 30° from a direct, face-to- 
face orientation. 

In order to isolate the effects of this one nonverbal 


dimension, both interviewers were trained not to 
e interview that might 


emit behavioral cues during th 

reflect their attitudes toward the client. They were 
instructed to sit upright, minimize facial expressive- 
ness and gestures, and maintain eye contact with the 


client. 


Dependent Measures 


Nonverbal behaviors. A videotape camera was sta- 
tioned unobtrusively in a small opening in the wall 
separating the interviewing room from the equip- 
ment room. It recorded a head-to-knee, front view 
of the seated client during the first 10 minutes of 
the interview. The nonverbal behaviors subsequently 
abstracted from these recordings had all been previ- 
ously established in the literature a5 indicative of 
psychological distancing, attraction, OF dependency. 
Nonverbal positions included for analysis were eye 
contact, forward leaning, arm crossing, and leg 
crossing: Behavioral acts consisted of shoulder shrug- 
ging, head nodding, hair grooming, smiling, “palms- 
up” gesturing, lip and tongue movements, and hand- 


to-mouth movements. 
A glossary defining 
train two independent raters. 
scored all 80 videotapes, while t 
f 20 to assess inte: 


these behaviors was used to 
One of these raters 


he author scored a 
rrater reliability. 


random sample 0 ri 
Scoring was performed with the audio track turned 
off to prevent scoring biases caused by speech 

ored as the total 


Positions were SC 
ds. With the exception of eye con- 
scores were transformed into 
score of 1 being assigned if the 
of 0 if not. Acts were 
ences during the 


content or tone. 
duration in secon 
tact, these position 
dichotomous data, a 
behavior occurred and a score 
scored as the frequency of occurri 


10-minute interview segment. To norma! 


pe coi Vx-+.5 transformations w per- 
Speech disturbance ratio. Viewing bologi 
distancing as a defensive or Orr ; Eo 
it was expected that anxiety would be rr ae 
this affective dimension. The speech di bs A 
ratio (Kasl & Mahl, 1965) was chosen aa E 
Herat measure to assess Clients’ ed on 
ort during the interview. Transcripts were le 
ne second, fifth, and eighth minutes of ea: Sed 
view. Then by reading the transcripts ai i : 
to the audio channel of the tapes, a rate are 
speech irregularities across the ‘tires Ri a 
a As with the nonverbal behaviors, a y sity 
E n coran by having the author « «pute 
fe e for a randomly selected sampl of 20 
„Questionnaire. A 24-item questionnaire 
signed. to tap attitudes okela ý ; a 
(e.g., “The interviewer wanted to ON t uch 
about me”), attraction (e.g., “How reich ee 
De e person who interviewed you?”), and i 
ae prepn with dieting recommendation: ¢.g. 
Bates a ae will actually follow the ; ned 
a or eac! | day? Ja All items were Like: vpe 
es where 1 indicated little item endorseme ind 


7, maximal endorsement. 
Toe x j 
PR behavioral compliance, This ble 
as success or failure i i 
as e in returning h 
clinic after 5 week: as A 
à s and showi 
See wing at least a 5-) ind 


RESULTS 
Manipulation Checks 


Š ARa the effectiveness of the evalu- 
ve eedback manipulation, an analysis of 
variance was performed on clients’ ratings of 
a questionnaire item concerning how ae a 
ing the interviewer seemed. As ante) 
a interviewer was felt to be more accepting 
y clients in the positive feedback conditio 
F(1, 64) = 4.86, p < .05. i 
No direct check was made on the physical 
proximity manipulation, since a question TR 
cerning interpersonal distance might ve 
raised a credibility issue for the clinic o- 
ever, in a sample of 20 pilot subjects all cor- 
ee indicated whether they were sitting a 
ere or social distance from the inter- 


Dependent Variables 


pene agreement in scoring the non- 
ee behaviors and the speech disturbance 
; A satisfactory, with product-moment 
orrelations ranging from .88 to ,99, In order 


these © 
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to gain conceptual clarity, this set of behav- 
ioral variables and the set of 24 self-report 
questionnaire items were each subjected to a 
principal-components factor analysis followed 
by varimax rotation. In both analyses, the 
major criteria used to judge completeness of 
factoring were the number of factors with 
associated eigenvalues greater than one and 
the number of factors judged to be psycho- 
logically meaningful. 

With respect to the set of behavioral mea- 
sures, a three-factor solution, accounting for 
42.3% of the total variance, appeared most 
compelling (see Table 1). The first behavioral 
factor (17% of the variance) seems to reflect 
psychological distancing and tension. Arm and 
leg crossing, the absence of forward leaning, 
and shoulder shrugging have all been viewed 
as instrumental in preserving one’s personal 
space (cf. Sommer, 1969), The positive load- 
ing of the speech disturbance ratio suggests 
that this distancing behavior is defensive, that 
is, designed to reduce anxiety. The only varia- 
ble that seemed difficult to interpret was smil- 
ing behavior, this frequently being associated 
with interpersonal warmth. In the present set- 
ting, however, which demanded the revelation 
of personal information to a stranger, smiling 
may have defensively functioned as a means 
of masking negative feelings toward the task 
or interviewer (cf. Ekman & Friesen, 1975), 

The second behavioral factor (14.1% of 
the variance) appears to reflect aspects of 
aroused dependency needs, including “oral” 
or nurturance concerns (lip and tongue move- 
ments and hand-to-mouth movements), feel- 
ings of helplessness (“palms-up” gesturing), 
and wishes for greater closeness to the inter- 
viewer (forward leaning), 

Eye contact and head nodding, both loading 
highly on the third behavioral factor (11.2% 
of the variance), suggest that this is an inter- 
personal attraction dimension. Hair grooming, 
which loaded negatively, is expressive, of at- 
tention directed toward the self rather than 
others (cf. Freedman et al., 1972). 

The set of 24 self-report items also yielded 
a three factor-solution, which accounted for 
49.6% of the total variance (see Table 2). 
Inspection of the factor loadings for these 
items indicates that the first self-report fac- 
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TABLE 1 
FACTOR ANALYSIS OF THE BEHAVIORAL VARIABLES 


Factor 
Variable 1 2 3 
Leg cross 
Smile 
Shoulder shrug 
Arm cross 


Speech disturbance 
Forward lean 


Palms-up gesture 
Lip and tongue moyement 
Hand-to-mouth movement 


Eye contact 
Head nod 
Hair grooming 


tor (27.6% of the variance) is also clearly an 
interpersonal attraction dimension. The six 
questionnaire items that loaded most highly 
on the second factor (13.7% of the variance) 
reflect concerns that one’s personal space had 
been violated by the interviewer. More spe- 
cifically, this dimension involves concerns 
about having been asked for and about having 
revealed too much personal information. The 
third self-report factor (8.3% of the vari- 
ance) seems to reflect willingness to comply 
and agreeability. Items loading highly express 
commitments to follow the dieting recommen- 
dations and appreciation of the services of 
the clinic. The content of these items suggests 
that this dimension may more generally reflect 
evaluative dependence, that is, the need to 
obtain approval by acting in a socially desir- 
able manner. 

For each client, two sets of standardized 
factor scores were computed, the first set 
representing the three behavioral dimensions 
and the second set representing the three self- 
report factors. A correlation matrix of these 
factors and the weight loss measure is dis- 
played in Table 3. This table reveals a signifi- 
cant association between the behavioral dis- 
tancing factor and the self-report dimension 
about personal space. The negative correla- 
tion indicates that clients who engaged in 
more distancing behaviors tended to express 
fewer concerns about the information they 
disclosed. This suggests that the distancing 
defensively serves to protect the individual 
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TABLE 2 
Factor ANALYSIS OF THE SELF-REPORT [TEMS 


Factor 
Item description 1 2 3 

Interviewer seemed warm and 

friendly 17 00 -29 
Interviewer was personally inter- 

ested in me 65S —.03 39 
Liking the interviewer 65 —.02 38 
Open in talking about personal 

feelings 65 —.05 
Interviewer was accepting 61—21 
Feeling involved with the interviewer 58 00 
Interviewer understood me PV 
Interviewer was too impersonal 

and reserved —.57 -39 
Did not reveal enough about myself —.66 16 
Interviewer did not really want to 

know me =70 =10 —.09 
Revealed too much about myself —.02 B5. 01 
Questions were too personal —.2 E OF 
Interviewer wanted to know too 

much J3 16 —.25 
Became too involved with the inter- 

view 040 TS ala 
Felt too detached from the inter- 

viewer =51 57 =.03 
Interviewer seemed too friendly =H Al 28 
Interviewer was helpful 37 04 65 
Optimistic about losing weight -09 -09 64 
Satisfied with the interview 37 108 61 
Wish to return to the clinic in $ 

weeks —01 —11 59 
Will follow the recommended planned 

menus NS 00 59 
Will follow the silent thought plan a03 =.01 49 
Wish to see same interviewer in 5 

weeks 30.0247 
Will adhere to the stricter diet ghey Av eye 


from experiencing intrusions into his personal 
space, Distancing behavior also correlated 
negatively with self-reported willingness to 
comply; that is, clients who behaviorally with- 
drew from the interviewer also indicated less 
readiness to follow his recommendations. A 
third significant association obtained between 
the behavioral and self-report dimensions of 
interpersonal attraction. The correlation be- 
tween the behavioral dependency factor and 


TABLE 3 


INTERCORRELATIONS AMONG THE BEHAVIORAL FACTORS, 
Setr-Report FACTORS, AND WEIGHT Loss MEASURE 


Variable 1 RE E V.) 

1. Behavioral distancing 
2. Personal space con- 

cerns —.22* 
3. Behavioral depen- 

dency 00. —.07 
4, Willingnessto comply —.29** 00 —.04 
5. Behavioral attraction .00 “1200s 12, 
. Self-reported attrac- 
2 ee —.09 00 10 .00 .22* 
7. Weight loss criterion. Ol —.17 —.01 .08 00 —.04 


and 5 represent the three behavioral 


Note. Variables 1, 
mand. 4, and 6 are the three self-report di- 


factors, and Variables 
mensions, 

*p <.05. 

Dp <0 
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the self-reported willingness-to-comp!y dimen- 
sion was negligible, suggesting that t)«:° rep- 
resent different aspects of social dependence, 
Effects of Field Dependence 

Initially, 2 X 2 X 2 X 2 (Field Depo «lence 
X Evaluative Feedback X Physical Pi «mity 
x Interviewer) factorial analyses of ance 
were performed on these dependent m sures. 
Since no significant interviewer effec. were 
obtained, this factor was subsequen: col- 
lapsed; the following is a presentatior the 
significant main and interactive effects © field 
dependence. Cell means for the dependent 
measures are shown in Table 4. 

Tt was generally anticipated that fie’ nde- 
pendent clients would be less psycholo: cally 
involved vis à vis the interviewer. Con. stent 
with this prediction, they were found en- 
gage in significantly more distancing be! viors 
during the videotaped interview, F(i )= 
10.14, p < .005. 

The hypothesis that field dependen ents 
would show greater attraction for the uter- 


viewer was not empirically confirm: , al- 
though the means of both the behavi: and 


self-report attraction factors were in |- ex- 
pected direction. 

With respect to expressions of depe:. ency 
needs, the field dependent clients, as «.tici- 
pated, showed a greater willingness to « ‘here 
to the dieting recommendations, F(1,”?2) = 
4.28, p < .05. That this behavior refiected 
more the need to receive social approva than 
an actual commitment is suggested by the 


findings from the measure of long-term be- 
havioral compliance. Supporting the predicted 
effect, the data revealed that 47.5% of the 
field independent clients returned to the clinic 
after 5 weeks and showed a weight loss of at 
least 5 pounds, in comparison with only 25% 
of the field dependent clients, F(1,72) = 
4.47, p < .05. 

Thus some empirical support was obtained 
for three of the four predicted main effects 
of field dependence. In addition, hypothesized 
interactions of field dependence with the emo- 
tional climate of the social setting were also 
partially confirmed. Because a_ significant 
three-way interaction for the behavioral dis- 
tancing factor obtained, F(1, 72) = 9.36, p < 
.005, analyses of the simple interactions at 
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TABLE 4 


CELL MEANS FOR ALL MEASURES 
Across TREATMENT GROUPS 


Accepting Neutral 
feedback feedback 
Per- Per- 

sonal Social sonal Social 
dis- dis- dis- dis- 


Measure tance tance tance tance 


Field dependent 


Behavioral distancing 07 05 —1.01 


Personal space concerns 99 =.27 -=.13 
al dependency 23 —.50 67 
ss to comply 37 —.18 AT 
al attraction -63 —.03 —.30 
Self-reported attraction 0S AS —.29 
Weight loss criterion® 10 20 30 
Speecl; disturbance ratio» 40.2 53.2 42.7 


Behavioral distancing 17 00 
Personal space concerns ld 12 
Be ioral dependency —.24 26 


ness to comply 
oral attraction 
Self-reported attraction 
Weight loss criterion 
Speech disturbance ratio 


30° 40 
Bb Ons 


* Percentage of clients returning to the clinic after 5 weeks 
with a 5-pound weight loss. 
b Number of speech disturbances per 1000 words, 


each interpersonal distance were calculated. 
These findings indicate that when the inter- 
views were conducted at the personal dis- 
tance, a significant interaction in the expected 
direction obtained, F(1,36) = 4.81, p < .05. 
When offered acceptance and approval, field 
dependent clients showed less distancing than 
when the interviewer was emotionally neu- 
tral; in contrast, field independent clients 
showed more distancing behaviors when given 
positive feedback. 

At the social distance, a second simple 
interaction obtained, F(1,36) = 4.56, p< 
:05. Unexpectedly, field dependent clients were 
less distancing when the interviewer was af- 
fectively neutral. 

Given that the behaviors loading highly on 
the distancing factor have been viewed as 
serving self-protective needs, a separate 
analysis was performed on the speech disturb- 
ance variable to assess differences in anxiety 
experienced by the field dependent and field 
independent clients. A significant triple inter- 
action also obtained on this measure, F(1, 72) 
= 3.54, p< .06; consequently, analyses of 
the simple effects at each level of physical 
proximity were computed. When the inter- 
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viewer sat at the personal distance, the num- 
bers of speech irregularities of field dependent 
and field independent clients were differen- 
tially affected by accepting feedback, F (1, 36) 
= 3.59, p< .10. Field independent clients 
emitted more disturbances when receiving ac- 
cepting feedback, while field dependent clients 
expressed fewer. Both this interaction and 
that obtained on the behavioral distancing 
factor support the view that the kinds of ten- 
sion in an interpersonal context differ for the 
field independent and field dependent indi- 
vidual. The results suggest that an inter- 
viewer who sits near and provides accepting 
feedback negatively affects the field indepen- 
dent individual; for the person with a global, 
undifferentiated perceptual style, an inter- 
viewer who sits near and does not provide 
accepting feedback is anxiety arousing. 

Two additional interactions provide support 
for the expectation that field dependent cli- 
ents would be more sensitive to the emotional 
cues emitted by the interviewer. A significant 
Field Dependence X Evaluative Feedback 
interaction obtained on the behavioral attrac- 
tion factor, F(1,72) = 3.70, p< .06. This 
effect derived primarily from the differential 
reaction of field dependent clients to the na- 
ture of the verbal feedback they received, 
These clients showed more attraction and 
responsivity when offered accepting in com- 
parison with neutral feedback, F(1,36) = 
3.35, p < .10. Field independent clients were 
not significantly affected by this situational 
variable, 

On the behavioral measure of aroused de- 
pedency needs, a significant Field Dependence 
X Physical Proximity interaction obtained, 
F(1, 72) = 6.62, p < .05. Again, only the field 
dependent clients reacted differently across the 
two levels of the situational variable. As an- 
ticipated, they showed greater needs for nur- 
turance and closeness when the interviewer 
sat at the farther, social distance, ¥ (1,36) = 
7.29, p < 05. 


DISCUSSION 


The results of this study extend the work of 
Freedman et al. (1972) by revealing that a 
wide range of gestures and postures are asso- 
ciated with perceptual style. Of particular 
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interest are the data indicating a relationship 
between field dependence and distancing be- 
haviors. In comparison with those having a 
global perceptual approach, the field indepen- 
dent clients tended to express more distancing 
during the dyadic interactions. This finding 
corroborates data reported by Justice (1970), 
who found that field independent subjects 
maintain greater personal space when com- 
municating with another person. If these 
“Droxemic” behaviors are conceptualized as 
regulators of personal involvement in social 
contexts (Argyle & Dean, 1965; Scheflen, 
1972), then the present data provide further 
support for the association between perceptual 
style and preferred degree of psychological 
intimacy with others. 

Speculations can be offered on variables that 
may mediate this relationship. Schimek 
(1968), for example, has suggested that field 
independent individuals impose greater dis- 
tance in order to avoid the arousal of any 
intense affect. An interpretation from the 
present data can be forwarded that is consis- 
tent with, but more narrowly focused, than 
this. The present formulation posits that the 
observed differences in proxemic behaviors 
reflect contrasting means of coping with de- 
pendency feelings. Dynamically, the field in- 
dependent individual may be regarded as 
“counterdependent,” that is, having a need to 
avoid feeling influenced by and dependent 
upon others. The field dependent is viewed as 
openly accepting of dependency feelings and 
needing to foster dependency-gratifying rela- 
tionships (cf. Nevill, 1974). 

Empirical support for this view comes first 
from the significant interactions obtained on 
the behavioral distancing factor and the 
speech disturbance measure. The present field 
setting, across all experimental conditions, 
was considered a context likely to arouse some 
dependency feelings: Clients were explicitly 
seeking and receiving help and support for a 
personal problem. The combined situational 
effects of accepting feedback and close physi- 
al proximity, theoretically elaborated earlier, 
e expected to enhance maximally these 
feelings of dependency upon the interviewer. 
For the field independent clients, this degree 
of intimacy did evoke, as anticipated, their 
strongest reactions of discomfort and with- 
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drawal. Field dependent clients, i contrast, 
were not adversely affected in this e treat- 
ment condition. This fits the char rization 
of their being more comfortab| h and 
desirous of dependency-gratifyin »terper- 
sonal settings. These field depend clients 
did express more distancing in ti neutral 
feedback—personal distance and epting 
feedback-social distance conditions, © both 
of these contexts, the interviewer have 
appeared contradictory or inconsi= “at by 
expressing support via one com. icative 
channel but aloofness in another char 1. This 
mixed message may be particularly saten- 
ing for those who seek support from others. 

The two other significant interac s, in- 
volving the attraction and dependence \chay- 
ioral factors, indicate that the field « ndent x 
clients are relatively more sensitiv: o the 
presence or absence of cues reflecting emo- 
tional availability of the interview This 
greater reactivity of field dependent “rsons 
to the emotional climate, which extew: ~ find- 
ings reported by Konstadt and rman 
(1965) and Nevill (1974), is compat with 
the formulation of their greater exie:) =l di- 
rectedness and social dependence. 

The results from the measures of © :npli- ~ 
ance also lend support to the postula dy- 
namics associated with perceptual sty». The 
field dependent clients expressed greate; vead- 
iness to adhere to dieting recommen: «tions 
during the interview but were less behaviorally 
compliant during the dieting period. One way 


of understanding these main effects involves 
Janis’ formulation (Janis, 1975) of the crisis 
likely to develop at the termination of a dy- 
adic relationship. He postulated that a client, 
feeling attracted to and dependent on a coun- 
selor, may react negatively, exhibiting anger 
and depression, at the threat of termination; 
these affects can then lead to a renunciation 
of the values and norms communicated by 
the counselor. In the present context, the field 
dependent clients may have more strongly 


appreciated the supportive aspects of the | 


clinic and felt more abandoned during the 
dieting period when contact with the inter- 
viewer was only indirect. While there is no 
direct evidence for this conclusion, it is in- 
teresting to note the additional finding that 
field dependent clients who showed more de- 


= 
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pendency behaviors during the interview sent 
in fewer weekly reports (r = —.32, p < .05). 
Perhaps when opportunities for receiving di- 
rect support are withdrawn, field dependent 
clients more strongly reject requests for com- 
pliance, Clearly, the present set of findings 
points to the need for further research—pos- 
sibly where social dependency arousal is di- 
rectly manipulated (cf. Nevill, 1974)—in 
order to verify the hypothesized association 
between perceptual style and contrasting 
means of coping with dependency needs, 

Finally, a methodological note about the 
use of nonverbal behaviors may be broached 
here. Theoretically, these behaviors are re- 
garded as signals of emotion and attitudes 
that may be subject to censoring from direct 
verbal expression. In a study designed to 
evoke rather intense and possibly conflict- 
laden affect, this method seemed particularly 
advantageous. The obtained relationships of 
these nonverbal behaviors to each other (as 
reflected in the factor analysis) and to other 
events in the study (i.e., the self-report mea- 
sures and the field dependence construct) 
further validate their use as indicators of emo- 
tion. Moreover, in comparison with the self- 
report measures, the nonverbal behaviors 
appeared to be more sensitive and fruitful. 
Given the well-known criticisms of pencil-and- 
paper measures of affect, the recording of 
nonverbal behavior may become increasingly 
valued as an effective and unobtrusive means 
for assessing emotion, particularly in real-life 
settings. 
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Friendship and Disclosure Reciprocity 


Valerian J. Derlega, Midge Wilson, and Alan L. Chaikin 
Old Dominion University 


It was predicted that reciprocity 
encounter would be more likely 


of self-disclosure during a single laboratory 
between dyads composed of strangers than 


close friends. The prediction was confirmed, using female college students as 


subjects. When friends were the initial disclosers, 


the intimacy level of the 


subjects’ responses was independent of the intimacy of the disclosure input. In 
contrast, strangers varied their level of intimacy as a direct function of the 


intimacy of disclosure input. 
sults of previous studies of 


The results suggest caution in extrapolating the re- 
disclosure reciprocity (using strangers as subjects) 


to the processes involved in the development of intimate relationships. It was 
suggested that future studies should focus more attention on disclosure patterns 


in already established relationships. 


A consistent finding in the literature on 
self-disclosure is the phenomenon of disclo- 
sure reciprocity, what Jourard (1964) called 
the “dyadic effect.” In laboratory studies, the 
best predictor of the level of intimacy that 
B will use in disclosing to A is the level of 
intimacy with which A initiated the com- 
munication. (For literature reviews, see Alt- 
man, 1973; Chaikin & Derlega, 1974; Cozby, 
1973; Goodstein & Reinecker, 1974.) Dis- 
closure reciprocity is critical in the growth 
and development of friendships. Theories on 
the acquaintance process, including incremen- 
tal exchange theory (Huessmann & Levinger, 
1976; Levinger & Snoek, 1972) and social 
penetration theory (Altman & Taylor, 1973), 
assume that mutual disclosure spirals upward 
as a relationship develops, with individuals 
disclosing more and more about themselves 
at each encounter. 

One problem with almost all experimental 
studies on self-disclosure is that they have 
been limited to observing the phenomenon 
between strangers who are meeting each other 
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for the first (and usually last) Under 
these conditions, the second pers nds to 
reciprocate disclosure, no matter he timate 
the initiator has been. Indeed, eral re- 
searchers have been unable to nstrate 
that too high a disclosure lev produces 
withdrawal rather than recipro :ty (€g; 


Savicki, 1972). 

In a theoretical paper on disclosure reci- 
procity, Altman (1973) has speculated that 
reciprocity of exchange during a ngle en- 


counter is more likely to occur aming pass- 
ing strangers than among two pe! com- 
mitted to a relationship. Altman that 
the “social ‘norm of reciprocity’ per- 
ceived obligation to reciprocate disclosure— 

t role 


is hypothesized to play a more import 
in the early stages of a social bond than 
later stages” (p. 10). Persons involved 
close relationships may base their disclosure 
patterns on a number of considerations and 
may not be as concerned about matching 
disclosures. 

We agree with Altman’s theorizing. Early 
in a relationship, such as in encounters in- 
volving strangers and casual acquaintances, 
immediate reciprocity may be necessaty to 
prove one’s trustworthiness and to propel the 
relationship into more intimate stages. Be- 
tween close friends, however, mutual respect 
and trust have been established over a long 
time period. The friends have moved beyond 
stereotyped rules governing how acquain- 
tances should behave (cf. Levinger & Snoek, 
1972). An individual can respond to a friend’s 


in 
in 
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high disclosure in several ways, depending on 
what is appropriate at the moment. He can 
listen quietly, display sympathy, show con- 
cern, or diyulge personal information if such 
disclosure seems relevant. A relationship be- 
tween friends rests on a foundation of pre- 
viously established trust; the relationship 
should not be adversely affected by one 
partner’s failure to reciprocate immediately. 

Situational considerations also tend to in- 
crease reciprocity among passing strangers. 
Since no future encounters are anticipated, it 
may be easier to reciprocate disclosure to a 
stranger than to a friend, The stranger, who 
will not be seen again, poses no threat. In 
contrast, an individual may be cautious in 
reciprocating intimate disclosures to a friend. 
A long-term commitment exists, and one 
might not wish to risk losing a friend’s 
esteem by disclosing something negative. 
Simmel’s (1950) account of disclosure to a 
“passing stranger” develops this theme. Based 
on the preceding considerations, we predict 
that disclosure reciprocity will be greater in 
social encounters between strangers than 
between close friends, especially when reci- 
procity would necessitate revealing intimate 
information, 

It is worthwhile noting that every pub- 
lished experimental study on disclosure reci- 
procity has used strangers as subjects, despite 
the fact that most of these studies (including 
our own) have been concerned with gen- 
eralizing results from encounters between 
strangers to encounters between friends. 


MetHop 
Subjects 


Forty-eight female subjects were recruited from 
undergraduate classes for an experiment involving 
communication processes. In order to participate, 
each subject was requested to bring a close female 
friend with her to the lab. All participants were 
given either experimental credit, as part of a class 
requirement for introductory psychology classes, or 
$2. The decision to use female subjects was based 
on the fact that the available subject pool was 
overwhelmingly female. 


Design 


The design was a 2 X 2 factorial. The independent 
variables were whether the subject’s partner was a 
friend or a stranger and whether the partner’s dis- 
closure was low or high in intimacy. All subjects 
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were randomly assigned to the conditions. The 
major dependent variable was judges’ ratings of the 
intimacy of the subjects’ self-disclosures. 


Procedure 

Each subject brought a female friend to the ex- 
periment (i.e., “someone who knows your deepest 
and most personal thoughts”). Upon arriving at the 
study, the subject was either assigned her friend or 
a complete stranger (a confederate) as a partner in 
an experiment on communication processes. Sub- 
jects were told that the study involved passing 
information back and forth by note. Through a 
bogus drawing, the subject’s partner was always 
selected to send the first note. The experimenter 
emphasized to both the subject and her partner that 
the initial writer could send a note on any subject 
she wished and that the second person, the subject, 
could respond in any way she liked.t The experi- 
menter then asked the subject to wait in a cubicle 
while her partner composed her message, which the 
experimenter would deliver. While she waited, the 
subject filled out some questionnaires, just to keep 
her busy. 

The subject was soon given the partner’s note, 
which was a self-disclosure of either low or high 
intimacy (see below for the details of this ma- 
nipulation). The subject then wrote a note of her 
own for her partner. Following this, the subject 
filled out an impression questionnaire, which asked 
her to rate the partner. The subject was then 
debriefed and dismissed. 

Friendship manipulation. Although all subjects 
brought a friend to the experiment, the friend par- 
ticipated only half the time. In the friendship con- 
dition, after the subject entered a booth, the friend 
was taken to a different booth where another 
experimenter was waiting. This experimenter ex- 
plained the true nature of the study and asked 
the friend to serve as a “confederate” by writing a 
note to her friend, in her handwriting, that was 
either high or low in intimacy (see following section 
for details of how this was accomplished). These 
persons also expected that their notes would be 
copied and shown to at least one other subject in 
the experiment..No one refused our request to write 
the note and all seemed eager to cooperate. 

In the stranger condition, the subject, upon ar- 
riving at the lab with her friend, was met by the 
experimenter and another person (a confederate) 
who was presented as having been scheduled with 
a friend for the same time, The confederate èx- 
plained that unfortunately her friend had canceled 
at the last minute. The experimenter then offered 
the following solution. She briefly summarized the 
experiment to them, as we have described it for the 
friend condition. She explained that the reason two 


1JIn this way, we hoped to reduce the demand 
characteristics usually present in laboratory studies 
on disclosure in which subjects are explicitly asked 
to talk about themselves. 
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sets of friends were scheduled was that each girl in 
each pair was to write a note to a girl in the 
other pair, presumably a stranger. The usual pro- 
cedure would have to be modified, she went on, 
since one friend had not shown up. One girl in the 
set of friends would have to be dropped. The ex- 
perimenter asked the subject to participate and re- 
quested the subject’s friend to wait outside the lab 
until completion of this part of the experiment. 
Actually the friend was never used in this condition, 
but she did receive her promised $2. In this way, 
the subject was paired with a stranger. The con- 
federate copied in her own handwriting, unseen by 
the subject, the note the friend in the previous ses- 
sion had written to her friend-subject. This procedure 
permitted us to “yoke” subjects in the friend and 
stranger conditions who were run back to back. 

Both received the identical note, which was either 
low or high in intimacy. 

Intimacy manipulation. Each friend was asked to 
write a note that was high or low in intimacy. To 
ensure standardization of intimacy (but maintain 
generality of content within intimacy levels), she 
was asked to write on any one of three high- 
intimacy topics or three low-intimacy topics taken 
from the Taylor and Altman (1966, Note 1) 
intimacy-scaled stimuli. High-intimacy topics in- 
cluded “times I have been tempted to steal some- 
thing” and “times when I have wished I could 
change something about my physical appearance.” 
Low-intimacy topics included “situations that bore 
me” and “my favorite subjects in school.” To give 
the friend a feeling for high and low intimacy, she 
was shown the scale used by the authors for the 
system scoring intimacy and a dictionary definition 
of intimacy. Sample notes for each of the three 
topics were also given to the friend to clarify what 
was meant by high or low intimacy. In writing the 
note, however, each friend was encouraged to 
express it in her own words, drawing on her past 
experiences as appropriate. 

Dependent measures. The primary dependent mea- 
sure was a rating of the intimacy of the subjects’ 
self-disclosures. Two independent raters who were 
blind as to both hypotheses and conditions rated 
each disclosure along a 9-point scale (1 = little in- 
formation about self given, 9 = extremely intimate 
information). Each judge was trained in using the 
scale, which emphasized that as the uniqueness 
and/or the guarded nature (e.g., counternormative 
behavior, confession of weaknesses) of the disclosure 
increased, so did the intimacy level. The average 
for the two raters (the interjudge reliability was 
.76) was used as the unit of analysis. (To test the 
effectiveness of the intimacy input manipulation, the 
same two independent, blind judges who scored the 
subject's self-disclosures rated the partner’s self- 
disclosure on a 9-point scale in the low- and high- 
intimacy input conditions.) 

Other dependent variables included the number of 
words in each disclosure and impression ratings in 
which each subject rated her partner on 9-point 
scales for trustworthiness, unusualness of the note, 
and ability to predict her behavior in a crisis. The 


questionnaire also asked the subject as a masi 
tion check to rate how well she knew her po 


RESULTS 
Manipulation Checks 
Subjects were asked to indicate o a 
9-point scale how long they had known | + 
partner (1 = just met, 9 = years). Subi is 


in the friend condition indicated know 1g 
their partner significantly longer (8.2) © ən 
did subjects in the stranger condition ( a 
F(1, 44) = 508.9, p < .001. (Except for ii. / 
tests reported below, dj = 1,44, in this 
port.) There was a Friend X Intimacy Iı 
interaction on this measure, F = 8.7, p < .( 
Friends reported knowing their part: 
somewhat better in the high-intimacy (8 
than in the low-intimacy input (7.5) cov 
tions, whereas no differences appeared 
these ratings in the stranger condition (i 
vs. 1.0). 

The results of a ż test for independer 
groups indicated that the partner’s intimac 
ratings were significantly greater in the high 
intimacy (7.5) than the low-intimacy (4.2) 
input conditions, based on judges’ blind 
ratings, 1(22) = 6.9, p < .001. The standard 
deviation was .75 in the low-intimacy and 
1.4 in the high-intimacy condition. There 
was no significant difference in the number of 
words used in the low- and high-intimacy 
notes, (22) = 1.9, p > .10. 


Self-Disclosure Dependent Measures 


Figure 1 presents the two judges’ averaged 
ratings of intimacy in the four treatment 
conditions. Using a between-subjects analysis 
of variance, an Intimacy Input x Friend- 
ship interaction was found, F = 5.8, p < 05. 
Post hoc comparisons (Newman-Keuls tech- 
nique) revealed that intimacy scores for 
friends were not influenced by intimacy input, 
whereas intimacy scores for strangers were 
influenced. The intimacy ratings for friends 
were almost identical in both the low- (4.4) 
and high- (4.1) intimacy input conditions. 
When subjects believed that they were com- 
municating to a stranger, intimacy ratings 
were higher in the high- (6.2) than in the low- 
(4.2) intimacy input conditions ($ < .05). 
Also, the difference in intimacy means be- 
tween friends and strangers was significant 
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under high-intimacy input (p < .05). There 
was a marginally significant intimacy main 
effect (F= 3.5, p<.07), with intimacy 
ratings tending to be higher under high- 
intimacy than low-intimacy input. A friend- 
ship main effect occurred (F = 4.2, p < .05), 
with intimacy ratings being higher for stran- 
gers than for friends. This reversal in the 
usual prediction about the effect of friendship 
on self-disclosure resulted, of course, from 
the strangers’ high intimacy ratings in the 
high-intimacy input condition. 

An internal analysis was performed to 
determine the correlation between the inti- 
macy of the note received by each subject 
and the intimacy of her subsequent response. 
In the friend condition, intimacy input was 
not related to intimacy of response, that is, 
no reciprocity occurred (r= .15, ms). In 
contrast, reciprocity was quite strong in the 
stranger condition: The correlation between 
intimacy received and intimacy given was .64 
(p < .01). 

An analysis was made of the number of 
words written. There was a marginally sig- 
nificant tendency for strangers to write more 
(88.1) than friends (71.0, F = 3.8, p < .055). 
No other effects were obtained on this 
measure. 


Questionnaire Data 


There was a significant intimacy input 
main effect on the rating of the unusualness 
of the note sent by the partner (F = 8.3, p 
< 01). Subjects rated the note as being more 
unusual in the high-intimacy (4.8) than in 
the low-intimacy (2.8) condition. There were 
several friendship main effects, which seem to 
reflect the effectiveness of this manipulation. 
Friends, in comparison to strangers, were 
more trusted (F = 26.2, p< .001; 7.1 vs. 
2.3) and more predictable in terms of how 
they might react in a crisis (F = 45.1, p 
< .001; 7.2 vs. 2.8). There were no inter- 
action effects incorporating these pencil-and- 
paper dependent measures. 


DISCUSSION 


The results show that patterns of self- 
disclosure in a laboratory encounter are dif- 
ferent between friends and strangers. Stran- 
gers used the disclosure input—particularly 
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Ficure 1. Intimacy of subject disclosure as a 
function of degree of friendship and level of 
intimacy input. 


in the high-intimacy input condition—as a 
cue to govern their own response. Friends, 
on the other hand, maintained approximately 
the same low to moderate degree of intimacy, 
regardless of the level of disclosure input. 
Although low intimacy was reciprocated, high 
intimacy was not. 

The findings for the friendship main effect 
on the intimacy ratings do not mean that 
strangers generally disclose more than friends, 
Rather, they tend to confirm our view that 
the obligation to reciprocate high-intimacy 
input during the same encounter may be 
stronger between strangers than between 
friends. 

It seems that behavior patterns and under- 
lying mechanisms differ considerably for 
strangers and friends. In contrast to stran- 
gers, relationships between friends are prob- 
ably guided by a pattern of mutual trust and 
spontaneity. Friendships may not require a 
constant monitoring of one another’s behavior 
to assess what is obligatory. Behavior be- 
tween friends (particularly in terms of self- 
disclosure) can be nonintimate or intimate, 
depending on its usefulness in meeting mutual 
needs at an advanced stage of relationship. 
A friend may not have to reciprocate dis- 


on 


closure during the same interaction to prove 
that the revealer’s trust was well placed. 

In this study we have examined reciprocity 
only within a single session. Our decision to 
limit the study to one encounter was based 
on the wish to compare our results with past 
laboratory studies on self-disclosure. This 
methodology, however, has both advantages 
and disadvantages. Our note-passing proce- 
dure is more artificial than a face-to-face 
interaction, but the latter would have forced 
the friends to perform as actresses, without 
even a rehearsal! An alternative methodol- 
ogy, high in external validity, might have 
been a field study using interviews and recol- 
lections by people of past encounters with 
strangers and friends. A recent unpublished 
study by Rubin and Shenker (Note 2) used 
such a methodology to compare disclosure 
patterns among roommates and hallmates in 
a dormitory. However, such studies only re- 
sult in correlational data, with all the prob- 
lems of interpretation that such data produce, 
We settled on the present design as the 
best way to investigate the problem in as 
controlled a fashion as possible. 

Our use of 12 different notes as disclosure 
input for each intimacy level is unusual. In 
most studies, one or two standardized notes 
at each level are used.? Our use of 12 notes 
avoids a problem found in these studies in 
which intimacy input and the content of the 
disclosure are confounded. At the same time, 
yoking each friend’s note by using it again 
in the same intimacy condition for one stran- 
ger added control to the intimacy input ma- 
nipulation. In addition, our results indicated 
that there was little variability in the inti- 
macy inputs within conditions: The standard 
deviation was only .75 in the low-intimacy 
condition and 1.4 in the high-intimacy 
condition. 

This study is the first to investigate experi- 
mentally in the laboratory the effects of 
friendship on the exchan of self-disclosure. 
Although strangers are more readily acces- 
sible as subjects (particularly in experiments 
conducted on college campuses), results from 
these studies are probably limited in under- 
standing relationships between friends. Re- 
search designed to test theories about the 
acquaintance process has unduly emphasized 
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social relationships between strangers. 
hope that this study encourages othe: to 
investigate experimentally disclosure pa 
in established relationships. 


2 During pilot testing, we tried using stand: 
notes for both friends and strangers. Howev no 
matter what we tried, it was found that the 
(especially the high-intimacy ones) were 
credible in the friend condition. This proble 
not occur when individualized notes were us 
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What the Shadow Knows: Person Perception in a 
Surveillance Situation 


James M. Olson, John C. Barefoot, and Lloyd H. Strickland 
Carleton University, Ottawa, Canada 


Under one of three task conditions, male subjects observed a prescribed be- 
havior sequence performed by a female confederate, Covert-surveillance subjects 
followed her in public, having been told that she was randomly chosen as a 
target and was unaware of the surveillance, Overt-surveillance subjects actively 
followed the target, but knew of her role as confederate. Inactive subjects 
viewed a videotape of the behavior sequence. It was hypothesized (a) that 
active participation in surveillance would lead to emotional arousal in the ob- 
server (b), that overt-surveillance subjects would find the task more aversive 
than would subjects in the other conditions, and (c) that impressions of the 
target would be related to the affective experience of the observers. Self-reports 
revealed that overt-surveillance subjects experienced the task as more aversive 
than did inactive subjects. Covert-surveillance subjects also reported aversion, 
but this was coupled with a high level of reported pleasurable excitement. Im- 
pressions of the target were more negative in the overt-surveillance condition 
that in the other conditions. Discussion centered on the possible causes for the 
diverse emotional experiences and the distinctiveness of surveillance as a form 


of social interaction. 


Several investigators (e.g., Argyle & Wil- 
liams, 1969; Goffman, 1969; Strickland, 
1958) have noted that the act of surveillance 
may place demands on the observer which 
influence the way information about the tar- 
get is processed. For example, Strickland 
(1958) studied the attributions of subjects 
who played the role of supervisor in a simu- 
lated industrial setting. He found that they 
placed less trust in a worker who had been 
under surveillance than in an equally produc- 
tive worker who had not been monitored. 
Strickland hypothesized that subjects saw sur- 
veillance as a force that motivated task per- 
formance, leading them to doubt that the 
monitored worker was as instrinsically moti- 
vated as the worker who performed well in the 
absence of this extrinsic pressure. 
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In the course of replicating the Strickland 
study (Strickland & Barefoot, Note 1), it was 
found that subjects frequently reported that 
they felt uneasy about performing the pre- 
scribed monitoring task, citing their view 
that surveillance was a coercive and some- 
what antisocial act. These observations raise 
questions that are unrelated to the original 
Strickland hypothesis but are nevertheless 
important for the understanding of surveil- 
lance. If some subjects felt uneasy in a simu- 
lated industrial setting where surveillance 
tends to be sanctioned by both society and 
the participants, what will observers experi- 
ence when conducting surveillance in a less 
innocuous setting? How will this emotional 
experience influence the observer’s perceptions 
of the target? The experiment reported below 
is designed to provide information relevant to 
these questions by assessing the experience 
and perceptions of observers who “shadowed” 
a person who was performing normal activi- 
ties in a public setting.” 

In addition to the socially approved nature 


1It should be emphasized that this task differs 
from the Strickland task on a number of dimensions 
and, therefore, behavior of the subjects in the two 
studies cannot be compared directly. 
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of Strickland’s (1958) surveillance setting, it 
is also important to note that his subjects 
performed their surveillance overtly (i.e., the 
targets were fully aware of the monitoring). 
This is true of many surveillance settings, but 
some monitoring acts are carried on covertly 
without the target’s knowledge. Moreover, re- 
search on other marginally counternormative 
acts suggests that the variable of monitor 
anonymity may be an important mediator of 
the observers’ experience. For example, a 
number of studies have established that sub- 
jects are more willing to comply with requests 
following the commission of a transgression, 
an effect which has been ascribed to the sub- 
ject’s feelings of guilt (Krebs, 1970). It has 
also been found that this “guilt and compli- 
ance” effect does not occur if the subject’s 
transgression remains undiscovered (Silver- 
man, 1967; Wallace & Sadalla, 1966), sug- 
gesting that public identifiability may be a 
prerequisite for the experience of the negative 
emotional state. These studies suggest that 
any negative emotions produced by participa- 
tion in overt surveillance might not be pres- 
ent in subjects who do their monitoring co- 
vertly. 

It is also possible to suppose that observers 
might experience surveillance as pleasurable 
or thrilling. Research on the phenomenon of 
deindividuation (Festinger, Pepitone, & New- 
comb, 1952; Singer, Brush, & Lublin, 1965) 
has demonstrated that subjects may be at- 
tracted to mildly counternormative activities 
when they are not highly identifiable as indi- 
viduals. In addition, Berscheid and Walster 
(1974) have discussed instances in which re- 
actions to danger may be labeled as pleasur- 
ably exciting, and the act of surveillance has 
a number of components which might facili- 
tate such a labeling. There is the novelty of 
the act, the opportunity to perform a nor- 
mally forbidden act, the sense of superiority 
inherent in being able to “see without being 
seen,” and, perhaps, an opportunity to iden- 
tify with the “spy,” a figure who is fast be- 
coming a folk hero in our society (Taylor, 
1975). 

In summary, active participation in surveil- 
lance should lead to emotional arousal in the 
observer but the nature of that arousal should 
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depend on the overtness of the act, Th- fact 
that surveillance is viewed as at leas: =ar- 
ginally counternormative should lead i b- 
server to feel nervous and apprehensive ` the 
surveillance is performed overtly, but ti: il- 
ity to remain anonymous should alleviate ‘his 
aversiveness if the surveillance is perfo: «ed 
covertly. In fact, covert surveillance mig.. be 


experienced as exciting and positive. 

Of course, questions about the emo! al 
experience of the observer derive muci. of 
their significance from the possibility at 
those emotions may influence the obser- ~'s 
impressions of the target. Much psycholo «al 
research has been devoted to the demon 
tion of a relationship between affect and |“ 
ception, so it is reasonable to expect the 
server’s impressions to be related to the e 
tion generated during surveillance. The not 
likely form of the relationship is positiv”, 
with more favorable impressions being form 
in the conditions which generate more positi 
affect (Lott & Lott, 1974). 


METHOD 
Overview 


In a 2X3 factorial design, male subjects were 
asked to observe the performance of a standardized 
behavior sequence by one of two female confederates 
under one of three task conditions. Coyert-surveil- 
lance subjects followed the confederate, having been 
told that she had been chosen at random ant 
unaware of the experiment. Subjects in the overi 
surveillance condition also followed the confederate 
but were aware of her role as the experimenter’s 
accomplice. Subjects in the inactive condition did 
not actively follow the target but simply viewed a 
videotape of the behavior sequence. 

Following the period of surveillance, measures wer 
taken of the subjects’ experienced emotions and thei 
impressions of the target. All participants were 
completely debriefed. 


Subjects 


Male students were recruited from introductory 
psychology courses and given course credit for their 
participation. The data of 2 of the 62 subjects who 
participated were discarded. One of these subjects 
voiced suspicion about the randomness of the choic: 
of target, and the other lost sight of the target during 
the experiment and never found her again. The 
remainder were randomly assigned to condition to 
achieve 10 per cell. 


Confederates 


Two female age peers of the subjects served as 
targets. They were extensively trained to emit the 
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behavior sequence as invariantly as possible and 
were not informed of either the condition assign- 
ments of the subjects or the hypotheses under test. 


Task Variable 


Subjects began the experiment in a laboratory 
testing room where the experimenter told them that 
he was interested in “impression formation in sur- 
veillance situations.” He stated that most past sur- 
veillance research had been done in the lab with 
subjects viewing unnatural behavior in an unnatural 
setting, but his desire was to study “impressions that 
esult from naturally occurring behaviors in natural 
ituations.” Subjects in the covert-surveillance con- 
dition were then told the following: 


I ami therefore going to ask you to follow someone 
around, watch them, and then see what sort of 
impressions you form of them. . . . We'll randomly 
choose someone and have you observe him or her 
for a while. All I want you to do is observe the 
person. Don’t talk to him or her, and don’t lose 
him or her. Just follow them around wherever 
they go and watch them. You can observe them 
from whatever distance feels most comfortable to 
you. Then when your observation period is over, 
come back here to this room and TIl give you 
some impression ratings to fill out about the person 
you watched. 


Subjects in the overt-surveillance condition were 
given identical instructions except that the sentence 
ibout the random choice of the target was omitted 
and the following statement was used in its place: 
“The person I am going to have you watch is work- 
ing for me; she is part of this experiment. What she 
does while you are watching has already been deter- 
mined by me and practiced by her.” 

In the inactive condition, subjects were simply 
told that they were to watch a film, form impressions 
of the target, and later complete the impression rat- 
ing scales, 

Subjects in the overt- and covert-surveillance con- 
ditions were then given a stopwatch and told to 
observe for as close to 15 minutes as possible. Addi- 
tonal details about the task were added and covert- 
surveillance instructions included various comments 
designed to enhance the credibility of the deception 
(e.g., what to do if the target left the campus). All 
subjects were then given the opportunity to withdraw 
from the experiment. None chose to do so. 

Subjects in the active-surveillance conditions were 
then led to a staircase about 40 m from the testing 
room. The experimenter opened the stairwell door 
while saying, “To ensure that the choice is random, 
let's say that the next person to enter through this 
door coming our way will be the person you ob- 
serve.” He then allowed the subject to see that no 
one was in the stairwell and let the door close. 

The confederate, hidden in the stairwell, was able 
to hear the experimenter’s statement and walked 
through the door 30 seconds after the cue unless an 
earlier entrance was made necessary by the approach 
of someone else. 


Once the confederate came out of the stairwell, the 
experimenter left the subject alone to observe for 
the next 15 minutes. At the end of the observation 
period, the subject returned to the testing room to 
answer the questionnaire containing the dependent 
measures. 

Subjects in the inactive condition were taken di- 
rectly from the testing room to an adjacent room 
where a Sony AV-3600 Videocorder was connected to 
a 16-inch Sony video monitor. The experimenter 
turned on a videotape of the same behavior sequence 
seen by the subjects in the active surveillance condi- 
tions and left the room. When the tape was over, 
the subject turned off the machine and returned to 
the testing room to complete the questionnaire. 


Confederate’s Behavior Sequence 


The confederate proceeded from the stairwell to a 
nearby foyer and sat down near a window. For the 
first minute she stared out the window. During the 
next 2 minutes, she became increasingly “fidgety,” 
tapping her feet and manipulating her rings, her 
watch, her books, and her hair. She then stood and 
left the foyer, walking down a set of stairs to a 
system of tunnels used for interbuilding travel. For 
the next 24 minutes she walked through the tunnels 
to the university library, where she ascended the 
stairs to the fourth floor and sat at a table near a 
window. She took a reference book from a shelf at 
the table and opened it, but for the next 3 minutes 
she appeared to be unable to concentrate. She grew 
increasingly upset and finally took a tissue from her 
purse and wiped her eyes. After composing herself, 
she directed her attention to the book and read 
until the subject terminated his observation. At no 
time in the sequence did she establish eye contact 
with the subject. 


Dependent Measures 


Subjects were asked to indicate on 7-point scales 
(1=not at all; 7 = extremely) degrees of nervous- 
ness and guilt they experienced during the observa- 
tion. They were also asked to indicate on 7-point 
scales the degree to which the following adjectives 
described their reactions to the task: fun, nerve- 
racking, involving, unpleasant, exciting, and guilt- 
producing. For purposes of analysis these ratings 
were combined into two composite indices. The 
thrill index was composed of the ratings on the 
“fun,” “involving,” and “exciting” scales, whereas 
the aversion index was derived from the ratings on 
the “nervous,” “guilty,” ‘“nerve-racking,” “un- 
pleasant,” and “guilt-producing” scales.* 

In order to assess the subject’s impressions of the 
target, he was asked to characterize her on 14 trait 
adjectives, again using 7-point scales. These ratings 
were combined into the following four indices, which 


2 While these indices were composed on an a priori 
basis, their composition received strong support 
from a subsequent factor analysis of the data. 
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TABLE 1 
MEANS AND UNIVARIATE Fs ror THE TASK EFFECT 
Condition means 
Covert sur- Overt sur- 
Index veillance veillance Inactive 
Emotions 
‘Thrill 5.30 4.30 3.85 8. $ 
Aversion 2.61 2.95 1.93 5.2 
Impressions J 
Untrustworthiness 2.79 3.64 3.13 5.14 
Sociality 435 3.50 3.83 2.6 
Emotional maladjustment 3.66 4.41 3.90 3.6 
Respect 4.73 4.58 4.88 S 
Confidence 3.70 5.45 4.15 5.7 
Note, Range of possible scores = 1 to 7, 
adj = 2,54, 
*p <10. 
wp <05. 
wk > < 01, 
wee DD < 00L 


are listed with their components: (a) untrustworthi- 
ness index: unreliable, untrustworthy, insincere, hon- 
est (reversed scoring); (b) sociality index: friendly, 
cold (reversed scoring), likable; (c) emotional-mal- 
adjustment index: maladjusted, compassionate (re- 
versed scoring), easily upset, weak; (d) respect 
index: respectable, intelligent, mature.* 

Finally, subjects rated their confidence in their 
reported impressions and were asked two open-ended 
questions about what they observed and how they 
felt during the surveillance. 


RESULTS 
Overall Analysis 


Multivariate analysis of all indices revealed 
that there was no between-confederate effect, 
F (7,48) < 1, and no interaction between the 
task and confederate variables, F(14,96) = 
1.18. Both roots associated with the test of the 
task main effect were significant, F(14, 96) 
= 4,38, p < 001 and F(6, 48.5) = 3.34, p < 
Pel Ba 

Table 1 presents the means of the indices 
that composed this effect along with the re- 
sults of the univariate analysis of each index. 


Emotion Indices 

Univariate tests for both emotion indices 
were significant. Multiple comparisons using 
the Newman-Keuls test (criterion set at p < 
.05) revealed that the covert-surveillance sub- 
jects scored higher on the thrill index than 
either the inactive subjects or the overt-sur- 
veillance subjects. Overt-surveillance subjects 
did not differ from those in the inactive condi- 
tion. On the aversion index, both of the 
active-surveillance conditions produced higher 


ratings than the inactive, but they © not 
differ from each other. 

The picture that emerges from the res; uses 
to both the rating scales and the open- ded 
questions is one of quite different en» onal 
experiences in each condition. Subject the 
inactive condition were apparently bo «| or 
at least unemotional. They had low sc. s on 
both of the emotion indices and this i sup- 
ported by their open-ended responses. | hey 
tended to report that the task was “du” or 
that they “didn’t feel anything.” Many sub- 
jects wrote about the target’s possible emo- 
tions rather than their own, indicating that 
the request to describe their feelings may have 
been an irrelevant question since they hac few 
emotions to report. 

The overt-surveillance subjects felt ill at 


ease during their task. They reported high 
levels of aversion and little thrill. The open- 
ended responses paralleled their ratings with 
comments such as the following: “I felt very 
self-conscious about people around me know- 
ing that I was following her”; “I felt under a 
great deal of stress and was very glad that 
she never looked to see if I was staring at her 
or not.” Many subjects clearly felt awkward 
or embarrassed about their behavior, but the 
most frequent comment was that they felt 
“conspicuous.” 

On the other hand, covert-surveillance sub- 


3 The statistics and associated probabilities for the 
multivariate tests are Rao’s F approximation to 
Wilk’s Lambda. 
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jects reported high levels of both aversion and 
thrill. These apparently conflicting emotions 
are seen in the open-ended responses as well: 
“I was nervous at first, but then I began to 
enjoy it, sort of cat and mouse”; “It was fun 
but nerve-racking”; “I felt sort of like Man- 
nix. I was, however, a little nervous at first as 
I thought she might have realized I was watch- 
ing her. Anyway, it was great fun.” In these 
cases, it is almost as if the nervousness in- 
luced by the fear of discovery serves to 
heighten the enjoyment and make the task 
exciting. 
Impressions 

The tests summarized in Table 1 demon- 
strate that the method of observation affected 
the impressions of the target as well as the 
emotional experience of the task. Covert- 
surveillance subjects formed more favorable 
impressions of the target than did overt- 
surveillance subjects on the untrustworthiness, 
emotional maladjustment, and sociality in- 
dices, although the test was only marginally 
significant on the sociality variable. The data 
of the inactive subjects are intermediate to 
those of the two active-surveillance condi- 
tions. In order to test differences in overall 
impressions, multivariate multiple compari- 
sons tests were employed.‘ The covert- and 
overt-surveillance conditions differed from 
each other, F(4, 54) = 5.88, p < .001, and 
the impressions formed in the overt-surveil- 
lance condition were less favorable than those 
in the inactive condition, /(4,54) = 3.22, p 
< .02. The comparison between the covert- 
surveillance condition and the inactive condi- 
tion failed to reach significance, (4,54) < 1. 


Impression Confidence 


‘Table 1 also reveals a significant effect of 
the task on the subjects’ rated confidence in 
their impressions. Multiple comparisons using 
the Newman-Keuls test (criterion set at p < 
05) found that the overt-surveillance sub- 
jects were more confident than the subjects in 
either the covert-surveillance or inactive con- 
ditions, who did not differ from each other. 

This finding was unexpected in the sense 
that it runs counter to previous research on 
attribution processes (e.g., Strickland, 1958), 
which suggests that overt-surveillance sub- 
jects should be the least confident because 
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they knew that the target’s behavior was par- 
tially determined by external pressures and 
was, therefore, relatively uninformative about 
the target’s personality. Of course, it is possi- 
ble that the overt-surveillance subjects might 
have followed the target more closely, and 
actually obtained more information about her. 
In order to test this explanation, the subjects’ 
open-ended descriptions of the behavior se- 
quence were coded for completeness and accu- 
racy. There were no significant differences 
between conditions on this measure, F'(2, 54) 
< 1, lessening the likelihood that the manip- 
ulations led to differential accuracy, even 
though the subjects differentially reported 
their confidence, 


Discussion 
Emotions 


It is clear from these results that participa- 
tion in active surveillance can generate emo- 
tional arousal in the subject, but the nature of 
the affect depends upon the conditions under 
which the surveillance takes place. When the 
surveillance was open and known to the tar- 
get, observers reported that they felt nervous 
and apprehensive. When the target was un- 
aware of the surveillance, the apprehension 
remained, but the character of the experience 
was changed so that the subjects reported 
what we have described as a “thrill.” Al- 
though both active-surveillance conditions had 
their aversive aspects, the open-ended re- 
sponses suggest that the source of the appre- 
hension differed between conditions. Overt- 
surveillance subjects claimed that they felt 
conspicuous and that they were concerned 
about the reactions of bystanders. They 
seemed to feel as if they were the targets of 
observation as well as initiators, and the 
resulting aversion may have been due to 
evaluation apprehension. On the other hand, 
the uneasiness of the covert-surveillance sub- 
jects seems to have stemmed from a different 
source—fear of discovery by the target. As 
long as they were successful in remaining 
undiscovered (which they were), their aver- 
sion should have contained fewer components 
of the evaluation apprehension which was 


4Stevens (1972) has noted that multivariate con- 
trasts made after a significant overall test need not 
constitute an orthogonal set. 
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apparently experienced by the overt-surveil- 
lance subjects. This line of reasoning also 
suggests that the self-reports of the covert- 
surveillance subjects may have been partially 
due to the knowledge that they had success- 
fully completed a difficult task. 

Although the experiences of the overt- and 
covert-surveillance subjects were distinctly 
different, this difference cannot be unambigu- 
ously ascribed to the single variable of target 
knowledge because the manipulation varied 
other aspects of the relationship as well. By 
informing the overt-surveillance subjects that 
the target was a confederate, we also manipu- 
lated the implied consent of the target to the 
surveillance and informed the subject that the 
behavior sequence was planned rather than 
spontaneous. This is true of the inactive condi- 
tion as well. A purer manipulation of target 
knowledge would have entailed a condition in 
which the target was apparently chosen at 
random but behaved as if she had “discoy- 
ered” the presence of the observer during 
course of the behavior sequence. In fact, such 
a condition was included in the original design 
of the experiment, but pretesting revealed that 
this treatment was so disturbing that ethical 
considerations forced us to abandon it. This 
leads us to believe that, if anything, the 
comparison between the overt- and covert- 
surveillance conditions is an underestimate of 
the effect of target knowledge. 


Impressions 


Reports of subjective state were not the 
only variables affected by the task manipula- 
tion. Reported impressions of the target were 
also found to differ between conditions, with 
the target seen more negatively in overt-sur- 
veillance than in the covert-surveillance or 
inactive conditions. 

The potential practical implications of this 
finding make it imperative to include a note 
of caution about the generality of these re- 
sults. The effects of several major variables 
must be studied before the applicability of 
these results can be assessed. Three are of 
particular concern. Foremost among these is 
the question of adaptation over time. It is 
very possible that the emotional experience of 
surveillance (and the resulting impressions) 
would change as the observer became more 
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accustomed to the act. The exciting cts of 
covert surveillance are especially 1 to di- 
minish over time. 

Second, it must be remembered tit the 
surveillance in this experiment took pl:-+ ina 
benign context. Surveillance which hac! ‘gnifi- 
cant and possibly negative consequen. s for 
the target might heighten ego-threat a feel- 
ings of guilt in the observer, resultin: in an | 
entirely different emotional experien.. The 
effects of such emotions on impressi s are 
unknown. 

Another concern stems from the fa.‘ that 
time and resource constraints forced us to 
limit our study to one combination of par- 
ticipants’ sexes, male observers an male 
targets. Variations on the sex of pa: vants 
are imperative in future studies. 

These are only a small sample of th varia- 
bles that may be important in dete vining 
impressions in surveillance situation aria- 
bles such as the inherent social desi ty of 
the target’s behavior and the degr truc- 
ture in the observers task may have 
significant effects on the observer otions 
and the generality of the findi: this 
study. 

Confidence 

Although the impression ratin; ere gen- 
erally in line with expectations, © > puzzling 
aspect of the results was the {ct that the 


overt-surveillance subjects expressed more 
confidence in their impressions chan did the 
subjects in the other conditions. The differ- 
ence between the overt-surveillance and in- 
active conditions can be easily ascribed to the 
fact that inactive subjects were basing their 
impressions on stimuli that were less immedi- 
ate, but this does not account for the confi- 
dence differences between the overt- and 
covert-surveillance conditions. Here we must 
remember that overt-surveillance subjects 
knew that the behavior sequence had been 
planned by the experimenter. This knowledge 
may act as a demand characteristic because 
the subjects knew that the experimenter was 
aware of what they had observed and may 
have thought that he was expecting the sub- 
ject to come to a conclusion. In other words, 
the subject may have thought that the experi- 
menter expected a confident impression and 
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he reported one. If this line of reasoning is 
correct, it raises important methodological is- 
sues about research in social perception. If 
subjects are less willing to form confident im- 
pressions from spontaneous behavior than 
from experimenter-prescribed behavior, experi- 
ments which do not ask subjects to judge 
spontaneous behavior may overestimate the 
degree to which people are willing to draw 
strong inferences from the behavior of others. 

It is also possible to apply a similar expla- 
nation to the results concerning impressions. 
Perhaps overt-surveillance subjects reported 
negative impressions simply because they 
thought the experimenter expected them to do 
so. The absence of negative impressions in 
the inactive condition (where this demand 
was also present) argues against this interpre- 
tation, but the explanation cannot be com- 
pletely ruled out until it is explicitly tested. 


Surveillance as a Form of Interaction 


Despite the remaining questions, the data 
of this study demonstrate that the surveillance 
situation may be seen and experienced by the 
participants as one which is quite different 
from normal interaction; yet this is only one 
of the psychological dimensions that give sur- 
veillance acts their distinctiveness. Argyle and 
Williams (1969) have noted that the observer 
and the observed are faced with quite differ- 
ent cognitive tasks, which may cause them to 
adopt different cognitive sets and draw differ- 
ent inferences from the same information, 
Strickland (1958) has studied the attribu- 
tional dilemma of an observer who realizes 
that the behavior under observation may be 
partially determined by the pressures of sur- 
veillance itself. Another relatively unexplored 
aspect of surveillance stems from the fact that 
some forms of surveillance involve the ex- 
change of information in the absence of the 
rules and rituals that normally govern our 
interactions, For example, the prescriptions 
about saving face and maintaining the face of 
others (Goffman, 1967) are inapplicable to 
surveillance settings. The knowledge that the 
normal interaction rules have been suspended 
may cause subjects to treat the information 
differently and may contribute to the novelty 
of the task. Perhaps this knowledge contrib- 
uted to the subjective experience. reported by 


the subjects in the covert surveillance condi- 
tion, 

Although the above arguments are intended 
to support the assertion that surveillance is a 
distinctive form of interaction, it must be 
acknowledged that it also has much in com- 
mon with the way we gather information 
about others during normal interaction (Goff- 
man, 1969). The study of surveillance, there- 
fore, may shed light on psychological processes 
in normal interactions as well as special sur- 
veillance settings. 


REFERENCE NOTE 


1. Strickland, L. H., & Barefoot, J. C. Surveillance 
and attribution processes. Paper presented at the 
meeting of the American Psychological Association, 
New Orleans, August 1971. 
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Differential Social Perception and Attribution of Intergro.p 
Violence: Testing the Lower Limits of 
Stereotyping of Blacks 


Birt L. Duncan 
University of California, Berkeley 


In a modified 4 X 4 factorial design with race (black-white) of the harm-doer 
and race (black-white) of the victim as the major factors, the phenomenon of 
differential social perception of intergroup violence was established. White sub- 
jects, observing a videotape of purported ongoing interaction occuring in an- 
other room, labeled an act (ambiguous shove) as more violent when it was 
performed by a black than when the same act was perpetrated by a white, 
That is, the concept of violence was more accessible when viewing a black 
than when viewing a white committing the same act. Causal attributions were 
also found to be divergent. Situation attributions were preferred when the 
harm-doer was white, and person (dispositional) attributions were preferred 
in the black-protagonist conditions. The results are discussed in terms of per- 
ceptual threshold, stereotypy, and attributional biases. 


Tt is now generally accepted that systems of 
values and beliefs are important determinants 
of individual behavior. However, we know 
little “about the lower limits of these influ- 
ences, about their capacity to reach toward 
the regions of human experience which appear 
as immediately given, as the incontrovertible 
evidence of our senses” (Tajfel, 1969, p. 322). 
The many studies related to the “new look” 
in perception have adequately demonstrated 
that perception is not necessarily “veridical” 
and that an individual’s perception of a stimu- 
lus object or event is influenced by his par- 
ticular needs, wishes, and expectations of the 
moment—we tend to perceive what we wish 
or expect to perceive (Merton, 1957). A clas- 
sic experiment in the “new look” tradition 
was performed by Bruner and Goodman 
(1947) who found that the perceiver tended 
to accentuate the size of valued objects (i.e., 
coins) and that this tendency was especially 
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pronounced among subjects with a p ack- 
ground. The literature concerning (i. rela- 
tionship between motivation and perce: ‘on is 
very extensive (Allport, 1958; Brune: 1958; 
Secord & Backman, 1964). The reseay. ` find- 
ings to date indicate that the social -d cul- 
tural background of an individual doc. affect 
the manner in which he perceives the world 


around him. 

Kelly (1955) has formulated a theory of 
personality that leads to predictions similar to 
those derived from theoretical formulations 
based on social and cultural determir of 
social perception, although he emphasizes the 
construed replication of events and cognitive 
aspects of behavior rather than motivation. 
For Kelly, every man is a “scientist” inter- 
ested in the accurate prediction and control 
of future events. His predictions are deter- 
mined by the manner in which the individual 
has construed the replication of past events. 
Kelly’s theory of personal constructs in some 
ways resembles Brunswick’s (1956) formula- 
tion of the probabilistic use of stimulus cues. 

Basic to Kelly’s theory is the function of 
constructs (or concepts) in determining how 
a person construes a stimulus object. Simi- 
larly, Bruner (1957) contends that the process 
of perception involves other processes, such as 
inference, categorization, judgment, and pre- 
diction. According to Bruner, a person neces- 
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sarily “knows” the world only in terms of 

previously formed categories or concepts, All- 

port (1958) emphasizes the role of categoriza- 

tion in the development and maintenance of 

stereotypes: 

The human mind must think with the aid of cate- 

gories [the term is equivalent here to generalizations]. 

Once formed, categories are the basis of normal pre- 

judgement. We cannot possibly avoid this process. 

Orderly living depends upon it. 

Ve may say that the process of categorization has 

live important characteristics. 

(1) It forms large classes and clusters, for guiding 
our daily adjustments . . . 

(2) Categorization assimilates as much as it can to 
the cluster... 

(3) The category enables us quickly to identify a 
related object... 

(4) The category saturates all that it contains with 
the same ideational and emotional flavor . . . 

(5) Categories may be more or less rational. (All- 
port, 1958, pp. 19-21) 

In order to avoid a lengthy discussion of 
the distinction between a category and a 
stereotype, we shall here accept the definition 
of stereotyping offered by Tajfel (1969, p. 
423): “the general inclination to place a per- 
son in categories according to some easily and 
quickly identifiable characteristic such as age, 
sex, ethnic membership, nationality, or occu- 
pation, and then to attribute to him qualities 
believed to be typical of members of that cate- 
gory.” The relationship between this definition 
of stereotyping and Allport’s five character- 
istics of the process of categorization is obvi- 
ous. 

One of the stereotypes frequently applied to 
blacks is that they are impulsive and given to 
crimes and violence, If one believes that 
blacks are more prone to violent acts than 
whites, it is reasonable to assume that the con- 
cept of violence is more accessible when view- 
ing a black than when viewing a white com- 
mitting the same act (see Bruner, 1957). In 
other words, the threshold for labeling an act 
as violent is lower when viewing a black actor 
than when viewing a white actor. The present 
study attempts to test this proposition. 

The perception of intergroup violence was 
chosen as the experimental situation because 
of its contemporary relevance and the high 
probability that it is of concern to people in 
their day-to-day activities. Many members 
of our society feel extremely threatened by 


the possibility of intergroup violence, recalling 
the outbursts of the late sixties and the scat- 
tered recent outbursts such as in Boston, 
Massachusetts. However, it is not meant, by 
implication, that social perception is unique 
to race relationships. Given the increased 
polarization of various groups in our society 
along several dimensions (e.g., age, sex, poli- 
tics, dress) differential social perception is 
probably contributing to the misunderstand- 
ing among many groups. 

Asch (1948), employing written communi- 
cation, performed a now classic experiment 
that is relevant to the hypotheses to be 
tested. He attributed identical statements to 
various persons and found that the meaning 
of the statement varied with the hypothetical 
author, A statement written by Thomas Jef- 
ferson was usually interpreted positively, but 
the same statement was viewed negatively 
when attributed to a leading Communist fig- 
ure. Asch has argued that the subjects’ re- 
sponses did not indicate a reevaluation of the 
statements but actually represented a change 
in the object of judgment (Asch, 1940). 

One purpose of the present design was to 
establish the phenomenon of differential social 
perception in regard to intergroup violence. 
That is, subjects are likely to label an act as 
more violent when it is performed by a black 
than when the same act is perpetrated by a 
white. The study also seeks to go one step 
beyond perceptual phenomena in the Bruns- 
wick (1956) sense by looking to what causal 
locus intergroup behavior becomes attributed. 
At a theoretical level, it concerns the process 
through which an individual assigns causes to 
the behavior he observes and the consequences 
of his resulting beliefs about the causality, 
Heider’s (1958) attribution theory derives 
directly from Brunswick’s treatment of per- 
ception. Just as the perceiving individual must 
integrate the highly variable cues given the 
stimulus representation in order to “infer” the 
relatively unchanging object that gave rise to 
them, the attributing individual must inte- 
grate the cues given in order to infer the more 
stable factors that gave rise to them. Kelley’s 
(1967) conceptualization, which is derived 
directly from Heider, suggests the cognitive 
processes that might be engaged in by the 
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individual during this “inference.” Another heated discussion and ambiguous shove near ‘he end 

purpose of the experiment was to look at the of the tape. 

individual’s interpretations of the intergroup Subjects 

violence behavioral sequence in the context of aie PARAE A f 
; H H 4 = e subjects were white undergraduat At é 

stimulus, situation, and person attributional University of California, Teyine, They were r-uited 

choices postulated by Kelley. That is, the py responding to signs that displayed an « ‘er of 


purpose was to determine whether it was $1.50 for participating 30 minutes in a stud called 


3 P 


something about the stimulus that caused the “interpersonal behavior research.” Subjects vərtici- 
individual to act in that way (violently), Pated individually and were mandom aai ondi. 
. . = y r u mal- 
something about the person, or something Ha nie ia y 
about the situation (particular circumstances). 
Are the observers (subjects) willing to take Procedure 
the behavior more ox less at “face value,” 2 When the subject arrived for his appointn at, he 
reflecting a stable disposition (person attri- found a white experimenter tampering wi the 
butions), and attach insufficient weight to video system in the taping room, The camera ` 3s on 
circumstantial determinants (situation attri- ma thp ee a eles s a ig eee 
butions) of the behavior, as has been demon- iey RRNA Peta Nes belo me 
strated in previous research (Jones & Harris, television screen). The experimenter, becomir nore 
1967) for both black and white harm-doers? serious, stated, “Before anything else, let me p you 
If the threshold for labeling a behavior as for participating in this study.” Whereupon sub- 
violent is lower when the harm-doer is black, bas a r RE e a camer? E 
2 A a y was p 
pull the See perceptual incompleteness hold promised $1.50. The experimenter continuc fol- 
for “slim-evidenced” person attribution in the lows: 
case of white harm-doers? 
There are actually three people scheduled sach 
session. I scheduled you a few minutes © ad of 
MerHop the other two so I could familiarize you h the 
Stimulus Materials set-up. You will be observing the other ty ople 
> by means of the closed circuit TV system ich I 
tae Pa we was ag two black) from have just shown you. You will be watch: from 
eee ‘eave a S S ra a script, loosely another room because in the past we hay ‘ound 
constructed in order to ai opt their respective patois. subjects felt more at ease if they weren being 
The content of the scripts, however, was not altered i F 
i 4 i Ais directly observed. 
by this adaptation. The script was initially developed 
around two of the “dilemma of choice” items on the The subject was then led down the hal! ‘o an 


risk-taking questionnaire constructed by Wallach and 
Kogan (1962). These hypothetical decision situa- 
tions offered a scenario in which an altercation be- 
tween the confederates could be developed. Twelve 
versions (4 race-relation conditions X 3 confederate 
stimulus conditions) of the script were videotaped. 
Thus, every possible confederate pairing by race, 
harm-doer, and victim variables was provided. 


dimensions in the factorial matrix were 
k-white) of the harm-doer and the 
te) of the victim. The experimental 
of a videotape of two males dis- 

ft problem (Brown, 1965, Chap. 
subject who observed the tape 

at the discussion was actually 

r room. The subject was asked 

ior of the “actors” six times at 


observation booth, being made aware by the eaperi- 
menter of the cord (bogus experimental prop) from 
the taping room to the booth. After the subject was 
seated in the booth, the experimenter presented the 
subject with the dependent measure rating forms 
saying, 


Here are your instructions. You can read them 
while I’m talking to the other two subjects, but 
first I» want to show you the rating scales and 
tell you a little about our research. This study is 
part of a project aimed at developing a new sys- 
tem for rating of interpersonal behavior. As you 
can see this new system involves quite simple 
categories of behavior. This is because it is intended 
for nonprofessional observers, Note that your rat- 
ings occur immediately after I signal you with 
the buzzer. I will also be able to control the closed 
circuit TV from my booth next door, so I will 


turn it off briefly in order to let you complete 
your ratings. You won’t have a lot of time so 
respond quickly. Now you will do ratings of two 
series of interactions. The first is a practice. I will 
return after this first practice series to answer any 


to eval 

precise inti 
experimente 
measures wert 
rating), which w 


were signaled to him by the 
tape. The major dependent 
ts’ final evaluations (sixth 
red to coincide with the 
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questions you might have. Please do not leave this 
booth in order to ask me questions. I'll return to 
answer them. 


Independent Variables 


The subject was randomly assigned to one of the 
following experimental conditions (tapes): black 
protagonist-white victim; white protagonist—black 
victim; black protagonist-black victim; white pro- 
tagonist-white victim. All possible pairings of the 
confederates were made in order to avoid interpre- 
(ation of effects due to confederate differences, thus 
making 12 conditions. When the tape was started, 
the subject, looking at a TV monitor through his 
booth window, saw the following purported (bogus) 
live interaction: The experimenter, seating the two 
confederates in the taping room (seen earlier by the 
subjects), gave the following instructions: 


Why don’t you take seats over there. Would you 
please open the folders and take out the top 
sheet? Today you will participate in a study of 
decision making. You will be asked to read the 
two situations that are included in your folders. 
In each case the central person in the story must 
make a decision between two courses of action, 
both of which have positive and negative aspects. 
Your role is to act as advisor to the central 
person in the story and indicate the minimum 
probability of success that you would demand 
before recommending the alternative that might 
have the more desirable outcome, You should 
read the stories individually, come to some deci- 
sion on your own, and then reach a common deci- 
sion between the two of you. Any questions? I 
will be in another room but I'll be able to observe 
you by means of closed circuit TV. So, if you 
have any problems make it known and I'll come 
back, 


The “warm-up” risky situation discussed by the 
confederates on tape coincided with the practice 
rating of the subject. The hypothetical situation 
discussed concerned a man with a severe heart ail- 
ment who must seriously curtail his customary way 
of life unless he undergoes a delicate medical opera- 
tion that could either cure him completely or prove 
fatal, 

The subject was given two practice ratings, 
checked by the experimenter to ensure that the rating 
system was understood and used correctly. The 
“practice run” also established whether the manipu- 
lation was credible. Satisfied that the procedure was 
without difficulty, the experimenter went back to his 
booth, where the buzzer and controls for the video- 
tape monitors were located, and began the second 
rating session. Before leaving the subject he mumbled 
something about having to go check on the “other 
subjects” (confederates). 

The second rating sequence was developed around 
the situation involving an electrical engineer who 
might either stick with his present job at a modest 
but adequate salary or take a new job offering con- 


siderably more money but no long-term security. The 
confederates settled into the cooperative decision- 
making task. (For rating purposes, two cards labeled 
“A” and “B” were on the table next to the harm- 
doer and victim, respectively.) The following excerpts 
from the dialogue reflect the growing tension leading 
up to the ambiguous shoving event: 


B: Aw come on! You're saying that he should 
take the job when it’s likely that the new 
business will not survive. At least he can count 
on his company lasting. He may never be rich 
but at least he will be able to support his 
family and he can count on retiring with some 
money. 

A: You must be crazy! ... 
than his present firm. 

B: That’s not true! All companies have something 
like a cost of living increase built into their pay 
scales. 

A: Well, we don’t know. You're just guessing at 
that. It’s not in this story. The real point is 
whether he is willing to stay in some stagnant 
position or take a little risk and get ahead. 

B: But you're forgetting that the company will 
probably fold. 

A: You’re just too damn conservative. With an 
attitude like that you’ll never get ahead. 

B: What do you mean by that? 


a better pension plan 


The dialogue became more heated leading to B shoy- 
ing A, At that point the subject was signaled to rate 
the interaction. This behavior-rating sequence consti- 
tuted the major dependent variable. Almost immedi- 
ately after the shove sequence and signal by the 
experimenter, the subject's monitor was turned off, 
and the experimenter noisily hurried down the hall 
to the taping room as if something was wrong, 
returning shortly thereafter to the subject’s booth 
mumbling audibly about not expecting them (the 
confederates) to get so heated up. The experimenter 
then collected the six completed rating forms, and 
asked the subject to complete a short questionnaire 
before leaving. 


Dependent Measure Instructions 


The rating system employed as the dependent 
measure was a simplified version of the Interaction 
Process Analysis (IPA; Bales, 1970). The IPA was 
simplified by reducing the number of rating categories 
and requiring only periodic, not continuous, ratings 
of the interaction under consideration, 

The detailed instructions written on the IPA scor- 
ing pad with which the subject was made familiar 
read as follows: 


The observer should make one rating each time he 
is given a signal by the group supervisor or experi- 
mental technician. Upon hearing the signal the 
observer should rate the behavior which took place 
immediately prior to the signal. In making the 
rating the observer should do, in order, the fol- 
lowing things: (1) decide which person (A or B) 
was emitting the behavior to be rated and then 
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circle either A or B at the top of the rating sheet; 
(2) decide into which of the 10 major categories 
the behavior falls and circle the number of that 
major category; (3) indicate whether the behavior 
was of relatively low intensity or high intensity 
by making a check mark on the scale to the right 
of the major category being used; (4) fll out the 
three subscales under the major category being 
used. As soon as all these steps have been com- 
pleted, turn over the ñlled out rating sheet and 
get ready for the next rating signal. If the behavior 
to be rated does not fall into one of the ten cate- 
gories provided, circle #11 (Uncodable) and 
briefly describe the behavior. Use this category, 
however, only as a very last resort. It is impor- 
tant to note that each rating should be done on 
a separate sheet and that only one person (either 
A or B) and only one category of behavior (one 
of the ten categories) are scored on each rating. 
Also, a rating is to be done only when the signal 
is given. 


The 10 major categories were dramatizes, gives infor- 
mation, gives opinion, gives suggestion, asks for in- 
formation, asks for opinion, asks for suggestion, 
playing around, aggressive behavior, and violent be- 
havior, followed by an 8-point intensity scale. Each 
major category was followed by three 8-point sub- 
scales appropriately labeled as follows: provoked- 
unprovoked, intentional-unintentional, and conse- 
quential-trivial in the case of the aggressive behavior 
and violent behavior major categories; emotional- 
calm, enthusiastic-unenthusiastic, and show-off-mod- 
est for the dramatizes and playing around categories; 
tactful-tactless, intelligent-unintelligent, and auto- 
cratic-democratic for the major categories of gives 
information, gives opinion, and gives suggestion; 
inhibited-uninhibited, demanding-undemanding, and 
soft-loud for the major categories of asks for infor- 
mation, asks for opinion, and asks for suggestion. 

The final questionnaire contained items relevant to 
the attribution process used by the subject to make 
his final rating. The experimenter introduced the 
questionnaire by stating 


Pick out one sequence of behavior which left an 
impression on you and answer the questions on the 
form as they pertain to that interaction, Better 
yet, why don’t you use the last sequence of be- 
havior which you rated [reminding the subjects of 
the altercation in which B pushed A]. 


The experimenter gave the impression that any be- 
havioral sequence would do but had decided on the 
spur of the moment that the final interaction rating 
would be perfect. These questions were answered on 
a 9-point scale and concerned (a) the extent to which 
the observed behavior of A or B should be attributed 
to situational forces, (b) the extent to which the 
behavior of A or B should be attributed to him per- 
sonally (person), (c) the extent to which it should 
be attributed to the issue discussed (stimulus), and 
(d) other (some combination of a and/or b and/or c). 

After completing the final questionnaire, the subject 


BIRT L. DUNCAN 


was completely debriefed as to the purpes+ of the 
experiment. All of the deceptions were «uxrefully 
explained and their reasons were outlined. “he de- 
briefing attested to the effectiveness of the jocuipula- 


tions to the extent that the subject thougi 


indeed part of a triad and was privy to an ongoing 
interaction rather than a previously taped = quence 
of behavior. 
Quality of the Stimulus Tapes 
Forty students from University High Sch: erved 
as the treatment blind judges. They were e>: sed to 
the 12 stimulus tapes and asked to rate ti The 
following instructional rationale was offered 
It has been found that many people use | fol- 
lowing characteristics in describing othe sople. 
Each characteristic is represented graphica!\ by a 
scale which runs continuously from onc jeled 
extreme to the other with varying degr: eing 
indicated by numbers. Please indicate th ition 
on the scale that you believe best descr one 
of the subjects you have just observed b) ling 
the appropriate number. Do not restrict rself 
to a particular range on the scale, feel free i place 
your responses anywhere on the scale. You ould 
rate the subject on each scale; do not sk my. 
Also you should indicate how certain yo e of 
the accuracy of each of your ratings by ci one 
of the numbers on the certainty scale to right 
of the scale containing the adjective. Each :bject 
should be rated separately and two sets o` -cales 


are included for this purpose. 


The 9-point bipolar scales included sensitive 
ers-insensitive to others, critical of others-tol: 
others, competent—incompetent, nonaggressive 
sive, honest-dishonest, likable-not likable, c 
tive-cooperative, moral-immoral, individu: 
formist, hostile-affectionate, unfair-fair, uninte 
intelligent, and liberal-conservative. Include 
each scale were 4-point certainty ratings. 

Analysis of the treatment-blind judges’ rat 
the black confederate and white confederate for each 
stimulus tape revealed no between-condition differ- 
ences. If qualitative biases existed, particularly at 
the point in the tape of the altercation, the m 
expectation would be differences in the nonaggre 
aggressive ratings. Further, no between-confi 
differences were found. These findings are of consid- 
erable validation value and support the claim that 
the results to be reported are due to differential so- 
cial perception. 


RESULTS 


Of the initial 104 subjects, 3 left their ob- 
servation booth, and 5 misunderstood the ex- 
perimental instructions and filled out the 
ratings incorrectly. Consequently, the analysis 
reported below represents the responses of 96 
subjects, evenly distributed across conditions. 


SOCIAL PERCEPTION 


595 


TABLE 1 
MAJOR CATEGORY FREQUENCIES BY HARM-DOER/VICTIM RACE PAIRINGS 


Harm-doer / 


Vi 


im pairings 


Black-Black White-White 


Black-White White-Black 


Major 


category  Bi-Be BrBr Wi-W: WeWi 


BW: BOW: BeWi BeWs Wi-B: WrB: WB: WeB: 


0 0 1 2 0 1 0 0 2 2 2 2 
L 0 4 3 0 0 0 2 4 2 3 3 
2 2 3 1 1 1 2 1 1 3 1 3 
Violent 
behavior 5 6 0 2 3 6 6 5 1 1 2 0 
Note. Bi = Blac 


Confederate Number 2, 


The necessity for two sets of confederates to 
control for qualitative differences in the stim- 
ulus tapes formed a rather complex design 
for analysis purposes. Because a confederate 
could not be paired with himself, four empty 
cells were left, yielding 12 essential compari- 
son groups. 

The results of this experiment were parti- 
tioned in several different ways. Table 1 pre- 
sents the subject’s major category choice for 
all harm-doer/victim stimulus combinations 
for the sixth rating session, in which the 
provocation occurred. These data offer the 
most cogent evidence of the effectiveness of 
the experimental manipulation. That is, the 
stimulus event for the sixth rating session 
generated differential labels for the stimulus 
event, and these labels were distributed across 
appropriate major categories (i.e., playing 
around, dramatizes, aggressive behavior, and 
violent behavior). 

These gross frequency data of Table 1 are 
of considerable summary importance. Exam- 
ining the frequencies for the black-protagonist/ 
white-victim conditions, 75% of the subjects 
chose the violent behavior major category, 
x*(1) = 26.67, p < .01, whereas when the pro- 
tagonist was white and the victim was black, 
the behavior was labeled violent by 17%, 
x°(1) = 1.61, ns, of the subjects. Only 6% 
of the subjects perceived the behavior to be 
appropriately described by major categories of 
playing around or dramatizing when the 
perpetrator was black and victim white, com- 
pared with 42% in the white-harm-doer/ 
black-victim conditions. Further analysis of 
the major category choice shows that white- 
protagonist/white-victim conditions are dras- 


Confederate Number 1; Bz = Black Confederate Number 2; Wi = White Confederate Number 1; W: = White 
2 


tically different from the black-protagonist/ 
black-victim condition. The within-group 
(black—black) stimulus condition was labeled 
by 69%, x*(1) = 8.05, p < .01, of the sub- 
jects as violent compared with 13%, y2(1) = 
.49, ns, in the white-white conditions. This 
analysis from Table 1 suggests that the sub- 
ject’s perceptual threshold for labeling a be- 
havior as violent descended along the follow- 
ing stimulus-condition continuum: black-white 
> black-black > white-black > white-white 
(protagonist—victim). 

The scores of combined intensity ratings of 
major categories and subscales served as the 
dependent measure for the Race of Harm-doer 
X Race of Victim analysis of variance. As 
expected, there were substantial effects due to 
race of harm-doer, F(3, 80) = 28.23, p < .01, 
race of victim, F(3,80) = 7.05, p < .01, and 
a significant interaction, F (9, 80) = 14.13, p 
< .01. Individual comparisons performed on 
the data showed that subjects rated the stimu- 
lus behavior as more severe when perpetrated 
by a black against a white than subjects in the 
white harm-doer and black victim conditions 
(p < .01). In fact, subjects indicated that the 
black-protagonist/black-victim behavior was 
more virulent than the white-harm-doer/ 
white-victim conditions (p < .01). In all con- 
ditions where the white confederate is the 
harm-doer, the intensity of the same act per- 
formed by blacks became diluted descriptively 
to that of a little “horseplay.” 


Attributional Behavior 


Analyses of variance performed on the sub- 
ject’s attributions to causes for the “provoca- 
tion” behavior, which occurred in the sixth 
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@—@ Person Attributions 
O— — —O Situation Attributions 


Mean Person or Situation 
Attribution Scores 
> 
I 
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BlacklBlack 


White/White 


Black/White White/Black 


Harm-doer — Victim 


Race Pairings 


Ficure 1. Means for the measures of attribution to person and situation. 


rating session, revealed that the race of harm- 
doer versus race of victim independent varia- 
bles had marked effects on causal attributions. 

The dependent variable, something about 
the situation caused the behavior (of the pro- 
tagonist), yielded main effects of race of 
harm-doer, #(3,18) = 15.13, p< .01, and 
race of victim, F (3,80) = 4.57, p < .01, and 
a significant interaction, F(9,80) = 12.72, 
p< 0l. 

Comparisons between the mean data, using 
the studentized range statistic, showed that 


the subjects in the white-harm-doer/black- 
victim conditions and white-harm-doer/white- 
victim conditions differed significantly from 


the other conditions. The preference for the 
situation attribution causal mode was not 
evident in att ions to stimulus. 

Figure 1 shows that in the conditions where 
the harm-doer was black, there was a dramatic 
reversal in the attributional behavior from the 
white harm-doer conditions. Person attribu- 
tions were the causal locus, demonstrated by 
significant main efiects of race of harm-doer, 


F(3,80) = 8.89, p < .01, and race of vy tim, 
F(3,80) = 11.72, p< .01. The signi cant 
interaction, (9,80) = 6.39, p < .01, : vows 


the confluence of race of harm-doer and victim 


in determining attributional behavior. 


DISCUSSION 


The findings are disquieting, though they 
confirm intuitions and social indicators. White 
university subjects perceived the “somewhat 
ambiguous,” certainly less than blatant shove 
as violent (and labeled it thusly) for all con- 
ditions in which the black was the harm-doer, 
to a greater extent when the victim is white, 
but also when the victim was another black. 
Aggressive behavior, dramatizing, playing 
around, and so on (any category but violent 
behavior) were the labels applied when the 
harm-doer was white, even if the victim was 
a black or another white. Support was found 
for the hypothesis that the threshold for la- 
beling an act as violent is lower when viewing 
a black committing the same act. The data 
statistically described in several different ways 


SOCIAL PERCEPTION 


converge on the following perceptual threshold 
order (harm-doer listed first): black-white > 
black—black > white-black > white—white. 

If this finding is so readily available for 
college subjects, its generalizability to other 
subject populations can be expected to be 
even more dramatic. One may be tempted to 
ask, in the real world where violence is a fact 
of life, have blacks been the victims of mis- 
‘abeling or errors, in cases where there was a 
‘reasonable doubt” (i.e., low perceptual 
threshold acts)? In court testimonies, this 
could have grave consequences. 

In the present experiment the race of the 
harm-doer and the race of the victim were 
systematically varied, but all other aspects of 
the situation were constants. Subsequent ex- 
periments using the same basic paradigm need 
to explore the importance of other factors. 
The most obvious such factor is the nature of 
the harm inflicted. The harmful act in the 
present experiment is just one shove or push. 
‘This is a moderate degree of physical harm. It 
would be of interest to focus on the effects of 
(a) different types of harm and (b) varying 
degrees of harm. An example of the former 
type of research is suggested by the following 
question: Does the differential perception of 
violence as a function of race of the protag- 
onists apply only to physical harm, or is the 
same effect produced when the harmful act is 
verbal (i.e., character assassination)? The 
second type of research would involve sys- 
tematically varying the degree of harm in- 
flicted as, for example, in contrasting a single 
shove or push with two or three punches 
which cause the victim to fall to the floor. As 


the act becomes more extreme, it also be- 


omes more definite and there is less possibil- 
ity that attitudes and values can shape its 
perception and evaluation. However, there is 
some evidence that as the extent of harm 
increases, the need to somehow explain the 
harm also increases (Berscheid & Walster, 
1969), This may lead to a heightened differ- 
ential perception of violence as a function of 
race of the harm-doer. What this discussion 
intends is to point out the power of the experi- 
mental paradigm (bottling of prejudice or 
stereotypy in the laboratory). 

The results of the attribution data are even 
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more disquieting. In accord with current lit- 
erature (Jones & Nisbett, 1971) it was ex- 
pected that the observer subjects would tend 
to attribute the violent behavior in all harm- 
doer conditions to stable personal dispositions 
of the actor. When the harm-doer was black, 
dispositional attributions were indeed made. 
However, when the harm-doer was white, 
observer subjects preferred to see the act as 
under external constraints. How can this be 
explained? Jones and Nisbett (1971) suggest 
that the illusion that our reactions are percep- 
tions is sustained in part by the apparent con- 
sensus accompanying most of our reactions, a 
consensus that may rest as much on transmit- 
ted cultural norms as on the compelling fea- 
tures of objective “reality.” This comment 
needs no amplification. 

It would appear that the black man is im- 
bued (stereotyped, categorized, etc.) with 
such salient personality properties (e.g., given 
to violence) that these traits tend to engulf 
the field rather than be confined to their 
proper position, the interpretation of which 
requires additional data about the situation. 
Dispositions then are treated as causal and 
are “packaged.” Cronbach (1955) and Mischel 
(1968) recently spoke of these packages as an 
implicit personality theory, an intuition about 
how traits interact. These theories are thought 
to be relatively consistent across different 
stimulus persons as well as different observers. 
Why, however, does the observer subject go 
out of his way to attribute away from stable 
dispositions in the white harm-doer condi- 
tions? These are the data that contradict and 
are inexplicable. However we approach this 
puzzle, it remains an unpromising commen- 
tary. 
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Increasing Compliance by Legitimizing Paltry Contributions: 
When Even a Penny Helps 


Robert B, Cialdini and David A. Schroeder 
Arizona State University 


Two experiments were conducted in a door-to-door charity drive context to 
examine the effectiveness of a technique for solving the dilemma of small re- 
quests. The dilemma of small requests is that while they serve to make a target 
person’s compliance highly likely, they also tend to produce low-level payoffs 
for the requester. A procedure was developed to avoid the dilemma by legit- 
imizing rather than requesting the delivery of a minimal favor. Thus, it was 
predicted that a solicitor who implied that a very small favor was acceptable 
but not necessarily desirable would make it difficult for a target to decline to 
help and, at the same time, make it unlikely that the target would actually 
offer a low grade of assistance. In confirmation of this prediction, a door-to- 
door solicitor for charity was able to increase significantly the frequency of 
donations while leaving unaffected the size of the donations by adding the 
sentence, “Even a penny will help,” to a standard request for funds. Experi- 
ment 2 replicated this result and provided evidence for the legitimization-of- 


small-favors explanation of the effect. 


A growing number of studies have focused 
on the question of which factors affect a per- 
son’s willingness to comply with a request. 
Researchers have investigated the influence of 
such variables as mood states, both positive 
(eg, Isen & Levin, 1972) and negative 
(e.g., Cialdini, Darby, & Vincent, 1973), 
attraction (e.g., Goodstadt, 1971), norms 
(e.g., Cialdini, Vincent, Lewis, Catalan, 
Wheeler, & Darby, 1975; Regan, 1971), self- 
concept (e.g., Freedman & Fraser, 1966), 
modeling (e.g., Wagner & Wheeler, 1969), 
and cost (e.g., Darley & Latané, 1970). The 
last of these factors, cost of performing the 
request, seems perhaps the least interesting. 
After all, it is hardly a controversial state- 
ment that the cost of performing a request 
will affect the likelihood that a target person 
will comply with it. In support of this state- 
ment, the research evidence (e.g., Darley & 
Latané, 1970; Latané & Darley, 1970; Sued- 
feld, Bochner, & Wnek, 1972; Wagner & 
Wheeler, 1969) strongly suggests that one 
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way of enhancing the probability of a tar- 
get’s compliance with a request is to reduce 
its size considerably. 

It may well be, however, that small re- 
quests are so frequently complied with not 
just because they involve minimal costs and 
hence make it easy for a target to say yes, but 
because such requests also make it difficult 
for a target to say no. That is, when con- 
fronted with a minor request, a target person 
is hard put to argue for an inability to com- 
ply. It becomes awkward, for example, to 
claim that one does not have the wherewithal 
to provide the money, time, effort, etc., neces- 
sary to satisfy a request for minimal assist- 
ance. In addition to eliminating a variety of 
excuses for failing to help, small requests 
may be hard to refuse, since to do so would 
make it difficult for an individual to avoid 
giving the impression of being an especially 
unhelpful person. The problem of a minimal 
request strategy, however, is that very small 
favors, in themselves, are often unsatisfactory. 
Thus, while minimal requests are likely to 
lead to high percentages of compliance, they 
are unlikely to produce highly desirable out- 
comes for the requester. 

The dilemma of minimal requests, that they 
produce high levels of compliance but low 
levels of reward for a requester, may be 
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avoided by a skillful solicitor. For instance, 
Freedman and Fraser (1966) have demon- 
strated that a target person who complied 
with an initial request was at a later time 
more likely to comply with a substantially 
larger second request. A requester might be 
well advised, then, to ask for a miniscule 
first favor so that, upon the target’s agree- 
ment, the probability of a subsequent, more 
desirable favor would be enhanced. A second, 
more streamlined solution to the dilemma of 
minimal requests may exist as well. Rather 
than directly requesting a small favor, a so- 
licitor could simply legitimize the receipt of 
such a favor and thereby reap the benefits 
but not the disadvantages of minimal re- 
quests. That is, by implying that a minute 
favor is acceptable but not necessarily de- 
sirable, a requester could make it difficult for 
a target to refuse to provide some measure of 
help and could simultaneously make it un- 
likely that the target would offer a low level 
of assistance. To test this possibility, an ini- 
tial, exploratory study was performed in a 
naturalistic, door-to-door fund-raising context. 
Subjects, middle-class suburban home owners, 
were asked to contribute money to the Amer- 
ican Cancer Society in one of two ways. In 
the first method, a standard request for funds 
was made. In the second strategy, the stan- 
dard request was followed by the sentence, 
“Even a penny will help.” Because this phrase 
implied that a single penny was an accepta- 
ble donation but did not suggest, as small 
requests normally do, that it was what the 
requester desired, we expected that it would 
be an especially effective compliance tech- 
nique. It was predicted, then, that (a) target 
persons exposed to the request addendum, 
“Even a penny will help,” would be signifi- 
cantly more likely to decide to contribute but 
(b) once the decision to help had been made, 
these contributors would provide the same 
size donation as those exposed to the stan- 
dard request. 


EXPERIMENT 1 
Method 


Subjects. The subjects were 84 adult residents of a 
middle-income suburban housing area that had not 
been canvassed for funds by the American Cancer 
Society within the year. 
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Procedure. With the collaboration of ‘he local 
branch of the American Cancer Society ìir pairs 
of college-aged research assistants were -quipped 
with the identification badges, informatior “hures, 
and donation envelopes employed in Am ı Can- 
cer Society door-to-door funding drives pairs, 
a male and a female, approached a ho: id ad- 
dressed the first adult who came to the r. The 


experimental solicitor who was of the s Sex as 


the target person delivered the request funds. 
The experiment was conducted at times after- 
noons, evenings, and the weekend) d d to 
equalize the appearance of both male female 
targets in the sample. A solicitor introduc imself 
or herself and stated, “I’m collecting mone ‘or the 
American Cancer Society.” In half of th tances 
(control condition), the solicitor then d red a 
standard plea for funds, “Would you be ng to 
help by giving a donation?” In the oth ilf of 
the instances (even-a-penny condition), th icitor 
again gave the standard plea but added, sen a 
penny will help,” Two measures of hel; were 
taken: (a) whether a target gave a contrib) on and 
(b) size of the contribution. 
Results 

With respect to the percentage of contrib: and 
the total amount contributed in the two ment 
conditions, no differences were found due t -ex of 
the target person; hence, all analyses were co iucted 
with the sex factor collapsed. As predicted, t}. even- 
a-penny condition produced a significantly nigher 
frequency of compliance than did the contro) ondi- 
tion: Out of 42 possible subjects, 12 (28.69%) -f the 
control subjects gave, whereas 21 (50%) the 
even-a-penny subjects gave, x°(1)=4.0i p< 
:05. Further, among those persons who donated, 
there were no differences in the size of individual 
contributions between the conditions. In both condi- 
tions, a dollar donation was the median and modal 
contribution among those who gave; similarly ‘here 
was no difference in the mean donation per active 


ns 


contributor ($1.44 vs. $1.54), ¢(31) = 
Given this pattern of results, it is not surp 
that the even-a-penny condition ($30.34) pr 
be superior to the control condition ($18.55) along 
the practical dimension of total funds obtained. 


Discussion 


As we had expected, the addition of the 
sentence, “Even a penny will help,” to a 
standard request for funds substantially in- 
creased the likelihood of compliance with the 
request while leaving unaffected the size of 


* Because the donation amounts were not nor- 
mally distributed, all of the parametric tests re- 
ported in this article concerning monetary dona- 


eo involve log (X+1) transformations of the 
ata. 


INCREASING COMPLIANCE BY LEGITIMIZING PALTRY CONTRIBUTIONS 


individual contributions. It is our argument 
that this sentence produced an enhanced 
frequency of compliance because it legitimized 
the giving of paltry contributions and there- 
by made it difficult for a target person to 
decline to help. When the most minimal of 
monetary donations is said to be acceptable, 
excuses for failing to help that might ordi- 
narily be offered (e.g., “I can’t afford to give 
to all the various charities”; “Were too low 
on money this week,” etc.) become inapplica- 
ble. Further, the refusal to provide “even a 
penny” of aid might cause one to feel guilty 
or ashamed or might jeopardize one’s image 
as a helpful, socially responsible person. Thus, 
it is our contention that the even-a-penny 
condition made compliance less avoidable be- 
cause it rendered legitimate even the most 
minimal kind of help. 

However, an alternative explanation exists 
as well. Perhaps the success of the even-a- 
penny condition had nothing to do with the 
legitimization of small favors but rather was 
produced by an increased perception of the 
American Cancer Society’s need for assist- 
ance, That is, it is conceivable that the sen- 
tence, “Even a penny will help,” caused sub- 
jects who heard it to perceive the American 
Cancer Society to be in greater need of 
money than did subjects in the control condi- 
tion; for this reason, then, they may have 
been more likely to donate funds. In order to 
test this alternative interpretation and to 
test some implications of the legitimization 
hypothesis, a second, larger experiment was 
done. 

Experiment 2 involved four conditions. Two 
of the conditions (control and even-a-penny) 
vere designed to be similar to those of Ex- 
periment 1, so as to provide a replication of 
our earlier findings. A third condition (even- 
a-dollar) was included to test the assertion of 
the legitimization hypothesis that it is the 
legitimization of minimal favors that ac- 
counts for the effectiveness of the even-a- 
penny condition. Subjects in this condition 
heard a standard request for funds followed by 
the sentence, “Even a dollar will help.” Since, 
for the reasons already suggested, it should 
be easier to refuse to help when a dollar is 
presented as the lowest acceptable donation, 
we expected that subjects in this condition 
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would not show the high degree of compli- 
ance of those in the even-a-penny group and 
would not be significantly different from con- 
trol subjects.* A final condition (social legiti- 
mization) was run in order to compare the 
legitimization and need explanations. The 
ambiguity of the even-a-penny condition was 
that while it stated that very small donations 
were legitimate ones, it may have also implied 
that the charity agency was badly in need 
of funding. The social legitimization condi- 
tion avoided this interpretational problem by 
legitimizing minimal help in a way that did 
not suggest that the charity agency was des- 
perate for money. The solicitor in this condi- 
tion legitimized paltry donations by saying, 
“Weve already received contributions rang- 
ing from a penny on up.” Here, then, a penny 
donation was legitimized via the information 
that others had given only a penny. At the 
same time, the perception that the charity 
agency was severely needful of funds should 
have been greatly reduced. In keeping with 
the legitimization interpretation, it was pre- 
dicted that the compliance rate in the social 
legitimization condition would be similar to 
that of the even-a-penny condition and su- 
perior to that of the control and even-a-dollar 
conditions. 

An additional procedure was employed to 
assess the influence of the perception of need 
in our data. Some subjects in each condition 
were not asked to provide a donation but, 
instead, an indication of the American Can- 
cer Society’s need for money after hearing 
the request. In this way it was possible to 
test indirectly the need explanation of our 
results, 


EXPERIMENT 2 
Method 


Subjects. The subjects were 169 suburban home 
owners who resided in an area similar to that of 
Experiment 1. 

Procedure. Four mixed-sex pairs of college-aged 
solicitors who were different from those of Experi- 
ment 1 and who were blind to the critical experi- 
mental hypotheses approached target homes and 
addressed the first adult who came to the door. 
When approaching each of the 123 subjects peti- 
tioned for a donation, the solicitor said, “I’m col- 


2A dollar was chosen because it was the most 
frequently offered contribution of Experiment 1. 
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lecting money for the American Cancer Society.” In 
the control condition, the solicitor continued by 
making the standard request: “We've already re- 
ceived some contributions, and I wonder if you 
_would be willing to help by giving a donation.” In 
the eyen-a-penny condition, the standard request 
was followed by the sentence, “Even a penny will 
help.” In the even-a-dollar condition, the standard 
request was followed by the sentence, “Even a 
dollar will help.” In the social legitimization condi- 
tion, the solicitor modified the standard request 
somewhat: “We've already received some contribu- 
tions, ranging from a penny on up, and I wonder if 
you would be willing to help by giving a donation.” 
For these subjects the dependent measures, fre- 
quency and size of contribution, were the same as in 
Experiment 1. 

To the 46 subjects who were asked to provide a 
perception of monetary need, the solicitor repre- 
sented himself or herself as doing a survey on how 
people perceived the need for money of charity 
organizations that conduct door-to-door fund-raising 
drives. The solicitor then showed the subject a 
7-point “need for money” scale with categories rang- 
ing from 0 (not at all in need) to 6 (extremely in 
need) and asked: 


How much in need of money would you believe 
a charitable organization, such as the American 
Cancer Society, to be if one of its representatives 
came to the door and asked for a contribution in 
the following way? 


At this point, the solicitor provided, verbatim, the 
request sequence of one of the four conditions of 
the study. For these subjects the dependent mea- 
sure was the perceived degree of need they indicated 
on the scale. 


Results 


Table 1 presents the results for frequency 
of donation, total amount donated, and per- 
ceived need scores in each of our four groups. 
No reliable sex effects were obtained; how- 


TABLE 1 
PERCENTAGE OF SUBJECTS DONATING, TOTAL AMOUNT 


CONTRIBUTED, AND NEED SCORES 
PERIMENT 2 
Measure 
Total 
amount Mean 
ik given need 
Treatment % giving (in $) scoret 
Control 32.2 (10/31) 20.74 3.30 (10) 
Even-a-dollar 46.7 (14/30) 19.35 2.67 (12) 
Social 
legitim: 28.61 2.83 (12) 
Even-a-p 30 3.42 (12) 


Note. The need scores were obtained from a scale ranging 
from 0 (not at all in need) to 6 (extremely in need). Paren- 
theses indicate ns. 

a MSe = 1.52. 
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ever, there was a marginally signifi tend- 
ency for females to be more compliant in 
the even-a-dollar condition than male y°(1) 
= 2.14, p < .15. 

Need scores. A one-way analysis of iance 
on the need scores revealed that t were 
not reliably different from one anoth . F (3, 
42) <1. 

Donations. It was predicted that 1 ven- 
a-penny and social legitimization c ions 
would be alike in frequency of donati and 
that both would be superior to the | ntrol 
condition. It was also expected that th: even- 
a-dollar condition would not produc: reli- 
ably greater frequency of donations t! > the 
control condition. This pattern of pre ions 
seemed most appropriately tested by t of 
three orthogonal comparisons. The fir on- 
trasted the even-a-penny and social legi) iza- 
tion groups and found no difference, y) = 
.27, ns. The second compared the cont: and 


even-a-dollar groups and also found v sig- 


nificant difference, y7(1) = 1.34, ms. The 
third tested the combination of the e- 9-a- 
penny and social legitimization conc ‘ons 
against the combination of the contre: and 
even-a-dollar conditions and yielded a str gly 


significant effect, y°(1) = 5.93, p < .01 

As in Experiment 1, there were no ieat- 
ment differences in the size of individual con- 
tributions among those subjects who prow ied 
a donation. Again the median and modai con- 
tribution for those who gave was a dolla» in 
each condition; and again, there were no dif- 
ferences due to condition in the mean size of 
these contributions, F (3, 119) = 1.04, ns.° In 


8 While it might appear at first glance that the 
control condition mean contribution ($2.07) for 
subjects who donated is larger than that of the other 
groups ($1.74, $1.43, and $1.38 for the even-a- 
penny, social legitimization, and even-a-dollar con- 
ditions, respectively), this is a somewhat illusory 
result in that the size of the control mean is largely 
due to a single $10 donation that is five times as 
great as the next largest control donation. If this 
extreme contribution were removed, the mean of the 
other control donations would be $1.19. The similar 
procedure of removing the single largest contribu- 
tion resulted in comparable means for each of the 
other conditions (even-a-penny M=$1.25; social 
legitimization M = $1.26; even-a-dollar M = $1.24). 
The standard deviations for the various conditions 
both before and after removal of the largest con- 
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terms of the practical measure of total funds 
donated, the pattern of results also replicated 
that of Experiment 1. The conditions that 
legitimized small contributions, even-a-penny 
($31.30) and social legitimization ($28.61), 
produced a greater total than those that did 
not, control ($20.74) and _ even-a-dollar 
($19.35), £(119) = 2.28, p < .05. 


GENERAL DISCUSSION 


Both Experiments 1 and 2 provided evi- 
dence for a solution to the dilemma of small 
requests. By legitimizing rather than directly 
requesting minimal favors, it was possible to 
achieve high rates of compliance but not the 
low-level outcomes usually associated with 
such requests. Experiment 1 showed that in- 
forming request targets in a charity drive that 
paltry contributions were acceptable through 
the sentence, “Even a penny will help,” sig- 
nificantly increased the frequency of dona- 
tions without markedly decreasing the size of 
the donations. Experiment 2 replicated Ex- 
periment 1 and supported the legitimization 
mediator of the effect by demonstrating that 
the legitimization of small requests via mod- 
eling information (social legitimization) had 
effects upon donation behavior that were simi- 
lar to those obtained when solicitors used 
the sentence, “Even a penny will help.” Ex- 
periment 2 also indicated that it was the 
legitimization of a small donation that pro- 
duced the salubrious effect upon contributions 
in that the sentence, “Even a dollar would 
help,” failed to produce a greater compliance 
rate or total amount contributed than did a 
standard request control condition. Finally, 
‘xperiment 2 provided some evidence against 

perceived need interpretation of the data of 
Ë xperiment 1. While subjects’ perceptions of 
the charity agency’s need were generally in 
line with what would be expected from the 
various treatments, these perceptions oc- 
curred in a pattern that could not account 
for the donation data. In all, then, the legiti- 
mization-of-small-favors hypothesis appears to 


tribution are as follows: for control, SDs = 2.82 and 
47; for even-a-penny, SDs = 2.32 and 1.10; for so- 
cial legitimization, SDs =1.32 and 1.07; for even- 
a-dollar, SDs = .63 and .43, 


be supported by the data of these experi- 
ments. 

It is the case, however, that one aspect of 
the data allows for alternative explanations to 
the legitimization hypothesis. While, as pre- 
dicted, the two experimental conditions de- 
signed to legitimize small favors produced the 
highest compliance levels of Experiment 2, 
neither of those conditions elicited signifi- 
cantly more compliance than the even-a-dol- 
lar condition. It is conceivable, then, that the 
only meaningful difference in the data exists 
between the control conditions and the (un- 
differentiated) experimental conditions of our 
studies. Perhaps any addition to the standard 
wording of the control request would have 
produced an equivalent enhancement of com- 
pliance, and hence, the legitimization medi- 
ator may not be as strongly supported as we 
have suggested. Two considerations weaken 
such a possibility, however, First, while the 
three experimental conditions of Experiment 
2 were not significantly different among them- 
selves, they were differentially distinct from 
the control condition, Both the social legiti- 
mization, x*(1) = 6.46, p < .O11, and even-a- 
penny, x*(1) = 4.16, p< .041, conditions 
were superior to the control condition at con- 
ventional levels of significance, but the even- 
a-dollar condition was not, y*(1) = 1.34, ns, 
Second, it is not at all incompatible with the 
legitimization hypothesis that the even-a-dol- 
lar condition was somewhat superior to the 
control condition in frequency of compliance, 
We argued that a donation amount that was 
perceived as small and defined as acceptable to 
the requesters would lead to compliance be- 
cause noncompliance would be difficult to 
support with appropriate excuses and would 
be likely to jeopardize personal image. Earlier 
in this article it was predicted that these rea- 
sons for compliance would be less applicable 
to the even-a-dollar condition than the even- 
a-penny condition because the lowest accepta- 
ble donation was not as small. However, for 
many of our subjects, a one-dollar request by 
the American Cancer Society may have been — 
perceived as a small one. Indeed, among the 
donors in the control conditions of these 
nearly a third spontaneously gave 
with the 


studies, 
more than a dollar contribution, 
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smallest such donation being twice as large 
as the dollar request and the average being 
more than 34 times as great. Thus, while it is 
likely that all of the subjects in the even-a- 
penny conditions saw the penny request as a 
small one, it appears that a substantial por- 
tion of subjects in the even-a-dollar condi- 
tion may have seen the dollar request as a 
small one as well. Thus, the intermediate 
level of compliance obtained in the even-a- 
dollar condition is not inconsistent with the 
legitimization-of-small-favors model. None- 
theless, the possibility that the legitimization 
hypothesis does not best account for the re- 
sults of these experiments still exists, and 
future work should be directed to test its 
validity further. 

There is, of course, a practical value to 
our results as well. The findings suggest that 
a highly effective approach to asking a favor 
might be to legitimize but not specifically 
request trivial aid from a target person. It 
appears that such a procedure makes it un- 
likely that the target will refuse to help or 
offer substandard forms of assistance. Thus, 
the total amount of money collected in both 
experiments in the conditions designed to 
legitimize small favors ($90.31) was more 
than 14 times that of comparison conditions 
($58.64). Combining the probability levels 
for the appropriate tests of Experiments 1 
and 2 (i.e., even-a-penny vs. control in Ex- 
periment 1 and even-a-penny plus social 
legitimization vs. control plus even-a-dollar in 
Experiment 2) shows this to be a strongly 
significant effect (Z = 2.58, p < .01). It re- 
mains to be seen, however, whether our tech- 
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nique would be effective in contexts other 

than those involving monetary requests. 
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Communication Modality as a Determinant of Message 
Persuasiveness and Message Comprehensibility 


Shelly Chaiken and Alice H. Eagly 
University of Massachusetts—Amherst 


Subjects were exposed to either an easy- or difficult-to-understand persuasive 
message presented via the written, audiotaped, or videotaped modality. With 
difficult messages, both persuasion and comprehension of persuasive material 
were greater when the message was written as compared with videotaped and 
audiotaped. With easy messages, persuasion was greatest when the message was 
videotaped, moderate when audiotaped, and least when written, but compre- 
hension was equivalent regardless of modality. The persuasion and comprehen- 
sion findings, as well as results on other variables, were discussed within a 
framework that considered the effect of communication modality on the 
capacity for effective reception of information as well as on the tendency to 
yield to information. The communicator’s nonverbal expressions of confidence, 
manipulated within audiotaped and videotaped conditions, did not affect 


opinions. 


A relatively unexplored question in attitude 
change research concerns the effects of the 
medium or channel of transmission upon the 
effectiveness of persuasive communications. 
Most previous research indicates that live or 
videotaped messages induce greater opinion 
change than oral (i.e., audiotaped) messages 
(Frandsen, 1963), which, in turn, effect 
greater change than written messages (Can- 
tril & Allport, 1935; Haugh, 1952; Knower, 
1935, 1936; Wilke, 1934). However, a few 
studies show either no differential persuasive- 
ness as a function of modality or greater 
opinion change with written communications 
(McGinnies, 1965; Tannenbaum & Kerrick, 
954; Worchel, Andreoli, & Eason, 1975). In 
contrast to these persuasion findings, research 
on comprehension indicates that written 
entation of material generally results in 
eater comprehension than does audiotaped 
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or videotaped presentation (Harwood, 1951; 
Haugh, 1952; Wilson, 1974; Young, 1953), 
although this advantage may be manifest 
only with relatively complex material (Beigh- 
ley, 1952; Carver, 1935; Harwood, 1951; 
Westover, 1958). 

This greater capacity for the written mo- 
dality to convey information seems somewhat 
paradoxical in conjunction with the majority 
of persuasion findings when one considers the 
assumption that good reception of a message 
generally facilitates opinion change (e.g., Hov- 
land, Janis, & Kelley, 1953; McGuire, 1968, 
1972), an assumption that has received em- 
pirical support (e.g., Eagly, 1974; Haaland & 
Venkatesan, 1968; Janis & Rife, 1959; Mc- 
Guire, 1957). Although this relationship is 
generally considered to be the result of com- 
prehension influencing the amount of informa- 
tion that the message recipient obtains in sup- 
port of the message’s conclusion, it may also 
be a product of the negative affect generated 
by the recipient’s attempt to comprehend ma- 
terial that is not easily understood (Eagly, 
1974). 

This tendency for comprehension of mes- 
sage content to enhance persuasion suggests 
that the lack of consistency across previous 
modality studies may have been due to fail- 
ures to control for message comprehensibility. 
Because the written mode’s superiority in con- 
veying information is manifest primarily with 
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inherently  difficult-to-understand material, 
modality differences in persuasiveness may 
favor the written mode with difficult material, 
whereas the more usual advantage of the 
audiotaped and videotaped (or live) modali- 
ties may be limited to easy material. 

To examine this idea, subjects were pre- 
sented with either an easy- or difficult-to- 
understand message in either the written, 
audiotaped, or videotaped mode. Modality 
differences in comprehension favoring written 
over audiotaped and videotaped messages were 
predicted only for difficult material. There- 
fore, in difficult conditions, written messages 
were expected to be more persuasive than 
audiotaped and videotaped messages. With 
easy material, which was expected to be com- 
prehended equally well across modalities, it 
was anticipated that persuasion would be 
greater in videotaped compared with audio- 
taped and in audiotaped compared with writ- 
ten conditions. 

Underlying these predictions are the as- 
sumptions that (a) people more readily yield 
to (McGuire, 1968, 1972) or accept informa- 
tion transmitted in a videotaped compared 
with audiotaped compared with written mode, 
and (b) with difficult messages, these yield- 
ing differences are outweighed by the written 
mode’s persuasive advantage, which stems 
from its superiority in conveying difficult ma- 
terial. The assumption of modality differences 
in yielding is supported by the majority of 
opinion change findings cited earlier and by 
research suggesting that people are more 
critical of and perceive as less valid material 
that is written as opposed to material pre- 
sented via audiotape or videotape (e.g., Car- 
ver, 1935; Maier & Thurber, 1968). Further, 
the study explored two potential explanations 
for modality differences in yielding. Both ex- 

planations, one involving distraction and the 
other dealing with perception of the com- 
municator, are:based on possible functions 
served by nonverbal cues, which are present in 
both audiotaped and videotaped modes but 
absent in the written modality.* 

The distraction explanation suggests that 
vocal and visual cues may distract the mes- 
sage recipient from message content. To ac- 
count for the presumed modality differences 
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in yielding, this distraction must lessen the 
recipient’s critical attitude toward the mes- 
sage, probably by reducing the ability to 
counterargue (Keating & Brock, 1974; Oster- 
house & Brock, 1970), yet not be so exireme 
that it interferes with message comprehension 
(Baron, Baron, & Miller, 1973; Haaland & 
Venkatesan, 1968; Insko, Turnbull, & Yan- 
dell, 1974). To examine the role of distrac- 
tion, both counterarguing and perceived dis- 
traction were assessed. 

A second explanation for modality differ- 
ences in yielding is that nonverbal cues in the 
videotaped and audiotaped modalities may 
affect perception of the communicator and, in 
particular, may increase the salience of as- 
pects of the communicator’s image that facili- 
tate persuasion (cf. Williams, 1975). To ex- 
plore this idea, the study assessed subjects’ 
perceptions of the communicator.” In addition, 
because nonverbally expressed confidence is 
one aspect of the communicator’s image that 
has been shown to enhance persuasion (Lon- 
don, 1973), this variable was manipulated 
within audiotaped and videotaped conditions. 
If typical modality differences in yielding 
(video > audio > written) are a product of, 
the communicator’s image, these differences 
should be maximized with confidently pre- 
sented messages and minimized or reversed 
with nonconfidently presented messages. 

In summary, the study examined whether 
differences in message comprehensibility ac- 
count for modality effects in persuasion and, 
as a secondary focus, explored possible de- 
terminants of presumed modality differences 
in yielding. 


1The distraction and communicator-image expla- 
nations for yielding differences were deemed reason- 
able from the standpoint of examining modality 
effects in a laboratory context. It seems likely, how- 
ever, that to explain modality effects in other con- 
texts, additional mechanisms, which are perhaps more 
salient outside of the laboratory, may need to be 
considered. For example, differing degrees of credi- 
bility may be attributed to modalities as a result of 
their history of use in a particular culture. 

2 This dependent variable, as well as the perceived 
distraction measure mentioned above, may reflect 
differences in message comprehensibility in addition 
to modality effects in yielding. 


COMMUNICATION MODALITY AND MESSAGE PERSUASIVENESS 


METHOD 
view 


Subjects indicated their opinions and other re- 
ponses after receiving an easy- or difficult-to-under- 
tand persuasive message via written, audiotaped, or 
videotaped esentation. Within audiotape and 
videotape conditions, the communicator delivered the 
ntly or nonconfidently. 


Subjects 


^A total of 321 University of Massachusetts psy- 
logy students (154 males, 167 females) partici- 
1 and received extra credit toward their course 
lcs. Seven subjects were eliminated because they 
uspected an influence attempt (1) or doubted 
nenticity of the cover story (6); 15 more 
domly discarded in order to equalize cell 

ies. 


lure 


pating in numbers ranging from 2 to 7 
ubjects were told that the psychology 
nt was helping a New England law school 
e the ability of its law students to argue legal 
Che experimenter explained that the law school 
ently implemented a training program and 
dd i if the program were evaluated positively, 
the ov school wanted its students to serve as student 
aw ı community legal clinics. The experimenter 
marized what the subjects’ participation 
itaile reading background information about a 
case, reading (or listening to, or viewing) a ran- 
domly selected law student’s “case discussion,” giv- 
ing their own opinions on the case, and responding 
to a “case evaluation” questionnaire. 
The experimenter then allowed subjects 5 minutes 
» read handouts that provided background facts 
he (fictional) “Victoria Company Case,” 
any-union dispute over the management’s 
to pay a traditionally awarded Christmas 
to workers. The handout gave both union 
company positions in the case (i.e., that workers 
should vs. should not be compensated for the lost 
bonus) and concluded by stating that because an 
agreement had not been reached, an independent 
iter had been called in. 
t, the experimenter distributed written tran- 
scripts or played an audiotape or videotape of the 
persuasive message (“the law student’s case discus- 
sion”). Afterwards, subjects indicated their agree- 
ment with the message. Then they received a “case 
evaluation” questionnaire containing the remaining 
dependent measures (see below). Finally, subjects 
were debriefed. 
Control sub 
the same but 
T 


s (n =47) were treated essentially 
e their opinions without being ex- 
l to a persuasive communication. 


independent Variables 


_ Message difficulty. The topic employed (a legal 
dispute) afforded the opportunity to manipulate 
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message difficulty without rendering either the easy 
or difficult message implausible. Both versions were 
approximately 955 words long, advocated the com- 
pany’s position in the case, and contained three 
lengthy arguments supporting that position, The 
easy version was composed of sentences approxi- 
mately 20 words in length with one or two clauses 
per sentence. The difficult version was composed of 
sentences approximately 30 words in length with 
three or more clauses per sentence. In addition, the 
difficult version differed from the easy one in terms 
of its more sophisticated vocabulary. For example, in 
the difficult version, the words “apportioned,” “sub- 
ducted,” and “mollify” appeared in place of the 
words “distributed,” “withdrew,” and “change,” 
which were used in the easy version. To illustrate 
further, one passage from the easy version read as 
follows: 


Of particular importance is the fact that the com- 
pany agreed to make the wage increase granted in 
October of 1972 retroactive to September 1972, 
Making the wage increase retroactive cost the 
company over $9,300. 


The corresponding passage in the difficult version 
was: 


One particular benefit which should be underscored 
is that of retroactivity to September, 1972 of the 
wage accretion granted in October of the same 
year, a benefit which the company financed and 
the cost of which exceeded $9,300. 


Aside from differences in sentence structure and level 
of vocabulary used, the content of both versions was 
essentially identical.* 

Confidence. An amateur actor was coached to 
deliver each of the above versions of the message 
using either confident or nonconfident paralanguage 
(i.e., tone of voice) and body language (i.e., ges- 
tures, posture). 

Modality. The actor’s final performances were 
videotaped and used in videotape conditions, whereas 
the audio portions of the tapes were played in audio- 
tape conditions. These messages lasted approximately 
7 minutes, and subjects in written conditions were 
allotted 7 minutes to read through written versions. 


3 The difficulty manipulation should not be consid- 
ered a comprehensibility manipulation because mes- 
sage comprehension should be a joint product of both 
message difficulty and modality. An independent as- 
sessment of the success of the difficulty manipulation 
is virtually impossible because all likely checks should 
also be affected by modality. Thus, while an obvious 
overall check on the manipulation’s effectiveness 
would be a finding of decreased comprehension given 
difficult as compared with easy messages, we expect 
that this effect should primarily result from lowered 
comprehension in audiotape and videotape (but not 
written) conditions. 
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Dependent Variables 


Opinions. Subjects indicated their agreement with 
the statement, “The Victoria Company should be 
required to pay compensation to their union em- 
ployees for the lost Christmas bonus,” on a 15-point 
scale anchored by “definitely agree” and “definitely 
disagree.” Disagreement with this statement implies 
agreement with the position advocated in the mes- 
sage. 

Comprehension. The following four measures of 
message comprehension were employed: (a) a self- 
report of how difficult it was to understand the law 
student’s discussion (15-point scale); (b) recall, 
assessed by having subjects write each of the three 
arguments that the law student had used to support 
his position (an answer was considered correct if, in 
the opinion of two independent raters, r= .89, it 
adequately summarized one of the arguments in the 
message); (c) recognition, assessed by a five-item, 
multiple-choice test on details of the legal dispute ; 
(d) recall, assessed by a six-item, short-answer quiz 
concerning additional details (scored for correctness 
by two raters, r=.91). Also, subjects wrote down 
the overall position taken by the law student. All 
subjects demonstrated satisfactory recall on this 
measure. 

Perception of the source. Subjects rated the source 
(law student) on 15-point bipolar evaluative scales. 
Positive poles were competent, warm, intelligent, ap- 
proachable, interesting, generous, sincere, friendly, 
unbiased, modest, well read, good sense of humor, 
persuasive, and confident. 

Other measures. Subjects indicated, on 15-point 
scales, how distracted they felt from the content of 
the case discussion, as well as how much effort they 
had expended and how pleasant they had found the 
experience of reading (or listening to, or viewing) 
the case discussion. Again on 15-point scales, sub- 
jects rated the extent to which they thought that 
the law student’s language was appropriate for use in 
a community legal clinic and the extent to which 
they believed that he was an appropriate person to 
work in such a clinic. To assess counterarguing, sub- 
jects listed their thoughts and ideas concerning the 
consequences of the position advocated in the mes- 
sage. A statement was scored as a counterargument 
if, in the opinion of two raters, r = .94, it indicated 
a negative consequence of the position advocated or 
constituted an attack on the message’s validity (cf. 
Brock, 1967; Osterhouse & Brock, 1970). 

Subjects also wrote down their interpretations of 
the study. These responses were coded for suspicion. 


RESULTS 


In addition to performing overall signifi- 
cance tests through analyses of variance on 
the dependent measures, the major hypotheses 
were explored by simple effects tests and 
planned comparisons implied by the predic- 
tions detailed earlier. The four comprehension 


SHELLY CHAIKEN AND ALICE H. EAGLY 


measures were treated by multivariate analy- © 
sis of variance. Since subjects’ sex had nc 
nificant effect on opinions and few sign 
effects elsewhere, all reported analyses were 
performed ignoring this variable. 


Manipulation Checks 


Although subjects in confident conditions 
perceived the source as more confident than 
subjects in nonconfident conditions (p < 
.001), confidence did not significantly affect 
opinions and showed few effects on other 
measures. Thus, all subsequent analyses ig- 
nored this variable. The multivariate analysis 
of the comprehension measures indicated sig- 
nificantly poorer message comprehension for 
subjects in difficult compared with easy condi- 
tions, F(4,243) = 7.19, p < .001. All four 
univariate components of the multivariate 
analysis also exhibited this effect (p < .005 
or smaller). 

Opinions 

All treatment groups differed significantly 
from the control group in the direction of 
greater agreement with the message ($ < 
or smaller by Dunnett’s test; cf. Myc 3, 
1972). Opinion change scores (see Table |) 
were formed by subtracting from each sub- 
ject’s opinion the mean of the control group 
(M = 8.27). 

An analysis on these scores yielded a Mo- 
dality x Message Difficulty interaction, F (2, 
246) = 7.89, p < .001. As predicted, when 
easy messages were received, opinion change 
was greater in videotape over audiotape over 
written conditions, simple F(2, 246) = 3.56, 
< .05, although planned comparisons re- 
vealed that only the videotape versus written 
difference was significant (p < .01). The mo- 
dality effect was also significant within diffi- 
cult conditions, simple F (2,246) = 6.41, p < 
.005, and a planned comparison supported the 
prediction of significantly greater opinion 
change in written conditions relative to video- 
tape and audiotape conditions combined 
(p < .01)4 


4A comparison of the written-easy and written- 
difficult cells reveals that relatively (though not 
significantly) greater opinion change occurred when 
subjects received difficult rather than easy written 
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TABLE 1 
DEPENDENT VARIABLES AS A FUNCTION OF MODALITY AND MESSAGE DIFFICULTY 


Message difficulty 


Easy Difficult 
Audio- Video- Audio- Video- 
Variable Written tape tape Written tape tape 
m change 2.94 3.75 4.78 4.73 2.32 3.02 
difficulty 4.76 4.21 4.83 5.31 7.50 7.43 
n ve arguments recalled 2.45 2.21 217 2.29 1.74 1.67 
swer items correct 4.57 3.93 4.45 4.21 3.71 3,36 
ltiple choice items correct 3.55 3.05 3.24 3.10 2.81 2.64 
9.00 9.07 9.50 8.38 6.81 7.38 
4.67 5.69 6.98 6.79 9.21 8.24 
11.33 10.83 10.19 9.60 7.62 7.45 
9.67 10.07 9.76 9.83 7.36 8.21 


e. Higher numbers indicate greater opinion change and message comprehension; greater amounts of perceived difficulty, 
ntness, and distraction; and a greater tendency to judge the communicator as well as his language appropriate for a legal 
Cell n = 42.) 


x mprehension and Pleasantness of written messages would be manifest only 
with diffcult-to-understand messages (see 
Table 1). Thus, within easy conditions the 
simple modality effect was nonsignificant on 
all four comprehension measures and indicates 
that comprehension was essentially identical 
regardless of mode of presentation. Within 
difficult conditions, however, the modality ef- 
fect was significant on three measures: per- 
ceived difficulty (p < .01), arguments recalled 
(p < .005), and short-answer items correct 
(p < .05). Further, planned comparisons re- 
vealed that given difficult messages, subjects 
in written conditions, relative to subjects in 
videotape and audiotape conditions combined, 
reported less difficulty (p< .01), recalled 
more persuasive arguments (p < .01), and 
correctly answered more short-answer items 
(p < .05). 

Regarding the pleasantness ratings (see 
Table 1), subjects receiving easy messages 


In addition to the significant manipulation 
ieck on comprehension, the multivariate 
nalysis yielded a significant modality effect, 

F (8,486) = 2.23, p < .05. This effect was 
significant on three univariate components 
uments recalled, multiple-choice, and 
short-answer scores, p < .05 or smaller) and 
varginal on one (self-reported difficulty, 
13). The modality effect indicates that, 
overall, subjects in written conditions compre- 
hended more of the message than subjects in 
either videotape or audiotape conditions. 
"he Modality X Message Difficulty inter- 
ion was marginally significant in the multi- 
variate analysis, F(8,486) = 1.81, p=.07. 
Tt was significant on one univariate compo- 
nent, self-reported difficulty (p < .05) and 
marginal on a second, number of short-answer 
x, items correct (p = .13). The patterning of the 


Bete 
s R ae Ai bee TEn a wel A perceived their experiences as more pleasant 

oo out nonsigmiicant patterning on te ‘than those receiving difficult messages, F(1, 
eae cian ae is consistent ee 246) = 21.58, p<.001. The Modality x 

| prediction that the superior comprehensibility Message Difficulty interaction was only mar- 
ginally significant on this variable, F(2, 246) 
messages. Obviously, the two messages did represent = 2.24, $ = 11. However, paralleling the 
two unique communications even though the diffi- comprehension findings, the simple effect of 
culty manipulation was designed to create messages modality within easy conditions was nonsig- 


y in ease of comprehension. For this . imple eff 
‘ 5 i e simple effect 
reason and because the substantive empirical ques- nificant, K< 10, youen an p ; 
tion concerned the relative persuasive effectiveness of within difficult conditions was significant 
the various modalities as a function of message diffi- (p < 05). Thus, given easy messages, per- 
culty, the Modality X Message Difficulty interaction w s, z i, 

i; was essentially identical 
was explored by examining differences among mo- ceived pleasantness ` M d 
dality conditions within levels of message diffculty. regardless of modality, but given difficult mes- 


£ ing so 
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sages, pleasantness was greater in written con- 
ditions compared with audiotape and video- 
tape conditions combined (p < .05, planned 
comparison). 


Other Dependent Variables 


Analyses of subjects’ appropriateness ratings 
(see Table 1) revealed that subjects receiving 
easy, compared with difficult, messages rated 
both the communicator and his language as 
more appropriate for a legal clinic setting 
(ps < 001). Also, subjects in written condi- 
tions, relative to subjects in audiotape and 
videotape conditions, viewed the communica- 
tor’s language as significantly more appropri- 
ate (p < .005) and viewed the communicator 
as slightly more appropriate for clinic work 
(p=.07). Finally, the Modality x Message 
Difficulty interaction on the communicator’s 
appropriateness for clinic work (p < .005) 
indicated that given easy messages, appropri- 
ateness ratings did not vary with modality, 
whereas given difficult messages, subjects in 
written conditions viewed the communicator 
as more appropriate for clinic work than did 
subjects in audiotape and videotape conditions. 

On the distraction ratings (see Table 1), 
the modality effect (p < .001) supported the 
expectation that subjects in written condi- 
tions would report less distraction from mes- 
sage content than would subjects in audio- 
tape and videotape conditions. In addition, 
subjects receiving difficult, compared with 
easy, messages reported greater distraction 
(p < 001). 

A factor analysis of the source ratings 
yielded two rotated factors, labeled “profes- 
sionalism/expertise” (e.g., confident, intelli- 
gent, competent) and “personal attractive- 
ness/warmth” (e.g., generous, approachable, 
friendly), which accounted for 24.70% and 
17.40% of the variance, respectively. Factor 

scores, obtained for each subject on each of 
these factors, were treated by analysis of vari- 
ance, Subjects receiving easy messages viewed 
the communicator as more professional and 
expert (p < .05) and more personally attrac- 
tive and warm (p < .05) than did subjects 
receiving difficult messages. Also, subjects in 
written conditions perceived the source as 
more professional and expert than did sub- 
jects in auditotape and videotape conditions 
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(p < 001). Finally, subjects in audiotape “ 
conditions viewed the communicator as less 
attractive and warm than did subjects in 
written and videotape conditions (p < .001). 

Analyses on perceived effort and counter- 
arguing yielded no significant effects. 


Discussion 


Our analysis of the impact of communica- 
tion modality on persuasion suggested taking ; 
into account the effect of modality on recep- 
tion of information as well as on the tendency 
to yield to information that is received. The 
written mode’s superiority in transmitting in- 


formation was expected to be maniiest only 
with difficult material. Thus, the increment 
in persuasiveness stemming from superior 
comprehension in the written mode was ex- 
pected to appear only in the difficult mes- 


sage condition of the experiment. It was 
therefore predicted that messages conveying 
difficult content would be more persuasive if 
written rather than videotaped or audiotaped, 
In contrast, because easy messages should 
minimize modality differences in comprehen- 
sion while maintaining differences in yielding, 
they were predicted to be more persuasive if 
videotaped rather than audiotaped and if 
audiotaped rather than written. The results on 
opinion change supported these predictions. 


Comprehension and Pleasantness as Mediators 
of Persuasion 


Additional support for our analysis can be 
seen in the comprehension findings, which 
were consistent with its presumed mediating 
role in relationship to opinion change. Al- 
though the interaction between modality and 
message difficulty fell short of significance 
(p = .07) in the multivariate anal simple 
effects tests and planned comparisons showed 
that for difficult messages, comprehension was” 
greater in the written than in the video- 
taped and audiotaped modalities, whereas the 
modalities differed little for easy messages. 

Since the comprehension indices presum- 
ably served to measure the reception mediator — 
directly, the fact that we did not obtain as 
strong an interaction effect on message com- 
prehension as we did on opinions deserves 
comment. As McGuire (1969) has noted, a 
recurrent “paradox” in research guided by 
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leat theory and other approaches to atti- 
hange is that an independent variable, 

ia showing the predicted relationship to 
attitude change variable, exhibits 
all, the predicted relationship 
I nal learning or information- 
wiable, even though the hypothe- 
umal effect served as a basis for 
g the distal one. This paradox be- 
somewhat understandable, however, 

n one considers that, like most research 
this area, the present study placed some 
nce on verbal memory (i.e., message re- 
and recognition) in order to document 
mediational role of message reception. As 
been argued (e.g., Eagly, 1974), memory 
‘ovides only a weak indication of reception 
cause verbal memory reflects not only re- 
option, or encoding processes, but also stor- 
se and decoding processes. In this regard, 
t is interesting to note that in the multi- 
variate analysis, the univariate component on 
which the Modality x Message Difficulty ef- 
fect appeared most strongly was subjects’ 
If-reported difficulty in understanding the 


al 
tal 


munication. Perhaps this “subjective” 
in more closely reflected the quality of 
subject reception experiences than did 
our more “objective” recall and recognition 
measures, 


ngs of the pleasantness of 

nessage paralleled the compre- 

dings: With difficult messages the 

written modality tended to engender a more 
‘sant experience than the audiotape and 
videotape modalities, but with easy messages 
the modalities differed little. This parallelism 
supports Eagly’s (1974) argument that the 
decrement in opinion change that follows 
poor comprehension may stem from 
a ne e affective reaction as well as from 
the lesser amount of information received. 
To compare these two mediators, correlational 
analyses were undertaken. These analyses re- 
vealed that information received (as esti- 
mated by a relention index formed by 
summin sach subject’s three message 
scores) and pleasantness 
more highly with opinions 
; TPleasanness = -34) than with 
.21). Further, a multiple re- 
gression analysis, including both informa- 


from 
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tion received and pleasantness as predic- 
tors, revealed that each variable contributed 
significantly in predicting opinion change, 
Brotention = -27, F(1, 249) = 21.23, p < .001; 
Arteasantness = -29, F(1, 249) = 24.11, p< 
001, These findings, as well as the results of 
analyses of covariance controlling opinion 
change separately for comprehension and 
pleasantness, suggested that the amount of 
information received and perceived pleasant- 
ness both contributed to the opinion change 
effects and did so with about equal strength, 

Additional support for the importance of 
affect in contributing to reception effects may 
be provided by the appropriateness ratings. 
Thus, in the difficult audiotape and videotape 
conditions, subjects judged both the communi- 
cator and his language as relatively inappro- 
priate for a legal clinic setting. Perhaps these 
ratings, which correlated significantly with 
pleasantness (ps < .001), reflected annoyance 
with a communicator who was perceived as 
talking in an unnecessarily complicated man- 
ner. However, another explanation of these 
ratings might be that people expect oral 
presentations to be less complex than written 
ones—perhaps because communicators typi- 
cally do adjust the difficulty of their material 
to the limitations imposed by channels of 
transmission. When expectancies are violated, 
as they probably are by audiotaped or video- 
taped presentations conveying very complex 
material, people consider these presentations 
inappropriate. 

The present findings extend the generality 
of the Eagly (1974) research because our 
difficulty manipulation, accomplished through 
variations in vocabulary and sentence struc- 
ture, is probably more representative of the 
ways in which differences in comprehension 
are created in more naturalistic situations 
than were the comprehensibility manipula- 
tions used in her earlier experiments (e.g. 
noise and distortion on an audiotaped mes- 
sage). It should be emphasized, however, that 
our difficulty manipulation was analogous to 
Eagly’s comprehensibility manipulations only 
with respect to the audiotape and videotape 
conditions of the present experiment (see 
Footnote 3). In a Modality X Message Dif- 
ficulty analysis of variance excluding written 
conditions, we obtained, like Eagly, a message 
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difficulty main effect on opinions (~ < .005), 
as well as on all the comprehension measures 
(all ps < .05 or smaller) and on perceived 
pleasantness (p < .001). That the interaction 
between modality and message difficulty, but 
not a message difficulty main effect, obtained 
on opinions when all modality levels were 
considered is due, as noted earlier, to the fact 
that the difficulty manipulation had, as pre- 
dicted, little impact on written subjects’ 
abilities to comprehend the message and, thus, 
little impact on their opinions. 


Modality Differences in Vielding 

Although reception processes can acount for 
the lowered persuasiveness of difficult mes- 
sages in the audiotaped and videotaped com- 
pared with written mode, modality differences 
in yielding must be invoked to explain the 
ordering obtained with easy messages (video 
> audio > written). Two possible explana- 
tions for these yielding differences were 
explored. 

One explanation proposed that nonverbal 
cues, which decrease in number from video- 
taped to audiotaped to written modalities, 
diminish receivers’ critical abilities. Support- 
ing this interpretation, subjects in written 
conditions reported less distraction from mes- 
Sage content than did subjects in audiotape 
conditions who, in turn, reported less distrac- 
tion than did subjects in videotape conditions. 
However, because easy messages were rated 
less distracting than difficult ones, it seems 
that distraction is best viewed as a complex 
variable that may enhance yielding by re- 
ducing the ability to be critical but may also 
interfere with message reception (e.g., Insko 
et al., 1974). 

Though the effect of modality on distrac- 
tion provides some support for the distraction 
mechanism, it should be noted that the ex- 
perimental variables failed to affect counter- 
arguing, a variable which past research (e.g., 
Osterhouse & Brock, 1970) has implicated as 
a mediator of enhanced opinion change under 
distraction conditions. Yet, as Osterhouse and 
Brock argued, it may be necessary for a per- 
suasive message to counter a well-anchored 
attitude in order for counterarguing to occur, 
In contrast, our message presented new in- 
formation to subjects on an unfamiliar topic. 
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Our analysis also suggested that in- 
crease in yielding as modality change: from 
written to audiotape to videotape coul be a 
product of an enhancement of the com. ..uni- 
cator’s image from the addition of nons -rbal 
vocal and visual cues. However, the -urce 
ratings were not more favorable in vide: tape 
and audiotape relative to written conciiions. 
The communicator image mechanism also 
failed to receive support from the ma- 


nipulation of communicator confidence, which 
neither affected opinion change nor mediated 
the effect of modality on opinions.’ In sum- 
mary, with respect to exploring the determi- 
nants of modality differences in yielding, no 
support was obtained for the communicator 
image mechanism, whereas partial support was 
obtained for the distraction mechanism. Be- 
cause the underlying reasons for modality 
differences in yielding remain unclear, future 
research might well address itself to a more 
thorough exploration of this issue. 


Conclusions 


The present research indicates that the 
fects of communication modality on pers: 
sion can be usefully viewed within a fra: 
work that stresses the importance of effective © 
reception of persuasive material. By observ ng 
the hypothesized comprehension effects, in 
addition to the predicted persuasion findings, 
we can make a relatively strong inference re- 
garding the causal role of comprehension. In 
this regard, it is interesting to compare our 
findings with those of Miller, Maruyama, 
Beaber, and Valone (1976). With an audio 
message they obtained no effect of message 
“complexity,” manipulated through changes in 
sentence structure similar to those used in our 
difficulty manipulation. While our findings 
suggest that their complex message should have 
been less persuasive than the simple one, it is 
not possible to interpret their finding confi- 
dently in terms of comprehension mediation 


5 The effects of confidence were examined in an 
analysis that excluded subjects in written conditions 
where confidence was not manipulated. Although the 
confidence effect was nonsignificant, F(1, 160) = 1.45, 
p= .23, the order of the means was consistent with 
the expectation that confidently as compared with 
nonconfidently presented messages would elicit 
greater opinion change (M =3.78 vs. M = 3.16). 


because, unlike our study, no measures of mes- 
sage comprehension were included, If their 
manipulation did not have a major impact on 
comprehension, then no effect on persuasion 
would be expected. Further, the fact that 
message complexity mattered in our study but 
not in the Miller et al. (1976) study may be 
due to the different experimental contexts em- 
ployed. The conditions of our experiment, 
articularly its legal case context, probably 

maximized the tendency for recipients to focus 
attentively on the argumentation in the mes- 
sage and to base their opinions on such con- 
tent cues. In such circumstances, because of 
heightened attention to message content, vari- 
ables such as message complexity and com- 
munication modality may affect comprehen- 
sion and, therefore, persuasion. Under other 
| circumstances, however, recipients may not 
engage in such a cognitively elaborate process 

and may base their opinions on nonmessage 
cues such as communicator characteristics 
(e.g., Mills & Harvey, 1972; Norman, 1976). 
This is, in fact, the view favored by Miller 
et al. in interpreting the effects on persuasion 
of their speed of speech manipulation. In 
these circumstances, because of lack of atten- 
tion to message content, variables such as 
message complexity and communication mo- 
dality may have little effect on message com- 
prehension and therefore little impact on 
persuasion. 

Finally, because our design did not feature 
variation across stimulus conditions by in- 
cluding multiple” topics or renditions of the 
difficulty manipulation, some concern about 
generality is in order, Yet, because variation 
across stimulus conditions was sacrificed in 
favor of charting the causal role of compre- 
hension by developing a complex set of mea- 
sures for a single set of stimulus materials, 
external validity concerns are of a particular 
sort. We would not argue that all easy and 
difficult communications would have effects 
across modalities of the kind we obtained. 
However, given the present results and the 
aged generality they lend to the mediational 
ff l of comprehension in persuasion, we feel 
i atively confident in arguing that any com- 

hunication difficult enough to create a com- 
la prehension deficit in the videotaped and 
f audiotaped compared with written modality 
S 


T . 
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will be more persuasive if written. In the 
absence of such a comprehension deficit, we 
would expect greater persuasion in the vidt 
taped compared with audiotaped and aud 
taped compared with written modality. 
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į Speed of Speech and Persuasion 


Norman Miller, Geoffrey Maruyama, Rex Julian Beaber, and Keith Valone 
i University of Southern California 


{ The relationship between speaking rate and attitude change was investigated 

in two field experiments. Manipulations of speech rate were crossed with (a) 
credibility of the speaker and (b) complexity of the spoken message. The re- 
sults suggest that speech rate functions as a general cue that augments credi- 
bility; rapid speech enhances persuasion. No support was obtained for infor- 
mation-processing interpretations of the effects of a fast speaking rate. The 
increased persuasion produced by fast s 
ruption of effective counterarguing. These findings emphasize the importance 
evaluative factors in the persuasion process at the expense 
of a more rationalistic information-processing view of how man responds when 


of perceptual and 


confronted with an influence attempt. 


Although many of the persuasive communi- 
‘ons received during daily life are orally 
[h pte ‘od, the characteristics of speech that 
affec {guasion are rarely studied. More- 
over, studies that do examine delivery style 
typically treat it as a global variable. For 
instance, Dietrich (1946) studied the dyna- 
mism of a speaker’s style, Bettinghaus (1961) 
assessed the effects of overall speaking effec- 
tiveness by comparing trained and uncoached 
students, and Bowers (1965) examined the 
„effects of extroverted and introverted deliv- 
ery using dramatic arts students to simulate 
style. The persuasive effects of more discrete 
characteristics of oral delivery, such as inten- 
sity, pitch, speed of presentation, or specific 
emotional qualities apparently more rarely 
elicit attention. 

Several considerations suggest that speech 
rate might be an important variable. Texts on 
speech (Allen, Anderson, & Hough, 1968; 
Monroe & Ehninger, 1974) have reported 
considerable individual variation in speaking 
rate with 120-180 words per minute as the 
~ lower and upper limits of normal speech.* It 

§§ a short step to imagine theoretical implica- 
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peech could not be attributed to dis- 


tions of such variation that might contribute 
to differences in persuasion. Yet, even though 
Foulke and Sticht (1969) have most recently 
studied the effects of speed of presentation on 
comprehension, its effects on persuasion re- 
main unexplored. 

McGuire (1969) characterized attitude 
change as a stochastic process that flows 
from attention through comprehension, yield- 
ing, retention, and action. Although rate of 
speaking might theoretically be expected to 
influence any of these processes, it initially 
seems plausible to focus on how it affects com- 
prehension. In their review, Foulke and 
Sticht (1969) showed that when significant 
results are found, most are in the direction 
of lowered comprehension when speed of 
speech is rapid.’ i 

If a fast speaking rate does reduce com- 
prehension, then it should also curtail per- 
suasive power (Eagly, 1974). However, since 
the results of most studies in the comprehen- 
sion literature are not significant, it may 
make more sense to look for the effects of 
speed in some other stage of the persuasive 
process, such as yielding. To focus away from 
comprehension seems particularly sensible, in 


1For example, radio announcers are often noted k 
for their rapid speech rate. Lumley (1933) deter- 
mined the average speech rate of 25 radio speakers 
to be 162 words per minute, with a standard devia- 
tion of 19.40 words per minute. 

2It should be pointed out that a majority of the 
studies have found no significant effects. 
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that the high speeds used in the prior re- 
search equaled or exceeded the upper limits 
of the range of normal speech. Consequently, 
cne might reasonably assume that the speed 
manipulations in these studies were suffi- 
ciently powerful to have shown an effect if 
one does indeed exist. 

How might speed affect the yielding phase 
of persuasion? A fast speaking rate might dis- 
upt any covert attempt to counterargue 
azainst the persuasive content of the message, 
ttereby increasing persuasion. Whereas any 
ffects of speed on comprehension or counter- 


s argument disruption can be conceptualized as 


message factors—characteristics of the mes- 
sage that affect persuasion—speaking rate 
may alternatively act as a source factor, That 

„is, it might provide cues for viewing a fast 
speaker as more knowledgeable, competent, 
and facile. In this case a rapid speaking rate 
constitutes a credibility cue and on this basis 
should likewise enhance persuasion. 

Although the comprehension interpretation 
of speaking rate does make a distinct predic- 
tion—less attitude change when speech is 
rapid—both the counterargument disruption 
and enhanced credibility interpretation make 
the opposite prediction—greater attitude 
change with a rapid speaking rate. How can 
these two hypotheses be separated? First, 
with respect to the counterargument disrup- 
tion hypothesis, if rapid speech does indeed 
disrupt covert counterarguing, then, as im- 
plied by Kiesler and Mathog (1968), the im- 
pact of speaking rate should vary with the 
credibility of the speaker. According to their 
argument, when the message is counteratti- 
tudinal and the speaker possesses substantial 
credibility, counterarguing tends to become 
the most viable mode of resisting influence. 
This should be particularly true if one is 
highly involved in the issue (Osterhouse & 
Brock, 1970). However, counterarguing is or- 
dinarily an elaborate internal response, one 
requiring time and mental energy. A com- 

munication spoken rapidly should reduce the 
available time and increase the difficulty of 
making such covert responses. Consequently, 
increased speed should facilitate attitude 
change. On the other hand, « person listening 
to a speaker who lacks credibility can resist 
influence by using a less elaborate internal 


response; he can simply reject or discount 
what is said. Under these conditions, speaking 
rate should not substantially attenuate ability 
to resist, and consequently, should have little 
impact on persuasion. Although there are few 
direct tests of the role of counterarguing in 
the persuasion process (Miller & Baron, 
1973), the reasoning above does make a clear 
prediction. It suggests that if fast speech does 
disrupt counterarguing, there should be an 


interaction between speaking rate and credi- . 


bility. Alternatively, if speaking rate simply 
functions as a credibility cue, no interaction 
would be predicted. Instead, as previously 
suggested, fast speech should enhance per- 
suasion regardless of the speaker’s credibility. 

The results of two studies that begin to 
delineate the relation between speed of speech 
and persuasion (Beaber & Miller, Note 1) 
suggest that speech rate functions as a cue 
for increased credibility. The first experiment 
of Beaber and Miller investigated rate of 
speaking and credibility of the source in a 
2 X 2 design. In addition to finding the credi- 
bility manipulation to be successful (the high- 
credibility source was seen as more knowl- 
edgeable and more trustworthy), the fast 
speaker was perceived as more knowledgeable 
and more trustworthy. In addition, the com- 
munication was perceived as more complex 
but also clearer and easier to understand 
when presented by a fast speaker. These find- 
ings suggest that a rapid speaker is perceived 
as more competent, since it takes a skilled 
speaker to present complex material clearly. 
However, since credibility was manipulated 
by varying both ability and trustworthiness 
in this first study, the specific component of 
credibility that speed enhanced cannot clearly 
be specified. Certainly, there are situations in 
which high ability and trustworthiness do not 
necessarily occur together. The most obvious 
example is a used car dealer who, despite his 
expertise, is not always viewed as trust- 
worthy and may be seen as even less beliey- 
able if he speaks very rapidly, 

Therefore, in a second experiment Beaber 
and Miller manipulated trustworthiness inde- 
pendently from expertise to investigate 
whether or not the persuasiveness of an un- 
trustworthy communicator would be enhanced 
by rapid speech. In the first study, any effects 


of trustworthiness could have been obscured 
by the difference in expertise of the com- 
municator. Whereas in the first experiment 
the communication advocated “offshore dril- 
ling for oil,’ in the second experiment it 
argued that “The Mazda is the best car value 
around.” The communicator was depicted as 
a used Mazda salesman with something to 
g4 from his persuasiveness (low trust) or as 
. former Mazda salesman who currently 

worked in a bank (high trust). Thus, exper- 
) ise was held constant. This manipulation 
eemed most likely to produce an interaction 
vetween speed and credibility and thereby 
est the generality of speed as a cue for en- 
anced credibility. In this instance, an un- 
rustworthy fast speaker should appear to be 
less believable than an untrustworthy slow 
speaker; contrarily, a trustworthy fast speaker 
should be more believable than a trustworthy 
slow speaker, replicating the outcome of the 
first study. 

Manipulation checks showed that the trust- 
worthy speaker was in fact rated as more 
trustworthy and, furthermore, that the manip- 
ulation was independent of expertise, since 
‘he high- and low-trust speakers were rated 
as equally knowledgeable. As in the first ex- 
« periment, rapid speech enhanced ratings of 

both trust and knowledge. Further paralleling 

the first experiment, measures of persuasion 

also showed the rapid speaker to be more 
persuasive. 

In summary, then, these two experiments 

strongly argue that speaking rate operates as 

a very general and powerful credibility cue. 

The results clearly contradict information- 

processing hypotheses concerning rate of 
' ch. If fast speech had impaired compre- 
J hension, it should have likewise reduced 

rather than augmented persuasion. Alterna- 

s" tively, as argued in the first experiment, if 

fast speech had interfered with covert counter- 

arguing, it should have more strongly facili- 

| tated persuasion when credibility or trust- 
worthiness was high than when it was low. 

If, despite the data of these two experi- 
ments, one is intent on saving the compre- 
hension or counterargument disruption hy- 
potheses, one might argue that the proper 
tests have not been performed. Since both of 
these experiments were laboratory studies, 
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subjects might have construed the situation 
as a test. Consequently, the predicted inter- 
action between credibility and speech rate 
might have failed to receive support because 
the situation demanded extremely high atten- 
tion regardless of the level of the speaker’s 
credibility. Thinking that they might subse- 
quently be tested on some aspect of their re- 
tention or comprehension, subjects might have 
focused more on the content of the arguments 
regardless of the level of credibility assigned 
to the speaker. According to this view, where- 
as faster speech might indeed produce some 
minor enhancement of credibility, its more 
important, potent, pervasive, and normal con- 
sequences are those affecting information 
processing, whether decreasing comprehension 
or alternatively disrupting one’s ability to 
counterargue. Yet, in laboratory experiments, 
these effects cannot be observed; attention 
and vigilance to content is made so asymp- 
totically high by the implicit demands of the 
laboratory that the normally weaker and less 
important credibility enhancement effects re- 
main as the only ones to be observed. 

In terms of these arguments, a field setting 
might provide a more adequate test of the 
information-processing interpretations of 
speaking rate. Since the public accepts on- 
the-street interviewing with little suspicious 
questioning, test expectancies might not op- 
erate when a field experiment is concealed 
behind this type of facade. 


EXERIMENT 1 


Experiment 1 manipulated speech rate and 
speaker credibility in a field setting to deter- 
mine their effect on attitude change and per- 
ceptions of the speaker. Since in the Beaber 
and Miller (Note 1) experiments the trust- 
worthiness dimension of credibility did not 
interact with speech rate but rather func- 
tioned as a cue that unidirectionally enhanced 
credibility, no attempt was made to manipu- 
late orthogonally the competence and trust- 
worthiness dimensions of credibility. 


Method 


Subjects and experimenters. Subjects were 359 per- 
sons living in the Los Angeles area. They were indi- 
vidually approached in a public area by 1 of 14 ex- 
perimenters, each of whom conducted multiple repli- 
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cations of the design.t Experimenters were male and 
female undergraduates enrolled in an upper-division 
psychology class at the University of Southern Cali- 
fornia. 

Procedure. Each experimenter assigned a subject 
to each cell before starting the next replication. The 
order of cells was determined by random assignment 
but was constant across replications and experi- 
menters. All subjects were individually run. They 
were approached in a variety of public settings, such 
as parks, various on-the-street locations, other 
campuses, shopping malls, and door-to-door in vari- 
ous neighborhoods. 

The experimenter approached the prospective sub- 
ject and introduced himself (herself) as an inter- 
viewer from radio station KSC for the program 
People’s Forum, which was described as being con- 
cerned with topics of interest to the listening audi- 
ence. Subjects were told that tapes had been made 
containing listeners’ opinions on various topics. The 
interviewer asked the subject to listen to a tape on 
the current day’s topic (“The danger of drinking 
coffee”) and then to give his reaction to it. If the 
subject agreed, the interview was continued by 
describing the author of the tape and playing it for 
the subject. 

Persuasive communication. The communication 
was approximately 400 words in length. It argued 
that coffee is a hazardous drug; caffeine in large 
doses is a poison and can cause damage to and 
fluttering of the heart, headaches, migraines, cold 
hands, and stomach ulcers; by decreasing appetite 
and by leading to insufficient sleep, coffee lowers 
resistance to disease; by providing momentary en- 
ergy, it hides symptoms of other medical problems; 
it adversely affects teeth and gums; it is addicting 
and causes withdrawal symptoms; and in conclusion, 
like cigarettes, it should be classified by the govern- 
ment as hazardous. 

Credibility. In the low-credibility condition the 
speaker was introduced as a locksmith, whereas in 
the high-credibility condition he was introduced as 
a biochemist. To augment the credibility manipula- 
tion, errors in pronunciation, grammar, and word 
usage were inserted into the speech of the low-credi- 
bility speaker, while retaining the same content and 
meaning. After describing the speaker, the experi- 
menter played either a fast or slow version of the 
recorded message and then asked the subject to 
answer a few questions. 

Speaking rate. The speaker used in this experiment 
spoke at approximately the same rates as did the one 
in the Beaber and Miller (Note 1) experiments, The 
speaking rate in the low-speed condition averaged 
102 words per minute, whereas in the high-speed 
condition it averaged 195 words per minute. These 

rates were achieved by simply instructing the speaker 
to practice delivering the speech as rapidly and 
slowly as possible while controlling his level of en- 
thusiasm and involvement. Tapes were made on 
studio-quality equipment. Overall volume was bal- 
anced across tapes by matching average volumes on 
five sample segments of each tape. Cassette record- 
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ings were reproduced from the mo tapes and 
delivered via cassette recorders. 

Dependent measures. After listenin the tape, 
subjects were asked to indicate c ir agree- 
ment with a series of statements. nent was 
indicated on 10-point Likert scales (m at = 5.5), 
with higher numbers indicating stron zreement, 
The statements concerned global agre with the 
main contention of the communicat perceived 
speaker characteristics (knowledgeabi! perceived 
clarity of the presentation, and the | | coffee- 
drinking habits of the subject. 

Results 

Inspection of the credibility 1 ulation 
check indicated that the high edibility 
source was perceived as more know «Igeable:; 
Ms = 7.93 and 5.19 in the higi id low- 
credibility conditions, respectively (1,13) 
= 89.1, p < .001. This effect was rmined 


by testing the main effect against ‘ 
menter X Condition interaction, 


interaction was significant. This t con- 
servative test (using the significa: nterac- 
tion term) also revealed a significan' ~{fect of 
credibility on perceived understan: g: Ms 
= 8.71 and 8.07 in the high- and low . edibil- 
ity conditions, respectively; F(1,1. = 7.8, 
p < .05. The effect of speech rate o! knowl- 


edgeability approached standard els of 
significance: Ms = 6.93 and 6.12 in 


3 The authors wish to express their sincere appreci- 


ation to Susan Anderson, Evie Baker, Lauri- Brand, 
Pat Brotman, Gayle Caviglia, David Georg:, Med- 
rith Hose, Judi Kammerman, David Mace, Steve 
Mayer, David Quezada, Alan Seims, Francis Troll, 


and Jan VanSchaik for their able assistance in the 
collection of the data. 

4The high-credibility-slow-specch rate was 99.6, 
the high-credibility-fast-speech rate was 194.4, the 
low-credibility-slow-speech rate was 104.8, and the 
low-credibility-fast-speech rate was 195.8 (words 


+ 


per minute). For the two Beaber and Miller (Note | 


1) studies, a rate of about 110 words per minute 


was used in the slow-speech conditions, while a rate | 


of about 190 words per minute was used in the fast- 
speech conditions. Despite the similarity of speech 
rates, the different communicator used in the present 
study should increase the generalizability of the 
results. 

5 Different experimenters produced different mag- 
nitudes of effect with the credibility manipulation. 
Therefore, each effect was tested against its inter- 
action with experimenters. In effect, then, we are 
using a mixed model with experimenters as a random 


factor, testing each fixed effect against its interaction 


with experimenters. 
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and slow-speech conditions, respectively; F(1, 
13) = 4.4, p < .06, again, by the more con- 
servative test. Thus the high-speed speaker 
tended to appear more knowledgeable. 

An analysis of variance on the postcom- 
munication agreement measure (using number 
of cups of coffee drunk daily as a covariate) 
yielded main effects that approached signifi- 
cance when tested with the conservative ex- 
erimenter interaction error term: Ms = 6.13 
ind 5.44 in the high- and low-speed condi- 
ions, respectively; F(1, 13) = 2.80, p < .13; 
Ms = 6.28 and 5.33 in the high- and low- 
credibility conditions, respectively; F(1, 13) 
= 441, p < .06. Thus the high-speed com- 
munication was more persuasive than the low- 
speed communication. The fact that the ex- 
perimenters used in this study lacked much 
experience (which probably accounts in large 
part for the Experimenter Condition inter- 
actions) might well have reduced the strength 
of the effect of speech rate. Further, some re- 
searchers have argued that effects in the 
field are typically weaker than those in 
he laboratory (Hovland, 1959). In addi- 
lion, examination of the tapes revealed 
an instance of a glaring nonfluency (re- 
peated stuttering over one word) in the 
high-credibility—high-speed speech. As shown 
py Miller and Hewgill (1964), nonfluen- 
cies typically lower perceived credibility. 
Consequently, this should have further re- 
duced the effect of the speed manipulation. 
Despite these difficulties, it should be noted 
that the data did not support either the com- 
prehension prediction—less persuasion under 
high speed—or the counterargument predic- 
tion—a Speaking Rate x Credibility interac- 
tion showing a speed effect under high-credi- 
bility conditions but no speed effect in low- 
credibility conditions.’ Instead, they support 
the view that rapid speech serves as a credi- 
bility cue and thereby enhances persuasion. 


EXPERIMENT 2 


Although the credibility hypothesis con- 
tinued to receive general support from Experi- 
ment 1, there may still be special circum- 
stances in which speech rate functions to dis- 
rupt counterarguing. The communications 
used in both Beaber and Miller (Note 1) 
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studies and in Experiment 1 did not provide 
especially novel arguments to the listener, nor 
did they deal with particularly unfamiliar 
topics. In persuasion paradigms that offer 
novel arguments or topics, adequate process- 
ing of message content should become more 
salient. From this standpoint, it can be 
argued that for the topics used in the re- 
ported studies, subjects had counterarguments 
readily available and therefore did not experi- 
ence the high rate of presentation as being 
disruptive. Elaborate processing of the mes- 
sage was not necessary. A more sensitive test 
of the counterargument disruption hypothe- 
sis might require novel communications for 
which simple retrieval of stock counterargu- 


6 All effects are highly significant (p < .01) when 
the standard, and less conservative, within-cell vari- 
ance is used as the error estimate. However, to 
revert to this more typical error estimate means 
that these results cannot be generalized beyond the 
particular set of experimenters used to collect these 
data. This is of course normally true for virtually 
all experiments reported in psychological journals, in 
that few researchers test their effects against the 
variability of different experimenters’ outcomes. 
Most use only a single experimenter. If the results 
were significant using the more conservative error 
estimate (the interaction with experimenters), their 
external validity would be enhanced in that they 
presumably would obtain with any of the experi- 
menters from the larger population of which these 
14 constitute a random sample, 

TIn attempting to defend the counterargument 
explanation, some might argue that people primarily 
use covert counterarguing to resist persuasion when 
they are involved with or have a personal concern 
about the topic. While we have not explicitly ma- 
nipulated involvement, we can indirectly explore 
this notion by attending to an individual difference 
measure of the extent to which our subjects were 
likely to be concerned with the topic. It is reason- 
able to assume that subjects who report being heavy 
coffee drinkers are likely to be more concerned and 
involved with a speech emphasizing the dangers of 
drinking coffee. The results of the analysis of covari- 
ance argue against the view that covert counter- 
arguing is used to resist persuasion primarily when 
one is concerned with the issue. The persuasion ef- 
fects are found even though personal involvement 
(number of cups of coffee drunk per day) was co- 
varied out. If speech rate affects persuasion by inter- 
fering with counterarguing and if counterarguing 
only occurs when the topic is involving, removing 
the effect of involvement via covariance analysis 
should also eliminate any effect of speech rate on 
persuasion, 
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ments would not suffice as a means for re- 
sisting influence. Therefore, for the present 
experiment, a novel topic, “The dangers of 
hydroponically grown vegetables,” was chosen 
to ensure that subjects would be forced to 
process novel arguments. 

Although a more sensitive test of the 
counterargument hypothesis might be created 
by using a more novel communication topic 
and simply replicating one of the earlier stud- 
ies that examined the Speech Rate X Speaker 
Credibility interaction, a more potent criti- 
cism of the initial experiments would still re- 
main unresolved. The predicted Speed x 
Credibility interaction rested solely on the 
hypothesized differential salience of counter- 
arguing as a means of belief defense in high- 
versus low-credibility conditions. Although 
Kiesler and Mathog (1968) provided experi- 
mental support for the assumption that 
counterarguing is the preferred mode of de- 
fense when the speaker possesses high credi- 
bility, alternate interpretations of their data 
are possible (Baron, Baron, & Miller, 1973). 
Tn addition, since a credibility manipulation 
only indirectly affects orientation toward con- 
tent, it is difficult to specify the range over 
which content orientation is varied. Failures 
to produce the predicted interaction may only 
reflect a weak manipulation of orientation 
toward and away from content. 

A more direct test of the counterargument 
hypothesis should use variables more clearly 
related to information processing. Theoreti- 
cally, message complexity should have more 
pronounced effects on the difficulty of pro- 
cessing information. Indeed, highly complex 
messages may force the listener to restrict his 
or her covert activity and focus entirely on 
message comprehension. With complex com- 
munications, processing difficulties should be 
near their ceiling, leaving little room for speed 
effects. However, with simpler messages, an 
increased rate of speech should reduce the 
available time for counterargument produc- 
tion, thereby facilitating persuasion. In sum, 
there should be an interaction of speed and 

message complexity on attitude change. 

To test the interaction predicted by an 
information-processing model, Experiment 2 
manipulated speech rate and message com- 
plexity in a 3 X 2 factorial design using a 
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“novel” topic. Three levels of spea! 
were employed to ensure that the 
speech rate are not curvilinear and xtend 
further the generality of earlier find 


Method 

Subjects, A total of 90 males and femal hosen 
from the mall of a large open-air shoppi enter, 
participated in the study. 

Procedure. Subjects were randomly as d to 
the six cells of the design before replica The 
procedure and facade largely paralleled tha’ of Ex- 
periment 1. The experimenter approached ubjects 
individually and introduced himself as an int: viewer 
from radio station KSC who was interview peo- 
ple on the street for the program People’ wum, 
which was again described as being concer with 
topics of interest to the radio audience, jects 


were told that tapes had been made contai lis- 
teners’ opinions on various topics. The into wer 
asked the subject to listen to a tape on the t of 
the day (“The dangers of hydroponically wn 
vegetables”). If the subject agreed to hear the ipe, 
the interview was continued by describing the au hor 
of the tape and playing the tape for the eet. 
The speaker was ostensibly introduced as Charles 
Travis, a produce manager at Ralph’s Ranchi ` ft 
in Hollywood. The topic of the tape was sen 
because it had been previously determined that ‘ow 
people know what the term hydroponics means, ct 
alone understand the arguments favoring or opposir g 
hydroponic growing methods. Lacking argumenis, 
subjects are unlikely to have a defensible attitude 
regarding the value of hydroponics. 

Persuasive communication, 
was approximately 300 words in length. After dcn- 
ing hydroponics, it mentioned a number of “j0))- 
lems” with hydroponic methods: the need for <c- 
quent tests to ensure proper chemical concentra! ons 
in the growing medium, the small size of crops pro- 
duced, the excessive cost of and trouble with egot- 
ment, algae on crops, and finally, cellulat defects aid 
genetic abnormalities of crops. 

Speaking rate. Fast, moderate, and slow tapes 
were all delivered by the same person (but not the 
speaker used in Experiment 1). The average speech 
rates were 191, 140, and 111 words per minute in 
the fast-, moderate-, and slow-speech condition, 
respectively. The speaker practiced the delivery of 
the speech six times immediately before taping. In 
addition, the slow-speech version was recorded first. 
It was hoped that these measures would reduce 
differences in patterns of inflection between condi- 
tions. 

Message complexity. The content of the two com- 
munications was the same. As described above, it 
argued against the use of hydroponic methods of 
growing vegetables. In the simple communication the 
ideas were expressed in simple (vs. compound) sen- 
tences with few parenthetical or qualifying clauses. 
In the complex communication the very same ideas 
were expressed in compound sentences (formed by 
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combining the simple sentences of the simple com- 
rounication). In addition, the complex message used 

tended qualifying clauses, set in a sequence that 
vould be confusing without careful attention and 
omplete comprehension, 

Dependent measures. After listening to the tape, 
subjects were asked to indicate orally their agree- 
ment with a series of attitude statements. Agreement 
was indicated on a 10-step scale (1-10), with higher 
numbers indicating greater agreement. In addition to 
measuring agreement with the speaker, there were 

testions to measure perceptions of the speaker’s 

telligence, knowledge, and objectivity. 


results 


Since no predictions were made regarding 
main effects for message complexity (i.e., it 
was primarily included to investigate the 
resence of Complexity X Speed interactions), 
\essage complexity was analyzed by a multi- 
variate analysis of variance across all de- 
pendent variables. This analysis indicated no 
differences between complexity conditions. In 
addition, there was no hint of a Message 
Complexity X Speaking Rate interaction, as 
predicted by the counterargument disruption 
hypothesis for measures of persuasion (i.e., 
agreement with speaker), These data argue 
that speech rate does not affect counter- 
arguing. In consonance with this interpreta- 
tion, although Chaiken and Eagly (1976) 
found that complexity decreased persuasion, 
hey found no relation between their com- 
plexity manipulation and counterarguing. 

In the absence of a significant interaction, 
ihe focus should turn to the main effects for 
peed of presentation. A faster rate of speech 
produced more persuasion: F(2,84) = 4.00, 
p< 02; Ms = 4.62, 5.95, and 6.10 for low-, 
moderate-, and high-speed messages, respec- 
‘ively. Furthermore, in consonance with Ex- 
periment 1, faster speech resulted in percep- 
tions of greater intelligence: F(2,84) = 2.31, 
b> <.1; Ms = 5.70, 6.45, and 6.83; greater 
knowledge: F(2, 84) = 2.59, p < .08; Ms = 
6.46, 6.80, and 7.53; and more objectivity: 
F(2,84) = 4.41, p < .0l; Ms = 3.83, 3.93, 
and 5.76. 

Taken together, these results provide strong 
support for the credibility enhancement inter- 
pretation of speaking rate. They offer no sup- 
port for the conuterargument hypothesis and 
essentially replicate the data from the first 
experiment and the Beaber and Miller (Note 


621 


1) studies. In addition, the linear progression 
of means for both the agreement measure and 
the speaker perception measures adds greater 
generality to the earlier studies and argues 
against curvilinear effects within the normal 
range of speech rates. 


GENERAL DISCUSSION 


The data of these field studies support the 
notion that rapid speech functions as a credi- 
bility cue. In this regard, it is noteworthy that 
across several messages and speakers, as well 
as in both laboratory and field settings, rap- 
idly spoken communications were more per- 
suasive than those spoken slowly. Indeed, one 
might be inclined to assert confirmation of a 
new law: “Beware of the fast talker.” How- 
ever, it is important to point out that while 
these studies primarily manipulated speech 
rate and did attempt to control some of the 
obvious variables such as volume, enthusiasm, 
etc., that apparently naturally covary with it, 
in no sense can we claim that our effects are 
entirely or purely attributable to speed of 
speech. There may yet be residual factors, 
naturally co-occurring with rapid speech rates, 
that enhance persuasion. Only with computer- 
assisted construction of experimental materials 
can we hope to isolate speed per se as the 
critical factor. In support of our own interpre- 
tations, however, Smith, Brown, Strong, and 
Rencher (1975) have used computer-assisted 
manipulation of speech rate and have shown 
that greater competence is attributed to those 
who speak at a faster rate. 

It is also interesting that fast speech within 
the normal range apparently does not inter- 
fere enough with reception to disrupt com- 
prehension. While we have tended to empha- 
size the imperviousness of comprehension to 
speech rate, it is worth noting that it was not 
directly assessed in the present studies. Conse- 
quently, an ancillary view is that whereas 
speech rate may well affect comprehension 
(when adequately measured), comprehension 
in and of itself is not very important in the 
persuasion process, 

While we have generally argued that the 
data oppose the counterargument disruption 
hypothesis, one other interpretation warrants 
consideration, namely, the effort justification 
effect, which is derived from dissonance 
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theory. Rapid speech may enhance persuasion 
by increasing the effort required to process 
and comprehend the speech content ade- 
quately, The enhancing effects of effort on 
persuasion are well documented (Osterhouse 
& Brock, 1970; Zimbardo, 1965; Zimbardo & 
Ebbesen, 1970). Baron et al. (1973) argued 
that such effort expenditure need not involve 
physical activity but might simply be a 
psychological or cognitive process. If so, rapid 
speech might produce its effect as a conse- 
quence of the increased effort required to 
comprehend it. 

Another consideration is that the present 
research used paradigms in which the listener 
could not interact with the speaker. The im- 
pact of stylistic variables in social interaction 
paradigms may be quite different. High speech 
rate may reduce opportunities for response in 
a conversation by eliminating pauses. Under 
these circumstances, a listener may find a 
high speech rate frustrating and consequently 
might resist influence. 

Finally, the effect of the manipulation of 
complexity in Experiment 2 warrants addi- 
tional consideration. From the standpoint of 
traditional learning theory approaches to per- 
suasion, the failure to find a main effect of 
less persuasion with a more complex message 
seems puzzling. Indeed, Chaiken and Eagly 
(1976) find empirical support for this 
expectation. Yet, further reflection on and 
comparison of their procedures with our own 
suggests numerous reasons for the discrepancy 
between findings and, further, suggests cir- 
cumstances under which one might expect a 
highly complex message to reduce persuasion. 
Complexity should interfere with persuasion 
in circumstances where comprehension of con- 
tent critically affects the persuasion process. 
The following are circumstances that might 
maximize the importance of comprehension. 


1. The communication topic should disin- 
cline one to rely on expert opinion. Rather, 
one’s opinion should rest largely on one’s own 
values in combination with an understanding 
of the relevant facts. The facts should not be 
too technical but relatively easy for the lis- 
tener to understand and should have a mean- 
ing instrumental to the appropriate implemen- 


tation of one’s values when forminy a final” 
opinion on the topic. 

2. The communication must lengthy, 
The number of facts must be suffi tly sub- 
stantial, detailed, and interrelate o leave 
one puzzled about how one’s value pply to 


the “proper” concluding opinion, one 
does possess a fairly complete und: tanding 
of the facts regarding the issue. 

3. Finally, one’s task should n simply 
require that one offer an opinion nstead, 
stemming from one’s opinion, there © ould be 
action or behavioral consequences, -hich if 
not intended for oneself, should appv ‘o oth- 
ers. If there are no direct and perso] deci- 
sion consequences, the topic of comm: ‘cation’ 
should at least be one that is similar _ those 
for which there were decision conse ences 


in the past. 


When we consider the topic used by “hai- 
ken and Eagly (1976) and compare i! with 
our own, our failure to find complexity efiects 
becomes more understandable, Our ic, they 
danger of hydroponically grown ve;. ‘ables, 
was novel to most respondents, It aijued,) 
largely on the basis of unfamiliar technical 
information, for a conclusion that the listener 
essentially had to accept on faith. Further. its 
length was brief (approximately 300 wo Is). 
On the other hand, their communicatio: the 
“Victoria Company Case,” was precede‘ by 
a handout that provided background ©» for- 
mation concerning a dispute betweer they 
company and its newly organized union em- 
ployees over the fact that the management 
had failed to provide the Christmas bonus 
that the workers had typically received. For 
this preliminary information, presenting both 
union and company views, subjects were given 
approximately 5 minutes to process the infor- 
mation. Then they received the “case discus- 
sion” of a law student who was presumably 
playing the role of the lawyer brought in to # 
arbitrate the case. This “case discussion” 
constituted the persuasive communication and 
was at least three times the length of our own 
persuasive appeal, Finally, it is important to 
note that the subjects were led to believe 
that they were assisting the psychology de- 
partment in evaluating the ability of students 
at a New England law school to argue legal y 
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cases. In other words, their expressed opinion 
would have action consequences for others, 

The procedure used by Chaiken and Eagly 
(1976) to manipulate complexity largely 
paralleled our own. Yet, from the standpoint 
of our preceding arguments, it should be ap- 
parent that adequate information processing 
(comprehension) should play a much more 
central role in determining subjects’ accept- 
ance of the recommended conclusion in their 
case than in our own, In summary, whereas 
our field experiments may reflect general 
tendencies of persons to avoid complex cogni- 
tive effort and form judgments based upon 
simple decision rules (e.g., the speaker seems 
trustworthy vs. untrustworthy, etc.), there 
are other instances in which judgments are 
sufficiently important to ensure that neces- 
sary cognitive effort will be expended (e.g., 
buying a car, being on a jury). In these latter 
instances, there are apparently some specific 
situations (as described above) in which 
complexity does affect persuasion. 

In conclusion, it is important to note that 
the three hypothesized consequences of a 
fast speaking rate explored in this research 
reflect two views of the persuasion process, 
one emphasizing rationality or the importance 
of detailed information processing and the 
other deemphasizing it. The treatment of the 
persuasion process by social psychologists has 
tended to take a rationalistic view of man 
(eg., Festinger & Maccoby, 1964; Hovland, 
Janis, & Kelly, 1953; Jones & Gerard, 1967; 
McGuire, 1969). More recently, Miller (Mil- 
ler, 1968; Miller & Baron, 1973) has argued 
that this view may require modification, with 
greater weight being accorded instead to emo- 
tional and evaluative cognitions—at the ex- 
pense of information, content-oriented cogni- 
tions. 

If the impact of speaking rate on attitude 
change were mediated by either comprehen- 
sion effects or counterargument disruption, 
then a highly rationalistic view of the per- 
suasion process would have been supported. 
However, the findings, which support the 
credibility interpretation of speaking rate, are 
consistent with a less rationalistic view of the 
persuasion process. Indeed, to put it somewhat 
sardonically, it may be irrational to rationally 
scrutinize the plethora of counterattitudinal 
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messages received daily. To the extent that 
one possesses only a limited amount of infor- 
mation-processing time and capacity, such 
scrutiny would disengage the thought pro- 
cesses from the exigencies of daily life. 


REFERENCE NOTE 


1. Beaber, R. J., & Miller, N. The effects of speech 
rate on attitude change. Unpublished manuscript, 
University of Southern California, 1974. 
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Contrast Effects in Impression Formation 


David D. Simpson and Thomas M. Ostrom 
Ohio State University 


Do contrast effects in impression formation occur at the level of peoples’ cogni- 
tive representation? Two experiments were conducted in which this question 
was explored. Following the rating of either positive or negative stimulus per- 
sons (context stimuli), subjects wrote a paragraph describing their impression 
of a standard neutral stimulus person. Upon subsequent scaling, the paragraphs 
were contrasted away from the context stimuli. Because of procedural controls, 
this finding could not be attributed to subjects’ tendency to alter a given re- 
sponse language to provide maximal interstimulus discrimination. It was con- 
cluded that people’s cognitive representations can be influenced by context. 


Numerous studies have established that 
subjective judgments of a stimulus are influ- 
enced by contextual stimuli. A contrast effect 
exists in the absolute judgments of both 
physical and social stimuli in which judg- 
ments of a test stimulus are contrasted away 
from the overall context. Explanations of this 
contrast effect have focused on separating 
perceptual and response-based determinants 
(Krantz & Campbell, 1961; Manis, 1971). 
In the study of person impressions it is not 
strictly correct to speak of perceptual deter- 
minants. It is more useful to inquire whether 
the stored attributes of the stimulus person 
(collectively referred to as the subject’s cog- 
nitive representation of the stimulus person) 
are affected by stimulus context. Response- 
based interpretations refer to semantic proc- 
esses that convert a subjective response to an 
overt response, 

Because most early studies used a uni- 
dimensional but bounded response language 
anchored by verbal labels, it was impossible 
to determine whether the effect was due to 
one or both processes. The difficulty has been 
in ruling out known semantic processes to 
determine whether the cognitive representa- 
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tion is affected by context. For example, 
Campbell, Hunt, and Lewis (1957) had sub- 
jects rate the degree of schizophrenic dis- 
organization of definitions of words on a nine- 
category rating scale. This. rating procedure 
suffered from the response tendency to anchor 
the end categories with the range of stimuli 
presented for judgment and to place an 
equal number of stimuli in each category 
(Anderson, 1975; Parducci, 1965). 

In response to these problems, Campbell, 
Lewis, and Hunt (1958) argued that in order 
to demonstrate perceptual effects, it is neces- 
sary to adopt a response language that is 
(a) absolute (referring to an aspect of the 
stimulus that is  situationally invariant); 
(b) extraexperimentally anchored (not simply 
an arbitrary and transient linguistic phrase) ; 
and (c) unlimited, extensive, or extendible in 
terms of the number of response categories 
available to the subject. It has now been 
shown that contrast effects emerge even 
when response languages satisfy these cri- 
teria. Contrast effects are obtained when pitch 
is judged in terms of a piano keyboard 
(Campbell, Lewis, & Hunt, 1958), when line 
length is judged in inches (Krantz & Camp- 
bell, 1961) and when photograph-derived 
person impressions are written in paragraph 
form and subsequently scaled for favorability 
(Manis & Armstrong, 1971). 

The studies cited above clearly reduce the 
plausibility of a stimulus-range anchoring or 
equal-frequency interpretation of the obtained 
results by demonstrating that the contrast 
effect occurs even when subjects are provided 


625 


626 DAVID D. SIMPSON AND THOMAS M. OSTROM 


with an unbounded response scale. However, 
these studies fail to eliminate yet another ex- 
planation that also follows from response lan- 
guage concerns, namely, a subject’s desire to 
communicate differentiation among the stim- 
uli presented for judgment (Campbell et al., 
1958; Rosenberg & Cohen, 1966). Because 
subjects in many of the above experiments 
judged multiple test stimuli intermixed with 
multiple context stimuli, they may have been 
motivated to convey maximal discriminations 
in their judgments. That is, subjects may 
inhibit the reporting of responses that are 
implicitly evoked but which, if emitted, would 
not haye reflected the differences they per- 
ceive among the various stimuli. This differ- 
entiation tendency is especially a problem 
when only a limited number of response op- 
tions are available to subjects. 

The present research was concerned with 
determining whether contrast effects could be 
obtained when the direct effects of any tend- 
ency to convey maximal discriminations are 
eliminated. Four means were employed to 
achieve this end. First, subjects employed a 
response language capable of expressing ex- 
tremely minute qualitative and quantitative 
distinctions between persons. Impression re- 
sponses were in the form of unstructured 
written paragraphs. Second, the response lan- 
guage used to report judgments of the test 
stimulus was qualitatively different from the 
response language used in judgments of the 
context stimuli. A change in response lan- 
guage rules out the possible explanation of 
vocabulary exhaustion (i.e., avoiding overuse 
of a single response term). Third, the tend- 
ency to differentiate between responses when 
using the test stimulus response language was 
eliminated by presenting only one test stim- 
ulus, thereby requiring only one response. 
Fourth, whereas in some prior research the 
instructions to subjects contained “demands” 
to differentiate (e.g., “Make each description 
clear enough so that someone else would know 
which picture you’ve written about,” Manis 
& Armstrong, 1971; “try to be as discrim- 
inating as possible,” Campbell et al., 1957), 
subjects in the present experiments were not 
sensitized to these kinds of experimenter ex- 
pectations, 


METHOD 


General design. Subjects first rated a numi 
positive or a number of negative context s 
persons. Then they formed an impression oÍ 
tral stimulus person common to both contex: 
ditions, wrote a brief paragraph about this stir 
person, and rated the favorability of their impres 
on an eight-category scale, 

Experiment 1 consisted of a 5X 2 Replication 
Contexts between-subjects design with each subje: 
judging three context persons before judging the tes 
stimulus. Due to the unexpected absence of a reliable 
contrast effect on the rating scale measure of Experi- 
ment 1, Experiment 2 used the same design but em- 
ployed a stronger context manipulation by requiring 
subjects to judge 12 context persons before judging 
the test person. If context affects the cognitive rep- 
resentation, subjects should contrast their impression 
on the test stimulus away from the context for both 
the paragraph and rating dependent measures. 

Procedure. For Experiment 1, 153 students in an 
introductory social psychology class completed a 
short questionnaire introduced as an impression for- 
mation demonstration, Data from nine of these sub- 
jects, evenly distributed between contexts, were elim- 
inated because they used the rating scale incorrectly. 

Subjects in Experiment 2 were 160 introductory 
psychology students who participated in partial 
fulfillment of a research participation requirement. 
These subjects participated in groups of 8-30 indi- 
viduals with all subjects in a given session judging 
the same stimulus replication. Of these, 10 subjects, 
equally distributed between contexts, misused the 
rating scale and were eliminated. 

Context manipulation. Stimulus traits for Experi- 
ment 1 were selected from Edwards’ (Note 1) re- 
scaling of Anderson’s (1968) list by dividing the list 
into three negative intervals (1.37-2.12, 2.13-2.87, 
2.88-3.62), three positive intervals (4.38-5.12, 5.13- 
5.87, 5.88-6.62) and one neutral interval (3.63-4.37) 
where scale values ranged from 1 (very unfavorable) 
to 7 (very favorable). Four traits were randomly 
selected from each interval to provide the seven 
person descriptions needed for each stimulus replica- 
tion. Sampling was done with replacement between 
stimulus replications. To strengthen the context ma- 
nipulation for Experiment 2, subjects were required 
to rate their impressions of 12 context persons each 
composed of either four homogenously positive or 
homogenously negative traits, In this case, four stim- 
ulus persons were sampled from each of the three 
positive and three negative intervals for each stim- 
ulus replication. The same five neutral stimulus per- 
sons as in Experiment 1 were used as test stimuli. 

Order of presentation of context stimuli was held 
constant in each condition. For Experiment 1, all 
subjects rated the second most extreme, the least 
extreme, and the most extreme context stimulu 
persons in that order. For Experiment 2, order of 
presentation was determined randomly, subject t 
two constraints: (a) Among the first three context 
stimuli appeared one from each of the three inter 
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vals; (b) the final two context stimuli were always 
from the most extreme interval, For each of the 
context persons there was an eight-category scale 
ınchored by the labels highly (considerably, mod- 
crately, slightly) favorable to (slightly, moderately, 
considerably) highly unfavorable on which subjects 
were instructed to rate the favorability of their 
impression. 

The test stimulus person immediately followed the 
context stimuli (on the same page in Experiment 1 
and on the following page for Experiment 2). For 
each replication the test stimulus was composed of 
four homogenously neutral trait adjectives (M= 
4.00). The same test stimulus person was used in 
both context conditions within a replication. 

Dependent measures. Instead of being asked to 
rate their overall impressions of the test stimulus on 
the same scale as*had been used in evaluating the 
context persons, subjects were requested to describe 
their impressions of the test stimulus person in their 
own words in a brief paragraph of not more than 
five sentences. When all subjects had completed this 
task, they then turned to the following page and 
rated their impressions on an eight-category scale 
identical to that used in judging the context stimuli.* 

To replicate the scoring procedure of Manis and 
Armstrong (1971) and to ensure complete separation 
between context effects in response production and 
in response evaluation, all paragraphs were evaluated 
by a new group of judges. For each experiment a 
different group of 25 students in introductory psy- 
chology served as judges in partial fulfillment of a 
research participation requirement. From the para- 
graphs of each stimulus replication, five blocks of 
paragraphs were assembled such that wherever pos- 
sible, each block contained an equal number of para- 
graphs from both contexts. Within each block, para- 
graphs appeared in a different random order. These 
five blocks of paragraphs from the five stimulus rep- 
lications were then rotated through a 5X5 Latin 
square to allow five independent judgments of the 
five orders of blocks of paragraphs for each replica- 
tion. Judges rated each paragraph on a 21-point nu- 
merical scale ranging from 0 (dislikes very much) to 
20 (likes very much) to indicate how much they 
thought the paragraph writer liked the stimulus 
person. 


RESULTS 


Paragraph judgments. If context produces 
a contrast effect on the cognitive representa- 
tion, the paragraphs written.in a positive con- 
text should be more negative than those writ- 
ten in a negative context. The average of the 
five independent judgments given each para- 
graph became the basic data for a 5 X 2 X 2 
(Replications X Contexts X Experiments) be- 
tween-subjects analysis of variance. Para- 
graphs written by subjects exposed to a favor- 
able context were judged as significantly less 
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TABLE 1 


Errect or CONTEXT ON SUBJECTS’ PARAGRAPH 
DESCRIPTIONS OF A NEUTRAL STIMULUS PERSON 


Context 


Unfavorable Favorable 


Replication 


Experiment 1 


1 8.53 8.59 

2 11.85 8.84 

3 8.05 7.11 

4 10.90 5.68 

5 7.13 6.69 

M 9.41 7.38 

Experiment 2 

1 9.49 9.79 

2 11.32 7.39 

3 10.30 9.79 

4 7.99 6.29 

5 9.27 5.08 

M 9.67 7.67 

Overall M 9.54 7.53 
Nole. Scores represent the averaged ratings of five inde- 
pendent judges where 0 = dislikes very much and 20 = likes 

very much, 


favorable than those written by subjects ex- 
posed to an unfavorable prior context, F 
(1,274) = 21.15, p < .001. This effect was 
of comparable magnitude for Experiment 1 
and Experiment 2 (interaction F < 1) and 
was significant for each, F(1, 134) = 12.62, 
p<.001; F(1,140) =9.18, p< .003, re- 
spectively (see Table 1). 

Below are two representative examples of 
subjects’ paragraph impressions of an evalu- 
atively neutral stimulus person composed of 
the traits perfectionistic, cunning, prudent, and 
unpredictable after having judged three posi- 
tive or three negative context persons, re- 
spectively. Each was rated close to the mean 
of its content condition. 


Perfectionistic gives me the impression of hard to 
please, hard to get along with, overly anxious, im- 
patient with. nonperfectionist. Cunning—sly, back 
biting, fox-like ; prudent—penny-pincher ; unpredict- 


1A third dependent measure consisting of a sub- 
ject’s comparative judgment of his paragraph im- 
pression with a standard paragraph impression was 
also included, coming between the paragraph and 
rating tasks. Results from this measure were in 
weak support of contrast, combined experiments 
x2(1) = 1.89, 25> p> .10, but is not further re- 
ported here for reasons of brevity. Further infor- 
mation about this measure may be obtained from 


the authors. 
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TABLE 2 


EFFECT OF CONTEXT ON SUBJECTS’ RATINGS 
OF A NEUTRAL STIMULUS PERSON 


Context 
Replication Unfavorable Favorable 

Experiment 1 

1 3.40 3.81 

2 6.27 5.93 

3 4.00 3.57 

4 5.00 3.40 

5 4.13 4.61 

M 4.56 4.26 
Experiment 2 

1 4.13 4.60 

2 5.40 4.27 

3 4.93 4.53 

4 3.87 3.73 

5 4.87 3.53 

M 4.64 4.13 

Overall M 4.60 4.20 


Note, These ratings were based on an 8-category scale where 
1 = highly unfavorable and 8 = highly favorable, 


able—predictable to the point of always doing the 
opposite or against the norm. 

Person #4 impresses me as a highly intelligent 
individual possessed of considerable experience and 
“common sense.” He/she has not been sheltered but 
has had to fend for him/herself, probably for a sub- 
stantial period of time. The person seems ambitious 
and highly motivated, with the ability of sharp 
observation of surroundings. 


Rating scale judgment. Although the ex- 
pected contrast effect was obtained, F( 1, 274) 
= 5.28, p< .02, and was of comparable 
strength in the two experiments (interaction 
F <1), it was not very substantial in magni- 
tude (see Table 2). In fact, it was not sig- 
nificant for Experiment 1 and Experiment a 
F(1, 134) = 1.78, p < .19; F(1, 140) = 3.58, 
p < .06, respectively. 


DISCUSSION 


At what level of information processing 
does the contrast phenomenon occur in im- 
pression formation? Does contrast reflect 
changes in the cognitive representation (Hel- 
son, 1964; Manis, 1967), shifts in response 
selection (Manis & Armstrong, 1971; Schneid- 
erman & Manis, Note 2; Rosenberg & Cohen, 
1966), or some combination of these processes? 

With regard to these questions, previous 
research has reliably demonstrated a contrast 
effect when a rating scale dependent mea- 
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sure was employed. Although probleris of 
interpreting contrast with this respon’: lan- 
guage have been discussed in the intro ion 
of this article, the insensitivity of this »ea- 
sure in the present experiments is surpr ing. 
The simplest explanation of this insensi ity 


is that the rating measure contained ext) ze- 
ous “noise” because it was the last of he 
measures employed. A second explanatic: is 
that the reference scale for “favorabili 
judgments may be much less influenced 
context range manipulations than the jud 
ment scales used in prior research, Favo 
ability judgments are frequently used b 
people outside of the laboratory, and so th 
reference scale may have very strong natura 
anchors. Further, the scale is bipolar with an 
anchor point between the favorable and un- 
favorable ends. The context stimuli were 
selected to all be on one side or the other of 
that mid-scale anchor, and the test stimulus 
was selected to coincide with that anchor. 

In both Experiments 1 and 2 a highly 
reliable contrast effect occurred in subjects’ 
unrestricted paragraph descriptions of their 
impressions. The finding cannot be plausibly 
explained by subjects’ tendency to differenti 
ate between stimuli in their responses, Thes: 
results would seem to contradict the presump 
tions of some authors (Anderson, 1975 
Parducci, 1974) that the contrast effect i 
due largely if not entirely to respons: 
selection processes. Some alteration of th= 
cognitive representation appears to be in- 
volved. Consistent with this conclusion aie 
several recent studies that have looked :i 
physiological reactions of people when pre- 
sented with context-discrepant stimuli, Skin 
conductance increases when subjects hear an 
evaluatively loaded word that is discrepant 
with the preceding context (Grush, Coles, 
Ferguson, & McGee, 1973) and when encoun- 
tering threatening stimuli that are context 
discrepant (Krupat, 1974). There is also sub- 
stantial evidence that memory is better for 
semantic items that are discrepant from a 
preceding context than for items consistent 
with that context (Grush et al., 1973; 
Wickens, 1970). The kinds of response- 
selection processes proposed by others to ex- 
plain contrast phenomena (e.g., stimulus end 
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anchoring, equal frequency, and stimulus dif- 
ferentiation tendencies) seem inappropriate 
to explain these physiological and memory 
changes or the contrast manifested in sub- 
jects’ paragraph responses in the present 
studies. The present studies are most con- 
sistent with the view that peoples’ cognitive 
representations can be directly influenced by 
context. 

It is conceivable that the effect of context 
on paragraphs was mediated by subjects’ 
prior preference scales. Subjects may first 
have labeled their impression on the basis of 
the rating scale categories encountered when 
judging the context stimuli, since those cate- 
gory labels would still be salient. The content 
of their paragraph descriptions may then 
have been derived from their rating labels, a 
process known to occur under certain cir- 
cumstances (Ostrom & Upshaw, 1968). If 
ratings have thus “contaminated” paragraphs, 
there should be a high correlation between 
judges’ paragraph ratings and subjects’ own 
ratings, high enough to explain the relation- 
ship between context and paragraph ratings. 
To test this possibility, an analysis of co- 
variance was conducted on the paragraph 
judgments using subjects’ final ratings as a 
covariate. This analysis is a conservative test 
of the possible contamination, since in fact 
both measures were affected by the context 
manipulation. Although the within-cell cor- 
relation between the two measures is high 
(r = —.59), the robust contrast effect for 
paragraph responses remains, F(1,273) = 
15.75, p < .001, even with subjects’ ratings 
partialed out. 

A literal adaptation level interpretation 
(Helson, 1964) of the paragraph results 
would suggest that subjects’ underlying af- 
fective experience was influenced by context 
and that this in turn guided subjects’ writing 
of their paragraph responses. The plausibility 
of this explanation, however, is also weakened 
by the rating scale results. 
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Twelve hypotheses were proposed that specify the eminence of thinkers to be 
a function of belief structure, zeitgeist relationships, and sociocultural and 


political variables. An archival research 


neously tests individual and 
2,012 thinkers from Occidental 


design was introduced that simulta- 


generational factors. The sample consisted of 
civilization spanning 124 generations from 580 


B.C. to 1900 A.D. The dependent variable was derived from a factor analysis 
of 10 distinct measures, A multiple-regression analysis then indicated that 


Philosophical eminence is a function of: 


(a) breadth, extremism, and con- 


sistency of belief structure; (b) zeitgeist tepresentativeness, precursiveness, and 
modernity; (c) role model availability (but not ideological diversity); (d) 
political fragmentation and political instability (but neither imperial instability 
nor war intensity); and (e) historical proximity to the present, The implica- 
tions of the results and design for further research were briefly discussed, 


Almost no one can claim to be an “edu- 
cated layman” without being able to recog- 
nize the name of the philosopher Descartes, 
yet how many college graduates have even 
heard of Descartes’ contemporary thinker 
Henri de Roy? Discussion of Aristotle is fre- 
quently uttered from the lips of modern in- 
tellectuals, but who ever so much as remi- 
nisces about Aristotle’s colleague Xenocrates? 
In a nutshell, why are some thinkers “illus- 
trious” and others “buried in obscurity?” 
Because fame probably entails both indi- 
vidual and cultural factors, such questions 
should be of intrinsic fascination to students 
of personality and social psychology. And 
now that quasi-experimental designs, multi- 
variate analyses, and archival data collec- 
tion techniques are receiving more attention 
(McGuire, 1973; Simonton, 1975d, 1976a), 
there exists no reason why this curious and 
significant issue cannot be broached, I will 
attempt to do so by specifying a dozen hy- 
potheses that seem worthy of empirical ex- 
ploration. 

The first three hypotheses concern the 
functional relationship between a thinker’s 
eminence and his or her belief structure, 
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Hypothesis 1; Eminence is a positive func- 
tion of philosophical breadth. In other words 
some philosophers are one-issue advocates 
whereas others deal with a fuller spectrum of 
philosophical questions, Are the former gen- 
erally less famous than the latter? The ra- 
tionale for his hypothesis is fairly straight- 
forward. Supposedly one reason why thinkers 
continue to be of interest to posterity is that 
later generations face perennial puzzles that 
earlier thinkers have already attempted to 
fathom. The larger the number of basic ques- 
tions pondered by a particular thinker, th 
greater is the likelihood that posterity wil] 
have the occasion to consult his or her philo 
sophical discussions, and hence the greater th 
fame. 

Hypothesis 2: Eminence is a negative func 
tion of philosophical extremism. Research has 
shown that persons generally prefer and even 
rate as more creative those individuals having 
similar attitudes and beliefs (Byrne & Nelson, 
1965). The present question is whether the 
same principle applies to the determination 
of a person’s fame as a thinker, That is, will 
a philosopher be less appreciated by posteri ty 
if he or she advocates decidedly minority 
opinions? Expressed differently, will the 
thinker who adopts more conciliatory or “mid- 
dle-of-the-road” Philosophical views be more 
likely to have a larger “following” and hence 
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more extensive acclaim by subsequent gen- 
erations? 

Hypothesis 3; Eminence is a positive func- 

on of philosophical consistency. Various cog- 
nitive theories presume that individuals desire 
or prefer “consonance,” “balance,” or “con- 
tiguity” in their own belief structures (Abel- 
son, Aronson, McGuire, Newcomb, Rosen- 
berg, & Tannenbaum, 1968). Supposedly 
thinkers should be all the more expected to 
possess this predilection for rationality. But 
do their intellectual reputations actually rest 
on the logical compatibility of their beliefs? 

The next three hypotheses pertain not to 
the beliefs per se of any thinker but rather 
to the relation of those beliefs to the “spirit 
of the times” or, to use the German term, 
the zeitgeist (for a review see Stover, 1967). 

Hypothesis 4: Eminence is a positive func- 
tion of representativeness. Some thinkers ad- 
here to beliefs that are essentially equivalent 
to those advocated by the vast majority of 
What renders such 
thinkers famous, perhaps, is not so much the 
novelty of their beliefs but rather the com- 
prehensiveness with which they express the 
vevailing mores and intellectual fashions of 
their times (see Boring, 1963, pp. 5-25; 29- 
49), For instance, it has been said of Voltaire 
that his fame as a thinker is due to his being 
“so effectively the spokesman of his age, so 
characteristically its representative, so com- 
pletely its embodiment. . . .” (Redman, 1968, 
p. 40). Does Voltaire typify a more general 
law? 

Hypothesis 5: Eminence is a positive func- 
tion of precursiveness. Sometimes thinkers 
earn recognition from posterity not because 
they epitomize the spirit of their age but 
because they appear to be “ahead of their 
time,” precursors of a future zeitgeist (cf. Al- 
bert, 1975, p. 146). Yet is there a nomothetic 
relationship between precursiveness and en- 
during fame? 

Hypothesis 6: Eminence is a positive func- 
tion of modernity. In a sense this is a special 
case or corollary of the preceding hypothesis. 
That is, eminence may not so much depend 
on being a precursor of the next generation’s 
zeitgeist as on holding beliefs that appear 
“modern” to individuals living in the 20th 
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century. Although one may believe such mo- 
dernity to be a real, stable individual char- 
acteristic, this factor might be more properly 
labeled a bias, since as the zeitgeist changes, 
a thinker’s modernity rating probably changes 
as well. 

The following pair of hypotheses concern 
sociocultural influences. They differ from all 
of the preceding hypotheses in that they 
affirm that some influences on eminence may 
operate during the “developmental” period of 
a thinker’s life. This longitudinal period may 
be defined as the years prior to the 30th birth- 
day (cf. Albert, 1975, pp. 147-148; Simon- 
ton, 1975d). 

Hypothesis 7: Eminence is a positive func- 
tion of role model availability during the 
thinker’s developmental period. A recent 
transhistorical investigation has shown that 
the number of creative individuals in any 
given generation is a positive function of the 
number of individuals in the previous genera- 
tion (Simonton, 1975d). It was at that time 
presumed that the availability of role models 
was responsible, that is, that creative develop- 
ment depends on early identification with 
productive individuals. Here we are formulat- 
ing this aggregate relationship in more indi- 
vidualistic terms; Are the more famous think- 
ers more likely to have grown up in environ- 
ments rich in productive thinkers of the 
previous generation? This question is not 
equivalent to the aggregate version investi- 
gated earlier and may even have a totally 
different answer. 

Hypothesis 8; Eminence is a positive func- 
tion of ideological diversity during the think- 
er’s developmental period. One view of the 
great thinker is that of the “synthesizer” 
(Koestler, 1964, pp. 224-233). Such an indi- 
vidual draws upon the rich experiences en- 
countered early in life—intellectual contro- 
versies and contending schools of thought— 
and integrates them into a mature Weltan- 
schauung that obviates, or at least attempts 
to obviate, all future debate. Thomas Aquinas 
may have achieved something like this for 
Medieval philosophy by boldly attempting to 
reconcile the numerous and divergent ideas 
and doctrines of Aristotle, Ibn Rushd, and 
traditional Christianity. So perhaps ideologi- 
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cal diversity has a beneficial impact on the 
development of thinkers of the greatest mag- 
nitude. 

The concluding four hypotheses all involve 
political factors. As in Hypotheses 7 and 8, 
the following specify events that may have an 
influence, positive or negative, on the devel- 
opmental phase of a thinker’s life. However, 
the last hypothesis departs from this scheme 
by specifying a factor that may impair the 
eminence of a thinker by inhibiting activity 
during the productive years (i.e., between 30 
and 50 years of age; cf. Simonton, 1975d). 

Hypothesis 9: Eminence is a positive func- 
tion of political fragmentation during the 
thinker’s developmental period. “Political 
fragmentation” refers to the number of inde- 
pendent states into which a given civilization, 
in this case Occidental culture, is divided. One 
investigation has indicated this variable to be 
correlated with the number of creators per 
century (Naroll, Benjamin, Fohl, Fried, Hil- 
dreth, & Schaefer, 1971), and another study 
has provided some rationale for concluding 
that political fragmentation acts as a develop- 
mental period influence (Simonton, 1975d). 
However, both studies dealt with creativity 
as a whole and employed aggregate data. The 
present concern is whether growing up in a 
context characterized by a large number of 
sovereign political units is conducive to the 
development of the highest caliber thinkers. 
If this is true, the reason is supposedly the 
same, namely, that political fragmentation 
provides a good indicator of cultural diver- 
sity and such diversity is favorable to creative 
development (Simonton, 1975d; but see 
Simonton, in press-b). 

Hypothesis 10: Eminence is a positive func- 
tion of imperial instability during the think- 
er’s developmental period, Political fragmen- 
tation, it has already been proposed, is not 
always a good indicator of cultural diversity: 
When the number of states is relatively small 
due to the formation of large empire states, 
cultural diversity may nonetheless surface in 
the form of popular revolts, rebellions, and 
revolutions (Simonton, 1975d). Therefore, 

another indicator would be “imperial insta- 
bility,” defined as the number of civil disturb- 
ances divided by political fragmentation. 
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Again, what we wish to determine cre i 


S 


whether this factor positively influe s the 
achieved eminence of individual ikers 
rather than aggregates of creators, has 
been already demonstrated (Sin: ston, 
1975d). 

Hypothesis 11; Eminence is a ne tive 
function of political instability durin: the 
thinker’s developmental period. Unlike ivil 
disturbances that involve turmoil amon ‘he 


populace or between ruler and the rule 0 
litical instability solely involves the g 1 
ing minority, the “power elite,” Coups « 
military revolts, dynastic conflicts, pol 
assassinations, and the like all point | 
fundamentally chaotic political system. Po 
cal instability, moreover, has been showr 
have a detrimental impact on the devel: 
ment of major, but not minor, “discursiy 
creators (Simonton, 1975d). Since this vari 
ble only affects major creators, and sinc 
philosophers are among the discursive group 
of creative disciplines (Simonton, 1975c) 
political instability should negatively influ 
ence philosophical eminence, 

Hypothesis 12: Eminence is a negative 
function of war intensity during the thinker’s 
productive period, Although it is hypothesized 
that warfare during the mature years of a 
thinker’s life will hamper the full realization 
of any creative potential, this prediction is a 
long shot: Even though war is relevant to 
some cultural activities (Simonton, 1976e), 
research has not succeeded thus far in show 
ing war to have any consistent connection 
with creativity (Naroll et al., 1971; Simon- 
ton, 1975d, 1976b, 1976d). The main hope 
here is that these empirical failures are due to 
the use of aggregate rather than individual 
data (for a defense of testing plausible “null 
hypotheses,” see Greenwald, 1975). 

Admittedly, the foregoing hypotheses do 
not always enjoy a secure theoretical basis. 
Although some of the hypotheses are derived 
from past research findings, many of the oth- 
ers simply entail the kind of speculations 
that so frequently appear in discussions on 
the history of ideas, Still, the empirical verifi- 
cation or refutation of such commonplace 
conjectures has inherent scientific value, espe- 
cially since they concern variables that are 
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seldom investigated but may be necessary for 
the formation of a general socio-psychological 
heory (see Triandis, 1975). 


METHOD * 
Analytical Overview 


The above hypotheses raise a question regarding 
the appropriate research design, since they simul- 
taneously involve two different levels of analysis. 
Even though the dependent variable, eminence, is 
clearly an individual characteristic, only the three 
belief structure factors also constitute purely micro- 
level independent variables. Thus Hypotheses 7-12 
quite obviously advance macro-level independent 
variables; for instance, ideological diversity and war 
intensity are properties of generational aggregates, 
not of persons. Even more critically, the extent to 
which a thinker represents his or her zeitgeist is an 
attribute of neither the individual nor the generation 
but rather of the interaction between the two. Now 
archival designs have been introduced for analyzing 
micro-level (Simonton, 1976a) and macro-level 
(Simonton, 1975d) relationships, but what is re- 
quired at present is a design that combines both 
macro- and micro-level analyses into a single multi- 
variate strategy (cf. Scheuch, 1969). The solution 
pursued here is to retain the individual thinker as 
the statistical unit of analysis, yet to operationalize 
three distinct types of variables. First, eminence and 
the three belief structure variables are treated as 
pure individual level constructs. Second, each indi- 
vidual thinker is assigned to a specific time period 
in order to define the sociocultural and political 
context variables that characterize the generational 
level. Third, the factors of representativeness, pre- 
sness, and modernity are all defined by com- 
aring individual beliefs with the generational zeit- 
gceist, thus producing a set of individual-generational 
interface variables. These three types of variables are 
then introduced into a single multiple regression 
equation. The resulting design that simultancously 
surveys such mixed analytical levels is here termed 
an “individual-generational analysis.” 


Sample 


A sample of 2,012 thinkers was taken from those 
individuals listed in the Appendixes to Volume 2 of 
Sorokin’s (1937) Social and Cultural Dynamics. This 
source was primarily chosen because it contains 
measurements of great utility for testing some of 
the above hypotheses. Moreover, except for the fact 
that it is restricted to Occidental civilization, the 
resulting sample is very broad in scope: Over two 
dozen nationalities are represented, spanning a time 
period from 580 B.C. to 1900 A.D. 


Individual Level Variables 


Eminence, For the current investigation an attempt 
was made to operationalize philosophical fame in as 
many different ways as possible by exploiting vari- 
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ous types of sources authored by scholars of varied 
national and professional affiliations. In addition to 
avoiding an arbitrary choice, this approach permits 
us to check whether such diverse operationalizations 
will nonetheless yield congruent estimates of fame. 
Hence, the following 10 eminence measures were 
defined. 

Measure 1 was adopted unaltered from the emi- 
nence ratings estimated by Sorokin and his collab- 
orators for all of the 2,012 thinkers in the present 
sample (Sorokin, 1937, pp. 635-712). These ratings 
were based on a set of objective criteria and were 
quantified on a 1-12 scale (Sorokin, 1937, pp. 16- 
18). 

Measure 2 was adapted from similar eminence 
ratings published by Gray (1958, 1966). He rated 
creative individuals on a 0-4 scale for Graeco-Roman 
civilization (Gray, 1958, pp. 27-30) and a 3-7 scale 
for Medieval and Modern Western civilization (Gray, 
1966, pp. 1396-1417) according to a set of relevant 
criteria (Gray, 1958, p. 25, 1966, pp. 1391-1392). 
These two sets of ratings were rescaled to produce a 
1-5 scale for those individuals in both Gray’s and 
the the present sample. Sampled individuals not 
included in Gray’s study were accorded a zero rating 
for this measure. 

Measure 3 was derived from the index to the 
History of Philosophy by Marias (1967): One point 
was given for each separate page in which a thinker 
was mentioned. This measure ranged from 0 to 67. 

Measure 4 was derived from the index for Socrates 
to Sartre, a history of philosophy by Stumpf (1966): 
One point was given for each separate page on which 
a thinker was mentioned. This measure ranged from 
0 to 92. 

Measure 5 was adapted from the index-glossary to 
Barnes's (1965) An Intellectual and Cultural History 
of the Western World: (a) Two points were awarded 
if the thinker was honored with a brief annotation 
giving basic biographical data, (b) 1 point was 
given if he or she was only listed without elaborat- 
ing remarks, and (c) 0 points were assigned if he or 
she was not listed in the index at all. 

Measure 6 was taken from Kroeber’s (1944) 
Configurations of Culture Growth: (a) One point 
was given if the thinker was mentioned in Chapter 
2 (“Philosophy”) and (b) 0 points, if otherwise. 

Measure 7 was derived from the Treasury of 
Philosophy by Runes (1955); (a) One point was 
awarded if the thinker had a selection included in 
the anthology and (b) 0 points, if excluded. 

Measure 8 was derived from the Great Books of 
the Western World (Hutchins, 1952): (a) Two 
points were assigned if the thinker was actually 
included in the anthology, (b) 1 point was given if 
he or she was at least listed in the “runner-up” bib- 
liography in the second volume of the Great Ideas 


1Space is not sufficient to describe all of the 
methodological details behind this investigation. An 
1l-page supplement is available from the author 
upon request, however. 
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Syntopicon (1952, pp. 1146-1217), and (c) 0 points, 
if otherwise. 

Measure 9 was adapted from the Encyclopedia of 
Philosophy (Edwards, 1967): (a) Two points were 
granted if the thinker had a separate entry under 
his or her name, (b) 1 point was awarded if he or 
she was only listed in the index (and hence discussed 
under some other entry), and (c) 0 points, if other- 
wise. 

Measure 10 was adapted from the Encyclopedia 
Britannica (1974); (a) Two kinds were awarded if 
the thinker was given a separate entry in the Macro- 
paedia, (b) 1 point was assigned if he or she was 
only given an entry in the Micropaedia, and (c) 0 
points, if otherwise. 

A principal-components factor analysis of the 
above 10 measures supports the existence of a single 
“eminence” factor. The unrotated first principal 
component accounts for 50% of the total variance in 
the measures. Hence, despite variations due to dif- 
ferent types of archival operationalizations, measures 
of achieved eminence appear to have a high degree 
of concordance (also see Farnsworth, 1969, chap. 6). 
Since all 10 measures load very highly on the first 
factor, a linear composite was generated by summing 
the unweighted standardized scores. The correlations 
between each measure and the resulting factor com- 
posite range from .61 to .80, with an average of .71 
(all ps < .001). This factor composite, called “emi- 
nence,” comprises the dependent variable for the 
current inquiry, 

Philosophical beliefs. Sorokin and his collaborators 
(Sorokin, 1937, pp. 635-712) have rated all 2,012 
thinkers according to their adherence or advocacy of 
31 positions spread across seven major issues. These 
beliefs are enumerated as follows: Issue 1—skepti- 
cism, empiricism, criticism, rationalism, mysticism, or 
fideism; Zsswe 2—mechanistic materialism, hylozoism, 
mixed materialism-idealism, pluralistic idealism, re- 
ligious pluralistic idealism, or monistic idealism; 

Issue 3—temporalism, temporalism-eternalism, equi- 
librium of both, eternalism-temporalism, or eternal- 
ism; /ssue 4—conceptualism, nominalism, or realism; 
Issue 5—extreme singularistic individualism, singular- 
istic collectivism, singularism-universalism, universal- 
ism-singularism, mystical unity, or universalism; 
Issue 6—determinism or indeterminism; Issue 7— 
ethics of happiness, ethics of principles, or ethics of 
love. 

Since it is not the purpose of the present article 
to analyze directly the content of philosophical be- 
liefs and since detailed definitions are available in 
Sorokin’s work (or see Simonton, 1976c, 1976e), de- 

scriptions are not given in detail here, 

Belief structure. The following three independent 
variables are defined in terms of the above ratings. 


1. The breadth of a particular thinker is defined 
as the number of issues on which he or she has 
taken a positive position. Since every thinker in 
the sample has been credited with advocating at 
least one philosophical belief, this variable ranges 
from 1 to 7. 
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2. The extremism of a thinker was dete ed 
according to two criteria. First, in order a 
belief to be considered extreme, it must be 
that is, it must be advocated by a small minor if 
thinkers (viz., less than 10% of the sample) 

only qualification on this criterion was that 
tions which are rare only because they are 
recent were not considered “extreme” (e.g., Kan! 1 
“criticism” of Issue 1). Second, the belief sh: 
appear to stand at the extreme end of some sor 
continuum whenever such a unidimensional scale 
strongly implied (e.g., Issue 3). The following 
beliefs were judged to satisfy these two restrictior 
skepticism, fideism, mechanistic materialism, monist 
idealism, temporalism, eternalism, conceptualism, ex 
treme singularistic individualism, mystical unity, uni 
versalism, and the ethics of love. A thinker was 
given 1 point for each of these positions held. Sinc: 
Issue 6 had no extreme positions, the resulting ex- 
tremism measure could only range from 0 to 6. 

3. The consistency of a thinker’s belief structure 
was also defined according to a sort of “consensus 
gentium.” Rather than trying to operationalize the 
consistency of two beliefs by fiat—and thus running 
the risk of imposing the prejudices of the research- 
er’s own personality and milieu—the 2,012 thinkers 
were allowed to define their own broad “psycho- 
logic.” This task was accomplished by generating 
dummy (i.e., binary) variables for each of the 31 
positions. A correlation matrix was then calculated 
among all positions except those within the same 
issue. Those between-issue pairs of positions having 
positive product-moment correlations greater than or 
equal to .3163 were designated as “consistent ;” that 
is, two positions were deemed consistent if 10% or 
more of their variance was shared. Given the nomi- 
nal character of these variables along with the ipsa- 
tive nature of the within-issue values, this cutoff 
point is not excessively low. Further, this criterion 
involves correlations whose statistical significance 
extends well beyond the .001 level. At any rate, 24 
pairs of positions qualified as being consistent by 
these standards, and the thinker was given 1 poin! 
for each pair he or she maintained (sce Footnot: 
1). Since there were no negative correlations havin; 
the specified absolute magnitude, the consistency 
score had no negative values added to it. 


Generation Level Variables 


In order to study aggregate level variables, cach 
thinker was assigned to one and only one of 124 possi- 
ble consecutive 20-year periods from 580 B.C. to 1900 
A.D. Basically an individual was assigned to that gen- 
eration in which he or she reached the peak productive 
age, this peak being estimated as the 40th year of age 
(see Simonton, 1975a, 1975c). This generational 
assignment has two methodological consequences. 
First, it is possible to detect whether a given aggre- 
gate variable operates as a developmental or a 
productive period influence on the individual’s emi- 
nence. Thus if the relationship between the eminence 
of any thinker at generation g and a corresponding 
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sociocultural or political variable tends to attain the 
largest absolute value when the latter is also mea- 
sured at generation g, the existence of a productive 
period factor may be surmised. In contrast, if an 
aggregate independent variable maximizes its relation 
with the dependent variable when lagged to genera- 
tion g-1, then the function can be said to be devel- 
opmental. The root of this last inference is that any 
thinker who is 40 years old at generation g is 20 
years old at generation g-1, and consequently sus- 
ceptible to various developmental influences (Simon- 
ton, 1975c, 1975d, 1976e). This paradigm can only 
hold “on the average,” since the 40th birthday of 
separate thinkers actually falls on different years 
within each 20-year interval. Yet with over 2,000 
thinkers tallied into only 124 generations, such dif- 
ferences tend to average out so that the paradigm is 
closely approximated, 

The second methodological repercussion of the 
generational assignment is that it supplies a con- 
venient control variable, namely, the time in which 
the person lived or flourished. Research has shown 
that the time a person lived may be related to other 
substantive variables such that spurious associations 
may result (Simonton, 1975a, 1976a). Consequently, 
the date for the beginning of the 20-year historical 
period to which each thinker is assigned is used 
here as a control variable to guard against poten- 
tial methodological artifacts (see Simonton, 1975d). 

Sociocultural context. The next two variables are 
directly germane to Hypotheses 7 and 8, while the 
third is required for testing Hypotheses 4, 5, and 6. 


1. Since each thinker was assigned to one genera- 
tion, it was an easy next step to count the frequency 
of thinkers per generation. The result is a genera- 
tional time series that can be used as a measure of 
role model availability. For each thinker at genera- 
tion g, there is a corresponding tally of the number 
of thinkers at generation g-1; thus the latter fre- 
quency count is tantamount to a measure of the 
number of thinkers in the previous generation who 
are available for potential admiration and imitation. 
However, since this tabulation possesses a conspic- 
uous exponential growth that may be substantively 
unrealistic (Simonton, 1975d), it was subjected to a 
logarithmic transformation. This log-transformed and 
lagged frequency count is termed here role model 
availability? 

2. Ideological diversity was defined using the 
Appendixes in Sorokin’s (1937, pp. 635-712) work, 
which measure the fluctuations of all 31 positions 
from 580 B.C. to 1920 A.D. by 20-year intervals. 
These tabulations were based not only on the 2,012 
thinkers included in the current sample but also on 
various anonymous movements and events having 
direct relevance. Therefore, a measure of ideological 
diversity was generated by simply counting the 
number of philosophical positions represented in each 
generation. Again, to detect whether this variable 
functions as a developmental influence, the measure 
should simply be lagged one generation. Hence, 
according to Hypothesis 8, the eminence of a thinker 
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at generation g should be correlated with the ideo- 
logical diversity of generation g-1. 

3. Since the Appendixes actually list the represent- 
atives by position and by generation, the zeitgeist 
was defined for each generation as that position for 
each issue having a plurality of representatives. It 
must be stressed that Sorokin’s tables showing the 
percentage of the representatives for each philo- 
sophical position per issue could not be used in the 
present study, since the tabulations are not raw 
“head counts” but rather indices where each indi- 
vidual or movement is weighted according to emi- 
nence (Sorokin, 1937, pp. 16-23). Clearly such 
weighted measures would produce a methodological 
tautology, providing biased support for Hypothesis 4. 


Political context. Political fragmentation, imperial 
instability, political instability, and war intensity are 
all taken unchanged from two earlier investigations 
where they are more precisely operationalized (Sim- 
onton, 1975d, 1976e).8 It suffices to say here that all 
four measures were constructed in 20-year genera- 
tional periods compatible with the requirements of 
the present study. Hence, if Hypotheses 9-12 were 
valid, then the eminence of a thinker at generation g 
should be a positive function of political fragmenta- 
tion and of imperial instability at generation g-1, a 
negative function of political instability at genera- 
tion g-1, and a negative function of war intensity 
at generation g. 


Individual-Generational Interface Variables 


The last three independent variables all compare 
a thinker’s personal beliefs with some contemporary, 
future, or past zeitgeist. 


1. A thinker’s representativeness score was defined 
as 1 point for each individual position that was 
identica] to the zeitgeist position for his or her 
generation. Since there are seven issues, this score 
could range from 0 to 7. 

2. A thinker’s precursiveness essentially consisted 
of two special “representativeness” scores. First, an 
“ahead of time” score was calculated by adding 1 
point for each individual position that was identical 
to the zeitgeist position of the next generation but 
not of his or her own generation. Second, a “behind 
the times” score was calculated by repeating the 
foregoing procedure for the previous generation, The 
precursiveness score was then defined as the ahead 
of time score minus the behind the times score. The 
resulting measure could therefore range from —7 to 
+7. 


2Even though I believe this log-transformation 
to be preferable to the nontransformed data, all of 
the results to follow are unchanged even if the raw 
measure is used instead. 

8In the earlier of the two articles, the political 
measures ended in 1839, but in the second article 
these were extended three generations to 1899 using 
the exact same data sources and coding schemes. 
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3. Because Sorokin’s own tabulations extended 
throughout the entire generation of 1900-1920, the 
modernity measure could be based on a sample of 
thinkers (V = 298) not included in the present sam- 
ple. Therefore, the weighted measures would appear 
as more reasonable indices of the predominant zeit- 
geist in the first score years of the 20th century. 
The following positions had the highest weighted 
scores for each position: empiricism, mixed material- 
ism-idealism, temporalism-eternalism, nominalism, 
universalism-singularism, determinism, and the eth- 
ics of happiness. Hence, a thinker was given 1 point 
for each of these positions he or she advocated, pro- 
ducing a modernity scale ranging from 0 to 7. 


Data Reliability 


To the extent that it is feasible, most archival 
investigations should feature some kind of quantita- 
tive assessment of the reliability of the measures 
used. The factor analysis described above demon- 
strates the reliability of the dependent variable, emi- 
nence. The reliability coefficients for the political 
data have been reported in an earlier article 
(Simonton, 1975d). Thus, the following three items 
need to be discussed here: 


1, Although Sorokin (1937, pp. 272-273) admitted 
that his ratings of philosophical beliefs for each 
thinker are subject to some error, he did not provide 
any quantitative reliability checks. Nor is it possible 
to alleviate completely this deficiency here, since 
not surprisingly, there do not seem to exist any rat- 
ings sufficiently comparable to permit the usual kind 
of reliability calculations. Nonetheless, I have made 
quantitative comparisons between Sorokin’s ratings 
and two others, the first an unpublished survey of 
professors of philosophy and sociology by Adler 
(Note 1) and the second some space-frequency counts 
derived from the Great Ideas Syntopicon (Hutchins, 
1952); the resulting correlations are consistent, de- 
spite the wide divergences in constructs and opera- 
tionalizations (see Footnote 1). Moreover, objective 
calculations of reliability coefficients have been ac- 
complished for Sorokin’s indicators of artistic and 
s ific creativity and of war intensity and internal 
Naroll et al., 1971; Simonton, 1975c, 
!976b, 1976d). Since these comparisons in- 
variabl ld respectable estimated reliabilities, it 
se f vier that Sorokin’s measures are gen- 
erali; Finally, it should also be pointed 


out tha ide commendable efforts to ren- 
der his liable as possible: Besides ob- 
taining the ı of professional philosophers 


‘orokin took the additional 
ese raters blind to the 
(Sorokin, 1937, p. 20). 
Sorokin’s care that the 

to disprove his theory 


(see Simonton 
precaution of | 
hypotheses he wa 
Indeed, it is a testi 
resulting data can b: 
(Simonton, 1976c). 

2. The product-mome 
present measure of role © 
combined and log-transfor: 
of general, religious, and r 


elation between the 
availability and the 
weighted measures 

neous philosophy 
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used in an earlier study (Simonton, 1975c 
(N = 124, p<.001). 

3. Ideological diversity has been shown to >a 
significant contemporaneous relationship wi m- 
perial instability and a significant lagged relati hip 
with political fragmentation (see Simonto in 
press-b). Since both political fragmentation an: |3- 
perial instability are presumed indicators of cu! -al 
diversity and since the latter concept appears y 
similar to ideological diversity, these findings ma: be 
taken as indirect support for the reliability of e 
ideological diversity measure. 


ReEsuLTs 


Before a multiple-regression analysis ca 
be advanced, it is first necessary to verii 
whether Hypotheses 7-12 have correct! 
specified the temporal relationship betwee: 
their respective variables and the dependen‘ 
variable. In other words, does the functional 
relationship with the eminence of a thinker at 
generation g maximize at generation g for 
those variables hypothesized to be productive 
period influences and at generation g-1 for 
those thought to be developmental period in- 
fluences? This verification was achieved by 
comparing contemporaneous and lagged cor- 
relations between eminence and each of the 
aggregate independent variables (cf. Simon- 
ton, 1975d). The outcome dictated two modi- 
fications in the original hypotheses: First, po- 
litical fragmentation has the maximal effect 
when treated as a productive influence, and 
second, war intensity has the maximal effect 
when treated as a developmental influence 
(see Footnote 1). Due to the high autocorre- 
lations common to political indicators (Simon- 
ton, 1976e), especially in the case of politi- 
cal fragmentation (Simonton, 1975d), the 
differences between contemporaneous and 
lagged correlations were not very large, but 
the alterations were affected at any rate. Thus, 
Hypotheses 7, 8, 10, and 11 remain un- 
changed, while Hypotheses 9 and 12 must be 
respecified. 

After making the specification changes dic- 
tated by the above comparisons, eminence 
Was regressed on the 12 substantive variables 
and the methodological control variable, time.* 


*Tt must be noted that the observations are not 
fully independent, since the generational variables 
are replicated for contemporary thinkers. Further, 
the usual procedures for handling statistical de- 


( 
15 -$ 
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‘The results are shown in Table 1. Both stan- 
jardized and unstandardized partial regres- 
n coefficients (i.e., 6’s and 6’s, respectively) 
ve displayed along with the standard errors 
and the corresponding F tests. A multiple- 
regression analysis was used, since it permits 
the estimation of an effect while simultane- 
ously controlling for other effects that may 
produce spurious relationships (Kerlinger & 
Pedhazur, 1973). To illustrate, because philo- 
sophical breadth is a necessary prerequisite 
for high scores on the measures of belief struc- 
ture and zeitgeist relationships, the effects of 
the latter variables upon eminence must be 
estimated in a multivariate framework that, 
in effect, holds breadth “constant.” Hence, 
each partial regression coefficient in Table 1 
assesses the impact of the corresponding varia- 
ble upon eminence after partialing out the 
emaining dozen variables. This statistical con- 
trol function of multiple regression must be 
kept in mind as the following conclusions are 
stated: 


1. The breadth of issues that a thinker dis- 
cusses has a major impact on eventual emi- 
ence, supporting Hypothesis 1. 

2, The second hypothesis is contradicted 
by the results: The advocacy of extreme posi- 
tions has a significant positive association 
with a thinker’s fame. In other words, the 
image of the great philosopher as a serene 
proponent of moderate precepts may be false. 

3. Contrary to the prediction set forth in 
Hypothesis 3, consistency of philosophical 
beliefs is more characteristic of minor think- 
ers than of their more famous colleagues. 
Naturally, the problem here may reside with 
the a posteriori approach to defining consis- 
tency. To be on the safe side, therefore, it 
might be best concluded that major thinkers 
do not tend to associate positions across dif- 
ferent issues in the same manner as do the 
vast majority of intellectuals throughout the 
course of history. 


pendence (e.g., Simonton, 1975b, 1975d, in press-a) 
cannot be applied in this instance. Nonetheless, 
since: the dependent variable is measured on inde- 
pendent cases and since the belief structure and 
zeitgeist variables are as well, the error disturbances 
are not autoregressive and hence the statistical tests 
“should not be affected. 
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TABLE 1 


REGRESSION ANALYSIS: DETERMINANTS OF EMINENCE 
FOR THINKER AT GENERATION g 


Variable B b SE F 
Breadth 1.849 1413 170,930 
Extremism 1.367 .2425 31.67 
Consistency =1.110 1306 71,34% 
Representativeness —.744 1350 30.52 
Precursiveness —4.855 1.8363 6.99%% 
Modernity 1.198 1804 43,12 
Role model 

availability (g — 1) —.649 2850 5.01% 
Ideological diversity 

(g — 1) 1024 0428 +32 
Political fragmenta- 

tion (g) „158 086 0155 31.157% 
Imperial instability 

(i —.042 —.366 2312 2,45 
Political instability 

(g — 1) —.062 —.069 0280 6.01* 
War intensity (g — 1) —.008 —.002 0053 +12 
Time +130 001 0004 1111k 


Note, For the equation F(13, 1998) = 42.59, p < .001, 
17, SE = 6,28, and constant term = —1.765. 


4. The fourth hypothesis faired no better 
than the previous two, for representativeness 
is negatively related to philosophical renown. 
Only the lesser intellectuals in any genera- 
tion adhere wholeheartedly to the platitudes 
and profundities of the zeitgeist. This finding 
is not only substantively intriguing but also 
has methodological repercussions. Historians 
and behavioral scientists frequently assume 
that the more eminent persons of any genera- 
tion best embody, the “spirit of the time” 
(e.g., Boring, 1963, pp. 5-25; McClelland, 
1961, p. 112). But according to the present 
results, the lesser thinkers may actually be 
more indicative of the generation’s true zeit- 
geist. 

5. Yet another prediction is disconfirmed, in 
this case that of Hypothesis 5. Indeed, pre- 
cursiveness has a statistically significant nega- 
tive association with achieved eminence, which 
means that great thinkers are behind their 
times. That is, the beliefs advocated by the 
more illustrious thinker are more similar to 
those predominating in his or her youth than 
to those prevailing in the next generation. 

6. Philosophical eminence does positively 
correlate with the modernity of beliefs, and so 
Hypothesis 6 is confirmed. This result sug- 
gests that this particular modernity effect is 
actually responsible for the belief that famous 
thinkers are ahead of their times. Expressed 
differently, a surreptitious tautology may op- 
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erate as follows: First, there is a tendency to 
oyerrate thinkers who hold modern views, 
and, second, an inclination to attribute their 
fame to the fact that they appear to be in 
advance of their own zeitgeist. 

7. Role model availability has a very per- 
plexing relationship with philosophical fame: 
The association is negative, in contradiction 
to Hypothesis 7. Actually, this result may not 
be as aberrant as first appears. An earlier in- 
vestigation has shown that an unweighted 
measure of philosophical creativity has an 
autocorrelation over twice that for a weighted 
measure (.43 vs, .21) (Simonton, 1974, p. 
166). These measures were generational ag- 
gregates where the unweighted measures had 
all thinkers counted equally, while the 
weighted measures differentially tallied the 
thinkers according to their eminence (Simon- 
ton, 1975c). Hence, the autocorrelations may 
be taken to mean that lesser thinkers require 
role models to a much greater extent than do 
major thinkers, Stated differently, a major 
philosopher can develop in a relative intel- 
lectual vacuum, whereas a minor philosopher 
is much more dependent upon having thinkers 
in the past generation to imitate or criticize 
(cf. Kroeber, 1944, pp. 17-19). 

8. Contrary to Hypothesis 8, ideological 
diversity has no relationship whatsoever with 
the renown of any given thinker, Thus we 
cannot conclude that exposure to a large num- 
ber of contending schools of thought during 
development has any bearing, positive or 
negative, on eventual eminence as a mature 
thinker, 

9. Political fragmentation has a significant 
positive relationship with philosophical ce- 
lebrity. Except for the qualification that the 
influence maximizes at the productive rather 
than the developmental period, this outcome 
is consistent with Hypothesis 9. However, this 
latter qualification casts doubt on our assump- 
tion that the mediating variable is cultural 
diversity, since the latter construct is proba- 
bly only influential during the developmental 
period of a person’s life. Perhaps greater po- 
litical fragmentation means greater intellectual 
freedom and hence more opportunity for a 
philosopher to realize his or her creative po- 
tential. 

10. Imperial instability has no significant 
relation with philosophical eminence, and 
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consequently, Hypothesis 10 must be re = ted, 
Since this variable was also presumed to ‘di- 
cate cultural diversity, the latter variable nay 
have no association with the eminence o: sar- 
ticular individuals but instead may solely ve 


a relationship with the number of thi: ers 
per generation. 

11. Political instability has a signit ont 
negative developmental relationship h 
philosophical eminence, and thus Hypoth. is 
11 is confirmed. Stated another way, the m 
illustrious thinkers are less likely to grow «> 
in times plagued by military revolts, dynas‘ 
conflicts, political assassinations, and i 
like. The basis for this detrimental effect h 
already been suggested: Major philosophic: 
creativity may depend on developing a belie! 
in the world as fundamentally rational or pre 
dictable, yet exposure to events ascribed tc 
political instability may prevent this develop- 
ment from occurring (Simonton, 1975d; cf 
Lefcourt, 1966). 


12. Once again war intensity is not shown : ; 


to be relevant to creativity or eminence; Hy- 
pothesis 12 is empirically unsupportable. 

13. Interestingly, time has a significant 
positive association with eminence even after 


5It may be objected that the generational varia- 
bles lead to ambiguous conclusions regarding their 
effects on achieved eminence. For example, in the 
case of political fragmentation, it may be proposed 
that increases in the number of independent. states 
cause either an increase in the number of major 
thinkers or a decrease in the number of minor 
thinkers. Both possibilities would yield a positive 
correlation between political fragmentation and thc 
average eminence of thinkers at generation g, anc 
therefore it would not seem possible to generaliz 
the effects of that variable on individual eminence 
However, this argument overlooks the fact that the 
regression coefficient for political fragmentation is 
estimated while controlling for role model availabii 
ity. Since the latter variable has an extremely large 
autocorrelation (see Footnote 1), this variable ef- 
fectively controls for any fluctuations in the number 
of thinkers per generation (and the results are 
virtually identical if role model availability is placed 
contemporaneously with the other variables). Since 
the absolute quantity of thinkers Per generation is 
controlled for, the only way political fragmentation 
can have a positive impact on eminence is to increase 
the number of major thinkers at the expense of the 
number of minor thinkers, Hence, the micro-level 
inference is indeed sanctioned by the multiple- 
regression procedure. This same reasoning also applies 
to the other generational variables, such as political 
instability. 
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controlling for modernity (cf. Simonton, 
1976a). This residual bias is probably due to 
he fact that the modernity variable does not 
nclude all of the possible factors that render 
historical individuals similar or dissimilar to 
modern perceptions. 


Before discussing the implications of the 
above findings, it must be stressed that the 
regression equation exhibited in Table 1, 
though highly significant statistically, does 
not explain an overwhelming percentage of the 
variance in philosophical eminence. In point 
of fact, a humbling 78% of the variance re- 
mains after subtracting that attributable to 
the 13 variables. Hence, this investigation can 
only be said to broach the problem (also see 
Simonton, 1976a). 


Discussion 


Does a consistent picture emerge of the 
difference between the famous and the obscure 
thinker? To begin with, modernity and his- 
torical time may be eliminated from the pro- 
file, for both of these variables have little 
relevance for determining stable socio-psycho- 
logical attributes. What remains is an image 
of the eminent thinker as being more likely 
to cover a wide array of philosophical issues, 
to advocate extremist or minority positions 
and to combine these positions in unusual 
ways across various issues, to be concerned 
with the dominant issues of the previous 
rather than the following generation, to be 
less dependent on haying role models avail- 
able during development, and yet to be much 
more vulnerable to political influences, 
whether political instability occurs during the 
‘evelopmental period or political fragmenta- 
tion during the productive period of life. This 
picture can be plausibly connected to the 
typical profile of the creative individual that 
has emerged in contemporary research (see 
Stein, 1969). Certainly the more eminent 
thinker appears to have the characteristic 
independence of mind that is willing to pro- 
pose unpopular beliefs‘ and associations of 
beliefs and to take risks by attempting to 
cope with a large number of problems. This 
nonconforming behavior also seems to require 
more favorable political conditions such as 
political fragmentation in order, perhaps, to 
have more freedom to venture speculations 
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that would not be tolerated in times of auto- 
cratic universal states (Barnett, 1953, pp. 
65-72). That political instability also stifles 
creative development is in keeping with this 
interpretation, as has already been observed. 
Nevertheless, not all of the findings conform 
so neatly with this view. For example, why 
are major philosophers less dependent on role 
models? Is it because only by growing up in 
a relative philosophical vacuum can the indi- 
vidual avoid the excessive imitativeness that 
may cause a confining allegiance with a par- 
ticular philosopher or school? That is, are 
thinkers who grow up in times rich in philo- 
sophical activity more likely to become handi- 
capped by becoming mere “disciples”? And 
why are the more celebrated thinkers more 
prone to have beliefs similar to those of the 
past? Perhaps the major philosopher engages 
neither in preliminary spadework for the fu- 
ture nor in eloquent propaganda for the pres- 
ent, but rather all efforts are directed toward 
consolidating the achievements of the past 
into a unique synthesis having more breadth 
than anything achieved before. However, this 
is mere speculation, and only additional re- 
search can detect the flaws and virtues of the 
foregoing profile. 

A couple of final points should be stated 
appropos of future research. To begin with, 
the present project cannot pretend to be the 
last word, even though the huge sample size 
may appear intimidating. Besides the fact 
that the current variables could be better 
measured and that new variables could al- 
ways be added to the regression equation, 
there are several additional philosophical tra- 
ditions besides the Occidental that could be 
the source of new samples. Chinese, Indian, 
and Islamic civilizations are especially rich in 
this respect (Kroeber, 1944, chap. 2). Fur- 
thermore, the basic paradigm of individual— 
generational analysis might be fruitfully ap- 
plied to other disciplines besides philosophy. 
Genius could be alternatively examined within 
the context of the musical, literary, artistic, 
scientific, and maybe even political zeitgeist. 
Thus, it is the intention of this article to spur 
additional quantitative research on the rela- 
tionship between the “great genius” and the 
“spirit of the times.” 
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REFERENCE NOTE 


1. Adler, F. A quantitative study of the sociology of 
knowledge: Second report. Unpublished manu- 
script, University of Arkansas, 1952. (Available 
through interlibrary loan.) 
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Toward a Normative and Comparative Biocultural 
Social Psychology 
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This article addresses a fundamental cause of the so-called “crisis” of social 
psychology—the misplaced theoretical unit of analysis. The argument begins 
with the observation that much of the social behavior that social psychologists 
attempt to explain is “normative,” in being more characteristic of groups, 
classes, roles, and other sociocultural entities than of individuals observed at 
random. It is then assumed that the causes of normative behavior are dynamic 
properties of the same social system of which the behavior is distinctly char- 
acteristic—properties that may roughly be termed “values and beliefs.” It is 
further argued that the major theories of social psychology consist of processes or 
dynamisms that are conceptually located within the individual. An analysis of 
game theory, equity, dissonance, aggression, and attitude theories illustrates 
their inadequacy as explanations for normative social behavior. Next discussed 
are the strong implications of this analysis for the reorientation of social psy- 
chology with respect to the types of scientific principles it seeks to establish, 
the design of comparative research for the description and causal analysis of 
value and belief systems, and the relationship of social psychology to the other 


social sciences and biology. 


An ever-increasing body of self-criticism in 
the literature of social psychology seems to 
confirm the diagnosis that the field is writhing 
is of confidence” (Elms, 1975). The 
critiques of social psychology have covered 
numerous particulars but can be grouped into 
three categories. First, there is a long litany of 
methodelogical problems in experimental re- 
search, such as experimenter biases (Rosen- 
thal, 1969), demand characteristics (Orne, 
1969), “enlightenment” effects (Gergen, 
1973), the inability to handle circular causal 
processes (McGuire, 1973), and the validity- 
undermining artificiality of the experimental 
situation. Alleged ethical abuses of social psy- 
ogical research constitute a second cate- 
gory of criticism and include the deceptions 
involved in experimental manipulations (Ring, 
1967), especially leading the subjects to be- 
lieve they are harming others (Baumrind, 
1964), and the general exploitativeness 
against powerless subjects (Kelman, 1972). 


A part of this paper was presented at the meet- 
ings of the Society for Experimental Social Psychol- 
ogy at Purdue University, November 1975. 

Requests for reprints should be sent to Albert Pepi- 
tone, Department of Psychology, University of Penn- 
sylvania, Philadelphia, Pennsylvania 19104. 


The third category in the crisis literature is 
pitched on a general methodological or meta- 
theoretical level. For instance, in building its 
data base from experimental situations, social 
psychology tends to ignore the political and 
ideological context of the subject’s behavior. 
Indeed, most research contains no reference at 
all to the real society in which people live 
(Moscovici, 1972). Further, to cite a recent 
and widely read paper, social psychology is 
not a science because its data are not stable 
across time; the best we can do is historical 
description (Gergen, 1973). 

In my view, this last genre of criticism is 
the most interesting and significant because it 
goes to the essential nature of the field and 
raises questions about its ability to produce 
scientifically worthwhile results. But although 
such analyses are insightful and thought pro- 
voking, they do not, in my opinion, come to 
grips with the core problem of social psychol- 
ogy, which has obstructed progress since the 
field was first defined in textbooks nearly 75 
years ago. The core problem, as I see it, has 
two interrelated components: the failure to 
distinguish normative from nonnormative so- 
cial behavior, and the misplaced theoretical 
unit of analysis. 
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This article elaborates the diagnosis of the 
present crisis, examines some of the major 
theories of social psychology to evaluate their 
adequacy for handling normative social be- 
havior, and spells out some suggestions for a 
more comprehensive and valid social psychol- 
ogy, in terms of (a) the theoretical questions 
that need to be pursued concerning the func- 
tioning and origin of normative social be- 
havior, (b) a reoriented research methodology 
required to investigate normative social be- 
havior, and (c) the organizational relations 
that need to be fostered between social psy- 
chology and the social and biological sciences, 


Tue THEORETICAL UNIT or ANALYSIS IN 
SociaL PsycHotocy 


The argument best begins with the general 
observation that the social phenomena of 
interest to social psychologists in the real 
world, as well as the dependent variables in- 
vestigated in the field and laboratory, tend to 
be wholly or partly “normative.” By norma- 
tive is meant that such social behavior is 
more characteristic (e.g., more uniform) of 
some sociocultural collective unit than of indi- 
viduals observed at random. For present pur- 
poses, a sociocultural unit refers to any ethno- 
cultural group, class, or organizational struc- 
ture of roles, status positions, and subgroups. 

Added to the observation that social be- 
havior tends to be normative is the assump- 
tion that the determinants or sources of such 
behavior are properties of, and have their 
locus in, the same collective entity of which 
the behavior is distinctly characteristic, Tn 

other words, underlying normative social be- 
havior are dynamisms, structures, or condi- 
tions that are part of, and generated by, the 
collective system of interdependent individu- 
als or other components. By the same token, 
we may assume that a sufficient explanation 
of normative behavior cannot be derived from 
properties that are conceptually located in 
independent individuals. 
A more specific working assumption is that 
a major source or set of determinants of nor- 
mative behavior are the values and beliefs of 
the collective entity that exhibits the behavior, 
Although there are valid distinctions to be 
made among customs, rules, mores, world 
views, traditions, and the like, it does no 
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£ 
violence to the present argument to resent 
all of these by the twin concepts of va: es and 
beliefs. Conceptually, values and bei fs are 
dynamic properties of collective social tities 
that prescribe, proscribe, or otherwis: con- 
strain the attitudes, thoughts, and con ct of 


those who are psychologically part o! ‘hose < 
collective entities, 


The foregoing observations and assum) ions 
may not be too disagreeable to most cial 
psychologists. After all, most textbooks © fine 
social psychology as the part of psych: ‘ogy 
that deals with the behavior of the indi: ual 
in relation to other individuals or symolic 
representatives of the social environn nt. { 


However, such definitions are vague as to the 
meaning of the term “in relation to” nd 
definitely noncommittal as to the appropr ite 
theoretical unit of analysis for the vast assi t- 
ment of behavior with which social psychol- 
ogy deals. It is, in my view, a metatheoretical 
axiom that the causes of behavior cannot be 
clearly and validly specified unless one defincs 
the contours of the system or unit in whi h 
the causal processes operate. 

While textbooks may obscure the question 
of the theoretical unit of analysis for given 
social behaviors, the major theories of social 
psychology do not. Indeed, they fail alto- 
gether to distinguish normative from non 
normative behavior and are defined exclu 
sively in terms of hypothetical processes that 
operate within the independent individual, 
The crisis in social psychology, as far as the 
utility of its theoretical structure is concerned 
may now be succinctly diagnosed: The theo 
ries and conceptual models that have received 
most attention in scientific research over the 
last two decades cannot adequately explain 
normative social behavior, including at least 
some of the behavior which these theories 
were expressly designed to explain. F 

To look at the matter somewhat differently, 
if the intraindividual theoretical mechanisms 
have any validity at all with respect to the 
dependent variables, they may be correlates 
of the latter, both being effects of value- 
beliefs at the collective level. 

One may propose a number of hypotheses 
as to why social psychology has conspicuously 
failed to make the distinction between norma- 
tive and nonnormative (individual) behavior 
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hypotheses that bear on the history of psy- 
ology and the present organization of social 
vchology in (primarily) psychology depart- 
ents—but there may also be a logical fallacy 
ıt work constrained by research methodology. 
The fallacy is that since we measure the so- 


* cial behavior of individuals, the theoretical 


causes of that behavior must be within the 
individual. 

The foregoing arguments are no more than 
theoretical assertions, which may not be per- 
suasive to social psychologists raised in the 
(highly normative) individualistic tradition. 
Specific references to the limitations of the 

¿ major social psychological theories in the 
matter of dealing with normative social be- 
havior may be more convincing, and it is to 
this enterprise that the discussion now turns. 
\s representative major theories I have chosen 
those which, judging by the frequency of 
journal articles, appear to have, or to have 
had, the most adherents. 


“we 
*Game Theor y 


Reports of game experiments involving co- 
operation, competition, bargaining, and other 

cial behaviors indexed by mutual decision 
making once flooded our journals. The take- 
over of the game paradigm from mathematical 

ame theorists by social psychologists in the 

50s was quick and eager, for it was a 
method par excellence for rigorously repre- 
enting and observing the effects of inter- 
dependent reward and punishment structures. 
Interdependence, of course, is the basic de- 
fining characteristic of social relationships 
and group membership. The use of the payoff 
matrix to specify the kinds of control that 
players have over each other in consequence 
of the interdependence structure was cre- 
atively elaborated by Thibaut and Kelley 
(1959). One may safely say that no concep- 
tual representation of interdependence has 
been so richly descriptive and generative as 
the game matrix. 

Yet from the beginning, even before game 
theory was adopted by social psychologists, 
there was something inadequate about its 
motivational assumptions. Game theorists 
assumed that what basically drove players to 
make the choices offered by a payoff matrix 
was the individual desire to maximize the 
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specified outcomes that theoretically are 
mapped onto his or her underlying preference 
scale. Although, because of risk and uncer- 
tainty, the players did not always make the 
maximizing choice—as in the Prisoner’s Di- 
lemma game—the “pure” motive for the indi- 
vidual was to maximize potential rewards, or 
equivalently, to minimize negative outcomes. 
Even in the matrix artistry of Thibaut and 
Kelley, the assumption that individuals will 
stay in relationships as long as their level of 
satisfaction meets their expected level and is 
higher than it would be from outside alterna- 
tive sources is an assumption about individual 
motivation. The idea of being motivated to 
maximize the differences in reward (i.e., com- 
petitive motivation [Messick & McClintock, 
1968]) increased the realism of the motiva- 
tional analysis, but competitiveness is a 
quintessentially individual motivation even 
though it involves other persons as reference 
points. 

Where game theorists missed the point was 
in failing to recognize that the maximizing 
choices exhibited by game players, whether 
for absolute or relative gain, do not neces- 
sarily reflect competitive or acquisitive mo- 
tives within each of the independent indi- 
vidual players. Rather, the dominant or even 
exclusive motivation may be a collective one 
generated in the players by the state of the 
distribution of rewards at the time of their 
choices. Indeed, experiments (Pepitone, 1971) 
show that if an unequal distribution is per- 
ceived to be “just,” in the sense of being 
proportional to the respective merit of the 
players, the maximizing choices of both occur 
with a frequency so as to maintain the in- 
equality. If, on the other hand, the unequal 
distribution is considered unjust, then both 
players maximize so as to change the distri- 
bution into a more equitable one. In other 
words, decision making that characterizes the 
dyad is controlled by a shared value of jus- 
tice. This is not a claim that there are no 
individual motives in game choices but an 
assertion that a major source of the choices 
lies in the reward distribution of the group 
as interpreted by an ethical value that op- 
erates in a larger sociocultural collective of 
which subjects are a part. 

Actually, the concept of justice as equity 


644 


in terms of relative merit is probably not a 
universal value, and even in the sociocultural 
units where it does exist it is not always de- 
fined in the same way and activated by the 
same conditions. Clearly, there are other cri- 
teria of justice in the distribution of rewards 
and punishments including, for example, effort 
and degree of risk. Furthermore, in contrast 
to these “achieved” criteria of justice, there 
are, even in younger societies like ours, 
“ascribed” criteria. Thus, justice is obtained 
when rewards are proportional to caste or 
class and not at all to ability or effort, which 
may be totally irrelevant. 

In our cross-cultural experiments (Pepi- 
tone et al., 1970) on decision making, Euro- 
pean subjects rejected merit as a justification 
for an unequal reward. They acted as if in- 
equality per se were unjustified, even when it 
was proportional to merit. Their insistence on 
equality was, we were able to infer, based on 
Maoist doctrine with which the atmosphere 
was infused at the time. But in a different 
subcultural region of the same soicety, non- 
student subjects who were not involved in the 
cultural revolution were governed by the 
equity ethic, that is, both subjects in the dyad 
made maximizing choices so as to maintain 
or change the inequality depending on whether 
the latter was based on merit (Sprini, 1968). 
The fact that there are differences among cul- 
tural groups in the subscription to the ethical 
value of justice leads conveniently into a gen- 
eral discussion of equity theory. 


Equity Theory 


In social psychology, equity theory proba- 
bly began with Homans’s (1961) discussion 
of “distributive justice” in the exchange rela- 
tion. He explicitly postulated that justice is 
achieved when individuals are equal in the 
ratios of their net profits to investments. 
However, rather than pursue this normative 
concept of economic justice to its origin in 
the socioeconomic systems of which individu- 
als are a part, Homans developed a set of 
Skinnerian propositions governing the be- 
havior of the individual in the exchange rela- 
tion. The theoretical unit for the analysis of 
exchange relations now was the individual. 
Adams (1965) formulated the equitable ex- 
change more generally and potentially quanti- 
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fiably as an equality of input/outcome ra\ os, 
Adams also elaborated the cognitive and <- 
havioral consequences of inequity. He ti >- 
rized that the “just” correlations betw n 
input and outcome are learned during e 
socialization process, but he did not inqi <e 
where the ethical concept of justice thai ‘s 
learned, comes from. Indeed, the Adams 
periments (e.g., Adams & Jacobsen, 196 
deal with the consequences of an inequi 
generated in individuals who are overpai 
relative to their qualification. Of course, whi 
is really motivating the subjects is a concep 
of what is fair pay, which operates in thei 
culture but not necessarily in that of other: 
Present formulations of equity theory (Wal 
ster, Berscheid, & Walster, 1973), are ever 
more individual oriented; the distributive jus 
tice idea has all but disappeared. In genera 
form, equity acts like the hunger need: When 
the individual harms another, a tension is 
aroused, which the harm-doer then tries to 


reduce in one of a variety of ways. In the \ 


history of equity theorizing in social psychol 
ogy, the following questions have rarely il 
ever been asked: Where does this value come 
from? Is equity found in all cultures and sub- 
cultures? What historical and contemporary 
cultural conditions have created and maintain 
the value? 

In this connection it should be noted that 
the traditional approaches dealing with such 
questions as whether the concept of fairness 
and other ethical notions are learned through 
experience or develop in stages along with the 
growth of basic cognitive functions do no 
lead to an explanation of the origin of suc! 
values. 


Dissonance Theory 


The demise of group dynamics in the fifties 
(Steiner, 1974) left a vacuum into which 
nitive consistency theories flowed at | 
velocity. Because of catchy paradigms and 
some counterintuitive results, dissonance the- 
ory became a major enterprise in social psy- 
chology for more than a decade. There is no 
need to elaborate the point that the unit of 
analysis for dissonance is the individual. As 
with the other cognitive consistency models, 
dissonance is based on interactions among ele- 
ments in the individual mind. Despite the 
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huge volume of research, however, the ques- 
ion can still be raised as to whether the ob- 
rved effects of experimental manipulations 

e reflections of cognitive dynamics in indi- 
viduals or reflections of normative values and 
beliefs that prevail in certain cultural and 
subcultural groups. Thus, although the proto- 
type study of Aronson and Mills (1959) sup- 
ports the dissonance hypothesis that indi- 
viduals who have had to suffer in order to 
reach a goal will (if the degree of suffering 
cannot be reduced by distortion) enhance the 
attractiveness of the goal, it is by no means 
clear that the increased attractiveness of the 
goal is the outcome of a contradiction of cog- 
nitions. Until comparative studies are done 
with different kinds of subjects, it is at least 
equally likely that individuals are being in- 
fluenced by a shared value according to which 
goals involving sacrifice, pain, and suffering 
should be more attractive. 

Certainly, in the cultural domain there are 
numerous beliefs that are relevant to disso- 
nant situations. In religion, suffering and sac- 
rifice are the basic ingredients of martyrdom, 
and according to the Protestant Ethic, hard 
work is the ticket to salvation and afterlife 
happiness. Then there are philosophical beliefs 
about the self and how one should act when 
under adversity, such as “grin and bear it,” 
“keep a stiff upper lip,” and so on. Some of 
these aphorisms plainly suggest that as part 
of presenting the self to oneself or to others 
as unperturbed, courageous, and made of 
tough moral fiber, individuals are likely to see 
the environment as less unattractive than it 
would be without the painful trials and tribu- 
lations. 

It is perhaps because people have such 
beliefs and values concerning the desirable re- 
action to pain and suffering that they can 
readily predict the results of dissonance ex- 
periments for subjects who presumably share 
these values, From this perspective, Bem’s 
(1972) explanation of dissonance results in 
terms of self-perception processes may turn 
out to be totally gratuitous. 

The view that culturally or subculturally 
based beliefs and values account for disso- 
nance effects need not be one hundred percent 
right for cognitive theorists to become uneasy. 
The alternative to dissonance theory that I 
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have suggested is eminently viable so long as 
(a) there are no independent measures of the 
state of dissonance generated by mutually 
contradictory cognitions, and (b) there are no 
cross-cultural data to affirm that dissonance 
arousal and reduction are invariant mecha- 
nisms of the individual human mind. 


Aggression and Punishment 


From the earliest theorizing in psychology, 
aggression has been treated as an instinctual 
or learned response to frustration, More re- 
cently, notably in the work of Bandura 
(1973) and Berkowitz (1962), aggression is 
considered an instrumental or conditioned 
response. Frustration has been de-emphasized. 
But regardless of these theoretical shifts of 
emphasis, it is clear that aggression is pro- 
duced by a state or structure within the inde- 
pendent individual organism. In the research 
conducted by psychologists over the past 35 
years, there is scarcely any reference to nor- 
mative values and beliefs associated with roles 
and cultural groups as to the origin, direc- 
tion, inhibition, and form of aggression. In- 
deed, when in the context of testing the frus- 
tration-aggression theory (Dollard, Doob, 
Miller, Mowrer, & Sears, 1939) there were 
findings that suggested a normative basis for 
aggression, they were interpreted within an 
individualistic framework. 

We are talking about the clear findings of 
Pastore (1952), which showed that if one 
makes the frustration “arbitrary,” people will 
respond to it more aggressively. One widely 
held interpretation (Berkowitz, 1962) was 
that the arbitrary frustration was unexpected. 
An unexpected frustration is stronger because 
it is more difficult to cope with. The Pastore 
effect was also thought to be due to the 
greater inhibitory effect of “reasonable” frus- 
tration. Third, the unexpected frustration was 
considered a cognitive frustration which sup- 
plements the frustration on the level of action. 
Finally, in Pastore’s questionnaire study at 
least, some of the arbitrary frustrations 
seemed to involve malevolence, which prob- 
ably added to the overall level of frustration. 

One interpretation, however, has been con- 
spicuously absent, namely, that in virtually 
all of the arbitrary frustrations there was an 
apparent breach of contract that could be 
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assumed to exist between the subject and the 
agent of frustration. In essence, it could be 
readily assumed that the intensification of 
aggression was the result of an ethical viola- 
tion. 

That a role for normative ethical values in 
the causation of aggression was missed can be 
attributed to the reluctance of theorists in 
social psychology to depart from conceiving 
the individual to be the exclusive unit of 
theoretical analysis. By contrast, common 
everyday observations of aggression in the 
family, in labor disputes, in crime, and in 
political conflicts (including violent revolu- 
tions) clearly reveal a vast moral basis. 

But the role of normative values in the 
causation of aggression is not all; the target, 
inhibition, form, and timing of aggression are 
pervasively imbedded in normative rule struc- 
tures, The following familiar examples will 
suffice to sustain this point: 


1, In our principal culture—that of the 
broad middle class—men are normatively 
restrained from striking women. Of course, it 
occasionally occurs, but by contrast with 
males of the lower socioeconomic classes, it is 
infrequent. 

2. Females are also normatively restrained 
from displaying physical aggression. The re- 
straint reaches down into the lower socio- 
economic classes, although, as in the case of 
males, it is seen there more than in higher 
socioeconomic classes. The restraint for fe- 
males is not only cross-sex but within sex. 
Women aggress physically very little toward 
other women. 

3. There are obvious restraints that hold 
back the aggression of children against adults. 
Studies of child rearing (e.g., Sears, Maccoby, 
& Levin, 1957) show clearly that these re- 
straints are based on values and beliefs that 
parents try to inculcate in their children. 

There are s differences in the amount of 
aggression that is tolerated, at least in the 
category of verbal aggression. But at all levels 
of society physical aggression toward adult 
family members i ictly verboten. 

4. On the othe: _ it is right and proper 
under certain conc: for parents to strike 
their children. Indes ‘Id beating and bru- 
tality are endemic ‘hout our society, 
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although again more prevalent in the low: 
socioeconomic groups. The normative suppor 
for aggression against children is probab! 
part of a larger normative belief concernix 
the rights of holders of private property. 

5. That aggression begets aggression is 
matter of common observation. There are al: 
experimental demonstrations of it (e.g., Mı 
Clelland & Apicella, 1945). For the proposi 
tion to be valid it has to be assumed that the 
responder perceives the initiator’s behavior a: 
aggressive. As Tedeschi (Note 1) has per 
suasively written, the definition of aggression: 
is highly normative. It is a widely share: 
view that if someone strikes back in self-de 
fense, this does not constitute a violation « 
an antiaggression norm; moreover, to some o! 
servers, it may not be considered aggressi 
at all. It is interesting that persons with a; 
gressive designs know this rule and appear i 
arrange for their adversary to strike first, s 
that they can retaliate with impunity and 
sense of righteousness. 

6. There are assorted specific rules restrici 
ing aggression that apply to young males 
mainly in our urban subcultures but in some 
cases far beyond. For instance, “never hit a 
person when he’s down.” Here, the word down 
is interpreted figuratively so the rule applies 
widely. One should not hit a person wearing 
glasses, one should never shoot a person in 
the back, and so on. In other words, fair 
aggression stipulates that the parties to a 
dispute be given a chance to defend them- 
selves. One cannot help but note that these 
rules of the street have their parallel in inter- 
group conflicts and specifically in war. Thus, 
nations at war are not supposed to use gas, 
civilians are not supposed to be harmed, and 
captured enemy soldiers are not supposed tc 
be shot or tortured, and so on. Verbal aggres- 
sion is also under the control of normative 
rules. It is not good form to talk behind an 
other’s back, to bite the hand that feeds, ot 
to badmouth colleagues to strangers. 


With regard to the administration of pun- 
ishment in various contexts, it is obvious that 


vi 
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ethical values exercise a profound influence. ` 


So, for example, it is not fair to judge some- 
one as guilty if that person did not have some 
direct causal, or indirect accessory, responsi- 
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bility for the violation. Thus, it is frequently 
he case that physical or mental conditions 
cat impair rationality of judgment or impulse 

atrol justify a discounting of guilt. Simi- 
uly, if the violation was committed under 
coercion, there is discounting. By the same 
token, even if a person did not intend a vio- 
lation, he or she may be blamed with negli- 
gence, However, the more intended and delib- 
erate the violation, the more guilty the vio- 
lator is judged to be. Finally, it appears to be 
a widespread ethical value that punishment 
for a violation should be proportional to the 
degree of blame for, and the degree of serious- 
ness of, the violation (Pepitone, 1975). 


Attitudes and Attitude Change 


To the casual observer, one of the most 
striking characteristics of political, social, and 
economic attitudes is their normativeness. 
Members of various sociocultural groups tend 
to hold attitudes that are more uniform than 
individuals surveyed at random. As unambigu- 
ous as this point is, social psychologists have 
missed even describing it, much less develop- 
ing its theoretical implications. For example, 
n the study of national and ethnic stereotypes 
almost the entire emphasis has been on the 
subjects’ attributions of laziness, intelligence, 
and the like to these groups, with little or no 
attention to the normativeness of such stereo- 
types for the ethnocultural groups to which 
the subjects belong. 

‘The simple fact is that attitude and Attitude 
change theory in social psychology have been 
based almost exclusively on mechanisms with- 
in the individual. Attitudes are considered 
nal dispositions that develop by a process 
rning. Attitude change is based on the 
incentive value of the persuasive message or 
upon dissonance reduction. 

In a study of “cosmopolitanism,” Clancy 
and Pepitone (Note 2) found large differences 
between Catholic and Jewish undergraduates 
in attitudes toward the following six moral 
issues: war, capital punishment, hunting, abor- 
tion, suicide, and euthanasia. How does the 
dominant attitude theory account for this? 
The attitude development model would essen- 
tially propose that the individuals within 
each ethnic group independently learned simi- 
lar attitudes because of similar reinforcement 


histories, and that the reinforcements were 
different between groups. Although descrip- 
tively plausible, such a formulation begs the 
question of why certain attitude positions (or 
any position, for that matter) are rewarded 
by the socializers of one group and not 
rewarded or even punished by those of an- 
other. Similarly, the theory of changing atti- 
tudes by providing individuals with incentives, 
by itself, says little, if anything, about the 
basis or origin of the substantive incentives. 
On the other hand, if attitudes are normative, 
the appropriate theory of attitude change is 
centered on that (eventually predesignated) 
collective entity in which the norm may have 
originated and is being maintained, Ideally, 
the theory would consist of a specification of 
the conditions underlying the formation and 
change of the norm and would apply, more or 
less, to all individuals—dead, presently alive, 
and not yet born—who are (were, will be) 
part of the collective entity, without the neces- 
sity of any assumptions about incentive or 
dissonance mechanisms in each and every 
individual. 


METHODOLOGICAL IMPLICATIONS 


If social behavior is entirely or partially 
normative and determined by the values and 
beliefs of sociocultural groups and other col- 
lective entities, a change in the research strat- 
ery of social psychology is indicated. To de- 
velop valid and comprehensive theories of 
normative social behavior, three stages of 
research are necessary. First, in order to iden- 
tify the social behvaior that is normative and 
to discover what are the important value and 
belief systems underlying the social behavior 
of scientific interest, it will be essential to 
compare sociocultural collectives with respect 
to such behavior. Cross-cultural, cross-subcul- 
tural, cross-organizational, and cross-temporal 
research will need to become the stock in 
trade of social psychologists instead of the 
occasional research design. Comparative re- 
search which maps and defines various value 
and belief systems is only a descriptive first 
step, however. The second research stage is 
analytic; it involyes a study of the structure 
and functioning of the value and belief sys- 
tems already identified. That is, social psy- 
chologists will need to know something about 
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the organization of value and belief systems, 
what the components are and how they inter- 
act. In the analytic stage, we are also in- 
terested in the particular conditions under 
which the values become prescriptively or 
proscriptively operative and in the sanctions 
that are applied when they are violated. 

It is important to note the crucial role of 
controlled manipulational experimentation in 
the analytic stage. This role arises from the 
fact that values and beliefs do not exert their 
influence on social behavior automatically and 
continuously. Rather, they become relevant 
and activated only under certain stimulus con- 
ditions. For instance, it is probably correct to 
assume that widely prevailing values concern- 
ing the fair distribution of rewards motivate 
and guide the conduct of the people in certain 
cultures and subcultures when the situation 
calls for the distribution of rewards, when 
the existing distribution is unfair, when there 
is a degree and kind of surveillance, and so on. 
Tt seems evident that to learn in a precise way 
about the conditions under which values and 
beliefs are relevant and active, the experi- 
mental manipulation of situations is required. 
In research on normative behavior, however, 
experiments are designed to tap the values and 
beliefs of sociocultural groups rather than in- 
ternal mechanisms of the individual organism. 
This point about the potential and active 
state of values has often entered into discus- 
sions about the determinants of prosocial be- 
havior. Darley and Latané (1970), for in- 
stance, question the usefulness of values in 
understanding helping behavior (or its ab- 
sence) on the grounds that the situational 
rewards and costs affecting the individual ap- 
pear to account sufficiently for the observed 
behavior. While agreeing with these authors 
that an explanatory concept is of no use unless 
it can be demonstrated to have reliable ef- 
fects, I do not agree with the basis for the 
claim that values play no role in helping be- 
havior. There are two major considerations 
here: As just discussed, (a) values are not 
continuously active but are activated only 
under certain situational conditions, and (b) 
even when active, values operate along with 
other dynamics, sometimes competitively. 
Thus, under any given set of circumstances, 
observed behavior need not directly reflect 
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values for this concept to have scio ‘fic 
validity. 
There is an important distinction i> be 


made in the analysis of values and be is, 
particularly as they function in prosocia’ id 
antisocial behavior. In this article it has en 
argued that normative social behavior is <e- 
termined by the prescriptive, proscriptive, or 
constraining influence of values and be! ‘s 
generated and maintained by groups and o't 
collective units. Observations of antisocia! 
havior suggest that values and beliefs can « 

be used merely verbally by individuals 
justify certain conduct. That is, individu 
who violate one set of values may attempt 
excuse themselves by claiming commitme 
and loyalty to another set. Thus, for th 
sands of years, individually and en mas 
people have been slaughtering people and i 
many other ways breaking deeply internalized 
moral codes for, as they see it, some higher 
order or more compelling set of moral impera 
tives. Sometimes, one vaguely senses ihat 
these moral arguments are based upon a sir 
cere ethical calculus; at other times, howev« 
they represent pseudo, merely verbalized con 
pliance with values for self-serving purposes 
Similarly, when some individuals help others 
in need for entirely selfish reasons, such as 
the expectation of reward, they often refer to 
ethical values when explaining the motivation 
for their “altruism.” The general distinction 
between mere compliance with group stan- 
dards and true acceptance of them has been 
made often enough in theoretical discussions. 
The emphasis has been on the individual, on 
the rewards and threats of punishment tha‘ 
determine if there is only compliance or ac 
ceptance as well. It seems to me not unlikely 
however, that the appeal to moral values ti 
justify the violation of values is normative, 2: 
well as an individual defense mechanism. 
Thus, when nations go to war, when ethno 
religious groups try to exterminate each other, 
when whole societies engage in a policy of 
genocide for such stated values as “to pre- 
serve our way of life,” “to destroy the infi- 
dels,” “for God and country,” and “to purify 
our race,” we are observing normative be- 
havior, presumably the prescriptive outcome 
of a world view or self-image operating at 
the collective level, 
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The third stage of inquiry for a social psy- 
hology of normative social behavior is the 
.ost important and most divergent from the 
ay social psychology functions and is or- 
ganized at the present time. It deals with the 
origin of normative values and beliefs. It must 
be acknowledged at once that this formidable 
task cannot be accomplished by social psychol- 
ogy alone. There are numerous families of 
variables that causally relate to the formation 
of value and belief systems, and these are tra- 
ditionally “handled” by the gamut of social 
sciences, biology, and other disciplines. In its 
proximity to individual psychology, social psy- 
chology has access to only one family. It is 
obvious that to play a leading role in the de- 
velopment of scientific knowledge about nor- 
mative social behavior, social psychology must 
be placed in closer working contact with the 
other relevant disciplines than is presently the 
case. Conventional academic interdisciplinary 
arrangements are not necessarily optimal for 
this scientific mission as compared with mixed 
groups, outside as well as in the university, 
established even on ad hoc bases for research 
on a designated problem area (e.g., the origin 
of various religious beliefs, or values about 
work). On the other hand, the traditional 
fixed position of social psychology within uni- 
versity psychology departments has narrowed 
the theoretical focus of social psychologists to 
he independent individual and has fostered a 
diffusion of responsibility for dealing with 
large amounts of social behavior to other 
disciplines with which there is relatively little 
contact.* 

Although the variables that determine the 
form and substance of value-belief systems 
ire numerous, it appears possible to make a 
rough classification of them at least for heu- 
ristic purposes. In each of the following cate- 
gories, I will illustrate in a highly abbrevi- 
ated way the kind of data that need to be 
collected for comprehensive and valid theories 
of normative social behavior. i 


Environments and Economics 


The physical environment in which human 
organisms live and the economic system of 
human societies, determined in part by phys- 
ical resources, appear to influence powerfully 
the values and beliefs that develop in these 
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societies. Ethnographic studies compellingly 
suggest that where chronic drought and land 
infertility have kept populations at bare sub- 
sistence levels, fatalistic and pessimistic world 
views become endemic. A prevailing belief is 
that all resources—psychological as well as 
physical—are in permanent short supply. 
Foster (1965), an anthropologist, has identi- 
fied this belief, or as he calls it, “concept of 
the limited good,” in many peasant popula- 
tions in different parts of the world. In es- 
sence, the distribution of resources (generally 
perceived to be made by external sources) is 
viewed as following a zero-sum game strategy. 
That is, any resource received by others out- 
side the extended family or kinship network 
is felt to be a proportional loss for those who 
observe the other’s “gain.” Another anthro- 
pologist, Davis (1969), has described in detail 
the elaborate rule system concerning honor 
observed in Mediterranean society (see also 
Schneider, 1971). The analysis makes clear 
that this structure of beliefs and values 
underlying the intense and continuous compe- 
tition over the defense and acquisition of 
honor in virtually every area of life has its 
roots in land scarcity and aridity. 

There is some evidence for a “limited good” 
world review in samples drawn from what may 
be classified as a poor society in all sectors 
of the economy. Using a standard experimen- 
tal procedure involving the making of simple 
money bets, L’Armand and Pepitone (1975) 
have shown that students from South India 
who were presented with a series of wagers, 
the outcomes of which would reward another 
peer stranger without any objective loss to 
themselves, displayed in their wagering much 
less of a tendency to “maximize” rewards for 
the other than did comparable American sub- 
jects presented with the same series of bets. 
Of course, we are not certain if the econom- 
ically deprived context in which these subjects 


1M. Rosenbaum has reminded me of an early 
explicit diffusion strategy stated by Miller and Dol- 
lard (1941) in their influential book on social learn- 
ing and imitation. According to these authors, the 
task of the psychologist interested in such social be- 
havior is to specify the principles of learning; the 
task of the social scientist is to specify the social 
and cultural conditions or context in which the 
learning takes place. 


650 


live caused them to act as though they were 
in a zero-sum, purely competitive game, which 
they were not. But consistent with the infer- 
ence that these subjects shared a limited-good 
concept was a significantly positive correla- 
tion between the economic status of the sub- 
ject’s family and his degree of maximizing 
reward for another, 

Concerning the economic basis of beliefs 
and values, there is consistent evidence that 
groups of high economic status tend to de- 
velop political and social orientations that are 
designed to protect their economic advantages. 
This routine observation is generally in ac- 
cord with Marxist theory that control over 
economic resources and the relationship to the 
production process bring about a differentia- 
tion of classes that hold opposing positions 
over a wide range of political and social 
values. However, even when refinements are 
made in the definition of economic status, the 
correlations are not as high as the economic 
determinist position would predict. It is obvi- 
ous that there are sources other than sheer 
economic interest involved in the formation 
of political and social values. For example, 
voting preferences and liberal—conservative 
stands on many political issues are influ- 
enced by ethnic and religious values as 
well as economic status. The question of how 
economic interest combines with other values 
to affect beliefs about political and social 
issues is ripe for social psychological analysis. 


Sociopolitical Conditions 


Beliefs and values appear to evolve from a 
history of social and political conditions ex- 
perienced by groups and other collective enti- 
ties. Even a casual reading of history suggests 
that the set of conditions including oppres- 
sion, slavery, privation, and persecution play 
an important role in the development of reli- 
gious beliefs, Although there are many theo- 
ries concerning the origin and function of 
religion (Evans-Pritchard, 1965), certain con- 
cepts seem to be expressly designed to avoid 
unbearable hardships and suffering imposed on 
people by mean and heartless rulers, owners, 
bosses, and other oppressors. For instance, 
there is the concept of the soul or equivalent 
spiritual and nonphysical part of the human 
Personality. Because it exists for eternity, 
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because of its freedom to move in and «it 
of bodies and places, and because it lack 
substantive identity, it is invulnerable 
attack, insensitive to pain, and indestructi! 
The concept of an afterlife, although div: 
in form, is central to most religions. It is 
anticipation of salvation, serenity, happin« 
and justice that offers relief and vindicati: 
of the oppressed in advance. 

The study of the origin of religious belie: 
is obviously not experimental; it is mor 
akin to the comparative study of anima! 
species from an evolutionary perspective. One 
“collects” the beliefs, classifies them in vari- 
ous ways, and attempts to discern relation- 
ships between the criteria by which they 
are classified and the social and political con- 
ditions dominant in the history of the cultura! 
area in which the beliefs are found, 


Individual Psychology 


Although one source among several, indi- 
vidual psychology no doubt has considerable 
input to the formation of collective values and 
beliefs. When individuals discover that they . 
have needs and fear in common, they are apt 
to make collective arrangements to take care 
of them. Rules, values, traditions, and the like 
represent such arrangements. For instance, 
there are strong indications in the research 
literature that human individuals prefer a 
somewhat orderly and predictable social and 
physical environment, one over which they 
have a degree of control. From an evolution- 
ary point of view, there may well be a need 
for generalized control built into the species, 
for it is eminently safe to assume that indi- 
viduals who have it enjoy advantages in the 
struggle for existence. It is the possibly uni- 
versal need in individuals to control against 
the aggression of others that figures in the 
formation of ethics. That is, when individuals 
in a primal clan or herd became aware of their 
common fears based on the ever-present 
danger of each other’s aggression, the process 
of building moral codes and ultimately statu- 
tory laws was set in motion. As Freud (1952) 
argues in Totem and Taboo, the process of 
establishing a collective morality also involves 
the renunciation of individual instinct, Pre- 
sumably, Lerner’s (1970) concept of a need 
to believe in a “just world” is an expression 
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of a deeper demand for order in individuals 
and a determining element in the construction 
of ethical belief systems at the collective level. 

There are other individual needs that theo- 
retically play a role in collective formations. 
For example, the need for stable perceptual 
frames of reference (Sherif, 1936) has been 
shown to produce norms involving judgments 
of physical stimuli. The need for “social 
reality” (i.e., to validate one’s opinions) has 
been shown to be a basis of opinion uniform- 
ity in the small group (Festinger, 1950). In 
considering the latter as a model for belief 
and value formation in the collective, it is 
well to bear in mind that there is no direction 
inherent in the norm-producing individual 
need for opinion validation and, hence, no 
close accounting for the content and strength 
of the value and belief norms to which the 
need gives rise, That is to say, the collective 
norm is the result of an interactional mix 
of whatever array of unvalidated opinions 
happens to be displayed by individuals when 
they meet for the first time. 


Biology 


If social psychology has neglected the col- 
lective basis of many social behaviors, indi- 
viduals psychology has neglected those sources 
of behavior built into the species through 
evolutionary selection. From its beginning, 
scientific psychology has preferred to investi- 
gate and theorize about physiological proc- 
esses inside the individual. Only in relatively 
recent years has the field of ethology stimu- 
lated interest in the evolutionary basis of 
behavior. Although the evidence from etho- 
logical observations of subhuman animals is 
mainly analogical, there seems little question 
that human social organization—the forma- 
tion of social relationships and structures of 
relationships—has some biological basis. Of 
interest in the present discussion is the role 
of evolutionary biology in the formation of 
beliefs, values, and other collective sources of 
social behavior. Ethological analysis suggests 
that some of these collective forms may have 
developed out of a need to supplement or cor- 
rect built-in biological determinants of social 
behavior. 

An example derives from considering that 
the extended period of sexuality in the human 
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female (as opposed to the restricted period 
of heat in lower primates) is thought to 
have evolved because it helps preserve and 
strengthen male-female bonds and, thereby, 
the integrity of the family unit. But while 
the biological facilitation of sexual relations 
could have this felicitous consequence, it 
clearly does not ensure against multiple, con- 
tinuously competing males surrounding the 
female and her children. Indeed, although 
such male competition may have long run bio- 
logical adyantages for the species, it would 
not seem optimal for constraining continuous 
and long-term protection of children on the 
part of either male or female. The short- 
comings of biological evolution, therefore, 
must be supplemented by values, customs, 
and institutions such as monogamy, marriage, 
courtship, and various taboos against certain 
pair bonds. 

Another example of biological influence on 
collective values and other forms is suggested 
by the biologist, Wilson (1975), who states 
that “reciprocal altruism,” which strongly 
characterizes human social exchange, or rather 
the balance between altruistic and egoistic 
orientations, may be “selected in evolution at 
the level of the tribe or larger collective. 
Similarly, Campbell (1975) has cogently 
argued the possibility that the restraint of 
impulses and other “superego” functions pre- 
scribed by most religions and ethical systems 
could have resulted from the pressure of 
biological evolution. 


CONCLUSION 


From a formal point of view, major theories 
of contemporary social psychology (e.g., game 
theory, dissonance, and similar cognitive 
models, equity theory, attitude theories, and 
theories of aggression) refer essentially to 
dynamic processes that operate within the 
individual independently of others in the so- 
cial environment. This article raises questions 
about the scientific utility of such a theoretical 
orientation, arguing as follows: 


1. Much of the social behavior we observe 
in the real world and laboratory—including 
attitudes toward environment and self, deci- 
sion making, prosocial conduct, aggressive þe- 
havior, and many other dependent variables 
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under active investigation by social psycholo- 
gists—is normative, in being more character- 
istic of definable groups, organizations, and 
other sociocultural collectives than of indi- 
viduals observed at random. 

2. Theoretically, the basic sources of such 
normative social behavior are values, beliefs, 
and other conceptual dynamisms that origi- 
nate in and are maintained by the collective 
entities of which individuals constitute inter- 
dependent parts. 

3. Although the theoretical processes within 
individuals may affect the impact of the in- 
fluences stemming from collective values and 
beliefs (i.e., lead to individual differences in 
behavior and even contribute to the formation 
of values-belief systems), they cannot exclu- 
sively account for these sources of normative 
social behavior. 


The fundamental inadequacy of a purely 
individual psychology to account for a large 
portion of the behavior that social psycholo- 
gists try to explain recommends a reorienta- 
tion in both theory and research methodology, 
as well as in the organizational matrix in 
which social psychology functions. 

Knowledge from three stages of research 
and analysis will be necessary for a redirected 
social psychology: (a) the identification of 
normative social behavior and a description 
of the value-belief systems that are its source 
through a comparison of sociocultural groups 
and other collective entities; (b) analyses of 
the structure and functioning of value-belief 
systems, including experimentally based speci- 
fications of the conditions under which they 
are activated; (c) focused investigations of 
the origin of selected value-belief systems, 

involving the collection and integration of 
knowledge from such areas as the physical 
environment, economics, social and political 
history, individual psychology, and evolu- 
tionary biology. 

In general perspective, social psychology 
belongs in a large biocultural context, in close 
working relationships with other social sci- 
ences and biology. At the same time, our 
discipline needs to become less dependent for 
its theoretical inspiration on general individ- 
ual psychology, whose concepts are inadequate 
for the scientific task adumbrated above. 


ALBERT PEPITONE 
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Previous authors have assumed that compensation and derogation of an in- 
nocent victim are mutually exclusive alternatives. A study was done to test this 
alternativistic model. A total of 136 female undergraduates observed a con- 
federate receiving a series of ostensibly painful shocks under conditions preyi- 
ously found to induce high levels of derogation. Subjects were given a chance to 
compensate the confederate either before or after rating her personality. Con- 
sistent with predictions, subjects who observed suffering compensated sig- 
nificantly more than controls, regardless of whether they had derogated the 
victim before being given the opportunity to help. Although derogation had no 
attenuating effect on helping, a prior opportunity to aid the victim tended to 
diminish derogation of her personality in a manner consistent with prior re- 
sults. Implications for traditional “hydraulic” explanations of reactions to 


suffering are discussed. 


Throughout their history, social psycholo- 
gists have shown a flair for demonstrating 
the human tendency to distort reality in a 
manner designed to minimize personal dis- 
comfort. The tendency to derogate victims, 
presumably to maintain a conception of a 
“just world” (e.g., Lerner, 1971; Lerner & 
Simmons, 1966) would seem to be one such 
defensive mechanism. While recent research 
(Aderman, Brehm, & Katz, 1974; Cialdini, 
Kenrick, & Hoerig, 1976; Godfrey & Lowe, 
1975; Sorrentino & Boutilier, 1974) has indi- 
cated that victim derogation is likely to occur 
under somewhat more restricted circum- 
stances than Lerner and his colleagues have 
implied, it is the case that under some cir- 
cumstances (e.g., when the actor feels at least 
partly implicated in the victim’s suffering) a 

fering person will be devalued. 

Previous research has also shown that 
j ton is unlikely to occur if the observer 
t ome way compensate the victim 
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(Lerner & Simmons, 1966; Mills & Egger, | 
1972), and it has been concluded by those < 
who have reviewed the literature in this are: 
that the processes of compensation and justi 
fication represent alternative means of equity 
restoration (Lerner, 1974; Walster, Berscheid, 
& Walster, 1973). This implies that once 
derogation of a victim has occurred (and 
equity restored, presumably), subsequent op- 
portunities to compensate the victim will be 
declined, As Lerner (1974) states, “Once the 
rejection is accomplished, the observer can 
again rest easy—his world is just, and he 
need not feel impelled to act to re-establish 
justice” (p. 344). In fact, if derogation doc 
serve to re-establish equity, it would again 
produce inequity (in the opposite directior } 
if the observer were to subsequently aid the 
victim. According to this account, then, the 
Prospects for such a victim are dim ones 
indeed, Observers who have no immediate 
opportunity to compensate an innocent viciin 
are likely to derogate the unfortunate in or< 
to restore justice and, as a result, are less 
likely than before to provide compensation, 
But what support is there for a “hydraulic” 
model of equity restoration such that once 
justice is achieved by means of derogation 
there will be a reluctance (or at least no 
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need) to employ the device of compensation? 
While there is evidence that after compensa- 
‘on has occurred derogation becomes less 
wobable (Lerner & Simmons, 1966; Mills 
& Egger, 1972), there appear to be no data 
indicating that devaluation of a victim im- 
pairs subsequent helping. In fact, the only 
available evidence in this regard seems to 
oppose an alternativistic model of derogation 
and compensation as modes of equity restora- 
tion, Lerner and Matthews (1967) found, 
contrary to their expectation, that a condition 
that led to the highest derogation also led 
to more helping than a condition that pro- 
duced no victim derogation. Helping took the 
form of choosing to wait with the “victim” 
who had expressed a desire to have someone 
present with her while she awaited further 
electric shock. The authors attempted to ex- 
plain the apparent benevolence of subjects 
who had already devalued the victim by 
suggesting that “some subjects may have been 
merely curious and others may have been 
hoping to derive some sadistic pleasure from 
the sight of the victim’s suffering” (Lerner 
& Matthews, 1967, p. 324). Examination of 
the reasons given by the majority of Lerner 
and Matthews’ subjects, however, indicated 
that a desire to help the victim was the 
explanation usually given for their choice. 

It is the contention of the present paper 
that the available data give little reason to 
conclude that derogation and helping are 
alternate and incompatible ways of respond- 
ing to a suffering other. The Lerner and 
Matthews findings suggest that derogation of 
victim, even when it does occur, is a rela- 
ely weak device for restoring equity and 
that were a subsequent opportunity for com- 
pensation to arise, it would be utilized re- 
gardless of previous personality evaluations 
made about the victim. An ‘experiment was 
conducted to test this hypothesis and to rule 
out the possible confounding mediators of 
curiosity and sadism suggested by Lerner and 
Matthews for their results. 


METHOD 


Subjects. The subjects were 136 female volunteers 
from introductory psychology sections at Arizona 


State University. The subjects participated as part 
of a course requirement and were run in groups of 
five to eight over a 2-week period. 

Procedure. The experiment was described on the 
sign-up sheets as a study in “the validation of a 
behavior scale.” From information presented there, 
the subjects expected to watch and rate persons 
“under varying conditions of emotional arousal.” 

Subjects were ushered into the experimental room 
by two male experimenters. They were first asked 
to rate “the average female college student” on the 
same set of bipolar adjectives typically used by 
Lerner (Lerner, 1971). Following this, subjects were 
given written instructions concerning the necessity of 
gaining knowledge about emotional reactions people 
have in various situations (based on Lerner & 
Simmons, 1966). The experimenter then explained 
as follows: 


Today we are taking advantage of a human learn- 
ing study which we are conducting to observe 
someone performing in an emotion-arousing situ- 
ation. You will be watching another subject per- 
forming on a learning task; this is going on right 
now down the hall, and we'll be watching it over 
closed-circuit TV. Your job will be to observe 
closely the emotional state of the worker and 
watch for cues which indicate her state of arousal. 
The subject will be working on a rather difficult 
serial learning task. In fact, she was somewhat 
reluctant to participate perhaps because she was 
not too comfortable about being watched over 
closed-circuit TV, among other things. However, 
when we explained that there would be other sub- 
jects who would not be able to get credit unless 
she did it, she finally agreed to do it.t 


Control subjects then observed a videotape in 
which a confederate was attempting to complete a 
“serial learning task” during which a buzzer was 
sounded for each “incorrect response.” Experimental 
subjects saw the same confederate make exactly the 
same mistakes in the same order. However, in this 
condition, she also received what appeared to be an 
extremely painful shock along with the buzzer for 
each mistake. At the completion of this segment the 
confederate was told that the first phase of the 
experiment was completed and that they would 
return in 10 minutes to begin the second half. 

All subjects then filled out an “emotional cues” 
scale, which included a question asking, “How stress- 
ful was the situation this person was exposed to?” 

Following this, half of the subjects (Order RH, or 
rate-then-help) were asked to rate the confederate 
on the bipolar adjective scales. Having done this they 


1This set of instructions was designed to elicit 
high feelings of complicity in subjects and was in 
some ways similar to Lerner and Simmons’s (1966) 
“martyr” condition. It was thus hoped to maximize 
conditions that had previously been shown to pro- 
duce high levels of derogation. 
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were then told the following by the experimenter: 


The subject you are observing obyiously has to 
undergo more effort in her part of the experiment 
than the average subject. We would like to be 
able to compensate her for her effort. In order to 
do this, we need to determine how much she 
should receive, and one good way to do this is to 
get an average decision from subjects who watch 
her about how much they think she should receive. 


Subjects were then handed a slip of paper con- 
taining instructions asking them to indicate (on a 
10-unit scale, ranging from 50¢ to $5) “how much 
compensation you think she should receive for her 
participation.” Subjects were informed that their 
responses would be averaged to determine the sum 
she received. 

For the other half of the subjects (Order HR, or 
help-then-rate) the order of compensating and rating 
the girl was reversed. 


RESULTS 


The subjects were run in groups with each 
group subjected to only one treatment, so all 
data were analyzed using the group rather 
than the individual subject as the unit of 
analysis. This set the number per cell at five. 
The design was a 2 X 2 factorial with the 
factors of Victim Suffering (Suffering vs. No 
Suffering) and Presentation Order of Modes 
for Injustice Reduction (RH ys. HR). 

Manipulation check. In response to the 
question concerning perceived stress of the 
situation to which the confederate was ex- 
posed, an analysis of variance indicated a 
clear main effect for the suffer/no-suffer 
variable in the direction predicted, F(1, 16) 
= 54.24, p < .0001. There was neither a sig- 

nificant effect for order nor any significant 
interaction. 

Victim derogation. These data were gener- 

ated by calculating the difference between 
each subject’s rating of the “average female 


TABLE 1 
VS FOR DEROGATION AND COMPENSATION: 


Suffer No suffer 
RH HR RH HR 
Derogat —8.79 —1.14 4.98 3.12 
Compensation S 
(in dollars)? 451 4.53 3.31 2.70 


Note. RH and HR 
(i.e., the tation or 
a MS. 37. 


2-then-help and help-then-rate 
des for injustice reduction). 
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college student” and her rating of t- con- 
federate. The mean values are disple- 4 in 
Table 1, with more negative scores inc’. «ting 
more derogation. 

Analysis of variance revealed a sut ing/ 
no-suffering main effect, F(1,16)=7. |, p 
< .02. Neither the main effect of orde nor 


the interaction was significant. 

Compensation. Means of the four © di- 
tions are reported in Table 1, Analys’ of 
variance revealed that the suffering condi ion 
led to significantly greater compensation ‘in 
the no-suffering condition, F(1,16) = 11! :/9, 
p < .0001. For the order effect, RH subj. ts 
tended to compensate somewhat more t-n 
HR subjects, although this effect fell shor: of 
significance, F(1,16) = 4.09, p< .06. Thue 
interaction was also significant, F(1,16) = 
4.92, p <.04, 

The important finding is that the subjecis 
who saw suffering compensated more than 
no-suffering controls, even when they had 
derogated before they were given the oppor- 
tunity to compensate [4.51 vs. 331, ¢(8) 
6.00, p < .001]. In fact, the subjects in the 
two suffering conditions compensated to vir- 
tually the same degree [4.51 vs. 4.53, £(8) 
<1, ns.] regardless of whether they had 
already evaluated the sufferer. 

An examination of the relationship between 
derogation and helping within the two high- 
suffering cells further elucidates the relative 
mutual impact of these two variables, When 
rating occurred first, there was a minimal 
correlation between evaluation of the victir 
and subsequent helping (r= .02). In this 
cell, then, where significant derogation did 
occur, ratings of the victim accounted tor 
less than 1% of the variance in subsequent 
helping. On the other hand, when compensa- 
tion occurred first, there was a much stronger 
positive correlation between the two variables 
(r = 81), accounting for 65% of the vari- 
ance. Thus, derogation by our subjects kad 
essentially no effect upon subsequent help- 
ing, whereas prior helping indeed served to 
attenuate devaluation. 


Discussion 


The results of this study would seem to 
challenge the notion that derogation of a vic- 


tim will serve to restore equity and abolish 
he likelihood of subsequent helping. Subjects 

10 were given an opportunity to devalue a 

tim’s personality still availed themselves 

{ an opportunity to compensate her. These 
subjects helped as much as those who had 
not yet rated the victim’s personality and, 
more importantly, did so to a significantly 
greater degree than a control group who did 
not observe suffering. Since the helping re- 
sponse in the present experiment did not in- 
volve subsequent contact with the victim, 
neither “sadism” nor “curiosity” is likely to 
have mediated our results. 

With respect to the derogation scores, our 
data are consistent with the results of previ- 
ous studies that have used similar procedures 
(Cialdini et al., 1976; Lerner & Simmons, 
1966): The perception of suffering led to 
devaluation of the sufferer. In addition, other 
studies (Lerner & Simmons, 1966; Mills & 
Egger, 1972) have found a tendency for 
compensation to reduce subsequent deroga- 
tion of the sufferer. Though our results did 
not show strong support for this finding, the 
trend of the data [—8.79 vs. —1.14, t(8) = 
1.63, p < .15] was consistent with the form 
of the earlier results. 

The present findings lead to a radically 
different interpretation of the meaning of the 
victim derogation phenomenon. While the 
cognitive process of devaluating a suffering 
ther is indeed interesting, such attitudinal 
negativity does not seem to have the pro- 
found behavioral implications assumed by 
previous authors. The traditional hydraulic 
model suggests that an observer of another’s 
suffering, when he has no immediate avenues 
for help, will tend to derogate the victim; 
once equity has been thus restored, he will 
be reluctant to provide aid were the oppor- 
tunity to arise in the future. The model. 
supported in this paper, however, implies 
that even when victim derogation does 
occur, subsequent helping activity will not be 
suppressed. 

The asymmetrical relationship between 
derogation and helping might be less aesthet- 


JUSTIFICATION AND COMPENSATION 


657 


ically pleasing than the prior notion that the 
two were substitute processes, but the present 
data lead to somewhat more consoling conclu- 
sions for innocent victims. While compensa- 
tion of the innocent victim may indeed serve 
to inhibit the tendency to derogate him, the 
converse does not seem to be true. Another’s 
suffering, then, would seem to induce a tend- 
ency to assist that other, independent of any 
prior deprecation of his personal character- 
istics. Contrary to earlier theorizing (Lerner, 
1974; Walster, Berscheid, & Walster, 1973), 
it appears that victim derogation does not, in 
itself, function as an equity restoration device. 
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of Jung’s Theory 
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Jung’s hypothesis that dreams have a compensatory function is clearly stated 
and well-known, yet has not been empirically tested. Three dream reports for 
each of 62 students were rated on 15 personality dimensions and compared 
with scores on the same dimensions as measured by the Edwards Personality 
Preference Schedule and the Adjective Check List (ACL). Six EPPS scales 
and 10 ACL scales correlated significantly with their appropriate dream rating 
scales. However, all significant correlations were positive, a finding that does 


not support Jung but is more in line with the theories of Adler, Fromm, and 
Hall, who regarded dream content as not substantially different from conscious 


functioning. 


Dreams have been of continuing interest to 
psychological researchers since Freud’s (1900/ 
1955) epochal insights, but it was the now 
classical work of Aserinsky and Kleitman 
(1955) that gave impetus to empirical studies 
of dreams. Despite the proliferation of such 
studies, relatively little work is available that 
relates empirical, as opposed to clinical, find- 
ings to accepted theoretical frameworks. Such 
a lack is particularly evident in relation to 
the extensive theoretical writings of Jung on 
dreams (Jung, 1933, 1961; Read, Fordham, 
Adler, & McGuire, 1960). Despite the rich- 
ness of Jungian theory and its applicability to 
a wide range of psychological phenomena, 
there have been very few empirical studies 
addressed to its validity. 

One of the major Jungian themes refers to 
the concept of compensation (Friedman & 
Goldstein, 1964; Hall & Lindzey, 1970), the 
notion of a dynamic equilibrium between 
contrasting elements, so that what is conscious 
is compensated by what is unconscious. Spe- 
cifically, in regard to dreams, Hall and Lind- 
zey stated this Jungian hypothesis clearly and 

succinctly: “Dreams are also compensatory so 
that the dreams of a predominantly extra- 
verted person will have an introverted quality, 
and conversely, the dreams of an introvert 
will tend to be extraverted” (p. 90). This 
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statement is a veridical echo of Jung's 
statement (Read et al., 1960) and agrees 
the writings of other Jungian interpre’ rs 
(e.g., Fordham, 1953; Jacobi, 1959). Suci 
easily verifiable hypothesis has not, to y 
knowledge, been investigated empirically É. 
though some indirectly relevant studies « in 
be found in the literature. 

Sarason (1944) administered Thematic ^ o- 
perception Test (TAT) cards and collec: -d 
dream reports for 25 institutionalized mcn- 
tally retarded girls over an unspecified period 
of time. Although Sarason presented no quan- 
titative data, the three illustrative cases he 
reported showed a substantial degree of simi- 
larity between the themes expressed in the 4 
TAT stories and in the dream reports. 

Gordon (1953) similarly compared T+T 
stories and dream reports for 29 psychia‘-ic 
patients. Although the data collection leaves 
something to be desired, since the dream 
reports for some patients were culled from 
therapist files and for others were obtained 
as verbal reports, Gordon was able to apply 
a need-press system of TAT scoring to both 


the TAT stories and the dream reports. A 4 


correlational analysis for 42 dimensions indi- 
cated 11 significant and positive coefficients. 
Despite this finding, Gordon (1953, p. 245) 
suggested that, at least for certain dimensions, 
a compensatory effect takes place and set 
forth the hypothesis that “one view of the self 
is expressed in dreams,” while “an attempt 
will be made to conceal this view, or to com- 
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vensate for it by emphasizing diametrically 
oposed content” on the TAT. Despite Gor- 
vts explicit use of the word compensate 
| his inclusion of Jung in his references, no 
ention was made of Jungian theory or of 
he compensatory function. 
® In 1963, Rychlak and Brams compared the 
| presence or absence of four major themes in 
| the dream reports of 41 college students with 
their scores on the Minnesota Multiphasic 
Personality Inventory (MMPI) and the Ed- 
wards Personal Preference Schedule (EPPS). 
Students whose dream reports contained many 
affiliative themes tended to be socially re- 
„sponsible, orderly, organized, and somewhat 
introverted. Those whose dream reports con- 
tained fewer affiliative themes scored higher 
on the MMPI Hypomania, Schizophrenia, 
Dominance, and Validity scales and on the 
| EPPS n Dominance scale, suggesting over- 
f ictivity, dominance, and coolness or with- 
| lrawal from interpersonal contacts. Students 
f 


„whose dream reports contained many reward 
| ‘themes dealing with achievement, status, and 
1 recognition were seen as dominating individu- 
Í ils desirous of attaining a position of leader- 
| bip and the attention of others. Students 
whose dream reports reflected anxiety, frus- 
tration, or hostility as a major theme scored 
higher on the MMPI Hypochondriasis scale 
and on the EPPS n Change scale. With some 
vinor exceptions, the results of this study do 
not support the compensatory hypothesis. 
Robbins (1966) compared ratings of dream 
sociations with EPPS scores for a group of 
32 students. For 11 dimensions common to 
both the dream associations and the EPPS, 
three (n Autonomy, n Sex, and n Aggression) 
ed significant and positive coefficients. 

in a rapid eye movement type of experi- 
ment, Brender and Kramer (1967) admin- 

istered six TAT cards to 13 volunteers for a 

duration of one night. The obtained 78 TAT 

stories and 34 dream episodes were rated on 
| 20 need dimensions, and the scores were 


as 


Y iel 


correlated. Significant correlations were found 
for five dimensions: n deference, sentience, 
affiliation, play, and dominance. Only the 
| coefficient for dominance was negative and 
was interpreted as reflecting the tendency for 
individual subjects to express dominance in 
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only one of the two media rather than a 
compensatory effect per se. 

Finally, in a direct test of Jungian hypoth- 
eses, Palmiere (1972) administered the My- 
ers-Briggs Type Indicator to 114 undergrad- 
uate students and selected for further study 
the top and bottom 25 students on the Intro- 
version-Extraversion scale. To these students 
she administered six TAT cards, each ac- 
companied by a short, forced-choice question- 
naire on personality aspects of the TAT 
characters, alternate story lines, and different 
courses of action the characters could take. 
The responses were scored on an a priori 
basis, although no evidence as to the reliabil- 
ity and validity of the questionnaire was 
presented. Palmiere hypothesized that the 
fantasies of extraverts would exhibit more 
repressed introversion, while the fantasies of 
introverts would exhibit predominantly more 
repressed extraversion. This hypothesis was 
not supported; on the contrary, introverted 
students chose significantly more introverted 
than extraverted responses. Thus, the avail- 
able pertinent literature, while not directly 
addressed to the hypothesis of compensation, 
does not support it. 

The present study represents an initial 
attempt to gather direct evidence on the va- 
lidity of Jung’s concept of compensation as 
applied to dreams by means of relating the 
same personality dimensions as measured by 
psychometric instruments and as measured 
by ratings of dream protocols. 


METHOD 
Subjects 


The subjects were 73 college juniors and seniors 
participating in an honors program at a large state 
university. All were white males, ranging in age 
from 18 to 25 and representing 26 different liberal 
arts majors. As part of their program, students were 
requested to keep a dream diary for a period of 2 
weeks. Subsequently, they were administered a bat- 
tery of psychological tests that included the EPPS 
(Edwards, 1959) and the ACL (Gough & Heilbrun, 
1965). In order to protect the students’ anonymity, 
code numbers were used in the collection of test pro- 
tocols. 

The 73 students produced a total of 626 dream 
episodes, with a range of from 0 to 21 dreams per 
subject. Since 62 of the 73 students reported a mini- 
mum of 3 dream episodes each, 3 dream episodes per 
student were randomly selected, and in order to 
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control for differences in dream productivity and 
recall and to avoid computational and design manip- 
ulations associated with ratios and percentages, these 
186 dream episodes were used as the data for 
analyses. 


Instruments 


Because the EPPS and the ACL are well-known 
instruments, a description here is superfluous. Al- 
though they were chosen, in part, because they both 
contain the same 15 need scales, the amount of over- 
lap is minimal. Gough and Heilbrun (1965) reported 
10 of the 15 ACL need scales correlating at or be- 
yond the .01 level with their EPPS counterparts but 
pointed out that in no case were the coefficients 
substantially high. The scales for both instruments 
are labeled need scales, but the extensive validity 
information that has accumulated for both instru- 
ments supports the notion that quite apart from 
their need aspects, they represent meaningful person- 
ality dimensions related to real-life behavior. 


Dream Ratings 


Since a pilot project with psychology graduate 
students had indicated that interrater reliability of 
dream report ratings could be increased substantially 
through prior discussion and practice, an additional 
20 randomly selected dream episodes were used as 
training material for the five judges who eventually 
rated each of the 186 dream episodes on the 15 
dimensions. These five judges were all psychologists 
who had attained 1-9 years of postdoctoral experi- 
ence, 

The 186 dream episodes were individually coded, 
typed, and ordered in random sequence. Then they 
were independently rated on each of the 15 dimen- 


TABLE 1 


RESULTS OF CORRELATIONAL ANALYSIS FOR 15 
Dimensions AS MEASURED BY THE EPPS, THE 


ACL, AND Dream RATINGS 
Correlation Median rsx 

c of dream 

EPPS- ACL- EPPS- reports 

dreams dreams ACL for the 

Dimension 5 raters 
Achievement 3a giek 20% 76 
Deference ae 27e aes 55 
Order —18 29% 33% 61 
Exhibition 07 —09 51% 68 
Autonomy —14 06 38** 66 
Affiliation 36**  —16 26* 81 
Intraception 06 -M 12 53 
Succorance 24 33 gje 57 
Dominance 4i 36 48 72 
Abasement —12 08 35** 63 
Nurturance 15 26*  49*+ 57 
Change 26* 39** 05 73 
Endurance 13 27* = 12 64 
Heterosexuality 31* 4ge*  27* 68 
Aggression 06 37** 18 75 


Note. Decimals and positive signs are omitted. There are a 
total of 186 dream reports. N = 62 students. EPPS = Ed- 
wards Personal Preference Schedule; ACL = Adjective Check 
List. 

*p <.05. 

*p <.01. 
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4 
sions by the five raters and randomized with r -pect 
to subject, order of rating, and dimension rated, «sing 
the following 6-point scale: O (not enough am 
material to rate), 1 (very little of this ‘ced 
present), 2 (below average), 3 (an average am: nt), 
4 (above average), 5 (very strong). 

Each of the 15 dimensions was defined fc the 
rater according to the descriptions given by Ed: «rds 
(1959, p. 11). These descriptions were availa to 
the raters when engaged in this task. For e of 
the 62 students, a dream rating score on eac of 
the 15 dimensions was obtained by adding the r gs 
of all five judges over the three dream epis >s. 
Thus scores could vary from 0 to 75 (a 5 assi od 
by all five judges on all three dreams). Since s en 
dream episodes belonging to 5 subjects reco od 


scores of O (all five judges agreeing that there ıs 
not enough dream material to rate), seven i- 
tional dream episodes for these 5 subjects werc 
lected from the available pool and used as rep 
ments. 

For each of the 62 students, scores on tl 
need dimensions as reflected by the EPPS, the > 1, 
and the dream ratings were correlated. 


RESULTS 


The results of the correlational analysis are 
presented in Table 1. 

The first two columns of Table 1 presen 
the correlation coefficients on the 15 dimen 
sions between EPPS scores and dream rating 
and between ACL scores and dream ratings. 
Of the EPPS scales, the following six corre 
late significantly and positively with th 
dream ratings: Achievement, Deference, Af- 
filiation, Dominance, Change, and Heterosexu- 
ality. Of the ACL scales, 10 similarly corre- 
late significantly and positively with the 
dream ratings: Achievement, Deference, C - 
der, Succorance, Dominance, Nurturan e, 
Change, Endurance, Heterosexuality, and ° g- 
gression, There is an overlap of five sc’ ies 


that correlate significantly with dream vat- 
ings on both the ACL and the EPPS. 

The third column of Table 1 presenis the 
correlations between the EPPS and the ACL; 


11 of the scales correlate significantly, with 
coefficients ranging from +.26 to +.51. 

Since a major concern underlying ratings 
is their reliability, the fourth column of ‘Table 
1 presents the median reliability coefficients 


for the five raters. These coefficients range 


from +.53 to +.81, with four coefficients in 
the .50s, six in the .60s, four in the .70s, and 
one in the .80s. 
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n 
Samy DISCUSSION 
The results clearly do not support the hy- 
thesis of dreams as compensatory aspects 
o psychic functioning. The substantial signifi- 
int findings are all in the positive direction 
; and reflect a communality of themes in both 
re 


+ the dream reports and the results of person- 
ality inventories. Indeed, the results are more 
in line with the position that what occurs in 
dreams is not substantially different from 
what occurs in conscious thought, a position 
that is consonant with the dream theories of 
Adler (1958; Ullman, 1962), Fromm (1957), 
and Hall (1959). That there is a relationship 

, between personality aspects and the content 
and structure of dream reports is a well-estab- 
lished fact, albeit the specifics are a matter of 
controversy. In the present study five dimen- 
sions show significant, if moderate, relation as 
expressed in the EPPS, the ACL, and dream 
reports: achievement, deference, dominance, 
change, and heterosexuality. It is only for the 

„change dimension that scores on the EPPS 
and ACL do not correlate significantly with 
each other, 

For the correlational pattern between the 
EPPS and the ACL, one can echo the words 
of Gough and Heilbrun (1965) that care 
should be exercised in equating the two tests. 
While 11 of the scales correlate significantly 
with their counterparts, the coefficients are 
iot substantially high. Of these 11 scales, 9 

lso correlated significantly in the study cited 
by Gough and Heilbrun. 

As noted earlier, the median reliability co- 
efficients for the five raters are respectable 

though not impressive, particularly since the 

raters were professionally experienced and had 
iadergone some preliminary training on dream 
ratings. Much of the literature cited in the 
introduction seems to have neglected this 
aspect, and few studies have given evidence 
of the reliability of their ratings. Robbins 

(1966) used two analysts to rate 60 dream 

protocols on 12 need areas and reported an 

average phi coefficient of +.63. Bender and 

Kramer (1967) reported average reliabilities 

of +.52 and +.53 across 20 needs as scored 

in the TAT and in dream protocols, with 
only 5 of the 20 needs scored with above- 
chance reliability in both TAT and dream 
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reports. Thus the present correlation coeffi- 
cients are in line with what has been reported 
in the literature. 

To determine whether the degree of reliabil- 
ity influenced the obtained results, a chi- 
square analysis was carried out by tallying 
significant versus nonsignificant dimensions 
as related to greater or lesser reliability (de- 
fined as above or below the median of the 
reliability coefficients), The obtained values, 
corrected for continuity, of .083 for the EPPS 
dimensions and .125 for the ACL are not 
significant, although the small frequencies in 
each cell may make such an analysis aca- 
demic. 

As is the case with the writings of other 
brilliant theorists, Jung’s contributions are 
often nebulous and not easily translatable into 
operational definitions. Yet the notion of 
compensatory aspects of dreams seems to be 
particularly clear and concrete. Judging from 
past endeavors to subject theoretical concepts 
to experimental verification (cf. MacKinnon 
& Dukes’ chapter on repression cited in Post- 
man, 1962), whether the current study is an 
adequate test of Jung’s hypothesis is cer- 
tainly not a closed issue. 

The current results do not support the no- 
tion of compensatory aspects of dreams, but 
further study with different dimensions and 
different populations is warranted. The dimen- 
sion of introversion—extraversion is a particu- 
larly obvious one, and work on this area is in 
progress. A possibility to consider is that the 
hypothesis of compensatory aspects of dreams 
may have little validity when applied to the 
dreams of normal and well-functioning indi- 
viduals but may be quite relevant when ap- 
plied to therapy patients or extreme groups. 
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In a pair of studies, subjects were found to shift their positions on an issue 
while they were expecting to engage an opponent in a discussion of that issue. 
As predicted, it was possible to influence the size and direction of these antic- 
ipatory shifts by manipulating the personal relevance of the discussion topic 
and the timing of the discussion onset. It was also possible to nullify the shifts 
by canceling the expectation of discussion. The results were taken to support 
a general formulation of anticipatory shifts as strategic responses to immediate 
situational pressures rather than genuine changes in attitude, Additionally, it 
was found that the durability of anticipatory change was associated with the 
tendency to engage in cognitive activity supportive of the change. The possi- 
bility is discussed that most “attitude change” studies have not involved atti- 
tude shifts but rather the “clastic shifts” obtained in the present experiments. 


In a study by Cialdini, Levy, Herman, and 
Evenbeck (1973), the expectation of discus- 
sion with a peer on a topic was found to pro- 
juice moderation of position (i.e., movement 
ward the middle of the opinion scale) on 
that topic prior to the discussion. These an- 
ticipatory moderation shifts were seen to be 
i to those typically found in tradi- 
forewarming studies (eg., Cooper & 
3, 1970; McGuire & Millman, 1965; Mc- 
vuire & Papageorgis, 1962). It was further 
lemonstrated that when the discussion expec- 
lation was canceled (i.e., when participants 
ere told that they were controls and thus 
would not have to engage in a discussion 
‘ter all), subjects “snapped back” to their 
initial, premoderation positions on the topic. 
Both the moderation and snap-back aspects 
of the Cialdini et al. (1973) data have been 
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replicated by Hass in studies employing a 
standard forewarning paradigm (Hass, 1975; 
Hass & Mann, 1976). 

Cialdini et al. developed a general formu- 
lation of anticipatory shifts to account for 
these and a variety of previous findings. They 
contended that position changes that occur on 
an issue in anticipation of some kind of inter- 
action with that issue may be seen as stra- 
tegic maneuvers designed to maximize the 
rewards of the impending situation. Hass 
(1975) and Hass and Mann (1976) have 
theorized similarly concerning the tactical 
character of anticipatory opinion changes. 
Such shifts are viewed, in contrast to prior 
work, as quite different from what has tradi- 
tionally been known as attitude change and 
are conceived as having, by themselves, little 
or no effect on true attitude. They are said 
to occur primarily in the interest of situa- 
tional utility and are characterized by their 
elasticity. Much like an elastic band, one’s 
position on an issue may be stretched and 
distorted under situational pressures only to 
snap back to its original form as soon as the 
pressures are released. 

In keeping with the basic assumption of 
their formulation that anticipatory shifts are 
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tactics designed to maximize situational out- delay before the onset of discussi: such * 
comes, Cialdini et al. (1973) maintained that shifts would be less likely to occur, sinc »old- 
position moderation on a topic is not the ing for an extended time an issue tion 
necessary response to an anticipated discus- that is different from one’s true ude 
sion with another person. They argued that might involve considerable cognitive ‘fort 
moderation would occur only when the re- and serves no immediate, situational pi. ose. 
wards of the immediate context favored a Here, a discussant might be expected re- 


moderate issue position. They speculated 
that a person expecting to discuss a topic with 
another would become more moderate on the 
topic prior to the interaction only when the 
topic was of small personal relevance. In such 
a case, an individual’s prime concern in the 
upcoming discussion would not likely be a 
strong presentation of his or her own position 
on the topic. Rather, when the issues at hand 
are of little personal import, a discussant’s 
personal position on the topic becomes less 
salient than the presentation of a defensible 
and/or admirable face. Thus, the most val- 
ued stance in such an instance is not neces- 
sarily the most accurate one but the one that 
allows the discussant to be in a relatively 
invulnerable position and to appear open and 
broad-minded. A moderate issue position is 
seen by Cialdini et al. as an especially fa- 
vored one in these regards. When anticipating 
a discussion on a personally important topic, 
however, a person’s concern for appearances 
should be dwarfed by outcomes connected 
with the topic itself. That is, one would not 
wish to weaken a personal position on an 
issue of strong self-relevance. Cialdini et al. 
(1973) hypothesized that with a highly im- 
portant issue one might well polarize in an- 
ticipation of a discussion on that issue. It 
was one purpose of the present study to test 
the above aspect of the Cialdini et al. argu- 
ment by examining the effects of topic im- 
portance on anticipatory shifts. 

Another way to test the Cialdini et al. con- 
ception of anticipatory moderation shifts as 
operating only to enhance the achievement of 
situational ends would be to arrange a situa- 
tion in which such ends could not be immedi- 
ately attained through a change in issue posi- 
tion, That is, if a person were anticipating 
immediate discussion, tactical position shifts 
would be expected to take place at once so 
as to maximize the receipt of positive out- 
comes in the imminent discussion situation. 
However, if a person were anticipating a long 


frain from anticipatory position change ntil 
a time shortly before the discussion ons 
According to the above analysis, the: he 


Cialdini et al. (1973) model would pi ict 


that the expectation of discussion would o- 
duce moderation of issue position only © +n 
the onset of discussion was to be imm e 
and the topic were personally unimportai to 
the discussant. One final prediction can ve 


derived from the Cialdini et al. formulati 11: 
If anticipatory shifts in the face of discus n 
are truly instrumental in nature, functio 
only to achieve immediate ends and hav 
by themselves little lasting effect on genu 
attitude, then by cancellation of the disc 
sion expectation it should be possible to ma 
the shifts disappear; that is, the issue posi 
tions should snap back to their original forn 
as occurred in Cialdini et al. 

In order to investigate the effects of dif 
ferential outcome contingencies on the op- 
eration of anticipatory position shifts, an 
experiment was conducted employing three in- 
dependent variables—issue to be discussed, 
personal relevance of that issue, and timing of 
the anticipated discussion. The design was a 
2 X 2 2 factorial with four control condi- 
tions. 


DIRIA 


EXPERIMENT 1 
Method 


Subjects. The subjects were 81 students at a g 3 
parochial high school located in the Bronx, > w 
York. The subjects constituted all the member: of 
the school’s senior class who were present on ‘ive 
day of the experiment. 

Procedure. Through the cooperation of school 
officials, a “senior class assembly” was called on ti 
morning of a regular school day. At the ass 
students were addressed by a male experimenter w 
stated that he and his research team were from Co- 
lumbia University and were part of a “national st udy 
designed to find out what American high school stu- 
dents think about a variety of. issues.” He then 
stressed the need for honesty in response to the 
questions to be asked of the subjects and assured 
them of the confidentiality of their responses. At 
this point, he distributed to each subject a three- 


3 


ELASTIC SHIFTS OF ATTITUDE 


Pace booklet that presented the experimental manip- 


é 


Bers aeo 


greom 


_. 


ulations. The first page contained short descriptions 
of the two issues used in the experiment and asked 
subjects to make a dichotomous for-or-against re- 
sponse to each of the following issues: 


Do you think a girl should wait at least a year 
after high school graduation before she marries, 


even if she thinks she knows who she wants to 
marry? 
sac ao 
Would you favor shortening the number of years 
of medical training for doctors, thereby permitting 
the tr of more doctors and lengthening each 
docti ictive practice? 
Yes. d No. 
In addition, below each issue appeared a 5-point 
scale on which subjects were asked to answer “How 
important do you feel this issue is for you?” The 


scales’ cateyories were labeled “very important,” 
“pretty important,” “Somewhat important,” “slightly 
important,” and “not at all important.” 

The second page of the booklet described the 
study as primarily interested in the attitudes of 
high-schoo! siudents on certain issues and the dis- 
cus that result from these issues. All subjects 
informed that they would be assigned to dis- 
the issues from the first booklet page 
with a classmate whose position was opposite to 
their own. Half of the subjects were then instructed, 
via a checkmark on their booklets, to expect to 
discuss only the deferred marriage issue; the other 
subjects were instructed in similar fashion to expect 
to discuss only the shortened medical training issue. 
Finally, the booklet’s second page informed half of 
the subjects that the discussions would take place 
immediately after the booklets were collected; the 
other subjects were informed that the discussions 
would occur when the experimenters returned in a 
week, 

On the last page, subjects were told that before 
the experiment began it would be necessary for 
them to give a more specific indication of opinion 
on the two issues than they had provided on the 
initial booklet page. This was done by marking a 
series of four 7-point semantic-differential scales for 
each topic. 

After all of the booklets had been completed and 
collected, the large body of subjects was divided 
into two smaller groups and taken to separate 
rooms. When each group was settled in its room, an 
experimenter entered and told the subjects that they 
had been assigned to a control group and thus 
would not have to participate in the discussion 
sessions after all. Specifically, he said; 


You people in this room have been randomly 
chosen as control subjects. That means that you 
will not have to perform in the discussion that 
was described in your booklets. Instead of dis- 
cussing an issue, all you have to do is respond on 
a questionnaire to some statements that usually 
come up during a discussion of the issues de- 
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scribed in your booklets. Respond to each state- 
ment by indicating the degree to which you agree 
or disagree with it. [Here the experimenter dis- 
tributed a Likert-type questionnaire consisting of 
eight statements related to the experimental issues. 
Upon completion of the questionnaire, the sub- 
jects were debriefed and dismissed. ] 


Independent variables. The following three inde- 
pendent variables were employed: assigned issue 
(deferred marriage or shortened medical training), 
timing of expected discussion on that issue (immedi- 
ate or 1 week later), and personal relevance of that 
issue to the subject (high or low). Subjects were 
assigned to the levels of the first two of these vari- 
ables at random, However, subjects assigned them- 
selves to the levels of the third variable by virtue of 
their responses to the personal importance question 
on the initial booklet page. A subject who indicated 
that the issue was personally very important or 
pretty important was categorized in the high per- 
sonal relevance condition. A subject who indicated 
that the issue was somewhat, slightly, or not at all 
important was placed in the low personal relevance 
condition. 

Dependent variables. Each subject indicated her 
position on both of the experimental issues on two 
different types of nondichotomous measures during 
the experiment. The first measure was administered 
while she was still expecting to discuss her assigned 
issue. It consisted of a series of four 7-point se- 
mantic-differential scales: harmful-beneficial, wise- 
foolish, bad-good, favorable-unfavorable. The sub- 
jects responded to the second measure after they no 
longer expected the discussion to occur. The measure 
was a Likert opinion index composed of eight state- 
ments; four concerned each experimental issue. Sub- 
jects indicated their level of agreement or disagree- 
ment to each statement on a 7-point scale ranging 
from “strongly agree” to “strongly disagree.” 

Each participant served both as an experimental 
and a control subject in the study, A participant was 
an experimental subject on the issue she was as- 
signed to discuss and was a control subject on the 
issue she was not assigned to discuss. 

Two major dependent measures were used in the 
study. The first was calculated as a difference score 
between a subject’s score on the semantic-differential 
scales and the mean score of the comparable con- 
trol group on those scales. The appropriate control 
group for an experimental subject was composed of 
those subjects who had made a similar initial for or 
against choice (as measured by the booklet’s dichoto- 
mous opinion scale) on the critical issue but did not 
expect to discuss it. The first dependent measure, 
then, represented the amount of position change on 
the assigned issue resulting from the subjects’ dis- 
cussion expectation on that issue. This measure is 
referred to as the anticipation measure. The second 
dependent measure was obtained similarly except it 
was calculated as a difference score between a sub- 
ject’s own score on the Likert questionnaire state- 
ments, administered after subjects were told that 
they would not have to discuss, and the mean score 
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of the comparable control group on those state- 
ments, It represented the amount of position change 
still present after subjects no longer expected to dis- 
cuss the assigned issue. This measure is referred to 
as the canceled anticipation measure. Both change 
scores were calculated such that a moderation shift 
was given a negative sign and a polarization shift 
was given a positive sign. 


Results 


The experimental group means collapsed 
over issue* for the anticipation and canceled 
anticipation measures are presented in Table 1. 
These means represent differences from con- 
trol subjects who, as compared with experi- 
mental subjects, made the same initial for or 
against choice on the critical issue but had 
no discussion expectation regarding it. As 
predicted, only when the discussion was to be 
immediate and it involved an issue of low 
personal relevance did subjects moderate their 
positions in anticipation of discussion, A 
planned comparison testing the low-relevance/ 
immediate-discussion condition against the 
other three conditions on the anticipation 
measure was highly significant, F(1,77) = 
10,24, p < .002. Further, tests of each condi- 
tion’s movement from the control position 
showed significant or marginally significant 
effects in all conditions. That is, a subject's 
expectation of discussion of a personally un- 
important issue produced a more moderate 
position than control subjects when the dis- 
cussion onset was to be immediate, F(1, 77) 
= 3.31, p < .07. However, a subject’s expec- 
tation of discussion produced a more polarized 
position than controls in the other experi- 


TABLE 1 


POLARIZATION AND MODERATION SHIFTS ON THE 
ANTICIPATION AND CANCELED ANTICIPATION 
MEASURES OF EXPERIMENT 1 


Personal relevance 


High Low 
Imme- Imme- 
diate Delayed diate Delayed 
discus- discus- discus- discus- 
Measure sion sion sion sion 
Anticipation® 1.79 1,68 —1.67 1.72 
Canceled 
anticipationb 1,33 322 58 52 
n 20 26 19 16 


Nole, Positive means indicate polarization shifts from the 
control; negative means indicate moderation shifts. 

*MSe = 16.23, 

b MSa = 21,57. 


f 


4 
mental cells: low-relevance/delayed, F(! 11} 
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= 2.89, p <.09; high-relevance/imme: ate, 
F(1,77) = 4.00, p < .05; high-relevanc: /de- 
layed, F(1,77) = 4.54, p < .05. 

An examination of the canceled ani pa- 


tion means provided in Table 1 show as 


expected, that subjects in all conditions t ‘led , 
to snap back to control issue positions iter 
the expectation of discussion on the i ies 


was canceled. None of the canceled anti va- - 
tion measure means was reliably dif nt 
from the control position. However, it ca: be 
seen that one group of subjects, thos in 
the high-relevance/immediate-discussion «ell, 
seemed substantially more resistant t ‘he 
snap-back effect than the other groups. lhis) 
aspect of the data suggests that our mar pu- 
lations may have activated qualitatively dif- 
ferent processes in the anticipatory ufts 
that occurred. 


Discussion 


There are a number of noteworthy as) cts 
of the data of Experiment 1. First, as c- 
dicted, subjects moderated their posit ons 
prior to the discussion of an issue only v en ‘ 
the issue was personally unimportant and he 
discussion onset was to be immediate. I iis 
result lends support to the contention tiat 
the anticipatory moderation shifts of earl >r 
studies. were strategic in nature, occurring n 
the interests of situational utility. That is, 
when the situational contingencies did not 
favor moderation or did not require immedi- = 
ate moderation, no such shifts took place. 

Second, under the conditions that did 10t 
call for moderation shifts, subjects tende to 
polarize in anticipation of interaction oc an — 
issue. This finding is wholly in keeping with 
the outcomes of previous studies (Bro k & 


‘1 Despite our expectation that the marriave issue i 
would have more personal relevance for our s'ibjects 
than the medical issue, it was the case fov both 


issues that approximately 56% of the subjects fell 
into the high-relevance category. The imporiance of 
the medical training issue may have been attribut- 
able to the school’s proximity to a major medical 
center and to the fact that many of the s’ par- 
ents were employed there. In other respects as well, 
the data were similar for the discussion topics; con- 
sequently, we collapsed over the issue factor in all 
analyses. 
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eer Fromkin, 1968; Greenwald, 1969; Jellison & 


` 


tills, 1969) demonstrating that the expecta- 
) of simply stating, in public, one’s own 
ition on a topic causes one to become more 
reme on that topic. One plausible explana- 
on for these polarization shifts is suggested 
by Jellison and Mills (1969), who say of 
their jects that “the commitment to state 
their position publicly could have increased 
icir desire to be certain that their side on 
sue was the correct one” (p. 346). The 
sire for certainty was presumably satisfied 
preliminary polarization shifts. Thus, for 
such subjects the maximization of situational 
outcomes required movement toward a more 
extreme position. 
\ final result of interest concerned the 
ndeney of the anticipatory effects to revert 
vard the control position when the expecta- 
of discussion was canceled. This snap- 
ick effect replicates the findings of the stud- 
ies of Cialdini et al. (1973) and Hass and 
Mann (1976) and again provides support for 
the claim that anticipatory position changes 
are tactical and elastic in character, With the 
cancellation of the discussion expectation, the 
situational pressures producing the prelimi- 
nary shifts were removed, and subjects 
snapped back to their original positions with- 
out residual effects. However, though none of 
the canceled anticipation measure means was 
jgnificantly different from controls, the sub- 
jects in one condition (high-relevance/imme- 
diate-discussion) were decidedly and unex- 
pectedly less susceptible to a snap-back effect 
than were those in the other conditions. 

The tendency of ‘subjects expecting an im- 
mediate discussion on a personally relevant 
ssue to be resistant to a snap-back effect was 
an intriguing one for us. It suggested that 
the manner in which the anticipatory shift 
occurred in such subjects made the initial 
change relatively nonelastic. One possibility is 
that only subjects in the high-relevance/im- 
mediate-discussion cell were motivated to 
undertake cognitive activity in support of 
their position. That is, subjects in each of the 
other cells were expecting a discussion that 
was on a topic of little importance to them, 
that would not take place for a week, or both; 
thus, they may not have been willing to do 
the cognitive work to solidify their position 
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shifts. Greenwald (1968) has proposed a cog- 
nitive response analysis of persuasion sug- 
gesting that persisting persuasion effects are 
a function of one’s cognitive responses to an 
issue or message. That is, genuine shifts to an 
attitude position are said to occur to the ex- 
tent that one engages in cognitive activity 
supportive of that position. It may have been, 
then, that in Experiment 1 a tendency of sub- 
jects in the high-relevance/immediate-discus- 
sion cell to generate and rehearse thoughts 
that supported their side of the issue caused 
their anticipatory polarization shifts to be- 
come nonelastic, real changes in attitude. The 
situational contingencies in the other three 
cells, however, may not have motivated sub- 
jects to do the cognitive work necessary to 
make their position shifts durable; thus, when 
the discussion was canceled and the discus- 
sion-related influences were removed, the 
shifts disappeared as well. 

To test this possibility, a replication and 
extension of Experiment 1 was performed. Ex- 
periment 2 served three major functions. 
First, it provided a test of the above-described 
hypothesis concerning snap-back effects; this 
was done by including as dependent variables 
several measures of cognitive activity taken 
while sub:ects were still expecting discussion. 
Second, it allowed an assessment of the gen- 
erality of the results of Experiment 1 over 
issues and populations; the subjects of Ex- 
periment 2 were college students of both sexes 
rather than high-school girls, and the issues 
employed were different from those of Experi- 
ment 1. Finally, it represented a more meth- 
odologically sound study in that subjects were 
experimentally assigned to levels of the per- 
sonal relevance factor rather than selecting 
themselves for these levels. 


EXPERIMENT 2 
Method 


Subjects. The subjects were 133 Ohio State Uni- 
versity undergraduates who participated in the ex- 
periment as partial fulfillment of the requirements 
of an introductory psychology course. Seniors who. 
had been assigned to a high personal relevance condi- 
tion were omitted from the analyses because, as 
seniors, the issue would not have high relevance for 
them; removal of seniors from the low-relevangey, 
cells produced no differences in the significance lev- 
els of the analyses. Also, subjects who failed to 
respond to key opinion items were eliminated. In_ 
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total, eight subjects were eliminated from the analy- 
ses. 
Procedure. Subjects were run in groups of from 
3 to 11 members and completed a questionnaire 
taken from a randomly ordered stack. The first 
page of the booklet informed the subjects that the 
experimenters were interested in investigating uni- 
versity student reactions to local campus issues. They 
were also told that they would be discussing one of 
the two issues listed with a partner whose position 
on the issue was opposite to their own. Either Issue 
1 or Issue 2 was circled, indicating that this was 
the issue they were to discuss. The first issue con- 
cerned the establishment of required comprehensive 
examinations for seniors at Ohio State University, 
whereas Issue 2 dealt with the division of the Ohio 
State University into separate graduate and under- 
graduate campuses. Issue 1 was circled in half of the 
booklets and Issue 2 in the other half. Subjects were 
told that they would never be involved in a discus- 
sion of the noncircled issue. Half of the booklets 
explained that the discussion would take place within 
a few minutes, and the other half stated that the 
discussion would take place in 1 week. The next page 
contained a short description of Issue 1, the estab- 
lishment of senior comprehensive exams. Subjects 
were also asked to indicate where they stood on the 
issue—for or against. 

In addition, before giving their own opinion on 
the issue, half of the subjects were informed that 
the Faculty Senate had endorsed the proposal calling 
for the establishment of senior comprehensive exams 
and that it would take effect in 1 year. The other 
half of the subjects were told that the plan would 
take effect in 6 years. This page also included a 
7-point scale on which subjects were asked to indi- 
cate the extent to which the issue involved them 
personally. The scale ranged from “no personal in- 
volvement” to “extreme personal involvement.” Page 
3 contained a brief description of Issue 2, identical in 
format to that of Issue 1. Again, subjects were asked 
to indicate their stand on the issue and the extent to 
which the issue involved them personally. 

Next, in a procedure similar to that employed by 
Brock (1967), Greenwald (1968), and Osterhouse 
and Brock (1970), subjects were informed that the 
experimenters were interested in their thoughts on 
Issue 1. The booklets instructed subjects to 


Simply write down the first idea on the topic 
that comes to mind in the first box, the second 
idea that occurs to you in the second box, etc. 
Please put only one idea or thought in a box. 
You might have ideas all on one side of the issue 
or the other, or a mixture of the two. Either case 
is fine. You may IGNORE SPELLING, GRAM- 
MAR, AND PUNCTUATION. You will have 
three minutes to write your ideas. 


Fifteen 7-inch (17.8-cm) horizontal lines, each about 
14 inches (3.8 cm) from the one above, created the 
boxes in which subjects were to write their ideas. 
Within each large rectangular idea box were two 
much smaller square boxes, one on the extreme right- 
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hand side and one on the left. When the 2 «:inutes 
had elapsed, subjects were told to 

indicate in the left hand box by each i you 


recorded, whether your idea was (+) in 
senior comprehensive exams, (—) opr 
senior comprehensive exams, or (0) nc 
favor of nor opposed to senior comp: 
exams. Please place a +, —, or O in e 
blank that you used.? 


When this had been completed, subjects we: ked 
to indicate in the right-hand box by each id: ‘hey $ 
recorded how confident they were in the val of 
the idea, using a 7-point scale on which 1 in. ted 
no confidence and 7 indicated extreme co) ce. 
After all subjects had rated their ideas on i 
an identical thought listing and rating proced vas 
conducted for Issue 2. 

On the last page of the booklet, the subjec cre } 
told that before they were assigned to dis ion 
groups, it would be necessary for them to ead 
more specific indication of opinion on the two ues. | 
This was done by marking a series of four 1) vint 


After all of the booklets had been complet nd 
collected, the group was divided in half. Eac) ub- 
group was then brought to a separate room 4 
discussion assignments could be made. When a 
was settled in its room, an experimenter e 
and told the subjects that they had been ass ed 
to a control group and thus would not hay to | 
participate in a discussion. ; 

At this point, the experimenter distributed a | <- 
ert-type opinion questionnaire consisting of ei at 
statements related to the experimental issues. Si \- 
jects responded to each statement by indicating th ir 
level of agreement or disagreement on an 11-poi: t 
scale. Upon completion of the questionnaire, the 
subjects were dismissed. 

Independent variables. The following three inde- | 
pendent variables were manipulated: assigned issue: a 
time of the expected discussion, and personal rv le~ ~~ 
vance of the issue. The subjects were randomly is- 
signed to one of two issue conditions—senior | m- 
prehensives or separation of campuses, The | ne 
of the expected discussion was manipulated by l .v- 


semantic-differential scales for each topic. | 
| 
t 


ing the subject expect to discuss either within a ‘ew 
minutes of completing the initial booklet or a woek 
later. The personal relevance of the issue to the 
subjects was manipulated by telling them that the 


proposals were to take effect in 1 year (in which 
case they would be personally involved) or in 6 
years (in which case they would not be involved). 

Dependent variables. The anticipation and can- 
celed anticipation measures were constructed in @ 
way identical to that of Experiment 1. Measures of 
cognitive activity occurring while subjects expected 
discussion were constructed from the thoughts that 


2 This procedure, in which subjects rate the valence 
of their own thoughts, was adapted from Cullen as 
presented in Greenwald (1968, p. 157). 
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mr each subject listed and rated on the assigned dis- 
cussion issue. One measure was the number of 
oughts a subject recorded that were either favor- 
‘e toward his or her own position or unfavorable 
yard the opposing position; this was called the 
pportive-thoughts measure. A second measure was 
«© number of thoughts a subject recorded that 
e not supportive thoughts. Finally, a subject’s 
idence in the validity of each recorded thought 
ya red, 
tions. We expected that Experiment 2 would 
chow the same pattern of results as Experiment 1 
n the anticipation and canceled anticipation mea- 
es, The major prediction of this study, however, 
is that the relative lack of a snap-back effect in 
he high-relevance/immediate-discussion cell would be 
related to greater position-supportive cognitive activ- 
ity among subjects in that condition than among 
those in the other conditions. 


7 
tesi 


An examination of the check on the manip- 
vation of the personal-relevance variable 

‘Please indicate the extent to which this 
issue involves you personally”) revealed that 
the manipulation worked for just one of the 
two experimental issues. Only on Issue 1, 
mandatory senior comprehensives, did sub- 
s jects in the high (5.07) and low (2.93) per- 
sonal relevance cells differ on this item (p < 
.001); Issue 2 did not show any such effect 
between high (4.27) and low (3.91) personal 
relevance subjects (p < .54). Because one 
purpose of Experiment 2 was to test the ef- 
fects of a truly experimental manipulation of 
personal relevance, only the data for Issue 1 
were fully analyzed.* 

Table 2 presents the means of the antici- 
pation and canceled anticipation measures of 
Experiment 2. The pattern of those means 
closely replicates that of Experiment 1. As in 
periment 1, only subjects in the low-rele- 
vance/immediate-discussion condition showed 
a mean moderation shift on the anticipation 
measure, F(1, 57) = 3.84, p < .06, and that 
mean was significantly different from the 
combination of the other three, F(1, 57) = 
6.74, p < .02. Again paralleling the results of 
Experiment 1, only subjects in the high-rele- 
vance/immediate-discussion cell were substan- 
tially resistant to a snap-back effect on the 
canceled anticipation measure. A planned com- 
parison of the amount of snap-back in that 
cell as opposed to the other three gave evi- 
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TABLE 2 
POLARIZATION AND MODERATION SHIFTS AND NUMBER 


OF SUPPORTIVE THOUGHTS PER CONDITION IN 
EXPERIMENT 2 


Personal relevance 


Low 


High 
Imme- Imme- 
diate Delayed diate Delayed 
discus- discus- discus- discus- 
Measure sion sion sion sion 
Anticipation® 1,76 1.63 —2.67 86 
Canceled 
anticipation’ 1.48 0.00 =.55 —.58 
Supportive 
thoughts? 3.78 277 2,87 2.27 


n 18 13 15 15 


Positive means for the anticipation and canceled 
tion measures indicate polarization shifts from control; 
neans indicate moderation shifts. 

88. 


dence of the greater resistance, F(1, 57) = 
3.08, p < .05, one-tailed. 

The crucial dependent variables of Experi- 
ment 2 were those measuring cognitive activ- 
ity occurring while subjects still anticipated 
discussion. A planned comparison testing the 
high-relevance/immediate-discussion condition 
against the combination of the other three 
conditions showed a significant multivariate 
effect in the direction of prediction, F(5, 52) 
= 2.33, p < .054. However, only the number 
of supportive thoughts measure showed a re- 
liable univariate effect, F(1, 57) = 6.93, p < 
O11. The other measures of cognitive activ- 
ity did not approach significance. The mean 
number of supportive thoughts per condition 
are displayed in Table 2. Moreover, the mean 
number of supportive thoughts on Issue 1 


3It was possible, however, to perform for Issue 2 
an internal analysis based on the personal relevance 
manipulation check, as had been done for the issues 
of Experiment 1. Although the fit was not perfect, 
the pattern of these data replicated the important 
aspects of Experiment 1. That is, only the low- 
relevance/immediate-discussion cell showed a mod- 
eration tendency on the anticipation measure, and 
only the high-relevance/immediate-discussion cell 
showed a tendency for residual change on the can- 
celed anticipation measure. Neither effect was sig- 
nificant, however. The means for the anticipation 
and canceled anticipation measures, respectively, for 
each condition are as follows; high-relevance/imme- 
diate (2.16 and 1.20); high-relevance/delayed (.32 
and —.44); low-relevance/immediate (—1.06 and 
—.65) 5 low-relevance/delayed (.04 and —.73). 
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recorded by subjects in the high-relevance/ 
immediate-discussion group (3.78) was sig- 
nificantly greater than that of control sub- 
jects (2.95), who did not expect to discuss 
that issue, F(1,79) = 4.86, p < .03; how- 
ever, the combined mean of the other three 
experimental groups (2.64) did not differ 
from controls (F < 1). 


GENERAL Discussion 


It seems clear from the results of Experi- 
ments 1 and 2 that anticipatory positions 
shifts may be either in the direction of mod- 
eration or polarization of initial position. An 
anticipatory moderation change was obtained 
in both experiments when subjects expected 
an immediate discussion on a personally un- 
important topic. Combining the probability 
levels of this change for the two studies via 
the Z transformation procedure suggested by 
Mosteller and Bush (1954) yielded a strongly 
significant effect, Z = 2.57, p < .01. Prelimi- 
nary polarization shifts also occurred in both 
experiments when subjects were expecting to 
discuss a personally involving issue, Again, 
combining probability levels over the two 
experiments resulted in highly reliable effects 
both when subjects expected immediate inter- 
action, Z = 2.38, p < .02, and when they ex- 
pected delayed interaction on the topic, Z = 
2.23, p < .03. Only when subjects were ex- 
pecting a delayed discussion on an issue of 
low personal import were there no reliable 
anticipatory shifts across the two experiments, 
Z= 1.61, p< .15. 

The pattern of these preliminary position 
shifts fits quite nicely with the suggestion 
that such shifts are strategic in character, 
occurring in the service of situational goals. 
As hypothesized, changing the direction and 
salience of situational pressures through the 
manipulation of the importance of the dis- 
cussion issue and the timing of the discussion 
onset produced large differences in the nature 

of shifts occurring in anticipation of the dis- 
cussion. Thus, the anticipation-measure re- 
sults of the present experiments lend support 
to the Cialdini et al. (1973) theorizing con- 
cerning the tactical quality of anticipatory 
position changes. 

A second important aspect of the present 
findings involved the elastic nature of pre- 
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liminary shifts. Cialdini et al. had sho%n that 

position moderation resulting from th» »xpec- i 
tation of discussion disappeared wi > the 

discussion was canceled. Hass and fann — 
(1976) obtained a similar effect in « fore-— 
warning situation when the expectatio of a | 
message was canceled. Both Experin: ts 1 
and 2 replicated these snap-back effec : for — 
moderation shifts and demonstrated thè -uchi f 
effects can occur for anticipatory polari ‘ion t 


changes as well. The elasticity of antici | 
shifts again indicates their strategic ba 
suggests also that they are fundame: ally | 
different from genuine modifications c l 
tude. However, one set of experimenta! | 
ditions made the preliminary position — ‘fts 
relatively nonelastic in both of our st: ies. | 
When subjects expected to engage in a: i 
mediate discussion of a personally rele ant 
issue, there was a tendency for the anti. òa- | 
tory polarization shifts that resulted to be | 
resistant to a snap-back effect when the lis- 
cussion was canceled. When combining 
probability levels of both studies, the : 
differences from control positions on the 
celed anticipation measure for the other | 
experimental conditions never even 
proached significance (no p < .35); howe. er, 
the high-relevance/immediate-discussion con- 
dition did show evidence of residual chanye, 
Z = 1.82, p< .07. Further, Experiment 2 
showed that the stability of the anticipatory 
shifts evidenced by high-relevance /immediate- 
discussion subjects was accompanied by er- 
hanced cognitive activity favorable to th ir 
own side of the issue and unfavorable to ne 
other side (Greenwald, 1968). 


Elastic Shifts and Attitude Change 


In their 1973 article, Cialdini et al. drow a 
distinction between attitude change and the 
elastic shifts obtained in their study, Atti- 
tudes, as traditionally characterized, are seen 
to be relatively durable, consistent tendencies 
to respond evaluatively to some object. over 
contexts and instances, However, the new 
issue positions assumed by subjects in the 
Cialdini et al. study, and in the present ex- 
periments as well, have shown none of the 
stability usually associated with an attitude. 
Instead, they seem momentary, labile, and — 
contingent on the immediate reward context. 
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ELASTIC SHIFTS OF ATTITUDE 


» 
Ti may well be that the concept of elastic 


à 


shifts is relevant to much more of the litera- 
tuve on persuasion than that concerning an- 
tivipatory shifts. That is, much of the ex- 
perimental laboratory literature on attitude 
change may tell us nothing about the manner 
in which attitudes are modified but, rather, 
may inform us only as to when and how 
people shift their positions on relevant issues 

> as to maximize situational outcomes, Evi- 
a uce in this regard can be found in a num- 
F r of sources. For example, Tedeschi, Schlen- 

r, and Bonoma (1971) have argued im- 
pressively that virtually all of the dissonance 
theory rature can be explained in terms of 
a tendency for subjects to manage the im- 

; s that the experimenter forms about 
It is their suggestion that a desire to 

pear consistent in the eyes of an experi- 

uter can account for these results more 
isily than a desire to reduce cognitive dis- 
sonance through attitude change. 

Further, an extensive review by Hovland 
(1959) determined that changes in attitude, 
so often claimed in experimental investiga- 
tions, almost never occur in survey studies 
of such change. A possible reconciliation of 
this seeming discrepancy can be achieved if 
we argue that experimental and survey stud- 
ies of attitude change are examining wholly 
different phenomena. The experimental stud- 
ies are measuring the temporary position 
shifts of their subjects who are reacting to 
immediate situational pressures; the survey 
studies, on the other hand, are tapping the 
nore stable and resistant attitudes of their 
respondents and thus show less influence. 
Finally, various workers in the area of per- 
uasion persistence have commented on the 
inconsistent nature of the literature (e.g., 
Cook & Insko, 1968; Watts & McGuire, 
1964). Some studies have found a complete 
loss of induced “attitude change” over time, 
others have found a partial loss, and still 
others have found no loss. Again, it may be 
poszible to reconcile this apparent discrep- 
ancy by dropping the assumption that each 
of these experiments involved the same phe- 
nomenon—attitude change, Thus, rather than 
suggesting that attitude change persistence 
effects are inconsistent, the literature may 
indicate that distinct kinds of shifts, differ- 


io 
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ing by nature in their durability, were evi- 
denced in the various experiments, In addi- 
tion to the results of their own experiment, 
Cook and Insko pointed out several studies 
of persistence effects that obtained findings 
in keeping with such a possibility and with 
the data of the present article, In each of 
these instances (Cook & Insko, 1968; Green- 
wald, 1968; Newcomb, 1963; Peterson & 
Thurstone, 1933; Watts, 1967; Watts & Mc- 
Guire, 1964), the greatest persistence effects 
were found among those subjects who had 
the greatest opportunity to engage in suppor- 
tive cognitive activity regarding the persua- 
sive message. Thus, it is conceivable that the 
majority of variables and procedures said to 
produce attitude change may have little in- 
fluence on true attitude but, instead, may only 
affect transient position shifts, Perhaps these 
variables and procedures bring about new 
stances that approach the nature of attitudes 
to the extent that they provide the motiva- 
tion and opportunity for cognitive activity 
supportive of the new positions (Greenwald, 
1968). 

We do not wish to imply, however, that 
such a conceptualization of traditional atti- 
tude change research renders it unimportant. 
On the contrary, it is our feeling that the 
great majority of everyday interaction on one 
issue or another involves the strategic shift- 
ing about of one’s position rather than genu- 
ine changes in attitude. Hence, the great part 
of research on “attitude change” is just as 
relevant as ever. However, it is our contention 
that the research may well embody a mis- 
labeling of the phenomenon under investiga- 
tion and that a clearer understanding of the 
phenomenon’s identity will result in a clearer 
understanding of the processes involved in 
its occurrence. 
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This experiment was conducted in an attempt to replicate and delimit the 
reverse placebo effect found by Storms and Nisbett in insomniacs. It was pre- 
dicted that the reverse placebo effect would obtain when the pills’ effects were 
described as affecting arousal (as in Storms and Nisbett’s study) but that a 
direct suggestion effect would result when the pills were described as directly 
affecting sleep onset latency. Results indicated that direct suggestion effects 


obtained regardless of the focus of the instructions. The present study together 
with the findings of Kellogg and Baron call into question the reliability and 


clinical significance of the reverse placebo effect in the treatment of insomnia, 


In a test of the theoretical implications of 

hachter’s (1964) theory of emotions and 
Kelley’s (1967) attribution theory, Storms 
and Nisbett (1970) found that insomniac 
subjects fell asleep faster if they were led to 
misattribute their arousal to the effects of a 
placebo pill taken before bedtime. Thus, when 
subjects were told that their bedtime arousal 
was due to the pill, they fell asleep faster. 
Conversely, when the same placebo pill was 
described as a relaxant, subjects had greater 
difficulty in falling asleep. 

The explanation offered for these somewhat 
unusual results was that on the one hand, if 
subjects were induced to attribute their 
symptoms to an external source (an “arousal” 

ill), they would worry less about them, and 
onsequently, fall asleep faster. On the other 
hand, if the subjects’ arousal remained un- 
fected despite the ingestion of a “relaxa- 
tion” pill, they would infer that their “real” 
arousal level was particularly intense. They 
would, thus, worry more and, consequently, 
take longer to fall asleep. 

Despite the cogency of their analysis (or 


The authors are indebted to Sharon Gurwitz, Janet 
Polivy, Leonard Sushinsky, and Camille Wortman 
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perhaps because of it), Storms and Nisbett 
(1970) were faced with the difficulty of ex- 
plaining the accumulating literature in sup- 
port of the reality of placebo (or suggestion) 
effects (e.g., see review by Shapiro, 1971). 
This literature documents a phenomenon pre- 
cisely the opposite to that demonstrated in 
the Storms and Nisbett (1970) experiment. 
Storms and Nisbett (1970) recognized the in- 
compatibility of their results (and theory) 
with the data on suggestion effects and in an 
effort to resolve the incompatibility developed 
the notion of “symptom familiarity.” Basi- 
cally, they contended that misattribution or 
reverse placebo effects are likely to occur 
when the patient is “quite familiar” (p. 326) 
with the particular symptoms under con- 
sideration. When the patient is familiar with 
the symptoms, he is more likely to recognize 
that the pill is not having the desired effect 
and thus infer that his condition is actually 
worse than he originally thought. If, however, 
the condition is a relatively new or unfamiliar 
one, the patient will lack the internal stan- 
dards of reference necessary to make the com- 
parative judgments that misattribution de- 
mands, In support of their position, Storms 
and Nisbett (1970) interpreted studies by 
Rickels and his colleagues (Rickels, Baumm, 
Raab, Taylor, & Moore, 1965; Rickels & 
Downing, 1967; Rickels, Lipman, & Raab, 
1966) as showing that the effectiveness of 
placebos in alleviating symptoms was con- 
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siderably lower—and perhaps reversed—in 
patients who had long experience with their 
symptoms and prior exposure to medication, 
Strong placebo effects were prominent in pa- 
tients with relatively little familiarity with 
symptoms or medication. Whatever the merits 
of this distinction, Rickels’ studies were spe- 
cifically directed toward anxiety, and no di- 
rect application of the model has been applied 
to insomnia, 

The familiarity hypothesis of Storms and 
Nisbett (1970) is also beset by at least one 
internal contradiction. If patients are familiar 
enough with their symptoms to recognize that 
the pill is not having the originally intended 
effect, it follows that these patients would 
have had enough experience to recognize the 
symptoms as the same ones they have been 
chronically experiencing. These veteran pa- 
tients, it seems, would hardly be the ones to 
begin suddenly to attribute their symptoms 
to a novel pill in the face of a long-standing 
tendency to attribute them to endogenous 
factors. The familiarity hypothesis, then, 
must be considered as unsubstantiated at best. 

Although the theoretical basis for the re- 
verse placebo effect is still in question, there 
is nevertheless an increasing interest in ap- 
plication of the misattribution principle to 
clinical phenomena, Valins and Nisbett 
(1971) have summarized a number of vari- 
ants on the original Storms and Nisbett 
model. Before the reverse placebo effect is 
implemented on a widespread basis, however, 
it is essential to investigate its limiting con- 
ditions: Under what circumstances and in 
which patients can a reverse placebo effect 
be reliably produced? One qualification of 
the phenomenon not directly discussed by 
Storms and Nisbett (1970) is that individuals 
show the reverse placebo effect only to the 
extent that they indulge in intense self-ob- 
servation, The operation of the reverse 
placebo effect requires, at the least, some in- 
trospection with respect to present and past 
symptomatology. Only to the extent that sub- 
jects or patients are led to dwell on their 
symptoms are they likely to engage in the 
chain of reasoning and comparative analysis 
that forms the basis for the reverse placebo 
effect. In the work of Rickels and his col- 
leagues, we may assume that those patients 
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whom Storms and Nisbett characterize as 


most familiar are in fact those most | kely 
to indulge in self-observation; they are cither 
the chronic patients (in whom there may 
well be an element of hypochondria) or ‘sose 
acute patients in whom symptoms are vost 
intense (and thus most subject to con’ vual 
inspection). In the Storms and N. ‘ett 
(1970) study, the purported effects o' the 
pill were couched in terms of internal s tes 


(e.g., “The pill will make you feel ca”) 
rather than in terms of behavioral co se- 
quences (e.g., “The pill will help you slee; `); 
this emphasis on internal state may hav: n- 
couraged the sort of self-observation that ip- 
posedly underlies the reverse placebo vi ct. 
The direct suggestion techniques usually as- 
sociated with the operation of placebo eff.. ts, 
on the other hand, are more typically cow ed 
in terms of behavioral consequences `d, 
thus, are probably much less likely to ir ce 
a state of internal self-scrutiny. 

Accordingly, the present study was des ed 
primarily to test the hypothesis that e- 
verse placebo effect is dependent on : >t- 
ened self-awareness, whereas the direct © z- 
gestion (or placebo) effect obtains when s: d- 
jects or patients are not encouraged to indul; e 
in self-perception. In order to accomplish this 
experimental test, subjects were told eit! er 
that the pills would affect their symptoms in 
some way (indirect suggestion), as in Sto. ns 
and Nisbett (1970), or that the pills would 
have a direct effect on their sleeping beha ior 
(direct suggestion), It was expected that sub- 
jects who received indirect suggestion ins’: \1c- 
tions would show the reverse placebo effect, 
whereas subjects who received direct su 
tion instructions would show the traditior 
placebo effect. 

, Secondarily, the present study was de 
to replicate the Storms and Nisbett (1 
effect (at least in indirect suggest 
jects) while avoiding some of the p 
methodological artifacts present in the origi- 
nal study. Storms and Nisbett’s pretest data 
were beset by substantial differences between 
experimental conditions, leading to the possi- 
bility that regression artifacts might have 
accounted for their results, Storms and Nis- 
bett attributed the pretest differences to high 
variability in the subjects’ reports, perhaps 
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i result of the subjects’ reporting their pre- 
test data in some cases well over 36 hours 
ier the event. Thus, in the present study, 
» enhance the reliability of the present data, 
ubjects filled out both pretest and experi- 
ures immediately upon arising. 
the pretest and experimental 
e separated by a full week (rather 
g made on consecutive nights); this 
‘ration precluded the possibility (present 
the Storms & Nisbett, 1970, study) that 
he degree of insomnia on the pretest nights 
night directly affect how tired subjects were 
on the experimental nights, It was hoped that 
} Iterations would establish the reality 
of the reverse placebo effect, while the addi- 
nal (direct suggestion) experimental groups 

| establish the limits of the effect. 
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METHOD 
Subjects 


Insomniacs were recruited through posters placed 
throughout the campus of Governors State Univer- 
sity and through a brief article that appeared in 
local suburban newspapers requesting insomniacs to 
participate in a research study on dreams. Subjects 
were informed that they would receive $5 for par- 
licipating in the study and that if interested, they 
should contact the Behavioral Studies program at 
the university. Those who wished to participate 
were told that the study involved investigating the 
dream content of people with sleeping problems and 
that they would be asked to record their dreams on 
four different nights in their homes. Appointments 
en made for subjects to pick up materials 
to record their dreams for two baseline nights. 

A total of 50 insomniacs were recruited for the 

vestigation, However, 2 subjects, 1 of whom took 
S pills on the experimental nights and an- 
no did not record data on the appropriate 
mental nights, were eliminated from the sta- 
ul analyses, The final sample of 48 insomniacs 
cousisted of 11 males and 37 females ranging in age 
from 17 to 58, with a mean age of 39.6 years, Ap- 
proximately 52% of the subjects were nonstudent, 
adult insomniacs from neighboring suburban com- 
munities, 
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Procedure 


All interaction between subjects and experimenters 
took place in the university’s Student Health Ser- 
vices Center, This location was chosen to maximize 
the credibility of the placebo manipulations and 
because of its general convenience to the subjects. 
All sessions were individually conducted and lasted 
between 10 and 15 minutes, During the first ses- 
sion, all subjects signed a statement indicating that 
they understood that they might be asked to take 
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one of a list of drugs that included Vitamin C, 
Vitamin E, caffeine, lactose, aspirin, scopolamine, 
nicotine, and antihistamine. The experimenter then 
gave the subject two dream study questionnaires 
on which they were to record various aspects of 
their dreams on Monday and Tuesday nights, In- 
terspersed among the dream-related questions were 
questions concerning how many times the subject 
awakened during the night, when the subject went 
to bed, and at what time the subject fell asleep, 
Subjects were asked to note the precise time at 
which they went to bed on Monday and ‘Tuesday 
evenings but to record the rest of the data the fol- 
lowing mornings within 15 minutes of awakening. 
The experimenters stressed to subjects the impor- 
tance of not taking sleeping pills, alcohol, or medi- 
cation with sleep-related effects during both base- 
line and experimental nights. 

When subjects returned their baseline data to- 
ward the end of the first week, they were requested 
to come back the following Monday for additional 
questionnaires and further instructions. In the in- 
terim subjects were randomly assigned to the five 
experimental conditions. On Monday of the experi- 
mental week, subjects were given two white lactose 
capsules that they were asked to take, one per night, 
15 minutes before going to bed on Monday and 
Tuesday nights. As in Storms and Nisbett (1970), 
the experimenters explained to subjects that the 
purpose of the research was to examine the rela- 
tionship between bodily activity during sleep and 
dream content. Subjects recorded the same informa- 
tion about their dreams and sleep for Monday and 
Tuesday nights of the experimenetal week, as they 
had during the baseline period. They returned on 
either Wednesday or Thursday of the experimental 
week for a final session. At this time subjects com- 
pleted a postexperimental questionnaire that was de- 
signed to assess their perception of the effects of 
the pills, what they thought was in the pills, and 
what other medication, if any, they had taken on 
the four nights of the study, Subjects were then 
debriefed and paid $5 for participating in the study. 


Placebo Manipulations 


Two of the experimental conditions in this study 
were exact replications of Storms and Nisbett’s 
arousal and relaxation manipulations, and thus the 
instructions were taken verbatim from their study. 
In describing the effects of the pills to subjects in 
these conditions, the experimenters either empha- 
sized that the pills would increase their level of 
arousal or would relax them. However, no suggestion 
was given that the pills would directly affect the 
sleep of the subjects in either group. Instructions 
for the arousal manipulation were as follows: 


This drug will increase your bodily activity, It 
works on the sympathetic nervous system which 
is the system which arouses you and sends adren- 
alin through your system, The pill will increase 
your heart rate and it will increase your body tem- 
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perature. You may feel a little like your mind 
is racing. In general, it may arouse you. 


For the relaxation manipulation, subjects were told: 


This drug will lower your bodily activity. It 
works on the parasympathetic nervous system, 
which is the system that relaxes you, It will de- 
crease your body temperature so that you will 
feel a little cooler, And it will calm down your 
mind, In general, it will relax you. 


In addition to the above conditions, two other 
experimental groups were employed, The purpose 
of these conditions was to determine the effect of 
direct sleep-related suggestions accompanying the 
placebo on the reporting of sleeping difficulty on the 
experimental nights. Basically, subjects in these 
groups were told either that the pills would help 
them to fall asleep or that they would keep them 
awake longer than usual. In contrast with the arousal 
and relaxation instructions, no specific information 
concerning the effect of the pills on bodily activity 
was given. For the somnia manipulation, subjects 
were told: 


This drug will generally tend to help you fall 
asleep. Its effects on your body are fairly diffuse, 
but you should have less trouble than usual in 
falling asleep. 


For the insomnia manipulation, subjects were told: 


This drug will generally tend to keep you awake, 
Its effects on your body are fairly diffuse, but 
you should have more trouble than usual in fall- 
ing asleep. 


A control group was included in the experiment 
to assess differences in reported sleep-onset latencies 
from pretest to posttest weeks not attributable to 
the experimental manipulations. As in the other four 
experimental groups, subjects in the control group 
took the placebo pills on the two experimental nights 
but were given instructions that the pills would 
affect their dreaming but not their bodily activity. 
Furthermore, no sleep-related suggestion was in- 
cluded in these instructions. For the control sub- 
jects, the following instructions were given: 


This drug should have an effect on the intensity 
or number of dreams that you have, but should 
not noticeably affect your bodily state or mental 
activity while awake. 


The experimenters repeated all instructions to sub- 
jects twice to ensure clarity and understanding of 
the effects of the pills. However, when subjects asked 
questions about the pills that were extraneous to 
the instructions they had received, the experimenters 
explained that they could not offer any further in- 
formation. The control, relaxation, and insomnia 
groups each contained 10 subjects, while the arousal 
and somnia groups each contained 9. 

One question that is frequently raised about at- 
tempted replications is whether the design had suf- 


R. R. BOOTZIN, C. P. HERMAN, AND P. NICASSIO 


ficient power to detect the original effect. 1: Storms $ 


and Nisbett’s (1970) obtained means and  ariance 
are assumed to represent the population val: s, then 
the probability of obtaining a significant mi tribu- 
tion effect (p<.05) with the present sar size 
is 81. 

Dependent Variables | 


Sleep, The main dependent variable » the 


amount of time that subjects took to fall as p on 
the two baseline and two experimental nigh Fol- $ 
lowing Storms and Nisbett (1970), this varia was 
indirectly measured by subtracting the time hich 


subjects reported going to bed from the at 
which they reported falling asleep. Subjects v 
asked to record the sleep onset latencies « 
since such attention to sleep may have revea. the 


true intent of the study to the subjects, Th only 
other sleep variable measured each night the 
study was the number of awakenings that ects 
reported. As a further measure of sleep distu: «nce, 
subjects also reported the extent of their in» nnia 
on 7-point scales for each of the nights of the ‘udy 
on the postexperimental questionnaire. 

Arousal. The degree of general arousal ex} ence 
by subjects on each of the four nights wa: nea- 
sured by asking them on the postexperimenta! ues- | 
tionnaire to rate (on 7-point bipolar scales) how t 
warm or cold they felt before falling aslee, and 
the extent to which their mind raced while ling 
asleep. A specific check on the degree of è sal 3 
attributable to the pills was made by asking ub- 


jects to record on the postexperimental questio: 
the extent to which they felt that the pills ai ted 
how warm or cold they felt, how much their 
raced, and how alert they felt, all on 7-point sles. 


RESULTS 
It had been predicted that subjects in the 
indirect suggestion conditions (arousal and 


relaxation) would show the reverse placebo | 
effect, thus replicating the Storms and Nis) ett 
(1970) findings. That is, subjects in he 
arousal condition would show a decrease in 
time to fall asleep, whereas subjects in ihe 
relaxation condition would show an increase 
in time to fall asleep. Subjects in the direct — 
suggestion conditions (somnia and insomnia) | 
were expected to conform to the traditional . 
placebo effect; that is, subjects in the somnia 
condition would fall asleep sooner, while sub- 
jects in the insomnia condition would take 
longer to fall asleep. 

If this pattern of results were to emerge, 
the statistical expectation would be for a 
significant directness of instructions (direct 
vs. indirect) X direction of instructions (im- — 
provement vs. worsening) interaction. Analy- 
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40 
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Control 
= 


Arousal 


Somnia 


Relaxation 


Experimental 


Ficurr 1, Mean time to fall asleep on pretest and experimental nights, 


sis of covariance of time to fall asleep for 
the experimental nights (with time to fall 
asleep on the pretest nights as the covariate) 
evealed no significant interaction (F < 1). 
n contrast, there was a significant main ef- 
iect, F(1, 33) = 4.04, p = .053, of direction 
ot instructions. As can be seen in Figure 1, 
subjects in the relaxation and somnia condi- 
tions fell asleep faster on experimental nights 
than subjects in the arousal and insomnia 
conditions, 

Additional analyses of covariance includ- 
ing the control subjects indicated that the 
main effect of the previous analysis was pri- 
marily due to improvement by the relaxation 
and somnia subjects. They differed signifi- 
cantly from control subjects, F (1, 26) = 4.19, 
p = .051, whereas the combined arousal and 
insomnia subjects did not (F < 1, see Table 
1). Additionally, within-group ¢ tests were 


calculated for each of the individual treat- 
ment conditions and for the combined groups 
that composed the main effect of direction of 
instructions, As can be seen in Table 1, the 
combined relaxation and somnia subjects and 
the relaxation subjects alone improved sig- 
nificantly. The arousal, insomnia, and control 


TABLE 1 


Winn ‘TREATMENT CHANGE FOR Mean TME TO 
FALL ASLEEP 


Condition n Change t 
Arousal (A) 9 —2.78 —.18 
Insomnia (I) 10 8.00 86 
A+I 19 2.89 32 
Relaxation (R) 10 — 23.60 —2.46** 
Somnia (S) 9 — 18.33 —1.41* 
R+S 19 —21.11 —2,72** 
Control 10 10.90 88 
Note. Time is recorded in minutes, 

*p <.20. 
*p <.05. 
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subjects did not change significantly from 
pretest to posttest. 

The results of the present study clearly 
failed to replicate the reverse placebo effect 
found by Storms and Nisbett (1970). In fact, 
a significant placebo effect was obtained even 
from the treatment condition that was an 
exact replication of the Storms and Nisbett 
relaxation condition, Storms and Nisbett, 
however, indicated that their experimental 
effects “had occurred only for subjects who 
had believed the pill descriptions” (p. 323). 
It is possible, then, that the absence of re- 
verse placebo effects is a reflection of dis- 
belief on the part of the subjects. Accordingly, 
subjects were classified on the basis of the 
postexperimental questionnaire into those 
who had accepted the pill descriptions (be- 
lievers) and those who had not accepted the 
descriptions and who had not attributed any 
aspect of their experimental reaction to the 
pills (nonbelievers). There were four non- 
believers in the arousal and insomnia groups; 
elimination of their data, however, only 
served to bolster the traditional placebo ef- 
fect. As can be seen in Table 2, believers had 
considerably greater trouble in falling asleep 
than did nonbelievers, though the effect fell 
short of statistical significance. A similar di- 
vision of relaxation and somnia subjects into 
believers and nonbelievers had no effect on 
the strong placebo effect in these subjects. 


Discussion 


The present experiment, which was origi- 
nally designed to replicate and delimit the 
reverse placebo effect found by Storms and 
Nisbett (1970) failed to replicate the reverse 
placebo effect in any respect, The major 


TABLE 2 


MEAN TME TO FALL ASLEEP FOR BELIEVERS AND 
NONBELIEVERS 


Experi- 


Condition n Pretest mental Change £ 
Arousal f 
insomnia 
Bellevers 15 60.17 69.17 9.00 1.35* 
Nonbelievers 4 77.50 57.50 20,00 
Relaxation 
somnia Š 
Believers 4 6464 45.28 —19.36 -37 
Nonbelievers 5 73.50 47.50  —26.00 


Note. Time is recorded in minutes. 
* o < 20. 
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finding of the study was the dire 
tion effect whereby subjects who 


structed that the pill would help then: sleep 
or relax fell asleep faster than dic jects 
who were told that the pills woul: srouse 
them or interfere with sleep. The ex) tation 


that the subject’s heightened self-av -eness 


would provide the critical preconditio: or the * 
reverse placebo effect was clearly :scon- 
firmed; the attempt to foster self-pe: ption; 
of internal events through reference © the 
pill’s symptomatic effects was insuffi nt to” 
impair the direct suggestion effect. 

The discrepancy between the result: í the 
present study and the results of Stor; and 
Nisbett’s (1970) study demands a con lera- 
tion of the methodological differences h: veen 
the two studies. Procedurally, the p ‘sent 
study corrected some of the difficulties the 
Storms and Nisbett study, and regressio: irti- 
facts did not cloud the interpretation the 
results. Perhaps a more likely explana: 1 of | 
the discrepancy, however, stems from t} dif- 
ferent subject populations in the two st lies. 
Storms and Nisbett used Yale undergra:’ ates” 
as subjects in their study; the present dy É 
recruited subjects from the communit as” 


well as from a primarily urban, comm ter, 
student population. The subjects in the ; “es- 
ent study were considerably older inan 
Storms and Nisbett’s subjects, and furti er- 
more, we may speculate that they were iess 
disposed than Yale undergraduates to tne. 
sort of cognitive and attributional exerti: ns | 
that presumably underlie the reverse plo 
effect. It may well be that misattribution -he- 

nomena are more prevalent among h, er- | 


reflective college students. This explai. tion 


for the discrepancy between the two stuilies, 
however, has difficulty in accounting “or a 
recent failure by Kellogg and Baron (i975) 
to find evidence of the reverse placebo ef- 
fect among students recruited from « state 
university. 


Kellogg and Baron (1975) also attempted 
to replicate Storms and Nisbett’s (1970) 
findings, and their results complement those 
of the present study. They found that sub- 
jects in an arousal condition took significantly 
longer to fall asleep. Although both Kellogg 
and Baron and the present study disconfirm 
Storms and Nisbett’s findings, Kellogg and 
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m found a significant worsening of in- 
nja in their arousal condition, whereas the 
esent study did not, However, it should be 
ed that the subjects in Kellogg and 
oms study, like those in Storms and Nis- 
tudy, did not exhibit as severe a de- 
nia as did the subjects in the 

pecscal study. Our subjects initially took 63 
minutes to fall asleep, while Storms and Nis- 
betts and Kellogg and Baron’s subjects took 
nd 45 minutes, respectively, to fall asleep 
fore the pill manipulations, It seems rea- 
onable to conclude that the relatively high 
initial insomnia in the present study (quite 
ible to Monroe’s, 1967, average of 59 
; for “poor” sleepers in his survey) 
luded significant worsening of insomnia. 
- and Baron, unfortunately, did not 
ude the equivalent of the present relaxa- 

m or somnia conditions in their study, so 

ie present results demonstrating placebo- 
induced direct improvement in this context 
are unique. 

The present study, like Kellogg and Baron’s 
(1975) study, did not simply fail to repli- 
cate the reverse placebo effect found by 
Storms and Nisbett (1970); rather, both 
found a significant direct suggestion effect. 
These studies call into question the current 
enthusiasm for clinical applications of the re- 
verse placebo effect. It seems clear that at 
best the effect is elusive and not under strict 
experimental control. The implications for 
treatment of insomnia, of course, must there- 
fore be cautionary. Unless and until the re- 
verse placebo effect can be reliably induced, 
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misattribution therapy seems as likely to ex- 
acerbate the symptoms as to reduce them, 
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The Recognition and Elimination of Inconsistencies Among 


Syllogistically Related Beliefs: Some New Light 
on the “Socratic Effect” 


Marilyn Henninger and Robert S. Wyer, Jis 
University of Illinois at Urbana-Champaign 


A series of experiments investigated (a) the criteria that subjects use to judge 
the inconsistency of beliefs and (b) the manner in which inconsistencies among 
subjects’ own beliefs are resolved once these beliefs are made salient to them. 
In one experiment, subjects estimated the inconsistency of a hypothetical per- 
son’s responses to a series of syllogistically related beliefs. The syllogistic model 
of cognitive functioning proposed by McGuire and extended by Wyer ac- 
curately predicted these estimates. In the second two experiments, subjects re- 
ported their own beliefs in syllogistically related propositions during two ses- 
sions spaced a week apart. The inconsistency among these beliefs was typically 
less in the second session than in the first, providing evidence for a “Socratic 
effect.” However, when beliefs were initially reported in the order one might 
typically consider them in a syllogistic inference task, their inconsistency was 
low in the first session and did not change appreciably over time. This and 
other findings suggested that the Socratic effect is due largely, but not entirely, 
to the resolution of inconsistencies among beliefs in the course of reporting them 
in the questionnaire rather than during the time interval between experimental 
sessions. Additional data suggested that not all of a set of syllogistically related 
beliefs need be reported during the first session in order for an increased con- 
sistency among them to occur. Implications of results for the general validity of 
the McGuire-Wyer syllogistic model, and for cognitive reorganization processes 


in general, are discussed. 


When an individual reports syllogistically 
related beliefs twice, a week apart, these be- 
liefs are typically more internally consistent 
the second time than they were the first. This 
phenomenon, which is typically referred to as 
the “Socratic effect” (McGuire, 1960), is 
well-established (McGuire, 1960; Rosen & 
Wyer, 1972; Wyer, 1974b; Wyer & Goldberg, 
1970). The effect generalizes over beliefs that 
vary in their importance to subjects and in 
their evaluative implications (Wyer, 1974b). 
Moreover, it does not appear to be an artifact 
of compliance with experimental demand char- 
acteristics, increased familiarity with the gen- 
eral type of questionnaire used, or regression 
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effects (Rosen & Wyer, 1972; Wyer, 1974b) 
The effect has been commonly interpreted a: 
an indication that persons actively reorganize 
inconsistent cognitions once these cognitions 
have been made salient by reporting them in 
close temporal contiguity. 

A rigorous investigation of the Socratic 
effect requires a precise quantitative index © 
the inconsistency of a set of syllogistical! 
related beliefs. Such an index is suggested | y 
the syllogistic model initially proposed 'y 
McGuire (1960) and extended by Wer 
(1970; Wyer & Goldberg, 1970). This m del 
has the form: 


Ps = PaP nja + (1 — Pa) Pria (1) 


where A and B are two propositions, P, and 
Pp are an individual’s beliefs (defined ix units 
of probability) that A and B are true, re- 
spectively, and Pz, and Pya» are his condi- 
tional beliefs that B is true if A is and is not 
true, respectively. The two products compris- 
ing the right side of the equation may be 
interpreted as beliefs in gwo mutually ex- 
{ 


COGNITIVE INCONSISTENCY AND THE “SOCRATIC EFFECT” 


Sry msive sets of premises, (A; if A, then B) and 


t A; if not A, then B), each of which, if 
, implies the conclusion B. This equation 
urately describes the functional relations 
ng the beliefs to which it pertains, and 
nant n which information bearing 
‘iefs comprising the right side of 
(i.e., beliefs in premises) com- 
t the belief in the conclusion B 
1976). Moreover, the equa- 
derivatives generate accurate 
descriptions of the relations 
beliefs (Wyer, 1970, 1973, 
» a summary, see Wyer, 1974a, 


t ititatis 


ation 1 is a valid normative index of 
ver in which individuals typically or- 
llogistically related beliefs, the in- 
cy (Inc) among such beliefs should 
cted by the absolute difference be- 
» right and left sides of this equa- 
it is. 


| Pp-(PaPaja+U—Pa)Paja)|. (2) 


if the four beliefs comprising this equa- 
i ure reported in a questionnaire at two 
( ent times, a Socratic effect would be 
i) vated by a decrease in the magnitude of 
ih absolute difference from the first time to 
th second. In fact, this appears to be the 
ci (Rosen & Wyer, 1972; Wyer, 1974b; 
W or & Goldberg, 1970). 

'espite the abundant evidence for the So- 
cratic effect, however, the validity of several 
assumptions underlying its interpretation re- 
main unverified. The present article reports 
three experiments that bear upon certain of 
these assumptions. The first experiment in- 
vestigated the extent to which inconsistency 
as defined by Equation 2 is an accurate re- 
flection of subjects’ actual perceptions of the 
inconsistency among related cognitions. The 
second experiment was concerned with the 
extent to which inconsistencies are resolved 
during the course of reporting beliefs on the 
questionnaire rather than during the interval 
between experimental sessions. The third ex- 
periment investigated the number and type of 
beliefs that must be made salient in order for 
the resolution of inconsistencies to occur. 
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EXPERIMENT 1 


If Equation 1 is a valid quantitative de- 
scription of the manner in which an individual 
organizes syllogistically related beliefs, a per- 
son should be able to recognize as inconsistent 
a set of beliefs that are not related in the 
manner implied by this equation, More pre- 
cisely, a person’s estimates of the inconsis- 
tency of such beliefs should be predictable 
from Equation 2. The present experiment in- 
vestigated this possibility. 


Method 


Subjects were 60 introductory psychology students 
who participated in the study to fulfill a course re- 
quirement. Judges were told that the purpose of the 
study was to determine how people judge the con- 
sistency or incorsistency of other persons’ beliefs. 
They then completed a 38-item questionnaire, in 
which each item consisted of the following four 
statements: 


a. A person belongs to group A. 

b. If a person belongs to A, he has attribute X. 

c. If a person does not belong to A, he has at- 
tribute X. 

d. A person has attribute X. 


Each statement was followed by an 11-point scale 
from O (not at all likely) to 10 (extremely likely). 
A value along each scale was circled to represent a 
hypothetical subject’s belief in the statement pre- 
ceding it. The values assigned to the four statements 
in each set, divided by 10, were assumed to corre- 
spond directly to Pa, Paya, Pajar, and Py», respectively, 
as defined in Equation 1. Judges were told to assume 
that the four beliefs in each set had been reported 
by a hypothetical subject in a previous experiment. 
and to estimate the consistency of these beliefs by 
placing a check along a scale from 0 (extremely in- 
consistent) to 10 (extremely consistent). These judg- 
ments were subsequently reverse scored so that higher 
numbers reflected greater perceived inconsistency. 

Stimulus items were constructed by combining 
all 36 possible combinations of two levels of Pa 
(denoted by circling the numbers 2 and 8 along the 
scale), three levels of Paya (denoted by 2, 5, and 8), 
and two levels of Ps (denoted by 3 and 7). Different 
values of Ps were selected to ensure that some 
inconsistency existed in all sets of statements and to 
provide a more diverse set of stimulus values. The 
theoretical inconsistency of beliefs (Inc), calculated 
on the basis of Equation 2, ranged from .02 to .50 
over the 36 test stimuli. In addition to the test 
stimuli, two practice stimuli were constructed, one 
in which the given set of beliefs was maximally 
consistent (Inc=0, as defined by Equation 2) and 
a second in which the four beliefs were highly in- 
consistent (Inc=.72). 
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Two questionnaire forms were constructed, each 
of which was completed by one half of the partici- 
pating subjects. While both forms presented the 
practice stimuli first, the order of presentation of 
the 36 test stimuli was reversed on the second 
form. 


Results and Discussion 


Mean estimates of inconsistency for the 36 
test items were correlated .935 (p < .01) with 
predicted values based upon Equation 2. Pre- 
dicted and obtained estimates of inconsistency 
are shown as a function of experimental vari- 
ables in Figures 1 and 2, respectively. The 
predicted effects of the two conditional be- 
liefs (Ppya and Pga) differ greatly at each 
combination of beliefs in A and B (P4 and 
Pg). However, the obtained pattern of effects 
generally corresponds closely to the predicted 
pattern. There is one major exception to this 
tule. That is, when P4 and Pp differ consid- 
erably in magnitude and the two conditional 
beliefs are both moderate (i.e., .5), judgments 
of inconsistency were greater than predicted 
by Equation 2. 

These conclusions are supported statisti- 
cally by an analysis of variance of judgments 
as a function of the four manipulated beliefs. 
The results of this analysis, shown in Table 1, 
indicate that 8 of the 15 main effects and 
interactions involving these variables were 
significant. Since Equation 2 generates point 


TABLE 1 
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estimates of the inconsistency of combi- 
nation of component beliefs (P4, Z 1, Paya, 
and Pg), the proportion of the to ums of 
squares theoretically accounted y each 
main effect and interaction can be culated 
and compared with the proportion ums of 
squares actually accounted for by se ef- 
fects. Data relevant to these compar s, also 
shown in Table 1, indicate considers > simi- 
larity between the obtained and licted 
proportions, 

The primary discrepancy from prec! ion is 
reflected in the P4 X Pg interaction, While 
this interaction should theoretically hav been 
zero, it was significant in the presen: data. 
This result is attributable to the fa: that 1 
inconsistency was judged to be greate: vhen 


Pa and Pp were dissimilar (M = .505) ‘han 
when they were similar (M = .484). nese 
data suggest that subjects tend to us the 
difference between P4 and Pp as a bas for 


inferring inconsistency, independently o- the 
values of the two conditional beliefs. 
Supplementary analyses. A somewhat lif- 
ferent test of the fit of Equation 2 may . lso 
be performed. The nine predicted values of 
inconsistency are the same within each com i- 
nation of the manipulated values of Py «ud 
Ppr. Therefore, a “dummy” variable with x ne 
levels corresponding to these predicted vaines 
may be constructed. An analysis of variance 


ANALYSIS OF VARIANCE SUMMARY OF INCONSISTENCY JUDGMENTS AND THEORETICAL AND 
OBTAINED PROPORTIONS OF SUMS OF SQUARES ACCOUNTED FOR, EXPERIMENT 1 


% sums of squares 
ccounted fo. 

Source af MS dferror Mseror F Theoretical Actual 

Pa 1 .0267 59 0379 71 a 
Para 2 ‘0601 118 “0405 1.48 "A 3 
Pajar 2 :008 118 ‘041 20 6 0 
Po 1 167 59 “048 3.47 0 5 
a X Pasa 2 1238 118 ‘044 5.45* gee 1.3 
Pa X Payar 2 ‘056 118 ‘034 1.66 3 3 
Pax Pa 1 1535 59 ‘Oat 13.11% 0 15 
Para X Pow 4 587 236 ‘052 11.28" 40 64 
Bara X Pa 2 6.399 118 ‘073 87.75" 29.5 35.2 
Pai X Po 2 5.719 118 100 57.42* 295 31.4 
a X Poa X Paw 4 <010 236 ‘034 30 a 
Pa x Paa X Pa 2 2.326 118 064 36.56" 169 12.8 
Pa X Paw X Pa 2 1.464 118 063 23.07 169 8.0 
Beia X Pog X Pa 4 058 236 047 1.23 4 6 
a X Paa X Paar XPa 4 129 236 ‘030 425+ eae 


*p <.001. 
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1 judgments may then be performed 


as a Ívoction of this dummy variable, P4 and 
Pp. |! ‘ye model were a perfect predictor of 
subj estimates of inconsistency, such an 


anal would yield only a main effect of the 


dun ariable. All other main effects and 
inte. ‘ons should be nonsignificant. In fact, 
an a is of the data revealed only a main 
effec the dummy variable, F(8,472) = 
48.5 .001, an interaction of P4 with Pz, 
and nteraction of the dummy variable 
with F (8,473) = 2.63, p < .05. The first 
of t} predicted interactions has already 
been ussed; the second is small and not 
read \terpretable, Moreover, the 8 de- 
grees freedom associated with the main 
effect the dummy variable accounted for 
93.4" the total sums of squares, while the 
remait 27 degrees of freedom (correspond- 
ing t other main effects and interactions) 
accou: «| for only 6.6% of the variance, with 
no sir effect contributing more than 2%. 


Co red in their totality, the results re- 


porte: ove support the hypothesis that 
Equa 2 describes the inconsistency that 
sub je verceive among syllogistically related 
belie. \lthough a few factors not taken into 
accou: by this equation also contribute to 
incon: tency judgments, these contributions 


are sll in magnitude. It therefore seems 
reason le to assume that if a subject per- 
sonal!, holds a set of beliefs that are not or- 
ganized in the manner implied by Equation 1, 


and if he is made aware of these beliefs in 
combi 


ition, he is apt to regard them as in- 
tent. Moreover, an increase in the ac- 
of this equation in describing a sub- 
liefs in a set of syllogistically related 
propositions may reflect a decrease in the in- 
consistency of these beliefs as the subject 
himself would define it. This does not neces- 
sarily mean, of course, that such decreases 
are the result of a conscious attempt by sub- 
jects to eliminate inconsistencies; these de- 
creases could be due to other factors. The 
next two studies are concerned with when and 
why such changes occur. 


EXPERIMENT 2 


Tn a typical study of the Socratic effect, 
subjects are asked to report their beliefs in 


syllogistically related propositions during two 
sessions a week or so apart. It is usually 
assumed that the subjects have formed beliefs 
in these propositions before participating in 
the experiment and that they recall these 
beliefs from memory in order to complete the 
questionnaire administered during the first 
session. Thus, the inconsistency of the beliefs 
reported in this session is assumed to have 
existed beforehand; this inconsistency is only 
made salient to subjects as a result of report- 
ing their beliefs in temporal proximity. How- 
ever, this may not always be the case. Sup- 
pose a subject is asked to report his belief in 
the proposition “caffeine destroys nerve cells.” 
He may never before have considered this 
possibility and its implications until the first 
time he encounters the proposition in the 
questionnaire. In such an event, his estimate 
of the likelihood that the proposition is true 
would not reflect his previously formed belief 
in its validity but rather would reflect a newly 
formed belief, based either upon other rele- 
vant information he has received in the past 
or simply upon his intuitive impression of its 
plausibility. 

This possibility has important implications 
for an understanding of the Socratic effect. In 
most previous research on this effect, the 
propositions to be considered by subjects have 
been ordered unsystematically in the question- 
naires, so that beliefs in conclusions are re- 
ported sometimes before and sometimes after 
beliefs in the premises. The order in which 
these beliefs are reported could have a sub- 
stantial effect upon the apparent inconsistency 
of beliefs reported in the first session and 
upon the change in this inconsistency over 
sessions. This order effect may be demon- 
strated by the following syllogism: 


Caffeine destroys nerve cells (A). y 
If caffeine destroys nerve cells, drinking coffee is 


dangerous (if A, then B). 
Drinking coffee is dangerous (B). 


Suppose a subject has never considered the 
premises of this syllogism before he is asked 
to do so in the questionnaire but that he 
regards them as plausible, and therefore as 
likely to be true. If he first reports his beliefs 
in these premises and then is asked to evalu- 
ate the proposition that drinking coffee is 
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dangerous, he may use these beliefs as “infor- 
mation” in inferring the validity of this prop- 
osition. Consequently, the set of beliefs he 
reports should appear fairly consistent. How- 
ever, if the subject is asked to consider the 
likelihood that drinking coffee is dangerous 
before he has considered the premises, he 
would not have the “information” (his be- 
liefs in the premises) available to him, and so 
he may judge this conclusion as unlikely to 
be true. Such a judgment would be incon- 
sistent with the beliefs in the premises he 
reports later on in the questionnaire, How- 
ever, note that in both cases, the subject’s 
beliefs may appear consistent in the second 
session of the experiment when his beliefs in 
the premises, formed in the first session, can 
be recalled and taken into account in judging 
the conclusion. This line of reasoning predicts 
that the inconsistency among syllogistically 
related beliefs will be less in the initial ad- 
ministration of the questionnaire, and that 
the apparent change in inconsistency over 
time will also be less, when beliefs are re- 
ported in syllogistic order during completion 
of the first questionnaire than when they are 
not. While some data relevant to this hy- 
pothesis were reported by Holt and Watts 
(1969), a less precise index of inconsistency 
was used than that based upon Equation 2, 
and the results of the study were inconclusive. 
The present experiment pursued the impli- 
cations of the analysis described above. In 
doing so, two factors not taken into account 
by Holt and Watts (1969) were considered: 
the plausibility of 4 (inferred from the mag- 
nitude of beliefs in A, or P4) and the rele- 
vance of beliefs in A to beliefs in B (indi- 
cated by the difference between the condi- 
tional belief that B is true if A is true and the 
conditional belief that B is true if A is not 
true, or the quantity Ppy4 — Prya’). The ef- 
fects described above were expected to be less 
likely to occur when A was implausible, since 
its implications should be less apt to be taken 
into account in evaluating the conclusion B. 
(For instance, if in the above example, the 
subject believed it to be very unlikely that 
caffeine destroys nerve cells, a consideration 
of this belief should not appreciably affect 
the conclusion, regardless of whether it is 
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reported before or after the conc! ‘on is 
considered.) Moreover, cognitive r niza- 
tion was expected to be less likely n A 
was irrelevant to B than when it was vant, 
Method 

Subjects were 72 introductory psychology dents 
who participated in the experiment to fulfill ourse 
requirement, Each subject completed a item 
questionnaire in each of two sessions a wei jart, 
Four questionnaire items pertained to eac i 24 
pairs of statements (A and B). Beliefs in the ‘tems 
were assumed to correspond to Pa, Paja, P: and 
Pn as defined in Equations 1 and 2. Of the airs 
of statements, 6 represented each of four co ina- 
tions of plausibility of A (high or low), an sle- 
vance of A to B (high or low). The order in ich 
the four belief items pertaining to each p of 
statements occurred in the questionnaire wa- also 
varied in a manner to be described. 

Selection of stimulus items. To select sets of lief 
statements varying in the plausibility of A an the 


relevance of A to B, normative data were co ed 
from 45 subjects other than those who partici; ‘ed 
in the main experiment. These data pertained 1 60 
different pairs of statements (A and B) on a va: «ty 
of topics. Along a scale from 0 (not at all likely to 
10 (extremely likely), 15 subjects estimated the | <e- 
lihood that statement A in each pair was true (. \), 
while 15 other subjects estimated both the likelih: od 
that B was true if A was true (Paa) and the lik: li- 
hood that B was true if A was not true (Psx). “ne 
remaining 15 subjects were asked to consider e: h 
pair of statements (A and B) and to estimate how 
frequently they had previously thought of A in re \- 
tion to B. These latter data enabled pairs of stai- 
ments to be selected that had not often been cc 
sidered in combination, so that beliefs in thie 
statements were less likely to have been consistent v 
organized prior to the experiment. 

Based upon these data, six pairs of statements 
were selected to represent each of four combinations 
of two levels of plausibility (P4) and two levels of 
relevance (Paa — Paya). Responses were divided by 
10 to convert them into units of probability. Aver- 
aged over judges and statement pairs, the values of 
Pa under high- and low-plausibility conditions were 
-75 and .32, respectively, when A was highly rele- 
vant to B, and .71 and .28, respectively, when A was 
of low relevance to B. The mean difference between 
Pays and Psw under high- and low-relevance condi- 
tions was .38 and .11, respectively, when the plausi- 
bility of A was high, and .44 and +12, respectively, 
when the plausibility of A was low. The following 
example shows four statement pairs each representing 
a combination of plausibility and relevance: 


1. High plausibility/high relevance: 
(A) Trucks carrying heavy cargo destroy high- 
way paving. 
(B) The weight limits on truck cargo should be 
decreased. 
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2. iigh plausibility/low relevance: 
) American factory workers are bored. 
R) American industry is producing substan- 
dard merchandise. 
t ow plausibility/high relevance: 
) Most milk contains radioactive material. 
!) Drinking milk is an unhealthy practice. 
4 w plausibility /low relevance: 
13 There will be no national boundaries by 
the year 2000. 
War will be a thing of the past by the end 
of this century. 


I ition of questionnaire forms. Four belief 


iten taining to each of the 24 pairs of state- 
me re constructed. A belief in one item (e.g., 
al carrying heavy cargo destroy highway 
par was assumed to correspond to Pa; a belief 
in d (“If trucks carrying heavy cargo destroy 
hig! paving, the weight limits on truck cargo 
she decreased”), to Paya; a belief in a third 
(“E trucks carrying heavy cargo do not destroy 
high vaving, the weight limits on truck cargo 
shot decreased”), to Paya; and a belief in the 
fou rhe weight limits on truck cargo should be 
dec ), to Px. 

tionnaire forms were then constructed. 
Wil h form, items were arranged so that the 
four is pertaining to each pair of statements 
repr ng a given combination of plausibility and 
rele were reported in a different order, specif- 
icali Po, Paya, Paar, Pa; (b) Pa, Pa, Psj, 
Ps Puja, Paias, Pa, Pa; (d) Pa, Po, Paya, 


Pr ) Pays, Pojar, Pa, Pu; and (i) Pa, Paya, 

The order of belief items pertaining to 
tatement pairs representing each plausibil- 
ity ınce combination was counterbalanced over 


the tionnaire forms in a Latin square design 
so tl beliefs pertaining to each of the six pairs 
Were cported once in each of the six orders. Within 
the |) ils implied by these orderings, the 96 items 
in i uestionnaire were distributed unsystemat- 
ically, with the restriction that belief items pertain- 
ing to the same pair of statements were separated 
by at icast 6 items pertaining to other pairs. 

Tweive subjects completed each of the six ques- 
tionnaire forms during the first session of the experi- 
ment. Within each group of 12, 2 subjects completed 
each of the six forms in the second session. Thus, 


the order in which beliefs were reported during 
Session 2 was controlled within each combination of 
plausit relevance, and the order of reporting 
beliefs during Session 1. 

Presentation of materials. Subjects were told that 
the purpose of the experiment was to assess the 
opinions of college students on a number of con- 
temporary issues. They were then given the appro- 
Priate questionnaire, with instructions to read each 
statement carefully and to estimate the likelihood 
that it was true by placing a check along an 11-point 
scale ranging from O (not at all likely) to 10 (ex- 
tremely likely). After completing the questionnaire, 
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subjects were dismissed. When they returned for the 
second session 1 week later, they completed the 
questionnaire again, with no further instructions, 


Results 


Each subject’s responses were divided by 
10, thus permitting the inconsistency of his 
beliefs in each set of related propositions to 
be calculated according to Equation 2.* Pool- 
ing over questionnaire forms, these scores 
were then analyzed as a function of time 
(Session 1 vs. Session 2), the six orders in 
which beliefs were reported in the question- 
naire administered during Session 1, the rele- 
vance of A to B (high vs. low), and the 
plausibility of A (high vs. low). 

Overall effects of time and relevance. Aver- 
aging over experimental conditions, incon- 
sistency was less in Session 2 (M = 112) 
than in Session 1 (M = .145), F(1,71) = 
50.03, p < .01, indicating an overall Socratic 
effect. Inconsistency was generally higher 
when A was highly relevant to B (M = .144) 
than when it was not (M =.113), F(1, 71) 
= 44.97, p < .01. This difference is undoubt- 
edly attributable to the greater similarity of 
Pp, Prya, and Prya, when A’s relevance to B 
is low. Under such conditions, the inconsist- 
ency of the beliefs comprising Equation 1 
would be low, regardless of the value of P4. 
However, contrary to expectations, no inter- 
actions of relevance with time or order were 


1The use of subjects’ responses to obtain a 
quantitative index of inconsistency requires the as- 
sumption that these responses are accurate representa- 
tions of their beliefs along a subjective scale of 
probability, This assumption is undoubtedly invalid 
in the strict sense. However, previous applications 
of the model (Wyer, 1970, 1975a) have shown that 
subjects’ estimates of Ps can be predicted with con- 
siderable accuracy by applying Equation 1 to their 
estimates of Pa, Paya, and Ppa along a scale similar 
to that used in the present experiment. (Specifically, 
when both actual estimates of Px and estimates pre- 
dicted by Equation 1 are averaged over subjects, the 
standard error of the difference between these esti- 
mates is typically less than .05, without requiring 
the introduction of ad hoc curve-fitting parameters). 
This accuracy not only supports the assumption that 
Equation 1 is a good approximation of subjects’ cog- 
nitive behavior in the situations to which it is ap- 
plied, but supports the assumption that the response 
scale along which these beliefs are reported is ap- 
proximately ratio. 
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TABLE 2 


MEAN INCONSISTENCY IN Sessions | AND 2 AS A FUNCTION OF THE PLAUSIBILITY OF A à 
ORDER OF REPORTING BELIEFS IN SESSION 1, EXPERIMENT 2 


High plausibility 


Low plausibility 


Order Session 1 Session 2 Change Session 1 Session2 
Pa, Psia, Para, Pa 184 140 —.044 157 093 - 
ioe Pa, Pria, Paya 197 135 —.062 -126 ALL = 
Pia, Peis, Pp, Pa 157 -107 —.050 A114 -107 + 
a, Pg, Pria, Payar 156 .099 —.057 .092 101 - 
Paya, Paya, Pa, Pe 166 124 —.042 134 .099 
Pa, Paya, Peat, Pp 132 124 —.008 -126 -107 


reliable (p> .10). Thus, the magnitude of 
the Socratic effect did not depend upon the 
relevance of beliefs in A to beliefs in B. 

Contingencies of order and plausibility. The 
Socratic effect was expected to be pronounced 
only when (a) A was plausible and (b) be- 
liefs in premises were reported before beliefs 
in the conclusion. Results bearing upon this 
hypothesis are shown in Table 2. The Socratic 
effect was significantly greater when A was 
plausible (mean decrease in inconsistency 
over sessions = .044) than when A was not 
plausible (mean decrease = .022), F(1, 71) 
= 6.19, p < .01. However, the Time x Plau- 
sibility X Order interaction was also signif- 
cant, F(5,355) = 3.08, p < .01. Simple ef- 
fects analyses indicated that when A was 
plausible, a significant Socratic effect occurred 
except when beliefs in Session 1 were reported 
in the order P4, Psa, Paya, Pr. Specifically, 
inconsistency decreased very little over ses- 
sions (from .132 to .124) in the latter condi- 
tion but decreased substantially over sessions 
(from .171 to .121) in the other five order 
conditions, 

In contrast, when the plausibility of A was 
low, a significant Socratic effect occurred only 
when beliefs were reported in Session 1 in 
the order Py, Paya, Para, Pa, decreasing from 
-157 to .093 over sessions. In the remaining 
five order conditions, inconsistency was ini- 
tially low (M = .118) and did not decrease 
substantially in the second session (M = 
-106). In all cases, differences in the magni- 
tude of the Socratic effect under different 
order and plausibility conditions appear pri- 
marily due to differences in inconsistency in 
Session 1; inconsistency in the second session 


antes «Sts 


was low, regardless of the order or the olau- 
sibility of A. 

These results are therefore consisten: with - 
expectations, with two exceptions. Fi: i, a 
strong Socratic effect occurred under `gh- 
plausibility conditions when beliefs we: re- 
ported in the order Prya, Paya’, Pa, Pp. his 
suggests that reporting beliefs in premise be- 
fore beliefs in conclusions is not a suffi ent 
condition for inconsistencies to be reso ed 
in the course of reporting beliefs during he 


first session, Rather, beliefs in A must be c- 
ported before beliefs in the condition ts. 
Second, a strong and significant Socratic efi < 
occurred even when A was not plausible wh n 
beliefs were initially reported in the ord:r 
Px, Ppa, Pepa, Pa. This effect did not occur 
when the conditionals were reported after P. 
The reason for this is unclear, 

Effect of order of reporting beliefs in 
Session 2. Supplementary analyses were per- 
formed on inconsistency as a function of 
time, plausibility, relevance, and the order ir: 
which beliefs were reported in the questior 
naire administered during Session 2, No main 
effects or interactions involving Session 2 
order were reliable (p> 10). Thus, tie 
elimination of inconsistencies appears to take 
place either prior to the second session (€.2., 
in the course of reporting beliefs in Session 1) 
or in the course of reporting beliefs during 
the second session when the beliefs recalled 
during Session 1 can be taken into account re- 
gardless of whether they have been previously 
reported in the second questionnaire, 


Discussion 


This study suggests that a major portion 
of the cognitive reorganization underlying the 
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Socratic effect may actually occur in the pro- 
cess of completing the questionnaires rather 
then during the time interval between ses- 
sio's. When a subject’s beliefs in premises are 
mae salient to him by asking him to report 
th: » beliefs in the order he would use them 
to cake syllogistic inferences, he appears to 
se beliefs as a basis for judging the 
ion, as he would use information about 
in a syllogistic inference task (cf. 
W: <, 1975a, 1976). However, when the con- 


clu on is presented first in the questionnaire, 
the bject does not have the opportunity to 
co irr his beliefs in the premises when 
juc ny the conclusion’s validity. Therefore, 
the fects of beliefs in the premises upon 
inf’ vices of Pg are not apparent until the 
se session. Also as anticipated, the magni- 
tu these effects depends upon the plau- 
sibi of A, and thus upon the likelihood 
th: s considered in evaluating the validity 
of 

ver, the data reported here cannot be 
con ely accounted for by a tendency for 
sul to infer beliefs in the conclusion from 
bel in the premises during the course of 
con »ting the questionnaire. The evidence of 
Bis tic effect under low-plausibility condi- 
tiov hen beliefs were reported in the order 
Pr, ‘sja Peyar Pa, and also under high- 
pla ility conditions when beliefs were re- 
poricd in the order Pga, Pgya’, Pa, Pe, sug- 
gests that additional factors must be taken 
inte account. Unfortunately, the nature of: 
these factors is presently unclear. It is also 


curious that the tendency to eliminate incon- 
sistencies did not depend significantly upon 
the relevance of A to B. One would intui- 
tively expect subjects to reorganize their be- 
liefs more often when these beliefs are highly 
relevant to one another than when they are 
not. However, no evidence of this was found. 
Despite these ambiguities, the major implica- 
tion of the study seems clear: Many of the 
apparent inconsistencies among beliefs are 
eliminated in the course of reporting them in 
each experimental session, and they are not 
eliminated between sessions as a result of 
intervening cognitive reorganization. 


Bb. 


Another question of relevance to an under- 
standing of the Socratic effect concerns the 
number and type of beliefs that must be made 
salient to a subject in order to stimulate 
cognitive reorganization. The answer to this 
question is likely to depend upon whether the 
implications of A for B have been considered 
by subjects before they are required to do so 
in the course of responding to questionnaire 
items. Some propositions are so obviously 
associated that calling a subject’s attention 
to one is apt to make him think of the other, 
even when the latter is not explicitly men- 
tioned. In such instances, asking the subject 
to report one belief (A) may stimulate him 
to revise his belief in B when he reports it 
subsequently, even if the implications of A 
for B are not expressly called to his attention. 
In other‘ cases, however, the subject has 
rarely, if ever, considered the implications of 
A for B. In these instances, calling a subject’s 
attention to his belief in A may not lead him 
to revise his belief in B unless the implica- 
tions of A for B are made salient to him as 
well. The present study investigated these 
possibilities. 


EXPERIMENT 3 


Method 


Subjects. Subjects were 64 introductory psychology 
students who participated in the study to fulfill a 
course requirement. These subjects were subdivided 
into eight groups of 8 persons each. The experiment 
was conducted in two sessions a week apart. In 
Session 1, each group of subjects completed a differ- 
ent 24-item questionnaire under instructions similar 
to those used in Experiment 2. In each case, the 24 
items were a subset of 64 that had been generated 
by constructing the four required statements (per- 
taining to Pa, Paya, Puja, and Pr) from 16 pairs 
of propositions (A and B). The nature of this subset 
is described presently. After completing the form in 
Session 1, subjects were dismissed. When they re- 
turned for Session 2, they completed a longer 
questionnaire consisting of all 64 items. 

Construction of stimulus materials. Of the 16 pairs 
of statements selected, 8 (Hi-Hs) were expected to 
have high association of salience; that is, the impli- 
cations of A for B were expected to be made salient 
to subjects simply by asking them to consider A. 
For example, one of these pairs was “Travel by rail 
is slow and inconvenient” (A) and “Few people use 
trains for transportation” (B); another was “Tech- 
nical schools are flooding the job market with com- 
puter programs” (4) and “There will soon be little 
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TABLE 3 
MEAN INCONSISTENCY IN SESSION 2, EXPERIMENT 3 


Salience of association 
between A and B 


Beliefs reported in Session 1 High Low 
None 194 156 
Pa only -161 A127 
Pp only 158 168 
Both P4 and Pg 149 128 
Psa only 150 133 
Ppa: only 140 186 
Both Pg;4 and Pyar 124 A191 
All four beliefs 

(Pa, Pr, Peja, Poia) 181 110 


demand for computer programmers” (B). The re- 
maining 8 pairs (Li-Ls) were assumed to have low 
association salience; that is, the implications of A 
for B were expected to be unlikely to be considered 
by subjects unless their attention was called to these 
implications. For example, one such pair was “Abor- 
tions will soon become easily accessible to everyone” 
(A) and “Child beating will soon become less com- 
mon” (B); a second was “The number of forest 
fires caused by discarded matches and cigarette butts 
is increasing” (A) and “Cigarette smoking may even- 
tually cause a lumber shortage” (B). Assumed dif- 
ferences in association salience were validated in part 
on the basis of pilot data (obtained in Experiment 2) 
concerning the frequency with which subjects re- 
ported thinking about the relation of A to B, and 
in part on the basis of the frequency with which 
additional pilot subjects actually generated associates 
(B) to stimulus statements (A). 

The eight 24-item questionnaires were constructed 
from these items. Within each questionnaire form, 
the number and type of beliefs pertaining to each 
pair of statements were systematically varied. Specif- 
ically, in one questionnaire, items pertaining to the 
following beliefs were included for each of the 16 
pairs described above: 


Hı and Ia: 
and Ls: 
ind La: 
I id Ls: 
Ls: 
Le: 

wa Paya 
i list no items included 


Pa, Puja, Pajsr, Pa 
Pa, Pa 


ı stionnaires were similar except 
irs of statements used in each 
is described above. These pairs 
cally over the eight forms using 
n, so that pooled over subjects 
wms, each of the 16 pairs of 
the same number of times under 

¿f items were ordered unsystem- 

ı form, 
omprising the questionnaire admin- 


The other se 
for the partic 
of the eight c 
were varied sy 
a Latin square 
and questionna 
Statements was t 
each condition. 
atically within « 
The 64 items 
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istered during the second session of xperiment 7 


were also ordered unsystematically, : premises 
occurring sometimes before and sometii. after the 
conclusion. The same final questionnair: is admin- 
istered in all conditions, regardless o/ ‘he initial 
questionnaire form. Therefore, pooled c all eight 
initial forms, the order of reporting | s in the 
second session was not confounded with number 
and type of beliefs made salient durius the first 
session. 

Subjects’ responses to items in both s ons were 
made along a scale from O (not at all li ) to 10 
(extremely likely) and were divided by prior to 
analysis, permitting the inconsistency of t beliefs 
to be estimated by applying Equation 2, 

Results and Discussion 

Beliefs in all four of the propositi s per- 
taining to 2 of the 16 statement pai . were 
reported by each subject during Se on 1. 
The inconsistency of these beliefs ws- com- 
puted according to Equation 2; th was 
compared with the inconsistency of ‘hese 


same beliefs when reported during the © cond 
session. Averaged over the eight questio: vaire 
forms, inconsistency was less in Sess n 2 


(M = 146) than in Session 1 (M=. 81), 
F(1, 63) = 3.35, p < .10. This differenc: was 
not contingent upon the prior associatic \ of 


A and B (p> .10). Although this differ: ce 
did not reach standard criteria for sig: ifi- 
cance, it was similar in magnitude to that 
obtained in Experiment 2 and in previous 
studies of the Socratic effect (Rosen & 
Wyer, 1972; Wyer, 1974b). 

The inconsistency of beliefs reported in 
Session 2 was then analyzed as a function of 
association salience and the number and tpe 
of beliefs reported in Session 1. This anvily- 
sis revealed a significant interaction of as oci- 


ation salience and the beliefs report: in 
Session 1, F(7, 441) = 2.75, p < .01. ata 
relevant to this interaction are sho. in 
Table 3. To diagnose the factors tha’ con- 
tribute to this interaction, two nonorthogonal 


subanalyses were performed. One y 
only to the effects of reporting either 
of the unconditional beliefs (P4 and Pp) in 
Session 1; the second pertained only to the 
effects of reporting either or both condi- 
tional beliefs (P/a and Prya). The results 
of these analyses are discussed in turn. 
Effects of reporting Pa and Py. inconsist- 
ency was analyzed as a function of whether 
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or poi Pa was reported in the first session, 
whejher or not Pg was reported in the first 
sess, and association salience. Only the 
effect of reporting Pa in Session 1 was sig- 
nifisat, F(1,63) = 4.58, p<.05. Specif- 
ica!!, inconsistency in Session 2 was less 
whe» /, had been reported in the first ses- 
sion W = .141) than when it had not (M 
E, However, this difference did not 
de} upon either the a priori salience of 
the ‘iation between A and B or whether 
the ¥ in the conclusion (B) had been 
rep in Session 1 (p > .10). Apparently, 
onci ibject’s belief in A is made salient to 
hin reporting it in the first session, its 
imp ons for B are recognized when he is 
aske » report his belief in B during the 
seco ession, regardless of whether he has 
cons d these implications beforehand. In 
inte: ng this finding, however, it is im- 
port: to note that a parallel process in- 
volvi beliefs in the conclusion (B) did not 
occu: «hat is, making beliefs in B salient in 
the session had no effect upon the in- 
cons’ «cy of beliefs reported in the second 


($ ). Thus making beliefs in premises 


salie ‘ads subjects to judge conclusions in 
am: er consistent with these beliefs, while 
mak the belief in the conclusion salient 
doe necessarily affect beliefs in the 
corre onding premises. 

Ey ots of reporting Pg, and Py;a:. In the 
secon. subanalysis, inconsistency was analyzed 
as a ‘inction of whether Psa was reported 
in Scion 1, whether Pg/4- was reported in 
this session, and association salience. It 
seemed likely that if a subject reports his 


in A’s implications for B, he would 
y consider his unconditional beliefs in 
B as well. This may produce a de- 
crease in the inconsistency of the beliefs he 
reports in the second session. However, while 
reporting the belief that A implied B (Pp/s) 
in the first session slightly decreased the in- 
consistency of beliefs reported in the second 
(from .169 to .150), this decrease was not 
reliable, F(1,63) = 2.16, p> .10. On the 
other hand, a significant interaction occurred 
between association salience and whether or 
not beliefs that not-A implied B (Py;a’) were 
reported in Session 1, F(1,63) = 7.51, p< 
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-01, When the a priori salience of the relation 
between 4 and B was high, beliefs reported 
in Session 2 were less inconsistent when 
Prya had been reported during Session 1 
(M = .132) than when it had not (M = .172). 
However, when the association between A and 
B was less likely to have been thought of 
by subjects without its being called to their 
attention, reporting Payas in Session 1 actu- 
ally increased the inconsistency of the beliefs 
reported in Session 2 (M = .188) relative to 
conditions in which P/a. had not previously 
been reported (M = 145). This latter in- 
crease is provocative. Perhaps when the 
salience of A’s implications for B is low, a 
subject who reports his belief that B is true if 
A is not true is stimulated to consider a 
variety of beliefs other than those involving 
A that have implications for B, and the in- 
consistency of these beliefs may, in fact, 
decrease. However, in the course of this re- 
organization, beliefs involving A and its impli- 
cations for B may tend to be ignored. As a 
result, the inconsistency of these latter beliefs 
may increase. 


CONCLUSIONS 


The series of studies reported here have 
several important implications for cognitive 
organization processes in general, and for the 
syllogistic model proposed by McGuire and 
Wyer in particular. The indication in Experi- 
ment 1 that subjects’ estimates of inconsis- 
tency are functionally related to values of 
Pa, Pr, Puja, and Prya in the manner implied 
by Equation 2, in combination with earlier 
evidence that Equation 1 accurately describes 
syllogistic inference processes (Wyer, 1970, 
1975a, 1976), provides substantial support 
for the validity of the model proposed by 
McGuire and Wyer as a description of cogni- 
tive functioning. Moreover, these data sug- 
gest that if a subject’s own beliefs are not 
related in the manner implied by Equation 1, 
and if these beliefs are called to his atten- 
tion, he will regard them as inconsistent. The 
assumption that this is the case underlies the 
use of the model as a description of cognitive 
reorganization processes in general and of 
the Socratic effect in particular. 

On the other hand, the results of Experi- 
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ment 2 suggest that many of the changes in 
beliefs that occur in order to eliminate cogni- 
tive inconsistencies take place in the course 
of reporting these beliefs and are not the re- 
sult of subsequent cognitive restructuring in 
the time interval after the beliefs have been 
made salient. Indeed, the elimination of in- 
consistencies may in large part be a result of 
syllogistic inference processes in which a sub- 
ject’s beliefs in the premises of a syllogism, 
once they are made salient to him, serve 
as “information” upon which he bases his 
belief in the conclusion, much as he would if 
new information from external sources were 
presented (Wyer, 1970, 195b, 1976). 

While evidence was obtained that beliefs in 
conclusions are inferred from beliefs in prem- 
ises after these premises have been made 
salient, there was little evidence that beliefs 
in premises are inferred from beliefs in the 
conclusion. For example, in Experiment 2, the 
inconsistency of beliefs was initially high when 
conclusions were evaluated before premises 
but was low when premises were evaluated 
first. Moreover, in Experiment 3, making be- 
liefs in the conclusion salient in Session 1 did 
not appreciably increase the consistency of 
beliefs reported in Session 2. Nevertheless, it 
would be premature to conclude that these 
reciprocal changes do not occur at all. Nor 
should one conclude that some cognitive re- 
organization does not occur over time, as well 
as in the course of completing the initial 
questionnaire. As noted previously, not all of 
the decreases in inconsistency over sessions 
observed in Experiment 2 can be accounted 
for simply on the basis of the assumption that 
subjects are inferring beliefs in the conclu- 
sion from beliefs in premises. Some portion 
of the decrease in cognitive inconsistency 
over time may conceivably be the result of 
processes of cognitive reorganization similar 
to those initially postulated by McGuire 
(1960, 1968). Procedures for isolating the 

relative contributions of “reorganization” and 
“inference” processes and for investigating 
the factors that affect these contributions 
remain to be developed. 

Although the present research suggests 
that the cognitive reorganization of inconsist- 
ent beliefs occur primarily during the course 
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of reporting those beliefs, the theoretic»! and 
practical importance of the Socratic nv ‘hod 


of inducing belief change remains. Val: able 
insight into the dynamics of cognitive anc- 
tioning may be gained through conside tion 
of the potential change that may be ef’ ¿ted 
in an individual’s beliefs by merely © ling 
his attention to other related beliefs he lds, 
The importance of continued research i: his 
area is further suggested by Salancik’s m- 
onstration of the manner in which th- sa- 
lience of previously formed cognitions ay 
alter subjects’ reports of related beliefs nd 


attitudes (Salancik, 1974; Salancik & Co ‘er, 
Note 1; Salancik & Conway, 1975). Co se- 
quently, the continued application of Eù :a- 
tion 1 to the investigation of similar | 1e- 
nomena may prove worthwhile. 


REFERENCE NOTE 


1. Salancik, G. R., & Calder, B. J. A non-predi o- 
sitional information analysis of attitude ex; s- 
sions. Unpublished manuscript, University of 
Illinois, 1974. 
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Use of Consensus Information in Causal Attributi -s as a 
Function of Temporal Presentation and Availa! ity 
of Direct Information 


Nina S. Feldman, E. Tory Higgins, Marylie Karlovac, and Diane N, © -ble 
Princeton University 


This study investigated the influence of two factors on the extent to which «: 
observer would use consensus information in making causal attributions for a 

actor’s choice behaviors. Undergraduate subjects were shown videotaped 
vignettes of a person choosing a favored item from an array of items and of 
four other persons either agreeing (high consensus) or disagreeing (low con 

sensus) with the choice. The two factors of interest were (a) availability of 
direct information (subjects either saw the array of items from which the 
actor chose or they did not) and (b) temporal presentation (consensus in- 
formation was either presented simultaneously or successively). The results 
showed that the impact of consensus information was greater when the informa- 
tion was presented successively than when it was presented simultaneously and 
for subjects who did not have direct information than for those who did. The 
results are discussed in terms of role-taking and information-processing variables 
and in terms of generality of the Kelley attribution model. 


This study was designed to explore the role 
of consensus information in the formation of 
causal attributions. Kelley (1967, 1973) pro- 
posed a three-dimensional model of causal at- 
tributions to explain how the individual seeks 
to answer the question “Why?” The covaria- 
tion of entities (providing distinctiveness in- 
formation), persons (consensus information) , 
and time (consistency information) was pos- 
ited as the basis for person versus entity 
attribution judgments. In exploring Kelley’s 
formulation, McArthur (1972) found strong 
support for the use of distinctiveness and 
consistency information but only weak sup- 
port for consensus as a critical factor in form- 
ing attributions for an actor’s behavior. More 
recently, Nisbett, Borgida, Crandall, and 
Reed (in press) have suggested that con- 
sensus information has little effect on the 
attributions people make either about them- 
selves or others. They propose that consensus 


The resarch was supported by a Biomedical Sup- 
port Grant and by National Institute of Mental 
Health Grant MH 27198-01 to D. N. Ruble. Por- 
tions of this article were presented at the Eastern 
Psychological Association meeting, New York, April 
1976. 

Requests for reprints should be sent to Diane N. 
Ruble, Department of Psychology, Princeton Uni- 
versity, Princeton, New Jersey 08540. 
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information is necessarily abstract an: 
when compared with direct target ca: 
mation or personal sensory impression: 
The present study sought to comp: e th 
effects of consensus in a situation wher sub- 
jects asked to make causal attributior 5 for 
an actor’s behavior were dependent sole\y o 
the experimental manipulation of consensu: 
information versus a situation where subjec 
had access to their own sensory impressiv 
of the situation in addition to experim: at 
consensus information. All subjects 
shown videotaped vignettes and were 


to decide why a target actor chose a rtain 
object from an array of possible choio -~ Fo 
all subjects, consensus was varied ov: high- 
consensus (everyone else in the groun «greed 
with the target actor’s choice) aii low 
consensus (no one else in the group agreed 


with the target actors choice) conditions 
Subjects in the direct information condition, 
however, also saw the actual array of items 
from which the target actor chose. 


1The terms high consensus and low consensus are 
somewhat inaccurate representations of the actu 
consensus conditions. That is, the conditions portray 
either complete agreement or complete disagreement 
with an actor’s choice. Nevertheless, it was decide 
to use this terminology in order to facilitate com- 
parisons with previous literature. 


CONSENSUS AND CAUSAL ATTRIBUTIONS 


arget 

po: Tt was predicted that subjects dependent 
jubjecsolely on manipulated consensus information 
‘olvenWould use this information as expected from 
$: acKelley’s model: High consensus would lead to 
ity sentity or external attributions of causality, 


4 


peen aS the actor’s choice would be seen as con- 
tion’ tingent on something about the specific item 
/tion chosen; low consensus would lead to more 
| exp person or internal attributions, as the actor 
| tiny would be seen as the chief causal agent in 
‘was the choice behavior. It was also predicted 
f exp that the impact of consensus information 
| pers would be significantly weakened for subjects 
N with direct information about the stimulus 
of array, as they would rely on their own 
. self impressions of the situation, 
nif The temporal presentation of the consensus 
cau information (successive vs. simultaneous) was 
ti also manipulated as an independent variable. 
f st. Previous studies have suggested that such 
du! information-processing variables may have a 
ney substantial impact on subjects’ attributions, 
tary: such as findings of order effects in the impres- 
to Þa sion formation literature (Jones & Goethals, 
omat 1971), Although Kelley’s (1967) model does 
dios not specify the temporal order in which con- 
ate) sensus information is to be presented, all pre- 
Doth ‘vious tests of the model have used a simul- 
rie! on $, z 
| taneous mode. However, in everyday life, 
k individuals often obtain consensus informa- 
(O tion successively (e.g., one is more likely to 
Chear several persons praising an individual's 
3 targe' vork at different times than simultaneously). 
© Thus, it seemed reasonable that a serial or 
successive presentation of consensus informa- 
tion might have a different effect on subjects’ 
perceptions of the situation than a simul- 
taneous presentation. The possible effect of 
this variable on the use of consensus infor- 
mation was examined by having the other 
actors in the group indicate agreement (or 
disagreement) with the target actor’s choice 
either all at once (simultaneous) or one at a 
time (successive). 


METHOD 
Subjects 


A total of 60 undergraduates were paid to partici- 
pate in an individual session. There were 20 males 
and 20 females in the direct information condition, 
while 10 males and 10 females were in the no-direct- 
information condition.* 


TORRITA ee EP eee MUN a i 
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Procedure 


Videotapes were made of five actors sitting at a 
table. One person’s back was to the camera, while 
two males and two females sat facing this target 
actor and the camera, Different actors were used 
in every segment. Each subject was told that he/she 
would be shown several videotapes of a person 
choosing a favorite item from an array of pictured 
items and would be asked to decide why the actor 
liked that item best. Due to the extreme simplicity 
of the experimental stimuli, the authors were afraid 
that the subjects might take the experiment too 
lightly. Therefore, subjects were asked to consider 
seriously all of the information presented before 
making their final judgments, (Postexperimental 
questioning revealed that subjects had complied with 
this request.) Each subject saw four videotapes of 
a target actor who examined an array of four pic- 
tures and responded to the question “Which one 
do you like best?” by clearly pointing to an item. 
The actual content of the pictures was not visible 
to the subject from the videotape. In order to mini- 
mize the variation in time lengths across conditions, 
a stopwatch was used in filming the vignettes; each 
videotape vignette lasted approximately 30 seconds, 

Immediately after the target actor chose an item, 
the four other actors in the group responded to the 
question “Who else likes that one best?” by either 
all raising their hands, nodding, and smiling (high 
consensus) or by negatively shaking their heads and 
frowning (low consensus), Each subject saw two 
high-consensus and two low-consensus videotapes 
with the order of high- and low-consensus video- 
tapes being counterbalanced across subjects, Half of 
the subjects saw videotapes in which the actors all 
responded in unison (simultaneous consensus) ; half 
saw each actor respond individually as the camera 
panned in and away (successive consensus). In the 
direct information condition, subjects were shown 
before each vignette the same stimulus array that 
the actors examined, Subjects in the no-direct~ 
information condition were not shown these picture 
arrays, 

The stimulus arrays were composed of pictures of 
animals (beaver, deer, zebra, possum), colors (blue, 
green, red, purple), cartoons (Mickey Mouse, Bugs 
Bunny, Dumbo, Pinocchio), and cars (blue sedan, 
red compact, white Mustang, and red sports car). 
Thus, there was a reasonably broad range of arrays, 
and each picture was well differentiated from the 


2The unequal number of subjects in the two 
conditions is due to the fact that additional subjects 
were included in the direct information condition so 
that developmental comparisons could be made in a 
subsequent study, It should also be noted that addi- 
tional conditions were part of the design of the 
developmental study. These are not described here, 
since they are not relevant to the focus of this 
article and since there were no significant effects of 
these variables in the adult data. 
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others within an array, Pilot testing had been done 
with a larger stimulus array to eliminate any pic- 
tures that received first- or last-choice ratings from 
over 40% of the subjects. Thus, the pictures were 
chosen to exclude extremes in value so that it would 
be believable to a subject that any one of the array 
could have been chosen as best liked and that others 
could agree or disagree. 

After each vignette, subjects were asked to con- 
sider the information presented and to respond to 
the question “Does his/her (the target actor’s) choice 
tell you more about him/her or more about the 
object on the card?” They were then asked “How 
much?” which was rated on a 3-point scale of “little 
bit, pretty much, a lot.” This resulted in a scale 
ranging from —3 (very internal) to +3 (very exter- 
nal). Thus, the design of the study was a 2 X 2 X 2 
factorial, with two between-subjects variables (direct 
information and temporal presentation) and one 
within-subjects variable (consensus). 


RESULTS 


A Direct Information X Temporal Presen- 
tation X Consensus repeated measures analy- 
sis of variance was performed on the attribu- 
tion data. The means for each condition are 
shown in Figure 1. A significant main effect 

= for consensus, F(1, 56) = 39.42, p< .001, 
revealed that subjects’ attributions were in 
P the direction predicted by Kelley’s model. 
High consensus led to substantially stronger 
entity attributions, and low consensus re- 
sulted in increased person attributions. How- 
ever, these effects must be viewed in the 
context of significant two- and three-way 
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Ficure 1. Mean attributions as a function of 
information condition, temporal presentation, and 
consensus, 
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interactions. A Direct Information X Con- 
sensus interaction, F(1,56) = 27.29, p< 
.001, indicated that as predicted, manipulated 
consensus information had little effect or 
direct-information subjects; while the attribu- 
tions of no-direct-information subjects, who 
had not been exposed to the stimuli, varied as 
expected with consensus information. There 
was also a significant Temporal Presentation 
X Consensus interaction, F(1, 56) = 11.58, p 
< .01, which indicated that the effect of con- 
sensus was much stronger when the informa- 
tion was presented successively than when it 
was presented simultaneously. Finally, there 
was a significant Direct Information X 
Temporal Presentation X Consensus interac- « 
tion, F (1, 56) = 12.42, p < .001. As shown in > 
Figure 1, successive presentation of informa- 
tion enhanced the predicted effect of consen- | 
sus for the nodirect-information group only. 

In fact, analyses within the presentation order 
factor revealed that the Direct Information X ~ù 
Consensus interaction was significant only in | 
the successive presentation condition, F(1, 
28) = 32.93, p < .001. This interaction did 
not reach significance in the simultaneous 
presentation condition, F(1, 28) = 1.76, p >4 


$ 


1b- 


.10, though there was a main effect of con- Nor 

sensus in this condition, F(1,28) = 4.90, 

p< 05. i 
DISCUSSION ores 


The results of the present study indicate, ° 
as predicted, that subjects asked to make 
causal attributions for an actor’s behavior, 
who were previously given only consensus in- 
formation, used this information in the man- 
ner expected by Kelley’s (1967) model of 
attribution. However, also as predicted, sub- 
jects who also had direct information avail- 
able from their own personal experience with 
the stimulus array did not use consensus in- 
formation in forming causal attributions. 

A review of the previous literature along 
with the present findings suggests that con- 
sensus information has little or no effect when 
subjects have either direct information about 
the target person or direct information about 
the external stimuli involved in the situation, 
both of which allow subjects to use their own 
reactions or imagined reactions (i.e., role tak- 
ing) as a basis for inferences concerning the 


arget person's behavior. Direct information 
sabout the target person might be available to 
subjects either through their own actual in- 
ment in the situation (i.e., the subject is 
actual target person) or through person- 
sketches of the target person that have 
given to the subjects. Direct informa- 
‘bout the external stimuli of the attribu- 
ituation might be available from an 
it description of the experimental set- 
id procedure to which the target person 
‘posed or from the subjects’ own actual 
re to the stimuli with which the target 
was involved. 
bett et al, (in press) found no effects 
‘sensus information in their series of 
ception studies, where consensus ma- 
ions were designed to affect subjects’ 
attributions toward their own emo- 
x physical behavior. Since, in these 
subjects were the actual actors pro- 
the behavior to be evaluated, they 
ily had direct information about the 
erson (i.e., themselves) upon which 
an assessment of the actions. Kahne- 
«d Tversky (1973) found no effects of 
islike information (i.e., category base 
a study where subjects were given 
\tegory base rat# information and a 


rsonality description of the target 

Here again, the lack of impact of 

us” information could be due to sub- 

ceiving direct information about the 

erson. Thus, they had the opportunity 

‘he role of the target person and could 

? toeir evaluation of the situation upon 
the information obtained in this way. 

f Th studies by Miller, Gillen, Schenker, and 

Radlove (1973) and by Nisbett and Borgida 

TA no effects of consensus information 

ere found when subjects had direct informa- 

tion about the external stimuli with which the 

target person had been involved, In these 

» Studies, previously conducted psychology ex- 

periments were described to subjects, half of 

| Whom were also told the expected results of 

each experiment (consensus manipulation). 

Thus, subjects in these studies were given 

enough information about the experimental 

Procedure (i.e., about the situation or exter- 

mal stimuli) that they could use role-taking 

Procedures to infer the teasons for the target 
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person’s behavior on the basis of what they 
assumed their own reactions would be in the 
experiment. The results of the present study 
may also be interpreted in this manner. Con- 
sensus information had no effect on subjects’ 
attributions in the condition where their di- 
rect experience with the external stimuli (ie, 
viewing the array of pictures) provided 
enough information to allow the subject to 
take the role of the target person. 

Another major result of the present study 
was that the temporal presentation of con- 
sensus information was found to affect signif- 
cantly the use of consensus information, but 
only for subjects who had no direct informa- 
tion. That is, there was greater use of con- 
sensus information with successive than with 
simultaneous presentation of the consensus 
information. This difference between simul- 
taneous and successive presentations could be 
due to a tendency for subjects to process 
simultaneous information as a single bit of 
information and successive information as 
independent bits of information. Thus, there 
would be “more” consensus information 
processed by subjects in the successive pre- 
sentation condition, The attributions of sub- 
jects with no direct information might there- 
fore be expected to reflect the greater impact 
of successive consensus information, as they 
have no other information to rely on, while 
subjects with direct information may not be 
affected by any consensus information, 
whether simultaneous or successive, given that 
other information is also available to them. 

Although the pattern of results in the pre- 
sent study and the findings of the above stud- 
ies support the hypothesis that subjects with 
no direct information available will use con- 
sensus information in forming their attribu- 
tions, other research does not appear to sup- 
port this conclusion. Studies by McArthur 
(1972, 1976) and by Orvis, Cunningham, 
and Kelley (1975), in which subjects were 
given combinations of consensus, distinctive- 
ness, and/or consistency cues but no direct 
information, found extremely weak effects of 
consensus information on subjects’ attribu- 
tions. As Ruble and Feltlman (in press) 
pointed out, however, the order of informa- 
tional cues in both of these studies was not 
varied, and the possible disadvantage to con- 
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sensus information from always appearing 
first may have led to an inaccurate assessment 
of its importance. Additionally, in both the 
Orvis et al. and McArthur studies, the con- 
sensus information was presented simultane- 
ously (e.g., “Almost everyone [no one] com- 
pliments Barry’s work”), the condition where 
the present study found a significantly weaker 
impact of consensus information. It is there- 
fore not clear to what extent these studies 
contradict our hypothesis. 

In summary, findings from previous re- 
search and the present study suggest that the 
use of consensus information in attributional 
processes depends, in part, on how the infor- 
mation is presented and on whether other 
relevant information is available through 
either direct experience or role taking. This 
conclusion has possible implications for the 
generality of the Kelley (1967, 1973) model. 
That is, attributions may depend on the con- 
text and process with which information is 
received by an individual, as well as the spe- 
cific combination and content of the informa- 
tion. The findings may also have implications 
concerning the effects of consensus informa- 
tion in the everyday world. For example, con- 
sensus as a vehicle for influencing attitudes 
or choice behavior may be greatest in situa- 
tions that involve an entity about which the 
individual has little firsthand knowledge, such 
as whether or not to see a particular movie. 
In contrast, if an individual does have direct 
information (e.g., personal observations of 
presidential candidates), consensus informa- 
tion, such as election polls, may have little 
impact on choice behavior. 
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Out of the Laboratory and into Wonderland: A Critique of the 
Kinder, Smith, and Gerard Adventure with Perspective Theory 


~ Harry S. Upshaw 
University of Illinois at Chicago Circle 


The Kinder, Smith, and Gerard study of attitude labeling was reviewed with 
reference to the perspective theory of attitudes. Because of serious methodo- 
logical problems in the operational definitions of basic terms, I rejected the 
authors’ claim that their data disconfirmed the theory. Hence, the assumption 
that perspective theory applies outside the laboratory as well as inside is 


retained, 


I: Through the Looking Glass Lewis Carroll 
cres the character Humpty-Dumpty who 


A 


asse «l control over words by using terms in 
whi ver way suited him, Upon reading the 
Kin Smith, and Gerard (1976) paper, I 
hav uneasy feeling that in their efforts to 
takı “perspective theory of attitudes” out 
of to laboratory, they may have taken us 
thro the looking glass into an operational 
won and in which basic theoretical con- 
struc ore given wholly arbitrary meaning. As 
a co: equence of considerable ambiguity in 
the © :ceptual structure of the study, it is 
diffic ewn to relate to perspective theory. 
It is ‘ly not an adequate basis for revising 
that y 

A |sic idea in my early work with Ostrom 
(Ostron & Upshaw, 1968; Upshaw, Ostrom, 
& Wa i, 1970; Upshaw, Note 1) is that the 
data ittitude research are often judgments 
of a pondent’s position, rather than the 


positio: itself. Measures of attitude, such as 
self-ra:ings, seemed obviously to be judgments 
and, us such, to be subject to context and 
other effects that have been studied in the 
psychology of judgment (Upshaw, 1974, 
1975). ‘The concept of judgment, however, im- 
plies an underlying stimulus. In some cases, 
we reasoned that stimulus (i.e., the content) 
is likely lo be the particular position that the 
Tespondent has chosen to endorse on a con- 
troversial issue. Hence, a person might decide 
to make a monetary contribution in some 


Upshaw, Department of Psychology, University of 
Ilinois at Chicago Circle, Chicago, Ilinois 60680. 


amount for charity, If he or she is called upon 
to judge the generosity of the contribution, 
the rating will reflect not only the size of the 
contribution but his or her judgmental cri- 
teria as well. Perspective, defined as the sub- 
jective stimulus range, has been shown to be 
a major correlate of these judgmental criteria 
(Upshaw, 1969a, 1970). An important aspect 
of our conception is that the criteria that are 
employed in rating one’s own position are the 
same as those employed in judging the posi- 
tions of others. Every self-rating is, therefore, 
implicitly or explicitly an element in an entire 
distribution of judgments, 

At least two features of any distribution of 
judgments need to be explained: the central 
tendency and the dispersion, Both of these 
judgmental parameters have been related to 
the respondent’s perspective, For judgments 
of a common set of stimuli embedded in dif- 
ferent contexts, central tendency has been 
assumed to be an inverse function of the 
average content values of the end anchors 
which define perspective, and the dispersion 
has been assumed to be an inverse function 
of the difference between the two perspective 
end anchors (Ostrom & Upshaw, 1968, p. 
221). The theory, then, serves to relate four 
items of information: the upper perspective 
anchor stimulus, the lower perspective anchor 
stimulus, a rating, and a corresponding con- 
tent position! From knowledge of any three of 
its terms, the fourth is determined. 


The Content Variable 


In its present form the theory is impor- 
tantly concerned with the respondent’s chosen 
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position on a content dimension, although it 
is not concerned with the exact nature of that 
dimension, There are very many considera- 
tions that a person might take into account in 
arriving at a self-rating. As an example, let 
us consider the liberal-conservative rating di- 
mension with which Kinder et al. (1976) 
dealt. In their normal activities people may or 
may not expend much of their effort formu- 
lating a judgment of how liberal or conserva- 
tive they or others are, but when called upon 
to do so, most people seem willing and able to 
perform the task. A demand to make judg- 
ments of liberalism-—conservatism effectively 
imposes a single rating dimension on all who 
participate without a corresponding imposition 
of a common content dimension. Most likely 
in circumstances of this sort, people concoct 
single content dimensions from an assortment 
of component dimensions according to a com- 
position rule that is, in principle, idiosyncratic. 
Hence, one person might base his or her self- 
rating on the liberal-conservative scale pri- 
marily upon his or her stand on humanitarian 
issues, whereas another might emphasize eco- 
nomic issues more heavily than social issues. 
In fact, it may be that a person’s cultural 
milieu is a major determinant of the composi- 
tion rules of the content dimension. Further- 
more, it seems probable that a person’s own 
position on a particular content dimension is 
predictable from knowledge of the composition 
rules that define that dimension. For example, 
a person who weights economic issues very 
highly in defining liberalism-conservatism 
seems likely (in contemporary America, at 
least) to view himself or herself as more con- 
servative than liberal. These questions con- 
cerning the composition of the content dimen- 
sion have been evaded in our laboratory 
research by virtue of our having established a 
sufficiently simple cognitive setting to support 
the assumption that respondents conceived 
the content dimension in the same way yet 
differed in their positions along it. Clearly, if 
the theory is to be applied to a group of per- 
sons, rather than to a single individual, it will 
be meaningful only if the people share a com- 
mon content dimension, whatever its composi- 
tion. 
The perspective theory is not different from 
other theories in requiring highly specialized 
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conditions for testing it. Because it is a theory 
of the judgmental scale that one uses in mak- 
ing ratings of positions along a content dimen- 
sion, its principal precondition is that 2l} re- 
spondents base their ratings on the same 
content dimension. In the Kinder al, 
(1976) study, the precondition was *° ost 
certainly not met. It was not even as: sed, 
Instead, the investigators, rather than ach 
respondent, undertook the formulation «the — 
content variable. Two content scales we de- 
vised, one a so-called “attitude” scale a). the 
other a “behavior scale.” In both in» ces 
the criterion for scale construction was . ‘zh, 
positive correlation with the self ng. 
Hence, a single set of composition ru ‘or 
each of two dimensions was determin: for 
all respondents, with absolutely no e ce 
that any respondent employed those co osi- 
tion rules in his or her conception of th: on- 
tent dimension, on the basis of which > ‘ngs 
were made, In effect, the composition ru’ - for 
these dimensions that the investigators con- 
trived are nothing more than the regr ‘sion 
equations for predicting self-rating fro : re- 
sponses to various survey items. It is d` icult 
to imagine how an empirical prediction qua- 
tion derived for a group of respondents ‘ould 
acquire psychological meaning in this c ntext 
for any single individual, 

Even if we accepted the hypothesis f a 


single content dimension for all responc -nts 
and one of the regression equations as a le- 
scription of its structure, we would still be 
faced with an insurmountable methodolog’cal 
problem in the way in which Kinder et al. 


(1976) operationally defined content pos’! ‘on. 
To appreciate the problem let us disting ish, 
as Thurstone (1947) did, between statistical 
and experimental independence. Statistical in- 


dependence implies a zero correlation |: ween 
a pair of variables in a particular pop ation. 


1The concept of the content dimension implies a 
single ordering of positions which has a one-to-one 
correspondence with the ordering of the same posi- i 
tions on the rating dimension. By definition there 
cannot be two content dimensions. On the other hand, 
there can be an infinite number of prediction equa- 
tions for a set of self-ratings, depending upon the 
particular variables that are included and the 
make-up of the group of respondents from whom f 
data are collected, 


e 
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Two variables are experimentally independent 
‘if they are experimentally distinct, so that 
one is vot derived from the other by a con- 
straj cither of the experimental situation or 
of t! computations” (Thurstone, 1947, p. 
63). irly, experimental independence must 
obtai before statistical independence can be 
teste. ‘cinder et al. operationally defined con- 
tent neans of the regression of rating on a 
set c variables that were considered com- 
pone: of content. According to this defini- 
tion crson’s content score is derived from 
his o rating by a constraint of the com- 
puta’ »., thereby violating the condition of 
exper > ntal independence that is required 
for : iningful test of statistical indepen- 
denc 

St rrelation between rating and the 
would -= content measure was imposed by the 
com) onal procedure which defined the 
latter bsequently, the content variable, 
which ¿d no reliable variance except that 
share: h the rating variable, was held con- 
stant ı partial correlation of perspective 
with : z, That procedure served to remove 
from ) perspective and rating all variance 
shared th content. However, because of the 
comp: ional constraint, we know that con- 
tent s d no variance with perspective that 
it did also share with rating, The partial 
corre] | procedure, then, merely removed 
some iion of the rating variance. The 
numer... of the formula by which the criti- 
cal pariial correlation is computed is rpr— 
fro se, where P, R, and C represent perspec- 
tive, ing, and content, respectively. In 
order for the partial correlation to have an 
algebraic sign opposite to that of the zero- 
order correlation, the product reo rro would 
have to be larger than rper. This condition 
would obtain only if rpo and/or rro were rela- 
tively very large, We have already argued that 


the value of rro is spurious because of the 
constraints in the measurement of content. 
Since both rpo and rpp reflect the shared vari- 
ance of perspective and rating, the difference 
in their values might be due to the smaller 
error component in C than in R (which was 
a necessary consequence of defining C as the 
Predictor of R). Only if the shared variance 
of P and C relative to the artifactually small 
error in C was large could rpo be larger than 
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rer. Since either rpo or rro must be larger 
than rpg in order for a negative partial corre- 
lation to exist between perspective and rating 
with content partialed, the criterion by which 
perspective theory was to have been judged 
to fit the data could have been attained only 
by spurious means. 

A great deal of devilment resulted from the 
exceedingly arbitrary operational definition 
that Kinder et al. gave to the content vari- 
able. Unfortunately, that was not their only 
arbitrariness, The concept of perspective was 
treated at least as badly, 


The Perspective Variable 


As indicated earlier, the term perspective 
refers to the effective stimulus range that a 
judge takes into account at the time of render- 
ing a judgment. As Ostrom and I have applied 
the concept to attitudes, it refers to two 
points on the content dimension. In the 
Kinder et al. (1976) study, perspective was 
operationally defined in terms of one point, 
rather than two as is required by theory, On 
the basis of some undisclosed rationale, they 
equated perspective with home milieu (some 
average of the mother’s and father’s political 
orientations) or the school milieu (the re- 
spondent’s university major), It is probably 
correct that a respondent’s perspective on 
major issues derives in some way from his or 
her home and social milieus. However, it is 
equally compelling that one’s content posi- 
tion as well as the composition of his or her 
content dimension also derive from the same 
experiences. This being the case, home and 
school milieus are no better operational defi- 
nitions of perspective than they are of con- 
tent, which means that they are inadequate 
for either. 

To claim that home or school milieu corre- 
sponds to a particular perspective implies that 
those people within a specified milieu grouping 
approached the task of self-rating by invoking 
a characteristic range of positions on the con- 
tent dimension, Furthermore, in order to de- 
rive the predictions that Kinder et al. (1976) 
sought to test in their study, it must be as- 
sumed either that the respondents from more 
liberal milieus adopted perspectives that ex- 
tend further in the liberal direction on the 
content dimension than is true of respondents 
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from conservative milieus or, alternatively, 
that their perspectives extend less far in the 
conservative direction. Either alternative 
would result in the contrast effect that the 
authors seemed to believe the theory required. 
It is not clear, however, which of the alterna- 
tives they imagined to hold. At several points 
they seemed to suggest that conservative re- 
spondents hold a perspective that is relatively 
more extreme in the conservative direction 
than that of liberal respondents and that 
liberals hold a similarly more liberal perspec- 
tive. It is not obvious why this assumption 
should be made. Granting the unlikely hy- 
pothesis of a common content scale for all 
respondents, we can surely imagine that lib- 
eral and conservative milieus differ in salience 
of positions other than those that are en- 
dorsed. The better informed a group of re- 
spondents is concerning the range of possible 
positions on an issue, the wider the perspec- 
tives are that members of that group are 
likely to hold. It may also be the case that 
conservatives and/or liberals attribute content 
positions to their opponents which are more 
extreme than those which are actually en- 
dorsed. There are few data bearing on these 
questions, but what few there are suggests 
that the relationship between content position 
and perspective is not simple. 

Scaling statements written to indicate re- 
spondents’ perspectives on the social position 
of the Negro in America, Ostrom (1966) dis- 
covered that black respondents apparently 
held narrower perspectives than did whites, In 
a subsequent study (Ostrom & Upshaw, 
1970), data were presented which suggested 
that the typical white perspective had ex- 
panded in the pro-Negro direction over a 35- 
year period from 1929 to 1964. The data of 
both studies suggested that by 1962, the white 
perspective was anchored at a more extreme 
pro-Negro position than was the black per- 
spective, which had remained essentially un- 
changed throughout the 35-year interval. One 


consequence of the differential shift in per- 
spective was a reversal during that period of 
an earlier finding whereby white judges had 
rated a typical itude statement as more 


lack judges, The direc- 
‘p appeared to be de- 
> of the judge but by 


pro-Negro than 
tion of that relat 
termined not by the 
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the perspective that was typical of the racial 
groups at the time of judgment. 

It should perhaps be emphasized that per- 
spective is basically an informational con ept. 
This fact implies that a major determinant of 
perspective is the information relevant i0 a 
particular issue to which an individual i. ex- 
posed, Common sense as well as experini ntal 
data (Brown, 1953; Upshaw, 1969b; \ site, 


Alter, Snow, & Thorne, 1968) sugges! ‘hat 
mere exposure is not a sufficient deter: sant 
of the values of the upper and lower i ints 
which define perspective. 

The quantification of perspective : res 
two values which are theoretically inc cen- 
dent. Because the determinants of thes: wo 
values are at present ill-understood is 


simply unsatisfactory in a test of persp iive 
theory to represent two of its terms iv a 
single indicator. The arbitrariness of the | er- 


ational definition is heightened by the act 
that the person’s milieu is as good (or as ad) 
an indicator of every other term in the |. ory 
as it is of perspective. The authors ga. no 
supporting evidence for the assumption nat 
people within one of their milieu classifica’ ns 
are relatively homogeneous with respec’ to 
perspective, nor did they try to assess ‘ay 
characteristic differences between group in 


the upper or lower perspective anchors. 

It would be very gratifying, indeed, to 
know that one’s psychological work has he}; od 
to illuminate some problem in the real wo: d. 
It is that eventuality that provides the inc: a 
tive for many of us to continue our pursuits 
with little immediate reward. However, it is 
surely true that the real world is more compii- 
cated than the laboratory world. This greater 
complexity implies the need for at least as 
much care in defining and measuring variables 
outside the laboratory as is employed within 
it. Poor research practices cannot be justified 
on the ground that the worldly setting makes 
it difficult to devise better ones. Given the 
problems as discussed above relating to real- 
world manifestations of content and perspec- 


tive, some ingenious solutions are required to: 


represent perspective theory in a complex 
setting, The study by Kinder et al. (1976) 
unfortunately did not produce ingenious solu- 
tions. In fact, the operational meanings that 
were given the major terms of the theory were 
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inding that was reported as being rele- 
to the theory. To revise the theory as the 
suggest on the basis of these unin- 


etable data would be tantamount to re- 


ishing the goal of a psychology of the 
orld in favor of a psychology of wonder- 
here, like Humpty-Dumpty, we all de- 
ir terms to suit ourselves, 


REFERENCE NOTE 


iw, H. S. Opinion change vs. reference scale 
e in the study of attitudes. Paper presented 

72nd annual convention of the American 
logical Association, Los Angeles, September 


REFERENCES 


). R. Stimulus similarity and the anchoring 
stive scales, American Journal of Psychol- 
3, 66, 199-214, 

R., Smith, T., & Gerard, H. B, The atti- 
ling process outside of the laboratory. 
of Personality and Social Psychology, 1976, 
t91. 

M. Perspective as an intervening construct 

:lgment of attitude statements. Journal of 

y and Social Psychology, 1966, 3, 135-144. 
M., & Upshaw, H. S. Psychological per- 
ind attitude change. In A, G. Greenwald, 
ock, & T. M. Ostrom (Eds.), Psychological 
ns of attitudes. New York: Academic 


so arbitrary that I, for one, refuse to accept Ostrom, T. M., & Upshaw, H. S. Race differences in 


the judgment of attitude statements over a thirty- 
five year period, Journal of Personality, 1970, 38, 
235-248, 

Thurstone, L. L. Multiple factor analysis. Chicago: 
University of Chicago Press, 1947, 

Upshaw, H. S. The personal reference scale: An ap- 
proach to social judgment. In L. Berkowitz (Ed.), 
Advances in experimental social psychology (Vol. 
4). New York: Academic Press, 1969, (a) 

Upshaw, H. S. Stimulus range and the judgmental 
unit, Journal of Experimental Social Psychology, 
1969, 5, 1-11. (b) 

Upshaw, H. S. The effect of unit size on the range 
of the reference scale. Journal of Experimental 
Social Psychology, 1970, 6, 129-139. 

Upshaw, H. S. Personality and social effects in judg- 
ment. In E. C. Carterette & M, P, Friedman (Eds,), 
Handbook of perception (Vol. II). New York: 
Academic Press, 1974. 

Upshaw, H. S. Judgment and decision processes in 
the formation and change of social attitudes. In 
M. F. Kaplan & S. Schwartz (Eds.), Human judg- 
ment and decision processes. New York: Academic 
Press, 1975. 

Upshaw, H. S., Ostrom, T, M., & Ward, C. D, Con- 
tent versus self-rating in attitude research, Journal 
of Experimental Social Psychology, 1970, 6, 272- 
279, 

White, B. J,, Alter, R. D., Snow, M. E., & Thorne, 
D. E. Use of instructions and hypnosis to mini- 
mize anchor effects. Journal of Experimental Psy- 
chology, 1968, 77, 415-421. 


(Received February 5, 1976) 


Journal of Personality and Social Psychology 
1976, Vol. 34, No. 4, 704-707 


Placing Perspective Research in Perspective: Reply to Upshaw 


Tom Smith and Harold B. Gerard 
University of California, Los Angeles 


Donald R. Kinder 
Yale University 


We consider Upshaw’s criticism of our recent use of the perspective model of 
attitudes to be ill-considered. We answer his specific criticisms and consider a 
more important issue that he failed to raise. 


Upshaw (1976) has criticized our recent 
(Kinder, Smith, & Gerard, 1976) use of the 
perspective model of attitudes (cf. Ostrom 
& Upshaw, 1968). In specifying what he con- 
siders to be “serious methodological prob- 
lems” in our operational definitions of the 
model’s basic terms, he has reiterated the sec- 
ond of two overriding concerns with which 
we began that investigation—that our anal- 
ysis be correct. Unhappily, he has neglected 
to raise the more fundamental, companion 
issue—the extent to which the investigation 
is appropriate. Considered together, these is- 
sues provide a proper perspective from which 
to evaluate our research. We take this oppor- 
tunity to respond to Upshaw’s specific criti- 
cisms of our analysis, Additionally, we again 
consider the implications of this second issue 
within the context of our embattled study. 


Was the Analysis Correct? 


The perspective model of attitudes holds 
that there is a consistent dynamic interrela- 
tionship of three elements: the content of an 
attitude; its rating on an evaluative dimen- 
sion; and the perspective, defined as the 
available range of content alternatives, from 
which it is rated (see Ostrom & Upshaw, 
1968, for the most complete description of 
the model). Upshaw’s criticisms of our article 
are as follows: (a) that our content dimen- 
sion is invalid with regard to the rating 
variable, (b) that the content variable was 
constructed so as to be spuriously related to 
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rating, and (c) that we lack an objec! ely 
controlled perspective dimension in th ab- 
oratory sense of control. 

In criticizing our conception of cor ont, 
Upshaw argues that the person makes a ; Jg- 
ment of liberalism-conservatism by C= 
tively composing a single content po. 'on 
from an assortment of issue-relevant ele. ats 
that might underlie such a judgment. H nas 
stressed that the reconstruction of this | tio- 
syncratic “composition rule” was beyon: the 
scope of our population-level analysis. 

The worth of any content construct i de- 
termined by the extent to which the ‘ue 
“facts” that underlie each person’s re ng 


judgment can be accurately assessed. Ve 
can control the presentation of potential {.. ts 
in the “cognitively simple setting” (to vse 
Upshaw’s description) of the laboratory, or 
we can seek to representatively sample avd 
measure those potential facts as we did in tive 
field. In neither situation can we ascertain 
the specific idiosyncratic composition rule 
that specifies how a person uses which facts 
in his/her judgment rating. The seeming lack 
of confounding variables in a laboratory situ- 
ation encourages us to assume that the single 
content base made available by the researcher 
is the one used by each subject, but this is 
not necessarily so. In reality, by sampling 
from a universe of potential facts, the re- 
searcher is more likely to account for any 
given population’s multiple idiosyncratic con- 
tent bases. 

We dealt with this issue in our study by 
pursuing an empirical question at the popu- 
lation level. Ours was a straightforward dis- 
covery process to determine the general na- 
ture of this composition rule. In our field 


a 


se 


PLACING PERSPECTIVE RESEARCH IN PERSPECTIVE 


ve looked at 35 attitude questions that 
to have wide-ranging liberal-con- 
servaiive implications for our students in that 


“mov cnt” of their lives. The questions were 
consi. ucted with the help of a large panel 
of us crgraduates (a point we did not men- 
tion the article). Our regression analysis 
allow us to specify most effectively the 
rela hip among these attitude questions in 
our p ulation, using the liberal-conservative 
self. g as a criterion, In this way we were 
able count for over half of the rating 
varia variance, The intent of our re- 
gres: analysis was to approximate as 
close : possible individuals’ composition 
rules the content dimension. We were able 
to id fy those 18 content elements (atti- 
tude tions) that together correlated .72 
with \iberal-conservative rating. Upshaw’s 
criticis- implies that he would accept nothing 
less t perfect correspondence. In the past, 
he hə- assumed perfect correspondence in 
his ow: «boratory tests of the model; he has 
mista! in unasked question for an affirma- 
tive answer, 

Mo pecifically, the content-invalidity 
critici uggests that elements that underlie 
and de» mine the real content on which each 
respo: bases his/her liberal-conservative 
self-ra: are missing from our contrived 
conter mension—that our content scale 
does : ompletely tap all of the attitude 
areas contribute to a determination of 
self-ra for each individual. Were this 
problem ‘o exist to the extent of substantially 
altering ‘he direction of our relationships, 
each individual would have to reply with the 
ideologically opposite response on the “miss- 
ing” attitude questions. Such an alternative 
explanation of our results seems farfetched. 

That we be expected to measure all under- 


lying content is an unfair and impossible 
Stricture to meet in any context. Our intent 
Was to adequately sample the range of atti- 
tudes that underlies our students’ liberalism— 
Conservatism. Our 18 attitude questions seem 
to us to be more than adequate. Considering 
additional content elements would not change 
the relative position of any given respondent 
on the existing dimension. Additional infor- 
Mation would be redundant. In this way, too, 
We consider our “behavior” content dimension 
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to be another sampling of that broad class 
of content-relevant facts that contribute to 
a liberal-conservative self-rating, In any one 
respondent’s head there can be, of course, 
only one effective content position at any 
given moment, but that content position can 
be measured in a variety of indirect ways by 
sampling elements from some set of attitudes, 
all of which are highly correlated with it. 

Upshaw’s alternative explanation of our 
results is an unlikely possibility on intuitive 
grounds. More importantly, his argument can- 
not account for the specificity of our findings 
—in particular, the differential availability of 
the “radical” label as explained in the article. 

Upshaw has also argued that we manufac- 
tured a spurious correlation between the con- 
tent and rating variables, this being an ad- 
ditional reason for doubting our negative 
results. In objecting to the constraint we im- 
posed on the formation of the content di- 
mension by maximizing its correlation with 
the rating variable, Upshaw confuses our 
construct validity check with something like 
an experimental intervention, Although the 
validity of the content dimension is demon- 
strated by its tight correspondence with the 
ratings of liberalism-conservatism, the prob- 
lem of spuriousness exists independent of this 
and all such analyses. As we maximize the 
amount of variance shared by content and 
rating, we merely concretize as a “scale” 
that relationship that already exists in the 
data. Spuriousness suggests that some third 
unmeasured variable actually causes both 
content and rating to vary in such a manner 
that the positive correlation was obtained, 
This is not the case. In fact, however, the 
existence of a substantial degree of correla- 
tion between all three aspects of the perspec- 
tive model is the key to the perspective anal- 
ysis—the perspective approach to attitudes 
hinges on just that kind of multicollinearity. 
A basic tenet of the model is that perspective 
acts to moderate the dependence of rating on 
content. Perspective is the third “spurious” 
variable operative in both the model and our 
analysis. 4 

Upshaw states that in the partial corre- 
lation numerator, rer — ”reořro (where P is 
perspective, R rating, and C content), either 
fpo or Tro would have to be relatively large 
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TABLE 1 


MEAN AND STANDARD DEVIATION VALUES OF CONTENT 
SCORES PER PERSPECTIVE LEVEL 


Attitude Behavior 
Milieu content content 
Home 
Very liberal 
M —.33 1.87 
SD -13 -69 
Moderately liberal 
M —.09 2.00 
SD 70 7 
Moderately conservative 
M alt 2.03 
SD 67 75 
Very conservative 
M 19 2.40 
SD -10 84 
School 
Social sciences 
M —.15 1.89 
SD 73 69 
Nonsocial sciences 
M 09 2.17 
SD -10 80 


Note. In all cases, a higher mean score indicates a more 
conservative content. 


in order for our analyses to confirm the per- 
spective prediction. Were this the case, it can 
readily be seen that our “artificially” created, 
spuriously large correlation between rating 
and content (rro) would operate in favor of 
the perspective prediction, Any spurious re- 
lationship scaled into our content variable 
would work against the conclusion we reached 
with our original analysis, 

Upshaw has raised an important point with 
his third criticism; He claims that we failed 
to objectively assess the ways in which our 
different “milieus,” which we offered as our 
operational versions of perspective, provided 
different labeling continua with differing end- 
points, Specifically, Upshaw’s interpretation 
is that the perspective “theory” requires the 
explicit specification of two endpoints. He 
wants us to prove objectively that our desig- 
nated milieus represent ideologically different 
but structurally similar label-availability 
bases. There is no way we can have the type 
of objective control in a field situation neces- 
sary for objectively measuring such a con- 
cept. We were aware of this limitation and 

were willing to settle for less than perfection. 
The real world is not the two-dimensional 
heaven for which Upshaw seems to pine. 
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An objective evaluation of perspective 
would involve some actual measurement, of 
the range of content positions available in 
any context. Intuitively, we feel tl our 
milieus reflect common knowledge divis -ns of 
the real world of our respondents into mean- 
ingfully different ideological backgrounds. Of 
course, there is no way we can includ- such 


a needed “milieu evaluation” into a qu: stion- 
naire because any such response in the con- 
text would be subject to the very san: per- 
spective bias we sought to test. Altern vely, 
we certainly were in no position, in th- case 
of “home milieu,” to go door to door ti ugh- 
out the country to collect such rang» data 
from our students’ mothers and fathers, 

Yet, we feel we have offered some evi ence 
as to the ideologically variant and ‘` ruc- 
turally invariant nature of both of our ~ ‘lieu 
operationalizations. We supported the lid- 
ity of the perspective measures by citin; two 
population parameters (Kinder et al., | 176; 


Footnote 1, pp. 483-484), not just oi , as 
Upshaw states. Looking across the dif rent 
levels of milieu for each version of pe! .pec- 
tive, in all cases we see that the mean score 
for each content variable was more conserva- 
tive as the milieu was expected to be nore 
conservative. And in all cases the standard 
deviation across milieu levels does not vary 
unreasonably (see Table 1). From these pa- 
rameters we are able to recover the approxi- 
mate effective endpoints of the range of 
available content positions. Thus, our per- 
spectives meet the assumptions of the model. 


Was the Investigation Appropriate? 


Upshaw’s comments serve to remind us that 
ours was not a laboratory test of the model, 
and he suggests that if we were to have been 
fair to the model, we should have rigidly 
adhered to the conditions of the laboratory. 

Precision should be the issue, not fairness. 
Ostrom and Upshaw (1968) have offered us, 
a model of attitudes that, theoretically, should’ 
deal with attitudes and not 15% of attitudes 
in 5% of contexts, Their perspective model 
of attitudes is a potentially important tool 
for social psychologists. It is not an easy 
task to deal with social psychological reality 
in terms of an essentially psychophysical 
model, but Upshaw and his colleagues arè 
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PLACING PERSPECTIVE RESEARCH IN PERSPECTIVE 


ng that effort (cf. Upshaw, 1974). 
sstion now becomes one of the model’s 


How can we use this psychophys- 
lel in contexts where attitudes (the 
social psychological content) natu- 


cur? This is the question we pursued. 


g within the necessary contraints of 


estion of utility, we were precise in 


itionalizations, 
ok the laboratory as a point of de- 
ind wondered how the model would 
ihe field. Our context was a situation 
the model should apply. Our findings 
‘hat it does not . . . quite, Yet, we 
laim that the model is useless, but 
at it did not work effectively in our 
We did not suggest that the model 
loned but suggested rather that it 
led to accommodate a class of addi- 
riables that inevitably confound real- 
ationalizations of the perspective 
If the model does not fit the situa- 
ealt with, where does it fit? Of what 
n it ever be if it must remain in 
itory—if it must be protected from 


plexity inherent in everyday reality? 


vould deny that a person’s perspec- 


derates her/his self-label. But label 


ity also affects self-label. In any 
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given instance, the label applied will be the 
result of some resolution between these two 
influences. 

Upshaw has specified rigid methodological 
canons with which he will always be able to 
sit back and shoot holes in others’ research. 
Given this stance, the model is inviolate and 
will never be disconfirmable or even quali- 
fiable. To indulge Upshaw’s obsessive de- 
mands regarding the use of the perspective 
model is to do a disservice to its potential, 
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Cognitive Tuning and the Attribution of Causality 
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Two experiments were conducted to investigate the effects of cognitive-tuning 
sets upon the attribution of causality. In the first study, subjects were given a 
set either to receive or to transmit information about a videotaped event that 
involved a teacher testing a learner and giving him shocks for incorrect re- 
sponses, Also, the severity of the outcome of the event (the learner’s distress) 
and the expected source/recipient of the information were manipulated. It was 
found that relative to expectant receivers, the more serious the outcome, the 
more causality was attributed by expectant transmitters to both the teacher and 
the learner for the outcome. Unexpectedly, the source/recipient variable had no 
effect upon attributions, In a second study, subjects again were given either a 
transmission or a reception set and then observed a videotape of the same learn- 
ing situation employed in the first experiment; all subjects saw a moderately 
serious outcome, The other independent variable was the extent to which the 
teacher’s behavior of giving shocks violated his purported attitude about giving 
shock. As predicted, when the teacher’s behavior was in violation of his at- 
titude, expectant transmitters attributed more causality to the teacher for the 
outcome than did expectant receivers. The results for these two studies are 
discussed in terms of expectant transmitter’s perceived responsibility to develop 
a clear, easily communicable interpretation of causality for events that involve a 


violation of expectancy. 


A relatively neglected facet of attributional 
research is the role that cognitive sets may 
play in the guidance of an individual in ob- 
serving and interpreting events. The present 
research was designed to examine the effects 
of situationally induced cognitive-tuning sets 
upon the attribution of causality. 

Based in part upon Lewin’s (1951) analy- 
sis of psychological life space, Zajonc (1960) 
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defined cognitive tuning in terms of an in i- 
vidual’s principal role in the communicat) n 
process, either to transmit or to receive in! °- 
mation. Zajonc proposed that this cogniti >- 
tuning variable influences the flexibility of :n 
individual’s cognitive structure that, in tu'n, 
may mediate impressions formed about : n- 
other person, According to this conception, an 
individual tuned to transmit information 
should have a relatively fixed and polarized 
cognitive structure; this individual should 
tend to exclude or minimize contradictory 
information. On the other hand, it is assumed 
that when an individual is set to receive oth- 
ers’ impressions, he should tend to keep his 
cognitive structure more open and flexible; 
specifically, he should tend to perceive as 
related the contradictions between cognitions. 
Presumably, under transmission tuning, polar- 
ization occurs in the interests of communi- 
cating a clear picture of an individual. Enter- 
taining contradictory strands of information 
would not contribute to the achievement of 
an unambiguous impression. However, under 
a reception set, an individual should not have 
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ssure to reject divergent strands of 
tion about another person. 


this p< 
inforn 


In # cord with Zajonc’s formulation, Cohen 
(1961) found that individuals expecting to 
trans: an Impression were more prone to 
supp! contradictory information and ex- 
hibite- more polarized impressions than did 
expec receivers. Also, Leventhal (1962) 
found iat expectant transmitters developed 
more plified and unified impressions than 
did « tant receivers. Finally, Brock and 
From (1968) found that expectant trans- 
mitte: chose to listen more to information 
supp of their initial impression of an- 
other on than did expectant receivers. 

In of this previous work on the cog- 
nitive- uing variable, certain theoretical 
links | ween cognitive tuning and the attri- 
butio: causality may be proposed. In par- 
ticula seems possible that persons expect- 
ing t ısmit information about an event to 
others ay feel the need to develop a clear, 
defini interpretation of the event. Con- 
versely ‘hose expecting to receive informa- 
tion } not have such a strong need for 
clar hey may have a more relaxed orien- 


tation 


ward the interpretative activity. 
Thus, © general, it might be expected that in 
order ccommodate a clear, communicable 
interpre ition, expectant transmitters will 
develo;, »ronouncedly high attributions of 
causali to plausible causal factors. On the 
other i: id, expectant receivers might be ex- 
pected ‘> make rather moderate attributions 
to plausible causes. 
There may be a number of elements in 
the communication process and in the situa- 


tion surrounding this process that have quali- 
fying effects upon this proposed relationship 
between cognitive tuning and attributed caus- 
ality. The first experiment reported here con- 
cerns how attributions might be affected by 
the interaction of cognitive tuning with the 
extremity of the outcome of an event and the 
expertise of the recipient to whom informa- 
tion would be communicated or the source 
from whom it would be received. 

In some recent research (Chaikin & Dar- 
ley, 1973; Harvey, Harris, & Barnes, 1975), 
it has been found that behavior that leads to 
telatively extreme negative outcomes produces 
high person (actor) attribution by observers. 
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These studies involved behavioral situations 
in which the actor was a highly plausible 
causal agent. We assumed that an extreme 
event may enhance attributions of causality 
to plausible factors by expectant transmitters 
as compared to expectant receivers. 

With respect to the variable of expertise of 
the recipient or source of information, some 
researchers who initially probed cognitive- 
tuning ideas (e.g., Cohen, 1961; Leventhal, 
1962) suggested that a high-expertise recipi- 
ent might serve to minimize the tendency of 
expectant transmitters to develop polarized 
impressions. Some evidence reported by Feld- 
man (Note 1) tends to support this sugges- 
tion. Although the rationale is not entirely 
clear, persons expecting to transmit an im- 
pression to a well-informed other might think 
that the other held a more qualified and di- 
versified view of the stimulus person and 
hence should be cautious in formulating an 
impression. In an attributional context, we 
might expect less pronounced attributions by 
expectant transmitters when they have to 
communicate interpretations to high-expert 
others. Thus, the expected general tendency 
for transmitters to make higher attributions 
to plausible factors than receivers would be 
anticipated mainly when they were expecting 
to deliver a communication to a relatively 
low-expert other. 

The general hypothesis in the first experi- 
ment was that expectant transmitters would 
make higher attributions of causality than 
expectant receivers to the most plausible 
causal factors in the situation, especially when 
an extreme outcome occurred and the com- 
munication was to be transmitted to a low- 
expert recipient. In order to make our predic- 
tions more specific, a brief description of the 
experimental situation in the first experiment 
must be provided. Tuning set and expertise of 
recipient/source were manipulated and, sub- 
sequently, subjects saw a videotape of a learn- 
ing experiment in which a teacher shocked a 
learner when incorrect responses were made. 
Severity of outcome was manipulated via a 
very high or a moderate distress reaction on 
the part of the learner to these shocks. This 
reaction was the issue toward which the attri- 
bution questions were directed. 

We assumed that the most salient causal 
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agent in the situation in terms of amount and 
type of behavior was the teacher but, also, 
that the learner’s own reaction to the shock 
might take on a highly plausible causal status. 
It was anticipated that the experimenter, who 
merely read instructions and stayed in the 
background, would be a relatively minor 
causal agent. Our specific predictions were 
that expectant transmitters would make 
higher attributions of causality for the learn- 
er’s reaction to the teacher (and possibly also 
to the learner) than would expectant receiv- 
ers, especially when an extreme outcome 0c- 
curred and the communication was to be 
transmitted to a low-expert recipient; no dif- 
ferences between transmitters and receivers 
were predicted for a measure of attribution 
to the experimenter. 


EXPERIMENT 1 


Method 
] Subjects 


The subjects were 96 male college students who 
participated in order to satisfy an introductory 
psychology course requirement at Ohio State Uni- 
versity, They were randomly assigned to conditions 
with 12 subjects per condition. 


Procedure 


The two subjects run in each experimental session 
were randomly assigned to one of two identical 
experimental rooms, each of which contained a 
closed-circuit television monitor with a diagram of 
a “psychological distress meter.” 

Subjects were told that their task was to observe 

very carefully a videotape of an actual experiment 
concerning learning and psychological distress that 
had recently been conducted, One half of the sub- 
jects were led to expect that they would be trans- 
mitting their personal description and interpretation 
of the events that took place on the tape to another 
person on the research team (transmission-tuning 
set). The other half of the subjects were led to 
expect that they would receive a description and 
interpretation of the taped events from another 
person on the research team (reception-tuning set). 
Also, half of the subjects from each cognitive-tuning 
set condition were informed that the person they 
would be meeting was a Clinical psychologist em- 
ployed at the Ohio State University Medical School, 
who was described to the subjects as being very 
experienced in the analysis of interpersonal events 
(high expertise). The other half of the subjects from 
each tuning-set condition were told that the person 
they would be meeting was a college sophomore who 
was considering clini vchology as his academic 
major and was particularly interested in observa- 
tions of interpersonal behavior (low expertise). 
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The videotape observed by subjects consisted of 
an $}-minute staged verbal-learning exp ent, Th 
the opening scene, the male experimenter t two 
male subjects. The word-association learni task 
was explained, and a drawing was held assign 
randomly the two subjects to the roles o! ‘cacher 
and learner. The subject, who was assigned 
of teacher, was asked to await further in 
while the subject assigned the Jearner’s rol 
corted to another room. 


In the second scene, the experimenter ained 
that the learner’s task was to learn 15 wo airs, 
During the learning phase, the teacher wo read 
the list twice over an intercom system. Dur test 
trials the teacher would supply the first wor’ each 
pair and the learner would reply with th rrect 
paired associate. Incorrect responses would ilt in 
the administration of a 4-second shock to th- «rner 
by the teacher. The experimenter then attac l the 
bogus electrodes to the Jearner’s wrist and fir and 
left the learner’s room, 

In the third scene, the teacher was show in a 
side view seated at a table with a micropho , re- 
cording sheets, and a distress meter at a mera 
range of approximately 10 feet (3.048 m). 1. reply 
to the experimenter’s question, the teacher in cated 
that he had overheard all of the instruction given 
to the learner on the open intercom. The peri- 
menter switched off the intercom and explai’ ‘| the 


following additional aspect of the experim: Ėt, of 


which the learner was not aware: The cl sodes 
attached to the learner’s wrist and finger wi. also 
connected to a galvanic skin response met that 
indicated the degree of psychological distress peri- 
enced by the learner during the experiment was 
explained to the teacher that another purpos: the 
experiment was to examine the relationship |) ween 
the learner’s performance on the task and the ount 
of distress he experienced, as indicated by the eter. 
Part of the teacher’s task was to record the level 
of distress exhibited by the learner during th- test 
trials. After the teacher indicated his underst: ding 
of the instructions, the experimenter positioned him- 


self 3 feet (91.84 cm) to the teacher's left and ree 
quested the teacher to begin the learning trials 

After the teacher went through the list twi 
testing trials were begun. At this point, a 
consisting of a close-up view of the psyche 
distress meter (a 15-point scale with O labeied as 
“low tension” and 15 labeled as “high tension”) 
appeared in the lower right-hand corner on the 
monitor in each subject’s room. The learner az swered 
incorrectly for 8 of the 15 pairs, receiving a shock 
from the teacher for each error. The distress meter 
indicated that gradually increasing levels of tension 
were experienced by the learner with each succeed- 
ing error and accompanying administration of shock, 
with a mean level of distress of 8 on the 15-point 
scale for half of the subjects (moderate distress) 
and 14 for the other half (extreme distress) - 

After completion of the tape, subjects were taken 
to a second experimenter, who was uninformed as t0 
the assigned conditions of the subjects. Subjects 
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were i lually administered a questionnaire that 


contai four manipulation checks: (a) the sub- 
jects’ ation of the time they had spent develop- 
ing ax rpretation so that another person would 
hear i estimation of the time spent thinking 
about another person’s interpretation, (c) 
perce the amount of expertise in the inter- 
preta vi interpersonal events possessed by the 
persor ih whom the subject would be meeting, 
and erception of the amount of psychological 
distr wn by the learner during test trials. The 
deper measures were the subjects’ perception of 
the to which the learner’s psychological reac- 
tion used by (a) the experimenter’s behavior, 
(b) cher’s behavior, and (c) the learner’s per- 
sona iis sensitivity to stressful circumstances). 
Subj ınswered each question on a 20-point scale, 
with amounts indicating high levels of each 
parti able. 

Su rently, subjects were debriefed thoroughly 
and « sed, In summary, tuning set (transmission 
or r ion), expertise of recipient/source (high or 
low | extremity of outcome (moderate or ex- 
tren cre manipulated in a 2X 2X2 between- 
subj Jesign. 


Results 


I of the manipulation checks and de- 
pen measures were analyzed in a 2 X 2 
x2 alysis of variance. The Tukey honestly 


sign cant difference (HSD) test (Winer, 
197: was utilized for a posteriori mean com- 
par s 

Man: bulation Checks 


Turing set. Transmitters indicated that 
they spent more time developing an interpre- 
tation (M = 11.97) than did receivers (M = 
6.29), F(1,88) = 33.53, p<.001. Also, 
transmitters reported that they spent less time 
thinking about hearing another person’s in- 
terpretation (M = 2.44) than did receivers 
(M = 12.04), F(1,88) = 142.57, p < .0001. 
The data for these two questions provide evi- 
dence that the tuning-set manipulation was 
effective. 

Expertise of recipient/source. Subjects who 
thought that they would be meeting a clinical 
psychologist (high expertise) indicated that 
they felt this person had more expertise (M 
= 17.42) than did subjects who thought that 
they would be meeting a college sophomore 
(low expertise, M = 11.62), F(1,44) = 
83.18, p < .0001. These data indicate that 
the manipulation of expertise of recipient/ 
source was successful. 


AND ATTRIBUTION 711 

Extremity of outcome. Subjects in the ex- 
treme-outcome conditions reported more dis- 
tress on the part of the learner (M = 17.6) 
than did subjects in the moderate-outcome 
conditions (M = 10.5), F(1, 88) = 132.28, p 
< .0001. These results indicate that the ex- 
tremity-of-outcome manipulation was effec- 
tive. 


Dependent Measures 


Attribution to teacher. Contrary to predic- 
tion, no effects were found for expertise of 
recipient/source when subjects were asked to 
what extent the teacher’s behavior was the 
cause of the learner’s psychological reaction, 
However, a Tuning Set X Extremity of Out- 
come interaction was obtained, (1,88) = 
7.42, p < .01. As shown in Table 1, transmit- 
ters in the extreme-outcome conditions rated 
the teacher’s behavior as more the cause than 
did transmitters in the moderate-outcome 
conditions (ps < .01) or receivers in the ex- 
treme-outcome conditions (ps < .01), collap- 
sing across expertise conditions. No other 
mean comparisons relevant to this interaction 
were significant. This interaction indicates 
that expectant transmitters made higher at- 
tributions to the teacher than did expectant 
receivers when an extreme outcome occurred. 
Main effects for tuning set, #(1, 88) = 14.95, 
p <.01, and extremity of outcome, F(1, 88) 
= 17.89, p<.01, must be interpreted in 
light of this interaction. The extremity main 
effect showed that the more extreme the out- 
come, the higher were the attributions made 
to the teacher. The tuning-set main effect 
indicated that expectant transmitters attrib- 
uted more causality to the teacher than did 
expectant receivers. 

Altribution to learner. Contrary to predic- 
tion, no effects for the expertise of recipient/ 
source achieved conventional significance lev- 
els for attribution to the learner’s personality. 
However, transmitters in the extreme-out- 
come conditions made higher attributions of 
causality to the learner’s personality than did 
transmitters when a moderate outcome 0c- 
curred (ps < .01) or receivers when an ex- 
treme outcome occurred (ps < .05), collap- 
sing across expertise conditions. This pattern- 
ing resulted in a Tuning Set x Extremity of 
Outcome interaction, /(1,88) = 4.64, p< 
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TABLE 1 


MEANS FOR THE MEASURES OF ATTRIBUTED CAUSALITY (EXPERIMENT 1) 


Condition 


High expert source/recipient 


Low expert source/recipient 


Measure Transmitter Receiver Transmitter Receiver 
i Extreme outcome 
Attribution to teacher 13.91, 9.66s 15.914 9.585 
Attribution to learner 17.00, 12.80» 14.58, 11.92), 
Attribution to experimenter 10.42, 9.08. 9.50. 8.335 
Moderate outcome 
Attribution to teacher 9.005 8.58), 9.66 8.25), 
Attribution to learner 10.30, 10.02), 11.25, 11.21, 
Attribution to experimenter 742a 9.425 6.83a 5.50, 
Nole, The higher the number, the greater the attribution for each measure (20-point scales). Means not sharing com sub- 
scripts are significantly different at the .05 level by the Tukey HSD test (Winer, 1971), 
.05. This interaction indicates that expectant We had expected a greater difference nan 
transmitters made higher attributions than was found between transmitters and rec: vers 
did expectant receivers when an extreme out- in the moderate-outcome conditions. It ; ght 
ome occurred, The means are presented in be argued that an extreme outcome c sti- 
‘able 1. Main effects for tuning set, F(1,88) tuted a necessary condition for expe ant 
= 5.12, p <.05, and extremity of outcome, transmitters and receivers to differ in neir 
F(1,88) = 18.58, p< .01, must be viewed attributions of causality. Extreme outc nes 


in terms of this interaction. These data are 
similar to those for attributions to teacher. 
Attribution to experimenter. No significant 
effects were obtained for this dependent mea- 
sure, The means are presented in Table 1. 


Discussion 


The results of the first experiment provide 
no support for the predicted interaction 
among tuning set, extremity of outcome, and 
expertise of recipient/source upon attribu- 
tions. The failure of this predicted interac- 
tion may be traced directly to the fact that 
despite the apparently effective manipulation 
of the expertise variable, it did not influence 
attributions. It seems possible that expectant 
transmitters simply did not place any special 
emphasis on their assignment of communicat- 
ing an interpretation to a clinical psycholo- 

gist. 

The significant interactions showed that 
transmitters attributed more causality to both 
the teacher and the learner than did receivers 
when an extreme outcome occurred. Thus, the 
learner’s personality was just as plausible as 
a causal factor to subjects as was the teach- 
er’s behavior. 


represent eventualities that quite ofte: are 
not anticipated. It is conceivable that i: ‘he 
videotaped situation witnessed by sub cts, 


the learner’s reaction was relatively | ex- 
pected. As such, this reaction may have : im- 
ulated expectant transmitters to try tc de- 


velop a more definitive interpretation oi the 
event, and this effort may have been trons- 
lated into higher attributions of causalit, to 
plausible factors in the situation. Implied in 
this interpretation is the idea that the bu: den 
of transmission of information is felt *.ost 
acutely when noteworthy, unlikely, or iow- 
probability events occur. 

In a refinement study, 34 subjects were 
shown either the moderate- or the extreme- 
outcome tape and asked to rate the extent to 
which they expected the learner’s distress 
reaction. As would be anticipated based upon 
the above argument, subjects who viewed the 
extreme-outcome tape rated the outcome as 
more unexpected than did subjects who viewed 
the moderate-outcome tape (p < .001). 

If we assume that a violation of expectancy 
about a behavioral event is somewhat similar 
to contradictory information about a stimulus 
person, the foregoing interpretation for our 


* attrib 
impre 
that 


ion data has a parallel in research on 
mn formation. Cohen (1961) found 
vectant transmitters formed more po- 


larize. impressions of a stimulus person when 
extre: contradictions or inconsistencies were 
inch ımong the characteristics ascribed to 
the ;» ¿on than when only moderate contra- 
dicti were included. 
EXPERIMENT 2 

A nd experiment was conducted to pro- 
vide iore precise test of the argument that 
a vio tion of expectancy regarding a behav- 
iora it will trigger attributional divergen- 
cies ‘ween expectant transmitters and re- 
cei The same videotaped situation (but 
only version involving a moderately dis- 
tres: yutcome) was employed in the second 
expe nt; the violation specifically con- 
cern ‘he degree to which the teacher’s act 
of s} king the learner was discrepant with 
info ion about the teacher’s attitude 
towe causing a person distress in the in- 
tere.’ of advancing psychological knowledge. 
In i). attribution literature, this behavioral 
expe ancy variable is similar to Jones, Davis, 
and . ergen’s (1961) variable of the extent 
to \ ‘ch a stimulus person’s behavior is in 
role <- out of role in a particular situation. In 
their now classic study, Jones et al. found 
that cut-of-role behavior, which was incon- 
sistent with behavior called for by the situa- 
tion, was more conducive to confident, ex- 
treme judgments about the stimulus person 
than was in-role behavior. 

In eriment 2 we assumed that since the 


violation of expectancy related so directly to 
the teacher’s behavior, the teacher would con- 
stitute by far the most plausible cause for the 
learner’s reaction to the situation. Hence, the 
hypothesis in Experiment 2 was that expec- 
tant transmitters would make greater attribu- 
tions of causality to the teacher than would 
expectant receivers but that this effect would 
occur only when the teacher’s behavior was 
unexpected. 


Method 
Subjects 
The subjects were 48 male college stuents who par- 


ticipated in order to satisfy an introductory psychol- 
ogy course requirement at Ohio State University. 
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They were randomly assigned to conditions with 12 
subjects per condition, 


Procedure 


The setting and task were the same as in the first 
experiment. Subjects were assigned to one of two 
cognitive-tuning sets (transmission or reception) and 
then were shown the videotape from Experiment 1 
that involved the moderate outcome. After the 
scene in which the electrodes were attached and the 
learner was given his instructions, the tape was 
halted by the experimenter, It was explained to the 
subjects that since a scene from the tape was missing, 
the experimenter would describe the events to them. 
The subject was told that on the tape the teacher 
was asked by the experimenter to sign a consent 
form regarding participation in the experiment. The 
teacher also was asked to indicate on a 15-point 
scale, anchored by 0 (a small amount) and 15 (a 
large amount), the amount of psychological distress 
he considered reasonable for persons to withstand so 
that the objectives of scientific exploration of human 
behavior might be attained. One half of the subjects 
from each tuning-set condition received a copy of 
the questionnaire on which the teacher had indicated 
that a moderate amount of distress (8 on the 15- 
point scale) was reasonable. The other half of the 
subjects received a questionnaire on which the teacher 
had indicated that a very small amount of distress 
(2 on the 15-point scale) was reasonable. 

Subjects were given 90 seconds to read the consent 
form and questionnaire. The subject then saw the 
learning and testing phases of the videotape, Since 
the learner’s distress level averaged approximately 8 
on this particular tape, half of the subjects saw the 
teacher behave in accordance with his attitude (no 
violation of expectancy). The other half of the sub- 
jects saw the teacher behave contrary to his atti- 
tude in that he had indicated that it was reasonable 
for persons to withstand a small amount of distress 
(2 units on the 15-point scale) but then had con- 
tinued the task when the Jearner’s distress rose well 
beyond this level (violation of expectancy). 

After completion of the tape, subjects were taken 
to a second experimenter, who was uninformed as 
to the assigned conditions of the subjects. Subjects 
were individually administered a questionnaire. The 
questionnaire contained four manipulation checks: 
(a) the subjects’ estimation of the time they had 
spent developing an interpretation so that another 
person could hear it, (b) estimation of the time 
spent thinking about hearing another person’s in- 
terpretation, (c) perception of the amount of psycho- 
logical distress shown by the learner during the test 
trials, and (d) expectation of the extent to which the 
teacher would give shock such that the learner would 
experience the level of distress observed. The de- 
pendent measures were the subjects’ perception of 
the extent to which the learner’s psychological reac- 
tion was caused by (a) the experimenter’s behavior, 
(b) the teacher’s behavior, and (c) the learner’s per- 
sonality (his sensitivity to stressful circumstances). 


\ 
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Subjects answered each question on a 20-point scale 
with high numbers indicating large amounts. 
Subsequently, subjects were debriefed thoroughly 
and dismissed. In summary, tuning set (transmission 
or reception) and behavioral expectancy (violation 
of behavioral expectancy or no violation) were 
manipulated in a 2X 2 between-subjects design. 


Results 


Each of the manipulation checks and de- 
pendent measures was analyzed in a 2x2 
analysis of variance. The Tukey HSD test 
(Winer, 1971) was used for a posteriori mean 
comparisons. 


Manipulation Checks 


Tuning set. Transmitters spent more time 
thinking about developing an interpretation 
of what was happening (M = 9.92) than did 
subjects given a reception set (M = 3.58), 
F(A, 44) = 22.61, p < .0001. Subjects given 
a reception set spent more time thinking about 
hearing another person’s interpretation (M = 
8.50) than did subjects given a transmission 
set (M = 3.25), F(1, 44) = 24.05, p < .0001. 
These data indicate that tuning set was suc- 

= cessfully manipulated. 

Distress. No significant differences were 
obtained in the analysis of the question ask- 
ing how much psychological distress the 
learner showed during the test trials. This 
finding was expected since all subjects saw 
the moderate-distress tape. Further, the over- 
all mean rating of distress (M = 10.08) was 
about the same as the overall mean rating 
for the moderate-distress tape in the first ex- 
periment (M = 10.5). 

Behavioral expectancy. The subjects for 
whom the teacher behaved in line with ex- 
pectancy saw the behavior as more expected 
(M = 11.58) than did subjects for whom the 
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teacher behaved contrary to expectancy (M 
= 6.62), F(1,44) = 13.22, p < .001. These 


data indicate that behavioral expectancy was 
successfully manipulated. 
Dependent Measures 

Attribution to teacher. As expected, © «ns- 
mitters, whose expectations for the teac er’s 
behavior were violated, made higher at bu- 
tions to the teacher than did receivers i the 


same condition (p< .05) or transn) 
whose expectations were not violated < 
.05). No other comparisons were signifi 
This pattern of data resulted in a Tunit 
X Behavioral Expectancy interaction, K 
44) = 4.26, p < .05. The means are pres: 
in Table 2. Main effects for tuning set, 1, 
44) = 4.53, p < .04, and behavioral e» >c- 
tancy, F(1,44) = 20.04, p < .0002, mu: 
interpreted in terms of the significant inte ic- 
tion. The main effect for tuning set sho- e 
that transmitters made greater attribut ns 
to the teacher than did receivers. The in 
effect for behavioral expectancy showed at 
there was greater attribution to the tes er 
when a violation of expectancy occurred | an 
when no violation occurred. 

Attribution to experimenter. There wer 10 
significant effects for attribution to the `x- 
perimenter. The means for this measure re 
presented in Table 2. 

Attribution to learner. There were no ig- 
nificant effects for attribution to the lea: 
The means for this measure are presente“ in 
Table 2. 


Discussion 


The results of Experiment 2 provide 
port for the prediction that when the te 


TABLE 2 


MEANS FOR THE MEASURES OF ATTRIBUTED CAUSALITY (EXPERIMENT 2) 


Condition 
\ "Violation of expectancy No violation of expectancy 
Measure Transmitter Receiver Transmitter Receiver 
Attribution to teacher 16.33, 9.83, 7.75 7.66», 
Attribution to learner 10.58, 11,08, 13.66, 13.00, 
Attribution to experimenter 10.75, 8.91, 9.254 7Aly 
Note. The higher the number, the greater the attribution for each measure (20-point scales). Means not sharing Pennie subscr ipts 


are significantly different at the .05 level by the Tukey HSD test (Winer, 1971). 


nam y ome 


eS a 
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er’s behavior was unexpected, persons who 
expe: ‘ed to transmit information would make 
gre -r attributions of causality to the teach- 
er’s \ctions in the setting than would expec- 
tan’ eceivers. The interpretation that events 
tha’ ‘olate expectancies trigger attributional 
dive encies between persons holding these 
diffe ot sets would seem to accommodate the 
dat: n the first experiment that, as shown 


in i refinement study, indirectly involved 


a vy’ «tion of expectancy when an extreme 
oute ne occurred as well as the data in 
Exp ‘ment 2 that involved a more direct 
viol 1 of expectancy. 

I najor contribution of this research 
wot seem to be its demonstration that 
unc ‘rtain conditions cognitive tuning may 
affe he attribution of causality. However, 
furt york is necessary to test directly the 
infe e that in some situations transmitters 
may rceive a responsibility to develop a 
clea ‘erpretation of an event that then 
gets islated into a rather high attribution 
of « lity to plausible causal agents. Also, 
and re generally, early theoretical work 


on cognitive-tuning variable (Cohen, 


1961 ajonc, 1960) stressed a number of 
cogn structural features (e.g., complex- 
ity ar- unity) presumed to be associated with 
differ ‘jal tuning sets in impression forma- 
tion xtending the tuning conceptual base 
to em ace attributional phenomena, we have 
not (/t with these various facets of cogni- 
tive icture in any substantial fashion. It 
seems iikely that a response format that al- 
lowed ‘he individual to report his interpreta- 
tion a verbal, relatively nonconstrained 
fashion might facilitate a more precise and 


comprehensive examination of cognitive struc- 
ture. Also, based upon this research, we can- 
not determine whether the attributional di- 
vergence between expectant transmitters and 
receivers might involve selective encoding or 
decoding of information. However, the present 
work does seem to have provided a founda- 
tion for further explorations and refinements 
and to have provided evidence in the virtu- 
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ally neglected area of the effects of situa- 
tionally induced cognitive set on attributions. 

An important practical implication can be 
drawn from the present research. People who 
have the responsibility of reporting and inter- 
preting events to others (e.g., newscasters 
and investigative reporters) should be wary 
lest they unreasonably overinterpret causal- 
ity to certain factors for relatively improb- 
able occurrences. The strong recognition that 
one is operating under a transmisison set 
that is best primed when noteworthy events 
occur may serve to instill more caution into 
one’s causal analysis. 


REFERENCE NOTE 


1. Feldman, S. The effect of cognitive tuning upon 
acceptance of discrepant information, Paper pre- 
sented at the meeting of the Eastern Psycholog- 
ical Association, Philadelphia, April 7-8, 1961. 
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Relationship of Preoperative Fear, Type of Coping, and 


Information Received About Surgery to 
Recovery from Surgery 


A. Marilyn Sime 
University of Minnesota 


This study investigated the relationship of preoperative level of fear, extent of 
information seeking (coping), and amount of information received about sur- 
gery to recovery from surgery. It was hypothesized that a curvilinear relation- 
ship would be found between level of preoperative fear and recovery and be- 
tween extent of information seeking and recovery; it was also hypothesized 
that amount of preoperative information obtained would interact with extent of 
information seeking. The subjects were 57 female patients between the ages of 
18 and 68 who were scheduled for abdominal surgery. The recovery measures 
consisted of a self-rating of postoperative negative affect (fear, depression, and 
anger), number of postoperative analgesics and sedatives received, and total 
number of days to discharge. No curvilinear relationships were found between 
preoperative level of fear or type of coping and recovery from surgery. Results 
showed a linear relationship between level of preoperative fear and recovery, 
with the least favorable recovery associated with higher levels of preoperative 
fear. A significant interaction was found between level of preoperative fear and 
amount of preoperative information, with high-fear subjects who reported 
little preoperative information experiencing the least favorable recovery period. 
The findings are discussed in terms of the parallel response model proposed 


by Leventhal. 


In recent years considerable interest and 
research have been directed toward the study 
of (a) the relationship of preoperative fear 
or anxiety to recovery from surgery (Abram 
& Gill, 1961; Cohen & Lazarus, 1973; Giller, 
1962; Janis, 1958; Johnson, Leventhal, & 
Dabbs, 1971; Wolfer & Davis, 1970), (b) 
the relationship of coping to recovery from 
surgery (Andrew, 1970; Cohen & Lazarus, 
1973; Janis, 1958; Johnson et al., 1971; 
Walseth, 1968), and (c) the relationship of 
preoperative information or instruction to re- 
covery from surgery (Andrew, 1970; Egbert, 
Battit, Welch, & Bartlett, 1964; Lindeman, 
1972), Because the experience of major sur- 

gery is generally viewed as a naturalistic 
stress event, factors investigated relative to 
recovery are those that have theoretical 


This study was supported in part by a Public 
Health Service fellowship (SFO4-NU-27, 222) from 
the Department of Health, Education and Welfare, 
Division of Nursing. 

Requests for reprints should be sent to A. Marilyn 
Sime, School of Nursing, University of Minnesota, 
3313 Powell Hall, Minneapolis, Minnesota 55455. 


importance to some concept of psycholog zal 
stress. 

This study examined the relationship of 
preoperative level of fear, type of coping, -nd 
amount of preoperative information ai) ut 
surgery to recovery from surgery. The y- 
pothesized relationships between these viri- 
ables and recovery were based on the theo- 
retical model of human reaction to stress jo- 
posed by Janis and Leventhal (1965, 1968). 
According to the model, external signs of 
threat arouse “reflective fear.” Magnitude of 
threat is assumed to determine the level of — 
reflective fear evoked and subsequent behav- 
ioral reactions. Behavioral reactions include 
expressed affect and self-exposure to informa- 
tion. Persons experiencing mild reflective fear 
deny threat and engage in no self-exposure to 4 
infomation. Persons experiencing moderate 
rt ive fear express low levels of emotional 
tension and engage in selective self-exposure 
to information. Those experiencing strong 
reflective fear express high levels of emo- 
tional tension and engage in unselective self- 
exposure to information. 
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One of the main implications of the model 


is tha’ a curvilinear relationship is expected 
betwee level of fear stimulation and the 
proba’ ity of adaptive defenses; that is, 
optim. | coping would be expected at some 
inter) Jiate level where fear arousal is 
neithe“ ioo weak nor too strong. It is with 
moder fear arousal that an individual is 
most ly to engage in the “work of worry- 
ing,” work of worrying is conceived by 
Janis ı form of inner preparation that 
make: sible effective actions at the time 
of stro- encounter. The person who has re- 
maine.’ \nworried cannot complete the work 
of wo rg. At the time of stress encounter, 
he is ble to ward off intense fright and 
experi s feelings of increased vulnerability 
resulti ı distrust and anger with author- 
ities w =n he has expected to protect him. 
The pen who has experienced high levels 
of antic atory fear also would not complete 
the wor. of worrying. At the time of stress 
encoun he has no tension-reducing re- 
sponse utilize and, consequently, con- 
tinues ‘perience high levels of anxiety and 
feeling: vulnerability. 


Studies that examined the relationship be- 
tween operative anxiety and recovery from 
surgery ow mixed results, In two separate 
studies, “anis (1958) found a curvilinear 
relations ‘> between anticipatory fear and 
postoper».ive emotional adjustment. Cohen 
and Laz«*us (1973) found only subjects high 
in seli-roted anxiety scoring significantly 
higher on postoperative “psychological reac- 
tions,” oth Giller (1962) and Wolfer and 
Davis 70) found a weak but reliable 
linear relationship between preoperative fear 
and a variety of recovery measures. Johnson 
et al. (1971) found preoperative fear linearly 
related to postoperative emotionality but 
unrelated to speed of recovery or doses of 
analgesics. 

Such mixed results are no doubt due in 
Part to differences in the definition and mea- 
surement of fear or anxiety. Some investi- 
gators have studied trait anxiety, whereas 
Others have focused on situational anxiety. In 
addition, surgery-specific anxiety has been 
Measured by observer ratings and self-ratings. 
Some investigations suggest that patient self- 
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assessment of situational anxiety is a better 
predictor of recovery than observer ratings 
(Cohen & Lazarus, 1973; Wolfer & Davis, 
1970). Another important issue involves the 
criteria selected to measure postoperative 
recovery. A wide variety of measures and 
indices have been used in surgery studies, 
The way in which recovery is defined will no 
doubt influence the relationships that will be 
found with preoperative variables. 

A second area of investigation has been the 
relationship of preoperative coping to recov- 
ery from surgery. Several different coping 
processes have been studied ranging from 
stable dispositions or traits to active coping 
with specific stressors. Studies of coping traits 
along an avoidance-sensitization dimension 
(Andrew, 1970; Cohen & Lazarus, 1973; 
Walseth, 1968) generally do not show a con- 
sistent or statistically reliable relationship to 
recovery from surgery. Active coping pro- 
cesses, specifically the seeking and utilizing of 
information as a coping strategy (Cohen & 
Lazarus, 1973; Hamburg & Adams, 1967; 
Janis, 1958), seem to be better predictors of 
reactions to stress events and to surgery. 

Two studies have shown a relationship be- 
tween postoperative recovery and information 
seeking or self-exposure to information as a 
coping strategy (Cohen & Lazarus, 1973; 
Janis, 1958). However, both investigators 
confounded coping with level of preoperative 
fear. 

The Janis-Leventhal (1968) model does 
not make completely explicit the way in 
which self-exposure to information is related 
to recovery. Self-exposure to information is 
one component of the prestress coping behav- 
ior described in the model. Avoiding informa- 
tion and unselectively exposing the self to 
information are both described as ineffective 
coping measures. The first coping style makes 
it unlikely that the work of worry will be 
accomplished, and the latter increases the 
possibility of obtaining nonrelevant and 
nonhelpful “scare” information. A moderate 
amount of information seeking may facilitate 
the work of worrying by ensuring sufficient 
data to stimulate worry without involvement 
in inappropriate or nonuseful data. 

The general theoretical position on the rela- 
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tionship between preoperative preparation 
and recovery from surgery is that such prepa- 
ration will contribute to the patient’s psycho- 
logical welfare following surgery as well as 
hastening physiological recovery. Some in- 
vestigators have used a combination of in- 
structions and reassurances (Egbert et al., 
1964), sensory information (Johnson, 1973; 
Johnson & Leventhal, 1974; Johnson, Mor- 
rissey, & Leventhal, 1973), and instructions 
on danger-controlling behaviors (Johnson & 
Leventhal, 1974) and have found all to be 
associated with improved recovery or in- 
creased compliance during stress encounter. 
Other investigators in clinical (Andrew, 1970; 
Keller, 1965) and laboratory (Epstein, 1973; 
Epstein & Clarke, 1970; Epstein & Roupen- 
idn, 1970) studies have not found informa- 
tion associated with reduced reactivity to 
stress. Such diversity of findings may reflect 
differences in effects for different types of 
information, 

The role of information in the Janis- 
Leventhal model is mainly that of evoking a 
particular level of reflective fear. However, 
it could be hypothesized that preoperative 
information would interact with the coping 
process of information seeking. Those en- 
gaged in extensive information seeking prob- 
ably would not benefit from added informa- 
tion, whereas information should be useful to 
the coping processes of those engaged in 
moderate information seeking and should 
stimulate the work of worrying in those not 
seeking information. 


METHOD + 
Sample 


The patients studied were 57 women admitted for 
abdominal surgery. All were native-born Caucasians 
between the ages of 18 and 68 (M =38.3, SD = 
14.6) who were scheduled for surgery that required 
an opening into the abdominal wall. Twenty-seven 
patients had gynecological surgery (hysterectomy, 


oophorectomy, or laparoscopy). Surgery on the re- 
maining 30 patients included cholecystectomy, colec- 
tomy, spl y, herniorrhaphy, and cystotomy. 
The pat irawn from five surgical services 
(consisting of her of staff surgeons, residents, 
and interns) ursing stations. 


Instruments 
developed by the inves- 
ating measure completed 


The four sc 
tigator. Each sca 
by the subject. 
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Preoperative fear scale. Level of preoperative fear 
was determined by the subject’s self-rating of the 


extent of her concern, anxiety, or worry abeut the 
impending surgery and recovery period. Th= scale 
consisted of eight items, each containing 5 de- 
scriptive statements relating to degree and frecuency 
of fear and the extent to which surgery ws per- 
ceived as potentially dangerous and uncomi- ‘able, 
Quantitative data were derived by assigning < ight 
(0 to 4) to each descriptive statement and su- ning 
across items. The possible range of scores 5s ‘rom 
0 to 32 with high scores representing hish ‘ovels 
of fear. 

Preoperative coping scale. The preoperative ving 
scale assessed the extent to which patients ired 
and sought information about impending ery. 
Extent of information seeking was determ by 
the patient indicating the number of topics vout 
which she sought information and whethe: she 
wanted certain kinds of information, Topics re- 
sented in the items are those frequently ked 
about by patients. Some examples of topic: are 
location and size of incision, length of the ra- 
tion, expected amount of pain, and type ¢ st- 
operative treatment. 

The scale consisted of 15 items. Quantitative lata 
were derived by assigning a plus or minus to -ach 
item and summing across items. Possible ra: of 
scores for the scale is —15 to 15, with me 
minus scores representing limited information <ek- 
ing and extreme plus scores extensive infor ion 
seeking, 

Preoperative information scale. Amount ot ~re- 
operative information was measured in two vas 


the number of events about which the patien: ve- 
ported receiving information preoperatively, and ‘he 
accuracy of her expectations of postoperative evi 18, 
The areas measured preoperatively (10 items sco ed 
plus or minus) included whether the patient hid 
received information about subjects such as t:e 
incision, medication, anesthetic, and pain, The arcas 
measured postoperatively (three items scored 0 to 4) 


included the number of events the patient had been 


told about and the number of events that came as 
a surprise to the patient. 

The scale consisted of 13 items. The possible range 
of scores was from O to 22, with high scores 1 
senting the receiving of much information. 

Postoperative negative affect scale. The postopera- 
tive negative affect scale was developed to assess 
patients’ postoperative experiences of fear, anger, and 
depression. The scale consisted of nine s, each 
containing five statements representing increasing 
degree or frequency of experiences with fear, anger, 
or depression. Scores for each item were weighted 
from 0 to 4, with 4 representing frequent experiences 
with negative affect. The total score represents the 
sum of these weights and has a possible range of 
0 to 36. 


1In the original study, an alternative division of 
subjects was employed and additional recoyery mea- 
sures were studied (Sime, 1973). 
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In ¿dition to the postoperative negative affect 


cry Measures 


scale other recovery measures were used: days 
to d ge, number of analgesics, and number 
of se“ atives. 

D: to discharge was defined as the number of 
hosp ys in the postoperative period. The day 
of ŝu was counted as one day; the day of dis- 
charg as not counted. The physician determined 
the c ji discharge. 

‘Ar algesic was any pain-relieving medication, 
oral injected, ordered by the physician. Number 
of ar sics was counted on the basis of the number 
of si istrations of the ordered dose during the 
peric er surgery. The nursing staff decided when 
to ac ister an analgesic (provided it was not more 
freq than a physician-specified time period), 
gene in response to patient request. 


Se cs were defined as any soporific agent 


orde vy the physician for the purpose of aiding 
sleey imber of sedatives was computed on the 
basi: the number of administrations of the 
orde lose during the period after surgery. Seda- 


tives generally given by the nursing staff in 


resp¢ o a request by the patient. 
Proc e 

Pa names were selected by serial order from 
the s il surgery schedule. Each patient was seen 
in he pital room on two separate occasions. In 
the j ìl session, which took place on the day 
befor rgery, the preoperative questionnaire was 
admi ed. The general format and content of the 
quest ires were shown to the patient. After the 
paties mnsented to participate in the study, the 
questi ire instructions were reviewed and the pa- 
tient vermitted as much time as needed to com- 
plete ask. The second session took place after 
the p was ambulatory and awaiting discharge. 
At th ime the postoperative questionnaire was 
adminsicred. As with the preoperative question- 
naire, instructions were reviewed and assistance 
was g when needed. 

Data from the hospital record were collected after 
the patient was discharged. These data consisted of 
the n r of administrations of analgesics and 
sedatives after surgery, the number of recovery days, 
type of s », and location and type of incision. 


These data were collected by the present author and 
an assistant. Data from 12 records were collected 
independently by both investigators and compared 
for similarity. There was no disagreement between 
the two investigators; therefore, adequate reliability 
in interpreting and recording from the record was 
assumed, although the reliability and validity of the 
entries was unknown, Data from each of the remain- 
ing 46 records were collected by one of the investi- 
gators. At this time, neither investigator knew the 
Scores the patient had obtained on any preoperative 
Measures. 

Patients were classified three ways according to 
(a) level of preoperative fear, (b) type of coping, 
and (c) amount of preoperative information. 
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Level of preoperative fear was determined by 
dividing the sample into three more or less equal 
groups on the basis of scores on the preoperative 
fear scale. High-, moderate-, and low-fear classifica- 
tions consisted of subjects scoring in the intervals of 
17-24, 13-16, and 0-12, respectively (M = 14.09, 
SD = 4.83). 

Type of coping was determined by dividing the 
sample into three more or less equal groups on the 
basis of scores on the preoperative coping scale, 
Extensive, moderate, and limited information seekers 
were those in the 6 to 15, ~1 to 5, and —15 to —2 
intervals, respectively (M = 1.23, SD = 6.48). 

Classification according to amount of preoperative 
information was determined by scores on the pre- 
operative information scale, Subjects were divided 
into two fairly equal groups, Those scoring between 
O and 13 were considered to have received little 
preoperative information, whereas those scoring be- 
tween 14 and 22 were viewed as receiving much 
(M = 13.42, SD = 4,06). 

The preoperative fear, coping, and information 
groups were examined for differences in age, educa- 
tion, marital status, previous number of operations, 
type of surgery, and type and location of incision. 
No significant differences were found. 


RESULTS 


A major interest in this study was to 
examine any possible interaction between 
preoperative coping and information with re- 
spect to recovery measures. Classification ac- 
cording to level of fear was found not to be 
related to classification according to type of 
coping (Pearson r= .13, n = 57, ns), so it 
was decided to explore any interaction that 
may occur between the preoperative fear and 
information groups as well. Therefore, two 
two-way analyses of variance (unweighted 
means) were performed for each recovery 
measure, using Fear X Information for the 
first and Coping X Information for the sec- 
ond. If the interaction test was significant, 
simple ż tests were run using the error term 
from the analysis of variance. 

Analyses of variance of the Coping X In- 
formation groups showed no significant main 
effects or interactions with respect to any 
of the recovery measures. For purposes of 
graphic display of the findings, T scores were 
computed on each recovery measure; scores 
above the mean (M = 50) are in the direc- 
tion of a less favorable recovery period. 
Figure 1 shows that there were no curvilinear 
relationships between type of coping and these 
recovery measures. 

Figure 2 shows that a linear rather than a 
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Ficure 1. T scores of recovery measures for groups 
differing in type of coping. 


curvilinear relationship obtained between 
level of fear and recovery. High levels of 
preoperative fear were associated with a 
longer recovery period, a greater use of anal- 
gesics and sedatives, and higher levels of 
postoperative negative affect. Analyses of 
variance of the Fear X Information groups 
showed significant main effects for fear levels 
on postoperative negative affect, F(2,51) = 
3.21, p = .04, number of analgesics, F (2, 51) 
= 4.09, p = .02, and number of sedatives, 
F(2,51) = 7.19, p = .002. Main effects for 
amount of preoperative information were not 
significant. 

The analyses of variance of the Fear X In- 
formation groups showed significant inter- 
actions on the following three recovery mea- 
sures: number of analgesics, F (2, 51) = 3.96, 
p< .05; number of sedatives, F(2,51) = 
6.91, p < .001; and days to discharge, F 
(2, 51) = 5.72, p < .001. Figures 3, 4, and 5 
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show the nature of these interactions. i 
case, high-fear subjects who received 
information preoperatively experience 
poorest recovery. Further analysis show the 
differences between high-fear subject: 
had received much information and thos 
had received little information to be sto ‘sti- 
cally significant for number of analg. ics, 
t(51) = 2.67, p < .01; number of seda: ves, 
#(51) = 3.86, p < .001; and days to Jis- 


charge, #(51) = 2.59, p< .01. It was lso 
found that moderate-fear subjects whe ad 
received much information required a si) ifi- 
cantly longer hospitalization period an 


moderate-fear subjects who had rec: ed 
little information, ¢(51) = 2.14, p < .02 

As mentioned earlier, the analysis of ~ ri- 
ance on postoperative negative affect shi ed 
a significant main effect with subjects | zh 
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Fıcure 2. T scores of recovery measures for groups 
differing in level of fear. (A = significant interaction 
with amount of information; B = significant main 
effect.) 


RELATIONSHIP OF PREOPERATIVE FEAR 


on precisrative fear level scoring high on 
postop ve affect, F(2, 51) =3.21, p = .04. 
Post h sts (Scheffé’s test) showed signifi- 
cant ci neces in negative affect between 
each of preoperative fear. The post- 
operati negative affect score, however, 
showed 9 interaction between preoperative 
fear ai) nformation. Thus, while information 
impro; performance of high-fear patients 
on thr ecovery measures, it had no affect 
on pos rative emotional state, 

As t be expected, extent of informa- 
tion < 1g was positively correlated with 
amou::t of information obtained (r= .41, 
a 5 ¢ 01). 

Discussion 

The | , ,otheses that both preoperative level 
of fear ond extent of information seeking 
would . >w a curvilinear relationship to re- 
covery m surgery were not supported. No 
relation. ip was found between extent of in- 
format’. seeking and the recovery measures. 
A linee clationship was found between pre- 
operat’ fear and recovery, with the low- 

; fear s jects scoring more favorably than 
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Ficure 3, Mean number of analgesics at each level 
of fear for subjects who had received much and 
little information. 
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Ficure 4. Mean number of sedatives at each level 
of fear for subjects who had received much and 
little information. 


either the high- or moderate-fear subjects on 
all recovery measures. The hypothesis that an 
increased amount of preoperative information 
would benefit the moderate and limited infor- 
mation seekers more than extensive informa- 
tion seekers was also not supported. Infor- 
mation interacted with level of fear rather 
than type of coping and benefited most 
those who experienced high levels of fear 
preoperatively, 

The results of this study, taken with the 
findings reported by a number of other in- 
vestigators, suggest that the theoretical con- 
structs in the Janis-Leventhal model may be 
inadequate for the analysis of human behavior 
in the stress situation of impending surgery. 
At least, the patient-reported level of pre- 
operative fear used in this study may not be 
related to recovery in the same way as 
“reflective fear.” Other investigators, how- 


ever, have used a number of measures of 
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Ficure 5. Mean number of days to discharge at 
each level of fear for subjects who had received 
much and little information. 


preoperative fear. Cohen and Lazarus (1973) 
used observer ratings and self-reports, Wolfer 
and Davis (1970) employed an adjective 
checklist, and Johnson et al. (1971) used an 
adjective checklist and a modification of the 
Taylor Manifest Anxiety Scale. None of these 
investigations have supported the curvilinear 
hypothesis. 

It might be argued that the linear rela- 
tionship found between preoperative level of 
fear and recovery is a function of a variable, 
such as a personality characteristic, which 
had not been identified. The higher fear pa- 
tients could be viewed as dispositionally 
more sensitive to threat and stress stimuli 
than the lower fear patients. Patients experi- 
encing high levels of fear preoperatively, 
therefore, would experience a more disrup- 
tive recovery period. Postulating a personal- 
ity predisposition could explain the postoper- 
ative negative affect scores; however, it does 
not explain the significant interaction found 
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between level of fear and amount of informa- 
tion. Given a postulated dispositional sensi- 
tivity to stress, it is unclear why an inc‘eased 
amount of information would be assoc ‘ated 
in the high-fear group with fewer medic ‘ions 
and shorter hospitalization. 

The findings are consistent with the varal- 
lel response model proposed by Lev: ‘thal 
(1970). Leventhal has suggested thai mo- 
tions and adaptive behavioral respons: are 
independent reactions to an individua! ap- 
praisal of stress stimuli. Under conditi s of 
threat, an individual will engage in be^- vior 
to reduce danger irrespective of his emo onal 
state. Johnson et al. (1971) found su: re- 
sponses to be independent in a study o! sur- 
gical patients. In this present study, vell- 
informed subjects who experienced high |. vels 
of preoperative fear received fewer anal.’ sics 
and sedatives and were hospitalized a sù- “ter 
period of time than their poorly info. ned 
counterparts; however, both groups rep ted 


comparable postoperative states of neg ‘ive 
affect. The postoperative emotional ste, 
therefore, seemed to be independent of © ose 
behaviors associated with the administr 
of medication and the date of discharge 

The results of this study would have | <en 


on 


more amenable to explanation had the pre ise 


nature of the preoperative information nca- 
sure been known. Subjects reported fron: a 
selected list those events about which they 
had received information and whether their 
expectations of postoperative events had been 
accurate. The nature of the information 
actually received is unknown, In a series of 
studies investigating the effect of instruction, 
the Johnson—Leventhal group consistently has 
found that instruction on the sensations to be 
experienced reduces distress during threaten- 
ing procedures (Johnson, 1972, 1973; Johnson 
& Leventhal, 1974; Johnson et al., 1973) and 
that instructions on danger-controlling behav- 
iors when coupled with sensation informa- 
tion contributes to reduced distress and in- 
creased compliance with a noxious diagnostic 
procedure (Johnson & Leventhal, 1974). 

In this study, the informed patients who 
experienced high levels of fear preoperatively 
continued to experience emotional distress 
postoperatively. They perhaps received little, 
if any, sensation information. The nature of 


OO ey ee 


the information received was more likely de- 


scription the procedural events that would 
occur ar irumental behaviors expected. In- 
formal o!l vations of the interactions between 
staff and ent support this supposition. Staff 
tended í form patients about such things 
as the of surgical procedure planned, 
postoper: ve deep breathing and coughing 
exercise ! awakening in the postanesthetic 
recover; om. The only sensation informa- 


tion ge lly given was that the patient 


could e incisional pain. A possible expla- 
nation he recovery measures received by 
the high r group is that those who received 
much p erative information received the 
kind oí ormation that, despite the high 
levels »stoperative negative affect, en- 
abled tl to participate in their care in such 
a way t} hey influenced the length of their 
stay anc ount of medication received. 
Curio in the moderate-fear group an 
increase. «mount of preoperative informa- 
tion was «sociated with a significantly longer 
period o \ospitalization and a slight increase 
in amoi of medication. It is unclear why 
increase: information in patients who experi- 
enced x: oderate levels of preoperative fear 
would associated with a lengthened stay, 
whereas increased information in high-fear 
patients led to a shorter stay. Do well- 
informed, moderately aroused patients seek 
to maiiiain contact with health-care pro- 
viders? Perhaps moderately aroused patients 
find the postoperative recovery period less 
Stressful than highly aroused patients and 
| therefore use the information they have re- 


ceived to lengthen rather than shorten the 
| recovery period. 
| Whatever the interpretation, these data 
Support the independence of postoperative 
emotional reactions and those behaviors re- 
lated to drug administration and length of 
hospitalization. These data also suggest that 
the preoperative information received bene- 
fited the high-fear group and may have 
disadvantaged the moderate-fear group. 

In this study, the active seeking of informa- 
tion and the obtaining of information per se 
did not lead to an improved recovery period. 
However, extent of information seeking cor- 
telated at a rather low magnitude (.41) with 
amount of information received. The method 


iii ial 
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used in this study to measure information 
seeking may not have accurately separated 
patients on this variable. Self-ratings may 
be less accurate than observer ratings. In 
further investigations concerned with this 
variable, efforts should be made to use an 
objective measure that does not rely on the 
recall ability of the patient. A possibility 
would be to tally the number of questions 
asked by the patient. 

This study suggests that interaction effects 
may need to be identified in order to cor- 
rectly interpret the relationship of preoper- 
ative instruction to recovery from surgery. 
Studies in this area, therefore, should further 
explore interaction effects of preoperative 
variables, particularly those that can be ma- 
nipulated by clinical practitioners. Specif- 
ically, the results of this study suggest further 
investigation of the relationship between level 
of preoperative fear, amount of preoperative 
information, and recovery from surgery. Al- 
though the results of this study suggest that 
high-fear subjects may benefit most from pre- 
operative information, more definitive data 
are needed. Experimental manipulation of 
type and amount of information among pa- 
tients with differing levels of preoperative 
fear is required before any firm conclusions 
can be drawn. 
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Soci: Facilitation of Word Associations: Further Questions 


Thomas O. Blank, Ilene Staff, and Phillip Shaver 
Columbia University 


‘he purpose of this study was to explore the effects of the presence of an ob- 
er on the latency and commonness of word-association responses. Previous 
rch on these effects had been inconsistent and inconclusive. Each of 40 
subjects responded to a list of words while in the presence of an ob- 
er and while alone; half were given an explanation for the presence or 
nce of the observer and half were not. Subjects scored significantly higher 
a measure of how common their responses were and gave fewer unique 
esponses when observed; there were no significant effects on latency. Further 
nalyses indicated that the observer conditions primarily affected responses to 
vords for which there was little consensus of choice of response word and did 
his by reducing the number of idiosyncratic responses, These results point to 
inadequacies in the drive theory and suggest that a more “cognitive” explana- 


The ef) s of an audience on human per- 
formance ` «ve been studied repeatedly by so- 
cial psych: ‘ogists ever since Triplett’s (1898) 
experimen . in the 1890s, but there was no 
satisfacto. explanation of the diverse findings 
until Zaj published his well-known theo- 
retical p: er in 1965. According to Zajonc 
(1965), h facilitated and debilitated per- 
formanc:. -ould be explained in terms of Hull- 
Spence dive theory if it were assumed that 
the presece of an audience raises a subject’s 
drive levi, which in turn results in the 
enhancenint of dominant responses. Thus, 

| simple ov well-learned (dominant) responses 
are facilitated by the presence of an observer, 
but complex, creative, or poorly learned (non- 
ominani) responses are impeded. Zajonc 
Claimed that his hypothesis was consistent 


With most of the early literature on social 
facilitation, and it has been supported re- 
Peatedly by studies performed since 1965 (see 
Cottrell, 1972, for a review). 


The authors are grateful to Robert Zajonc and 
Margaret Matlin for information concerning their 
| *Xperiment and for comments on an earlier draft of 
this Paper. This research was supported by National 
Institute of Mental Health Grant 1R03-MH-25543 
to Phillip Shaver, 

Requests for reprints should be sent to Phillip 
$ aver, who is now at the Psychology Department, 
a York University, New York City, New York 
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ion, incorporating the concept of self-presentation, is appropriate. 


Most of the recent studies have been based 
on one of two paradigms: (a) pseudorecogni- 
tion, in which subjects guess which one of 
several words varying in familiarity has ap- 
peared briefly in a tachistoscope (e.g., Henchy 
& Glass, 1968; Zajonc & Sales, 1966), and 
(b) task performance, in which subjects work 
on a task that can be scored for correctness, 
such as traversing simple or complex mazes 
(e.g, Hunt & Hillery, 1973). For some 
reason, less attention has been devoted to a 
third paradigm employed by early investi- 
gators of social facilitation, namely, word 
association. Only one recent study (Matlin & 
Zajonc, 1968) has used the word-association 
technique.’ This is unfortunate both because 
verbal behavior is such an important aspect of 
human behavior and because free associations 
could, in principle, provide important clues 
about the process of social facilitation in hu- 
mans that are unavailable in studies based on 
task performance. Both latency and a mea- 
sure of dominance—commonness of a response 
or mean commonness of a set of responses— 


1Two related studies deserve mention. Good 


(1973) followed up an issue raised by ae ‘a 
Zajonc but did not compare observer ae alone 
conditions. Kljaic (1972) published a relate ore 
in a book published in Yugoslavia; unfortunal ely, 
only an abstract is available in English. He ae 
ently found shorter latencies and moe ae i ja 
sponses with a single observer than alone, bu 
opposite was true for an attentive public. 
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can be obtained and analyzed. Also, word- 
association responses have been studied ex- 
tensively on their own, and those studies pro- 
vide a wealth of information concerning the 
relationship of word-association responses to 
anxiety, external stress, and other possible 
manipulations of drive level. Moreover, close 
examination of both the earlier word-associa- 
tion experiments and the study by Matlin and 
Zajonc reveals a number of unsolved problems. 

First, not all of the early studies support 
Zajonc’s drive theory, and many that do pro- 
vide only weak support. Allport (1920, 1924) 
performed a series of chain-association experi- 
ments in which subjects, working either alone 
or in a group, wrote down a string of associa- 
tions to several stimulus words. Allport con- 
cluded that more associates were produced in 
the group condition; he also noticed that 
“personal” (idiosyncratic) associations were 
more common in the alone condition, whereas 
what he called “free rising” and context- 
relevant associations were more common in 
the group. (Free-rising associations were in- 
directly but not idiosyncratically related to 
the stimulus words.) According to Allport 
(1920, pp. 167-168), the group stimulates 
productivity and 


takes [the subject] “out of himself” . . . [directing] 
his ideas toward outside objects, and... to the 
actual presence of others. In the group we... 
become objective rather than egocentric, present 
rather than retrospective. 


Although Allport’s conclusions may have been 
correct, his evidence of increased productivity 
in the group was rather weak. In Experiment 
IV, for example, there was no difference be- 
tween alone and group conditions in the aver- 
age number of associations, and the average 
gain of the two subjects who did better when 
alone was twice as large as the group gain 
evidenced by the other six subjects. In Experi- 
ment IT, the one most often cited as evidence 
for increased productivity in the group condi- 
tion, the difference was only 3.3 associations 
(63.6 for the group and 60.3 for the alone 
condition), and this may have been due to 
practice effects; extra trials were included in 
the group condition.” 

Ekdahl (1929) conducted an extensive 
series of studies in which subjects, working 


T. O. BLANK, I. STAFF, AND P. SHAVER 


either alone or in the presence of an experi- 
menter, gave associates to visually presented 
words. Contrary to Allport’s cv clusion, 
Ekdahl obtained an increase in respon- speed 
when subjects worked alone. In a : oup of 
100 students, for example, 62 had sh- ter re- 
sponse latencies when alone (mean d: ‘erence 
of 20.95 seconds), whereas only had 
shorter latencies when observed (mes: ‘liffer- 
ence of 15.35 seconds). Ekdahl rey ied a 
wealth of introspective comments in cating 
increased task difficulty, nervousn: and 
evaluation apprehension in the presen: of the 
experimenter. Many subjects also m. Joned 
withholding associations that might ha. been 
considered strange. This is consistent » :h the 
conclusion of Wapner and Alper (19 who 


found increased response latencies 

phrase-completion task when subjects 

in the presence of an audience. Wap: 
Alper attributed the increased latency 
extra processing step—search for “sa 
socially acceptable responses. This mig 
account for the reduction in personal « 
tions observed in Allport’s (1920) 

condition, 


Finally, Dashiell (1930, 1935) repor) 4 in- 
creased speed (more output per unit ti.) in 
a serial association task when two obs vers 
were present but a marked reduction in output 
(compared with an alone condition) hen 


subjects worked in noncompetitive grot 

On balance, then, the early evidenco for 
social facilitation of word associations is quite 
mixed. Some studies support the facilitation 
hypothesis and some contradict it; ie of 
the studies is statistically powerful, an 
eral (e.g., Allport, 1920; Ekdahl, 1929) 
gest that more than drive may be neede 
explain some of the results. 

The more recent experiment by Matlin and 
Zajone (1968) appears at first sight to be 
much less equivocal. Response latencies were 
measured from a tape recording, and response 
dominance was defined empirically in terms 


sug- 
d to 


2 See Williamson (1926) for a critical examination 
of Allport’s methodology. Some of W: iamson’s 
analyses are inaccurate and apparently based on 4 
misreading of Allport’s original data. However, some 
of his insights are cogent and point up methodo- 
logical and, procedural difficulties in Allport’s studies: 


of both the ”2lermo and Jenkins (1964) asso- 


ciation norms «nd a measure of response com- 
monness bas on all associates given within 
the study i The contaminating effects of 
competition non in earlier studies, were 
eliminated ; »bserver was not billed as a 
competitor, > participated not as another 
subject bu neone “interested in word 
association: study yielded a highly sig- 
nificant red in word-association latency 
in the obs: yndition and two significant 
interaction: ting that when the order 
of conditie vas “observer” followed by 
“alone” (wi ubjects), more common and 
less unique uses were given in the pres- 
ence of th erver. Matlin and Zajonc 
(1968) int: l all of their results as evi- 
dence for i > theory of social facilita- 
tion: Incre rive enhanced the emission 
of domin: espenses, which resulted in 
decreased | latency. 

A close: ra questions, however. 


First, the 
port for th: 


ondition that produced sup- 
vnance hypothesis was one in 


which subj ve associations in the pres- 
ence of an ver who then, without expla- 
nation, got id walked out at the break 
halfway t the experiment, The result, 
slower late : and less common associates 
after the vers departure, might have 
been due to « feeling on the part of the sub- 
ject that his responses were not interesting 
enough to the observer. 

Second, Matlin and Zajonc used measures of 
Tesponse dominance that were especially sensi- 
live to uniqueness; response frequencies were 
Converted to ranks, and analyses were per- 
formed on ranks instead of directly on 


“the frequen Since this had the effect of 
giving added weight to unique responses and 
obscuring the actual frequency of the more 
Common responses, Matlin and Zajonc may 
hot have been justified in interpreting their 
Tesults in terms of enhancement of common 
Tesponses. 

Third, Matlin and Zajonc measured re- 
Sponse latencies with a stopwatch, rounding 
to the nearest fifth of a second. Since many 

tencies are less than a second, more accurate 
Measurement would be desirable, 

Finally, if this powerful audience effect is 
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reliable, one must wonder why there is no hint 
of the countervailing tendency, discussed by 
Allport (1920, 1924), Ekdahl (1929), and 
Wapner and Alper (1952) to slow down, 
censor responses, and make a desirable impres- 
sion. The size of Matlin and Zajonc’s latency 
effect was very large compared with what 
might have been expected given the earlier 
studies of this kind. 

For several reasons, then, we considered it 
important to conduct a study similar to 
Matlin and Zajonc’s.* Ours differed from 
theirs in the following ways: (a) A more accu- 
rate measure of response latency was em- 
ployed; (b) the explanation for the observer’s 
departure halfway through some conditions 
was varied to see whether this could account 
for Matlin and Zajonc’s interactions; (c) ad- 
ditional measures of response dominance were 
constructed and analyzed; (d) self-report 
measures of emotional state were included; 
and (e) the observer was a female rather 
than a male. The first four differences were 
designed to provide additional information. 
The last was mainly a matter of convenience 
(social facilitation theories have so far not 
made predictions about the effects of the ob- 
server’s sex). 


METHOD 
Subjects 


Forty-one male students at Columbia University 
were recruited through an advertisement in the 
school newspaper and were paid $1.50 each for par- 4 
ticipating in the experiment. Data for one subject | 
had to be discarded because he failed to follow 
instructions (often he responded with definitions and 
sentences rather than words). All analyses were 
performed on the remaining 40 subjects. 


Materials 


As in the Matlin and Zajonc (1963) experiment, 3 
stimulus words were taken from the Palermo-J' enkins — 
(1964) list of associations. Words with eit! 
were eliminated. In an effort to shorten the experi- à 
ment and reduce the possibility of subject moen 4 
only half of the remaining 184 words ils ee 
These 92 words were selected as follows: i te 
words were ranked according to ages ae e 
most common response given in the mals ci ee 
student norms and then distributed over four 


lso learned 
study was completed, we al 1 
we aioe Ole 1) that he and a student failed 
to replicate the original Jatency findings. 


ee ee ee) ee TE 


728 


lists in a counterbalanced fashion. The two sublists 
with the closest summed frequencies were designated 
List A and List B. The order of the words within 
each list was determined by randomly ordering the 
ranks; the same order was used for each list. One of 
the two stimulus tapes used in the experiment con- 
sisted of List A followed by List B and the other 
of List B followed by List A. The tapes began with 
a short summary of the instructions and three prac- 
tice words. All words, practice and experimental, 
were presented at the rate of one every 10 seconds. 


Design 


The design was a 2 X 2 X 2 X 2 with repeated mea- 
sures on one factor; each subject worked both alone 
and in the presence of an observer. The between- 
subjects variables were order of observer conditions 
(observer first or alone first), the order in which 
the two lists were presented (AB or BA), and the 
nature of the explanation given to the subject con- 
cerning the observer. As regards the explanation, half 
of the subjects were told nothing, as in Matlin and 
Zajonc’s experiment, and half were told that there 
would be an observer who would be present for 
only half of the experiment due to her busy schedule. 
Thus, the latter subjects were prepared for the 
appearance and sometimes eventual disappearance of 
the observer before beginning to respond to the 
word lists. 


Procedure 


Each subject was greeted by a male experimenter, 
brought to the experimental room, and seated at one 
end of an L-shaped table. In front of him were two 
reel-to-reel tape recorders; a speaker was attached to 
one and a microphone to the other. 

The experimenter began the instructions with a 
very brief statement about word associations, explain- 
ing that they had often been studied by linguists and 
psychologists. This was followed by an explanation 
of the task. Subjects were told they would hear a 
list of words on one tape recorder and should “say 
the first word you think of.” They were told that 
the other recorder would record the stimulus and 
their response, that there would be two parts of the 
experiment with a break in between, and that in 
each segment there would be approximately 50 words, 
1 every 10 seconds. 

The only other instructions concerned the observer 
and differed somewhat from condition to condition. 
In the two explanation conditions, the experimenter, 
after explaining the task, added, “By the way, a stu- 
dent interested in word associations would like to 
observe your responses, but she doesn’t have time to 
be here for the whole session.” For the observer- 
first condition, the experimenter continued, “I think 

she’s around now so I’ll have her come in; she will 
stay until the break.” For the alone-first condition, 
he said, “I don’t think she’s around now, so I'll have 
her come in at the break.” For the observer-first/no- 
explanation condition, immediately before beginning 
the tape recorders, the experimenter merely said that 
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a student interested in word associations would like 
to observe. For the alone-first/no-explanation condi- 
tion, the same thing was said but not until aite» the 
break; no mention of the observer was made © the 
original instructions. For all conditions, the :: beri- 
menter added, “She'll be sitting in the corner. here, 


so you don’t have to pay any attention to `er.” 
The experimenter always asked for the subject ver- 
mission to be observed and in every case obtain. ` it. 

The actual experiment began when the experin. ster 
started the tapes, After preliminaries (instruc ons 
and practice) there were 8 minutes of stimulus yds, 


a 3-minute break, and another 8 minutes of v ds, 
During the break the subject was told to rem in 
the room; the experimenter asked about prob ms 
and questions, postponing answers to subst ve 
questions until the end of the experiment. 

While in the room, the observer sat at the o 0- 


site end of the table about 6-7 feet (182.88-2) 36 
cm) from the subject, facing in a direction per; on- 
dicular to the subject’s direct line of sight. Thus, he 
observer was in the periphery of the subject’s v ‘on 
but not in a position to make direct eye con! ct. 
While there she did not initiate any convers on 
and responded to any of the subject’s questions «th 
polite but short answers. On the few occasions <en 
a subject asked about a particular stimulus v <d, 
the observer replied that she had not hear it 
clearly; if asked to confirm what the subject tho ht 
he heard, she did so when he was correct, V en 
leaving, either at the break or at the end of he 
second session, the observer simply thanked he 
subject and walked out. 

At the end of the second session, the experims. ‘er 
returned, shut off the tape recorders, and gave ùne 
subject a questionnaire concerning (a) relaxation or 
nervousness before, after, and during each hal of 
the experiment, (b) the general pleasantness of © 1¢ 
situation, and (c) perceived effects of the obser» :'s 
presence. When the subject finished the quest on- 
naire, the experimenter returned, answered any tes- 
tions, and then thanked, debriefed, and paid the 
subject. 


Measurement of Response Latencies 


Tape recordings of each session were played hack 
at half speed by a coder who was blind to the sub- 
ject’s condition. He started a push-button electronic 
millisecond timer as the stimulus word ended and 
stopped it at the beginning of the response, That 
elapsed time was then halved to return it io real 
time. A 1-month test-retest reliability check on a 
sample of 92 stimulus—response pairs for cach of 
five subjects yielded a reliability coefficient above 
-99. The difference between the two timings was 
usually Jess than one tenth of a second. 


Measurement of Response Dominance 


Three indices related to response dominance were 
constructed in the following manner, 

Response commonness. A subject was given a score 
for each word equal to the frequency of his response 
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across all 4 jects, regardless of condition. Thus, 
a high num’) ı this measure indicated a common 
response; “'! dicated a unique response, and words 


given as r an equal number of times received 
the same We call this score, or the mean 
derived fre esponse commonness. This mea- 
sure, which implest and most direct index of 
response do has also been used by Good 
(1973) in } to the indirect measure used by 
Matlin anc ic (1968). Commonness values 
ranged fron 27 g 

Response nonness. This measure was the 
same as ti measure of response dominance 
used by M j jonc (1968). The most com- 
mon respon cored “1,” the second most com- 
*mon “2,” a In case of ties, each response 
was assign rage of the ranks between the 
next highes lowest nontied words; unique 
responses \ ı the mean of all unique response 
values. (Fo r information see Matlin & Za- 
jonc, 1968, 5.) It should be noted that higher 
numbers on -> measure indicate more uncommon 
responses, nse uncommonness scores ranged 
from 1 to 

Uniquene This is simply the number of idio- 
syncratic onses given by a particular subject. 
Uniqueness = ores ranged from 3 to 43. 
Measuren:: at of Stimulus Commonality 

Stimulu ords were separated into three cate- 
gories accor ‘ing to the likelihood of their evoking a 
common onse. The high-commonality category 
included that produced a particular response 
for at leas of 40 subjects. For example, “buying” 
elicited “ vg” 27 of 40 times. The low-common- 
ality cates included words that produced a par- 
ticular resjonse for, at most, 7 of 40 subjects. For 
example, the most common response to “music” was 
“Snstrume: but this was given by only 3 subjects. 


The medi commonality category included all the 
remaining ords, These specific categories were 
chosen be they followed natural breakpoints 
in the fr , distribution and yielded roughly 
equal grou; 


RESULTS 


The analyses to be reported were performed 
on each subject’s mean or median scores with- 


in the observer and within the alone condi- 
tions. 


Response Latencies 


The latency data were analyzed in three 
different ways. Matlin and Zajonc (1968) had 
analyzed median latency of alone and ob- 
Server conditions; therefore, our first analysis, 
an Observer X Explanation X Condition Or- 
der X List Order analysis of variance, was 
Performed on the median latency scores for 
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observer and alone conditions. There were no 
significant effects; the average median laten- 
cies were 1.57 seconds in the observer condi- 
tion and 1.64 seconds in the alone condition. 

We also performed a 2 X 2X 2 X 2 analy- 
sis of variance (Observer X Explanation X 
Condition Order X List Order) on the mean 
latencies for each condition. The results paral- 
leled those for the median latency: The F 
ratio for observer was .98, and no other effects 
approached significance. The means were 1.90 
and 1.85 seconds for the alone and observer 
conditions. 

Finally, we looked at within-subjects differ- 
ences in variability (there were no significant 
between-subjects variability differences be- 
tween alone and observer conditions). Rather 
than performing the statistical test for the 
entire group, we used F ratios based on ob- 
server variance for each subject. A slight ma- 
jority of the subjects showed more latency 
variance in the observer condition. 


Response Dominance 


Response commonness. A 2X2X2X2 
analysis of variance was performed on the 
response-commonness scores, and the follow- 
ing significant result was obtained: a Condi- 
tion Order X Observer interaction, F(1,32) 
= 5.43, p < .05. This effect was due primarily 
to subjects in the observer-first condition, who 
showed considerably more common responses 
in the observer condition; the effect was 
strongest for subjects who heard List B first. 
There were no significant main effects. The 
mean response-commonness score for the ob- 
server condition was 6.01 and for the alone 
condition, 5.86; for the observer-first condi- 


4This uniqueness score is not equivalent to the 
score Matlin and Zajonc called uniqueness. Theirs 
was the total number of responses given by a subject 
that were not found in the Palermo-Jenkins norms 
(i.e., by a standard external to the responses E o 
subjects in the experiment). Because Mat in an 
Zajonc reported that the findings on this measure 
were close to those from their commonness pe! 
and because the Palermo-Jenkins norms ee = 
tained in a time and place quite far remove: i 
the one in which our experiment was done, we i 
not perform a similar analysis. Rather, oat PENRE 
ness score is entirely a comparison internal to the 


responses given in the present experiment. 
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tion the means were 6.51 with an observer 
and 5.76 alone, 

Response uncommonness. The response- 
uncommonness analysis of variance yielded a 
significant three-way interaction similar to the 
two-way interaction for the response-common- 
ness measure, but stronger for one list order 
than the other (Condition Order x List Order 
X Observer, F(1,32) = 5.13, p < .05). More 
importantly, it also yielded a highly signifi- 
cant main effect for observer, #(1,32) = 7.63, 
p < .01. There were no other significant main 
or interaction effects. The means were 7.53 for 
the alone condition and 6.88 for the observer 

condition. 

Uniqueness. Uniqueness scores analyzed in 
a2 2 2 X 2 analysis of variance produced 
the same pattern of results obtained for the 
uncommonness measure: a significant main 
effect of observer, F(1,36) = 6.04, p < .05, 
and an Order X Observer interaction similar 
to the uncommonness three-way interaction, 
F(1,36) = 2.75, p< .10. Mean uniqueness 
was 19.60 alone and 17.45 observed. 

Stimulus commonality. Stimulus words were 
divided into high-, medium-, and low-common- 
ality groups, and a 2x 2 3 analysis of 
variance (Condition Order X Observer x 
Stimulus Commonality) was performed on 
response latencies. The only significant main 
effect was for commonality, F(2, 76) = 17.13, 
Ż < .001. High-commonality words were re- 
sponded to more quickly. An interesting Ob- 
server X Condition interaction was suggested: 
Whereas responses were slightly faster in the 
observer condition both overall (1.88-1.83 
seconds), and on high-commonality words 
(1.77-1.69 seconds), stimuli of low common- 
ality were responded to more slowly in the 
observer condition (2.04-2.10 seconds). This 
interaction was not statistically significant, 

however, F (2, 76 1.80, p < .17. 

Finally, we did analyses using only the most 
common stimulus-response pairs. These pairs 
should most cle: show drive effects if 
there are any at al! he means for each 
of the five most i ulus—response 


pairs (n = 23-27) ‘ly shorter 
response latencies in ob é ‘ons, none 
of the differences approache nce (the 

n effect 


F ratio for the largest obs 
was 1.18; all others were les 
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Questionnaire Data 


Subjects were asked to report on a 7-point 
scale their level of nervousness or relax::ion 
“during the first half of the experiment” snd 
“during the second half of the experiment.” 
These responses were then recoded, depent ing 
on condition order, into “alone” and 
served” categories. A 2 X 2 X 2 (Observer x 
Explanation x Condition Order) analysis of 
variance on the relaxation report yielded =g- 
nificant main effects for explanation, F(1, 6) 
= 4.40, p < .05, and Observer, F(1,36) = 
12.02, p < .005, and a Condition Order > j= 
server interaction, F(1,36) = 4.33, p< 5 
The Explanation X Condition Order inte 
tion approached significance, F (1,36) = 3 ?, 
$ < .075. Subjects in all explanation «id 
order conditions reported themselves tọ ve 
more relaxed when they were alone than w’ 
they were observed. The largest difference of 
means was in the observer-first group; thse 
subjects reported themselyes significantly iss 
relaxed when observed than when alone. ‘I 
were also less relaxed when observed than ‘ne 


} 


other subjects. Subjects who had not boon 
i 


given an explanation for the observer report- 
themselves as significantly more relaxed before 
the start of the experiment, #(38) = 2.71, # 
.05, in the alone condition, t(38) = 2.53, p 
.05, and after the experiment 4(38) = 2.( 
p<.05. 


Dominance or Uniqueness? 


As mentioned in the introduction, Matlin 
and Zajonc (1968) used a measure of response 
dominance—here called response uncommon- 
ness—that, in effect, gave special weight io 
unique responses. This left open the possibil- 
ity that suppression of unique responses in the 
observer condition was more important than 
“enhancement of dominant responses.” In fact, 
in the present study the pattern of results for 
the Matlin-Zajonc measure paralleled the pat- 
tern for the uniqueness measure more closely 
than it paralleled the pattern for response 
commonness, the most direct measure of re- 
sponse dominance. This is reminiscent of All- 
port’s (1920, 1924) results. Instead of dis- 
cussing commonness or dominance, he re- 
ported that subjects in the alone condition 
gave more “personal” or idiosyncratic re- 


sponses, 
ther, each 
scores, ti 
ously ( 


rder to explore this matter fur- 
ubject was assigned five pairs of 
ueness scores mentioned previ- 
r ione and observer conditions) plus 
ií frequency-based numbers for 
mmonness (2-3, 4-7, 8-19, 20, 

se scores indicate how many 
ich level of commonness a sub- 
e alone and in the observer con- 
| displays the mean frequencies 
ness category. The alone and 
; in each category were com- 
‘lated ¢ tests. Only in the case 
ponses was the alone-observer 
nificant, £(39) = 2.54, p < .01. 
rtually no difference in the most 
egory, Where Matlin and Zajonc 


a simila: 
each ley 
and abo: r} 
respons 


for each 
observes 


would } expected the greatest difference. 
Discussion 

The present experiment was conducted in 
order to explore the effects of an observer on 
the laten: y and response dominance of word 
associati Earlier research had obtained 
both lon: = and shorter latencies in the pres- 
ence of observer, The most recent experi- 
Ment & these lines (Matlin & Zajonc, 
1968) ‘ined markedly shorter latencies in 
the presence of an observer but raised more 
question ihan it answered. 

One such question concerns the unexplained 
but potentially embarrassing departure of the 
Observer halfway through the experiment. In 
the present study, even though the explana- 
tion variable significantly affected subjects’ 
self-reporied relaxation, it did not affect any 
of the b oral measures. 


A more important question concerns re- 
Sponse latencies as indices of drive, In the 
Present study, contrary to what would have 
been expected on the basis of Matlin and 
Zajonc’s results, presence of the observer had 
no effect on response latencies, even though 
Very precise measurements were obtained. 
These results are compatible with the history 
Of research on social facilitation of word asso- 
ciations prior to Matlin and Zajonc’s study, 
perhaps indicating that their results were due 
to chance or to some unidentified variations in 
Procedure. 

_ As in Matlin and Zajonc’s experiment, sub- 
Jects in the present experiment had higher 
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TABLE 1 


MEAN FREQUENCY or UNIQUE, UNCOMMON, 
MODERATELY UNCOMMON, COMMON, AND VERY 
COMMON RESPONSES IN ALONE AND OBSERVED 
CONDITIONS 


Observer condition 


Type of response Alone Observed 
Unique (1)# 19.65 17.45 
Uncommon (2-3) 7.83 8.63 
Moderately uncommon (4-7) 5.43 6.18 
Common (8-19) 8.98 9.40 
Very common (20-27) 3.90 3.78 


„ Nole. Numbers in parentheses indicate the range of response 
frequencies for the stimulus words in each category. 
*1(39) = 2.54, p <.01. 


mean commonness scores and gave fewer 
unique responses while being observed. Matlin 
and Zajonc interpreted this pattern in their 
experiment as support for the notion that 
drive increases the likelihood that dominant 
responses will be emitted. Another interpreta- 
tion, consistent with findings obtained by All- 
port (1924), Ekdahl (1929), and Wapner 
and Alper (1952), is that subjects avoid per- 
sonal, strange, or unique responses in the 
presence of an unfamiliar observer, Two of 
the three response-content measures used in 
the present study—uniqueness and uncom- 
monness—were sensitive to suppression of 
unique responses. They also proved to be 
more sensitive to the observer manipulation 
than was response commonness, the third con- 
tent measure and the one closest to what 
Matlin and Zajonc meant by “response domi- 
nance.” Moreover, as shown in Table 1, the 
observer had the greatest effect on the number 
of unique responses given by a subject. Rather 
than emitting more common (dominant) re- 
sponses when observed, subjects seem to be 
avoiding strange or idiosyncratic responses. 
This is quite different from the picture pre- 
sented by Matlin and Zajonc. 

Finally, subjects reported being less relaxed 
in the observer condition. Although this might 
be taken as a sign of drive, it can also be 
interpreted in terms of the subjects’ concern 
about making a favorable impression on the 


observer. 


5 These commonness categories reflect naturally 
occurring breaks in the frequency distribution. 
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Taken together, our results provide little, if 
any, support for drive theory; instead, they 
point up the need for a more social, cognitive 
explanation. The lines along which such an 
explanation could be developed have been sug- 
gested by several recent authors. Jones and 
Gerard (1967) base their explanation on the 
concept of conformity. Duval and Wicklund 
(1972) speak of socially induced “self-aware- 
ness.” Ironically, these explanations have more 
in common with the early theorizing of All- 
port and Ekdahl than they do with Hullian 
drive theory. A complete, coherent version of 
this impression-management explanation re- 
mains to be formulated. 

Even though the general nature of the “‘so- 
cial” component of social facilitation is fairly 
clear, recent writers have neglected the spe- 
cific cognitive processes that might mediate 
social facilitation effects. Drive theory’s pur- 
ported ability to do this, in terms of drive and 
response dominance, was one reason for the 
field’s favorable response to Zajonc’s 1965 
paper. In our opinion, recent information- 
processing models can serve this purpose bet- 
ter than drive theory. In such a model, an 
arousal component that accounts for the gen- 
eral facilitation of responses under high 
arousal conditions and a test or criterion com- 
ponent that could cause slowdowns can both 
be conceptualized as independent mechanisms 
within a single cognitive framework. A full 
adaptation of such an information-processing 
model to social facilitation is beyond the 
scope of this paper, but we can say that the 
experiment reported here and other recent 
studies (Blank, 1976) can be fruitfully con- 
ceptualized in terms of social and informa- 
tion-processing concepts that would be difficult 
if not impossible to translate into drive termi- 
nology. 

Word-association tasks should be very use- 
ful in evaluating * explanations of social 
facilitation. Sp rd lists could be con- 
structed, for exam] ʻo reveal the precise 
nature of censorii impression-manage- 
ment in certain situ “nfortunately, the 
commonly used Pal kins (1964) or 
Kent-—Rosanoff (1910) ts are not very 
suitable for such purpose the intention 
behind available lists o! able” words 
(e.g., Bodin & Geer, 19 uith, Hahn, 
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& Lieberman, 1968) is too obvious ʻo sub- 
jects. It seems worthwhile to constet new 
ones because the word-association task having 
no clear-cut correct answers and wing 
simultaneous measurement of latency com- 
monness, and semantic relations, co‘, as 
Allport believed, reveal the complex p! cesses 


involved in social facilitation. 


REFERENCE NOTE 


1. Zajonc, R. B. Personal communication, 
1975. 
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Frequency Perception of Successes as a Function of Results 


Previously Obtained by Others and by Oneself 


Robert Vreven and Joseph R. Nuttin 
University of Leuven, Louvain, Belgium 


The frequency perception of the outcome stimuli, “right” and “wrong,” fol- 
lowing each response in a series of tasks was investigated. The perceptual 
judgment about the number of rights and wrongs obtained was requested at 
the end of the task. The first four experiments were concerned with the effect 
of the number of successes obtained by a partner on a subject’s perception of 
the frequency of personal successes obtained in a simultaneous or immediately 
following task. It was found that females tended to compare their outcomes 
with those of a socius (contrast effect), whereas no stable pattern of results 
emerged for males. Experiment 5 studied the effect of previous personal success 
or failure on frequency perception of subsequent outcomes. The perceived 
frequency of later successes was assimilated to the number of successes in 
previous similar tasks and contrasted with outcomes in previous dissimilar 
tasks. The final experiment investigated the impact of a first series of two 
mutually exclusive neutral events on frequency perception of similar events in 
a subsequent series. An assimilation effect was found when both series were 
performed by the subject, whereas observation of the series of a socius had no 
effect. Results are discussed in terms of the effect of early information on sub- 
jects’ future expectancies and the role of competition and comparison processes. 


It is generally accepted that behavioral out- 
comes constitute the main factor in behavioral 
change. Conceptualized in terms of reward 
and punishment, positive and negative rein- 
forcement, etc., outcomes are considered to 
regulate the learning process. Due to an anti- 
‘cognitive trend in learning theory, perception 
of reward and punishment has long been ne- 
glected or overlooked, For the most part, per- 
ceptual studies have been concerned with the 
initiating stimulus side of the behavioral pro- 
cess. Just as the informative aspect of reward 
and punishment is taken into account, the way 
in which subjects perceive and recall their 
outcomes in serial tasks should not be over- 
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looked; it is essential to know how sub) ts 
process the information given by a series of 
outcome stimuli. In fact, “the behavior o a 
person who felt that his performance ` as 
largely unsuccessful would be expected to 
change more upon repetition of the task tin 
that of one who felt he was, overall, succ: s- 
ful” (Nuttin & Greenwald, 1968, p. 22; e 
also d’Ydewalle & Eelen, 1975, p. 434; Jans- 
sens & Nuttin, 1976; Underwood, 1969). 

On the other hand, a person’s perception of 
himself is largely based on the outcomes of 
his activity, that is, the extent to which he 
succeeds or fails in his performances. Some 
people tend to perceive their failures as (he 
dominant feature of their activity, whercas 
others seem to overlook their unsuccessful out- 
comes and feel overly optimistic about them- 
selves. Since a person’s behavior mainly con- 
sists of a series of acts and outcomes, it seems 
feasible to study his tendency to overempha- 
size either success or failure in quantitative 
terms, that is, in terms of frequency percep- 
tion of “right” and “wrong” stimuli in a serial 
task. It is obvious that studies concerned with 
the perceptual level of feedback and “knowl- 
edge of results” are to be distinguished from 
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investigations about subjects’ evaluations or 
mere estimations of results obtained. Finally, 
the preset research also attempts to relate to 
the rato: recently developed area of event 
perception and, in particular, to frequency 
percepti:. of stimuli and behavioral events 
(see, a ng others, Eckert & Kanak, 1974; 
Ekstra Wallace, & Underwood, 1966; Er- 
lick, 1941). 

The eriments reported in the present 
article ©» part of a research program on fre- 
quency -crception of the two main types of 
a pers behavioral outcomes, namely, suc- 
cesses ard failures in terms of “rights” and 
“wrongs ’ obtained in a serial task. Together 
with ai) ‘her recently published report (Jans- 
sens & Nuttin, 1976), this article attempts 
to syni. size our latest findings in this area. 
With t purpose in mind, a few data that 
have » been previously published in English 
(Lietac. 1966) are briefly summarized along 
with th most recent work. 

Earl xperiments have shown that percep- 
tual in. cessions of the relative number of 
successe and failures obtained in a serial 
task a: influenced by personality factors as 
well « social and situational conditions 
(D’An im & Nuttin, 1972; Janssens & Nut- 
tin, 16 Nuttin & Abreu, in press; Nuttin 
& Gree wald, 1968, pp. 15-31; see also Poi- 
tou, 19°’, 1968), The variables studied in the 
presen! cxperiments are related to the social 
and tenporal context in which successful and 
unsuccessful activities are performed. Gen- 


erally, when a person performs in a given sit- 
uation, he perceives other people as succeed- 
ing or {ailing in tasks similar to his own. We 
are concerned with the extent to which bril- 
liant or poor performances of other people in- 
fluence the perception of one’s own successes 
and failures obtained in similar tasks. It ap- 
Pears that social comparison is an important 
Component in a person’s perception of him- 
Self. The first four experiments are designed 
to determine whether this assumption is cor- 
rect, In a second main section of the present 
article we attempt to determine the extent to 
Which previous success obtained by the sub- 
Ject himself has an impact on his frequency 
Perception of success in subsequent similar or 
different activities. Finally, we attempt to 
explore briefly what differences, if any, char- 
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acterize frequency perception of affectively 
loaded events, such as behavioral outcomes, 
as compared with frequency perception of 
neutral events. 


Tue SOCIAL Context: THE INFLUENCE oF 
RESULTS OBTAINED BY OTHERS 


The first series of experiments investigated 
the impact of a socius’ level of performance 
(i.e., the relative number of his successes) on 
a subject’s frequency perception of his own 
successful outcomes obtained in similar tasks, 
In these experiments, the socius was always 
of the same sex and age as the subject, be- 
longed to the same social group (students), 
and was previously unknown to him. 

In the basic experimental setting, subjects 
have to perform individually a series of short 
tasks; immediately after each response, the 
experimenter provides predesigned feedback 
by enunciating “right” or “wrong.” The ex- 
periment is divided into two phases, In the 
first stage a socius (stooge) performs either 
very successfully or very unsuccessfully on 
the tasks. After observing this perfermance, 
the naive subject responds to a similar series 
of tasks; in this second phase, the subject 
always obtains an equal number of successes 
and failures. Finally, the subject, who has not 
received any preliminary instruction, is re- 
quested to indicate his perceptual judgment 
on the number of successes and failures he 
just obtained. The subject’s perceptual judg- 
ment is the dependent variable in each of the 
following experiments. 

With respect to perceptual judgment and its 
relation to others’ performance, two hypothe- 
ses have been tentatively formulated. It 
seemed plausible to predict that subjects, after 
having attended to a highly successful per- 
formance of a socius, would develop a high 
level of expectation and, afterwards, be struck 
by the difference in their own performance in 
which only 50% successes were obtained, 
Hence, a kind of contrast effect could be 
expected.? A similar effect in the opposite di- 


1 It should be noted that this kind of contrast on 


i hat different 
he basis of active comparison is somew! 
fo the so-called structural contrast impression that 


automatically emerges from a series of homogeneous 
items with one or two heterogencous elements, These 
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rection was predicted for subjects who had 
observed a socius’ very unsuccessful perfor- 
mance. At the same time, however, it could be 
expected that the socius’ highly successful per- 
formance would arouse competition in the 
subjects and activate a tendency not to feel 
inferior in order to maintain self-esteem. In 
this instance, this tendency could act to re- 
duce perceptually as much as possible the 
existing differences between subject and socius. 
Therefore, an assimilation effect could be pre- 
dicted. 

Since both of these hypothesized effects 
seemed equally predictable from existing the- 
ories, the difference between a high and a low 
performance of a socius in frequency percep- 
tion of own outcomes was approached at the 
outset as an empirical problem. Therefore, the 
first experiment, which was conducted in our 
laboratory (Lietaer, 1966), was of a rather 
exploratory nature. 


Experiment 1 
Method 


Subjects. Subjects were 60 male and 60 female 
university students who volunteered to participate 
in the experiment. The subjects were randomly as- 
signed to the different conditions with an equal num- 
ber of males and females in each condition. 

Procedure. Two individuals participated together 
in the experiment; one of them was actually a stooge, 
the other was the naive subject. Subjects were in- 
structed that they had to perform a task, called a 
realistic estimation task (RET); it consisted of a 
series of 22 photographs successively projected on a 
screen for a duration of 4 seconds, Each item repre- 
sented a certain number of objects of the same kind 
(trees, animals, bicycles, etc.), Each subject was re- 
quested to make a realistic estimation of the approxi- 
mate number of objects in each photograph. His re- 
sponse was given in terms of multiples of five units 
(e.g., 35 or 60, not 37 or 61). The actual number was 
s between 25 and 95. The response was said to 
ht” when the subject’s estimate was as good 
an approximation of the real number as an average 
subject was able to make in previous experiments. If 
not, the response was pronounced “wrong.” The 
importance of the test was emphasized in such a way 
that each outcome, immediately following each re- 
sponse, was perceived and experienced by the subject 
as a real failure or success. In fact, successes and 
failures were experimentally induced according to a 


iff of course, from 


two contrast effects are very d i 
in the field of 


the well-known contrast phen 
visual sense data. 
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preestablished scheme. Instruction: were given jp 

find it natural 

he socius and 

. In the first 

5 performance 
D 


such a way that the subject wou 
first to attend to the performance 
afterwards to perform a similar 
phase of the experiment, the sto: 
was either high or low. In the 

successes and 6 failures were obtaii 
in the low condition, 6 successes an 
experienced. After observing this 
naive subject was invited to respon: 
Jar series of 22 items, whereas th 


condition, 16 
vy the socius; 
» failures were 
ormance, the 
unother simi- 
noge left the 


room. In this second stage of th veriment, the 
subject always obtained 11 successe 1 11 failures; 
that is, he perceived a series of out stimuli con- 
sisting of 11 rights and 11 wrongs indom order, 
Immediately following the task anc hout having 
received any prior instruction, the act was re- 
quested to make a final realistic tual judg- 
ment. He was told that 22 photogra vere shown. 
His perceptual judgment of the nur of successes 
and failures obtained was the depe! variable of 
the experiment. The process implic his percep: | 
tual judgment is termed incidental perception; 
it is agreed upon that short-term mv: impression 
formation, and other processes pla ole in this 
type of perceptual judgment. It shoi lso be real- 
ized that incidental event perceptio! ngs to the 
main perceptual processes in daily lii 
Results and Discussion 

Results are summarized in Tai > 1, where 
the data show an unexpected di rence be 
tween male and female subjects two-way 
analysis of variance was carried The in- 
teraction between high and low formance 
level of socius and sex of subjects was highly 
significant, 7(1,116) = 8.21, p )05. Fur- 
ther analysis indicates that after «itending t0 
the performance of a very unsuccessful per 
son (low condition), females tended to de- 


velop an overoptimistic impression of the 
number of successes personally obtained, 
whereas after they observed a socius’ high 
performance (high condition), personal failure 
tended to dominate their perception. The dii- 
ference between both conditions was not 
clearly significant, however, #(116) = 1.94 
.10 > p > .05. Thus, for females a contrast 
effect obtained. Males, on the contrary: 
showed an assimilation effect: Their frequency 
perception of personal successes was signifi- 
cantly higher after attending a socius’ pe™ 
formance in the high condition than in the low 
condition, ¢(116) = 2.07, p < .05. In accord- 
ance with the two hypotheses described above: 
it is assumed that male subjects reacted in 4 
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more com: -<titive way, whereas the females 
simply cor »ared themselves and their out- 
comes wil ‘hose of their socius; they were 
struck by difference between both, which 
js assume > be the basis of the contrast 
effect.* 

Followi his exploratory investigation, 
several e nents, using different kinds of 
tasks, we signed to check the results ob- 
tained, In imarizing, we can say that the 
clear-cut 1 rssimilation effect as opposed to 
the femal: trast effect was not always con- 
firmed; b« iles and females often showed 
some typi ontrast effect (D’Amorim & 
Nuttin, 1 One difference, however, was 
constantly irmed: The contrast effect in 
females wi iways higher than in males; 
when it w ound in males, the effect was 
slight. As i) he unstable assimilation effect in 
males, alte’ ating with a slight contrast ef- 
fect, it seer. plausible to conceive of it as the 
result of co icting competition and compari- 
son tende: es. In conclusion, then, sex of 
subjects r ns as an important variable in 
predicting effect of a socius’ level of per- 
formance : 1e perceptual impression of per- 


sonal out 
designed ¢ 
effects for 


ves. Experiments 2 and 3 were 
ved some light on the differential 


Experiment 2 


The as 
Was that the 
Subjects c 


Self-confid 
jects in ; 


nption leading to Experiment 2 
stronger contrast effect in female 
id be explained by a difference in 
nce between male and female sub- 
rforming the type of estimation 
tasks used in Experiment 1. Subjects who lack 
self-confidence with regard to the task to be 
Performed are likely to be less concerned with 
equaling the results obtained by a fellow stu- 
dent; they would tend to compare their out- 
comes with those of the socius (noncompeti- 
tive social comparison process) and would be 
Struck by the difference. These subjects would 
Perceive their own outcomes as contrasting 
With those of a socius. Subjects who feel more 
self-confident, on the contrary, would be likely 
to compete with a socius and would tend to 
educe, as much as possible, the existing dif- 
ference between their own outcomes and those 
fa socius (assimilation). Experiment 2 (Lie- 
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TABLE 1 


MEAN NUMBER OF PERCEIVED PERSONAL SUC 
EXPERIMENT 1 


Socius’ per- ees nae! 
formance Male Female 
High 11.83 10.66 
Low 10.76 11.66 


taer, 1966) was designed to test these hy- 
potheses. 


Method 


Subjects. Subjects were 288 volunteering first-year 
students, 144 females and 144 males between 18 and 
20 years of age. There were 16 male and 16 female 
subjects in each condition. 

Procedure. In general, Experiment 2 closely fol- 
lowed the procedural steps of Experiment 1, with 
only a few exceptions. Since stronger competitive 
tendencies in males were presumed to be responsible 
for the differential effect between the sexes, experi- 
mental conditions were designed to either stimulate 
or discourage competitive tendencies. To this effect, 
a preliminary stage was introduced into the experi- 
mental procedure as has been previously described. 
After receiving the instructions and before attending 
to the performance of a socius, the subjects were 
submitted to an initial trial with six items of a simi- 


*In connection with the same two hypotheses and 
the level of expectation mentioned in the introductory 
section, the following differential effect should be 
mentioned. In several experiments it was found that 
subjects who either attended to a socius’ successful 
performance in the high condition or who them- 
selves had performed very successfully in a previous 
task expressed a higher level of expectation than 
subjects who witnessed a socius’ unsuccessful perfor- 
mance in the low condition or who had not been 
successful themselves. In both cases the level of ex- 
pectation expressed refers to the number of successes 
that were to be obtained by the subjects themselves 
in a forthcoming serial task of the same kind, These 
results are comparable to those obtained for “goal 
discrepancy scores” in the classical level of aspiration 
research. However, when the subjects’ perceptions of 
the number of successes subsequently obtained were 
measured, results differed according to the origin of 
the higher expectations: A high level of ieee 
based on previous personal success was followed by 
an overestimation of the number of successes actually 
obtained (assimilation effect), whereas high anea 
tion based on a socius’ success tended to be kelara 
by a contrast effect in the subjects Tet 
ceptions of their own successes. Within eac et 
two groups, however, subjects pe LERA the 
highest levels of expectation tended to perce! 


highest frequencies of success, 
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lar task. As in Experiment 1, each response was 
immediately followed by the experimenter’s feedback, 
“right” or “wrong.” The subject’s initial trials were 
manipulated so that they obtained, respectively, 1, 
3, and 5 successes out of 6 responses (for the failure, 
neutral, and success conditions, respectively), Im- 
mediately following this initial trial, subjects of each 
of the three conditions mentioned attended to the 
socius’ performance in one of three conditions: high, 
medium, or low (with 17, 11, and 5 successes, respec- 
tively, out of 22 responses). Finally, each subject 
in each of the 3 X 3 conditions made his own per- 
formance in which he always obtained a 50-50 
success-failure outcome. The same experimental de- 
sign held for male and female subjects (18 cells 
altogether). As to the dependent variable, the fre- 
quency perception of successes and failures was mea- 
sured in terms of perceptual judgments immediately 
following the serial task, as described for Experiment 
1, It was assumed that after initial failure, subjects 
would not feel like competing with the socius; on 
the other hand, it was expected that initial success 
would stimulate subjects into feeling more self-confi- 
dent with regard to their ability to compete with 
the socius, as males seemed to do in Experiment 1. 


Results 


A three-way analysis of variance was per- 
formed on the data. The three main effects 
were the preliminary experience, the socius’ 
level of performance, and sex of subjects. Re- 
sults are shown in Table 2. 

Tf we limit our comments to the main points 
and discard the intermediate conditions yield- 
ing intermediate results, we should mention 
the following findings: 


1. The main effect of initial experience was 
found to be highly significant, F(2,270) = 
14.157, p < .005. As shown in Table 2, suc- 
cessful outcomes obtained in the main part of 


TABLE 2 


MEAN NuMBER OF PERCEIVED PERSONAL Successes, 
I 


PERIMENT 2 


Socius’ performance 
Preliminary 


experience High Medium Tow 
Failure 
Male 9.81 10.38 11.12 
Female 9.38 10.38 10.44 
Neutral 
Male 11.31 38 11.38 
Female 9,88 38 11.56 
Sues 12.38 5 11.31 
Female 10.88 12.31 
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the experiment were “overperceive:'”’ after the 
initial experience of success, wi seas they 
were “underperceived” after the ini ia] experi- 
ence of failure. 

2. An additional significant mais -ect was 
found for performance level of so xs, F(2, 
270) = 3.437, p < .05. In gener: impres- 
sions of own successes were higher the low 
condition than in the high condit n (con- 
trast). There was a Performance evel of 
Socius X Sex of Subjects interacti.i, F(2, 
270) = 2.642, .10 > p > .05. Furthe:. :nalysis 
indicates that performance of sociu had a 
significant effect for females, F(2, 70) = 
6.034, p < .005, but not for males, F- ?, 270) 
<1. 

3. The area of principal interest, amely, 
the Initial Experience X Performance vf So- 
cius interaction, did not yield a sig: ‘ficant 
effect, F(4,270) <1. From examina ‘on of 
Table 2, it appears that for females, t:> pre- 
liminary success manipulation did no- suc- 
ceed in producing a competitive per ptual 
distortion (assimilation). Regardless « what 


the females’ initial experience had be: :, the 
difference between themselves and the ocius 
dominated their outcome perception (con- 
trast). 

As for male subjects, their outcome percep- 
tion remained rather constant across the dif- 
ferent levels of the socius’ performance, with 
two important exceptions. Table 2 sh ws a 
remarkable difference between the effec: of a 
high performance of the socius in the :nitial 
failure condition as compared with the initial 
success condition. Both of these conditions 
are rather extreme, the former serving to dis- 
courage competitive tendencies, the latter serv- 
ing to stimulate them. After the subject has 
experienced initial personal failure, the suc- 
cessful performance of the socius is perceived 
by the subject as a sharp contrast. ‘This out- 
standing difference dominates the subject's 
perception of his subsequent outcomes; hence, 
the subject underestimates the number of his 
Successes in his real task, On the contrary, 
initial personal success and a high perfor- 
mance of the socius arouses competition in the 
subject. In this condition, the subject over- 
estimates his own number of successes in order 
to reduce perceptually as much as possible the 
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existing di‘ rence between his own and the 
socius’ performance. 

Summari: «s, Experiment 2 shows that the 
jnitial succ: — variable does not succeed in 
producing : = competitive perception effect 
in female jects, whereas initial failure 
makes mai subjects perceive in a more 
“feminine s competitive) way! 

Experiment 3 

Another othesis about sex differences in 
this area \ ormulated in terms of the so- 
cial compa process and within the frame 
if Witkin’s itkin, Lewis, Hertzman, Mach- 
ver, Meis & Wapner, 1954) studies of 
ield-depen versus field-independent per- 
eption. | -dependent subjects, as de- 
cribed by itkin, Goodenough, and Karp 
(1967), ter io use the social situation in 
yhich they themselves to define attri- 
nites of the lf, just as they rely on the sur- 
ounding visual field in their perception of 
timulus objects within it. It was hypothe- 
ized, thers . that field-dependent subjects 
vould com} their own outcomes with those 
f a socius ‘-ontrast), whereas field-indepen- 
lent subjeci- would be less influenced by the 
cius’ leve! oi performance. 
lethod 
As a measui of field dependence, Witkin’s (Witkin 
tal, 1954) ıc |-and-frame test (RFT) was admin- 
tered to a -up of 30 male and 30 female sub- 
els. Only Seios 3 of the RFT was used in the 
tesent expe , since data have suggested that 
ores for Series 3 of the RFT may be substituted 
t total RF scores with no loss in validity. The 
‘ore for each subject was the mean absolute error 
| degrees from the true vertical position for the 
tht trials of the series. 

Following this test, the same subjects participated 


l our stand: 


i RET experiment, described in Ex- 
itiment 1, ording to a previously established 
heme, 50% of both male and female subjects at- 
Mded to the performance of a socius in the high 
ndition and 50% in the low condition. 


sults and Discussion 


In the RET experiment, male as well as fe- 
ale subjects perceived the relative frequency 
Successes and failures in contrast with the 
imber of successes obtained by the socius. 
contrast was stronger in females than in 
ales, as was found in several other experi- 
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ments mentioned above. The whole group of 
subjects was then divided on the basis of the 
median deviation on Witkin’s RFT into field- 
dependent and field-independent individuals. 
The effect of the socius’ high and low per- 
formance conditions on the subjects’ frequency 
perception of their own successes was calcu- 
lated for each subgroup. Although no signifi- 
cant difference between male and female sub- 
jects was found in Witkin’s RFT, a majority 
of females was found in the field-dependent 
group (19 girls vs. 11 boys) and a majority 
of males in the field-independent group (19 
boys vs. 11 girls). As to the frequency per- 
ception of the successes personally obtained, 
a contrast effect was found in the field-de- 
pendent group, as shown in Table 3. In fact, 
field-dependent subjects tended to perceive 
more personal successes in the socius’ low con- 
dition and less in the high condition. The dif- 
ference between the two conditions was sig- 
nificant, (28) = 2.80, p < .01. In contrast, 
no significant difference between the two con- 
ditions was found for subjects in the field- 
independent group, (28) = .15, p > .10. 

These results seem to indicate that the non- 
competitive social comparison process, men- 
tioned above, is more likely to occur in field- 
dependent subjects; this process is assumed to 
underlie the contrast effect. Since females are 
found to be more numerous among the field- 
dependent group, it is plausible that field de- 
pendency is at least partially responsible for 
the sex differences found in the frequency per- 
ception of behavior outcomes. 

The importance of the social comparison 
process just mentioned in producing the con- 
trast effect is investigated in a final experi- 
ment on the influence of a socius’ performance 
on frequency perception of personal successes. 


TABLE 3 


MEAN NUMBER OF PERCEIVED PERSONAL SUCCESSES, 
EXPERIMENT 3 


Group 
Socius’ per- Field depen- Floja jadepen 
formance dent 
i 9,38" 10.07" 
nd 11.29» 10,19" 
Te ee a 
an = 16. 
bn = 14, 


“| 
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Experiment 4 


In our previous experiments concerning the 
influence of successes obtained by a socius on 
the subjects’ frequency perception of personal 
successes, the socius’ performance preceded 
the subjects’ personal task (successive presen- 
tation of the two tasks). In the present experi- 
ment, however, socius and subject perform 
simultaneously in such a way that for each 
item of the serial task, the subject perceives 
his own outcome in immediate juxtaposition 
with the outcome of the socius. It is assumed 
that in this condition, a subject is practically 
unable to escape comparing his outcomes with 
those of the socius. At the same time, the dif- 
ference in performance between the subject 
and the socius is perceptually so obvious that 
competition is discouraged; assimilating his 
personal successes with those of the socius 
would exceed the limits imposed on perceptual 
distortion, Therefore, it is predicted that in 
the simultaneous condition as compared with 
the successive condition, the socius’ level of 
performance (the relative number of his suc- 
cesses) will have a stronger impact, in the 
direction of contrast, on the subject’s percep- 
tion of personal successes. 


Method 


Subjects. Subjects for the experiment were 60 male 
and 60 female first-year university students, ran- 
domly assigned to the different conditions with an 
equal number of males and females in each condition. 

Procedure. The experimental comparison of the 
two conditions just mentioned was realized as fol- 
lows. In the simultaneous condition, the RET was 
performed by socius and subject working together 
(i.e., simultaneously). Both sat in open boxes next to 
each other in the experimental room and were re- 
quested to give their personal responses to each item 
by pushing buttons on a keyboard in front of them. 


TABLE 4 


MEAN NUMBER OF PERCEIVED PERSONAL SUCCESSES 
PERIMENT 4 


Socius’ performance 


foanditian, High Low 
Simultaneous 
Male 10.40 11.87 
Female 9.53 11.47 
Successive 
Male 1 10.47 
Female 10.53 
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Responses were marked on a display 
experimenter and were immediately í< 
auditory stimuli, “right” and “wron: 
by the experimenter and directed to 
the subject, respectively, and in imme 
to each other, as was explained in t 
orally given to the subjects, In the sy 
tion the experiment ran in two pha 
performance preceding the subject's p: 
described in the standard procedure 
1. In the simultaneous condition as 
successive condition, two subconditions 
a high condition in which the sociu 
successes and 6 failures and a low cond 
reversed proportions. As for the subje: 
11 successes and 11 failures in all con. 


front of the 


Results and Discussion 


The mean numbers of success 
by the subjects in their own pert: 
shown in Table 4. The data were 
a three-way analysis of variance, 
performance of the socius, perfo 
tions (simultaneous vs. successi\ 
of subjects as the main effects. 

As expected, a significant Soc 
Performance X Performing Condi! 
tion was obtained, F(1, 112) 
.025. Further analysis indicates i 
perceived their own successes in ¢ 
those of the socius with whon 
coacting (the simultaneous con: 
contrast was clear-cut for m 
#(112) = 2.15, p < .025, as wei 
males, #(112) = 2.83, p < .005 
cessive condition, on the contrar 
results as those obtained in F< eriment 
were found again, that is, a contris 
female subjects and an assimilati: 
males. However, neither of these 
significant (both ps > .10). 

The results obtained confirm our predig 


competitive social comparison in producing h 
contrast effect. By giving two subjects 
sponse-by-response feedback about 
performance (simultaneous condition), © 
tinuous comparison is unavoidable, and 
difference in level of, performance beet 
perceptually overevident. Hence, the subj 
competitive tendency to assimilate pê 

successes to those of a successful socius (4 
dency assumed to be dominant in males: 
found in several experiments) is strongl 
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^ noncompetitive social comparison, 


courage 
emphas the subject’s difference in out- 
comes, > more “natural” and resistant to 
change emales (in this kind of perfor- 
mance) ; this type of comparison is 
obtaine xtent that social competitive- 
ness is i ed. 

TEMP< ÌRDER IN OBTAINING SUCCESS 

AND FAILURE 

Anot! najor problem to be investigated 
is whet eviously obtaining a high pro- 
portion rsonal successes (or failures) in 
either : lar or dissimilar task has any ef- 
fect on irequency perception of a series of 
success ‘ntil now, we have studied the 
impact ı high proportion of successes or 
failures © \\ained by someone else, namely, by 
a socius :n the presence of the subject. It is 


expected ihat the repercussion of previously 
experienced successes on present perception 
will differ according to whether these previous 


successe were obtained by the subject him- 
self or y someone else. In earlier research 
(Nuttin & Greenwald, 1968, chap. 2), the 
proble: was already touched upon in the fol- 
lowing way. Subjects, who were divided into 
Groups A and B, were given a 50-50 propor- 


tion oi failures and successes for a series of 
items -s a whole, For Group A a high propor- 
tion ot successes was obtained in the begin- 
ning ci the task, whereas for Group B there 
was a high proportion of failures among the 
first s It follows, then, that in the re- 
maini rt of the series relatively more fail- 
ures (for Group A) or successes (for Group 
B) had to be given in order to obtain the 
global 50-50 proportion. In another experi- 
Ment, the initial part of the experiment, with 
either many successes or many failures, was 
Separated from the main task, in which a 
50-50 proportion of successes and failures was 
given. It was expected that after receiving 


m 


initial success, subjects would have high ex 


Pectancies and subsequently would be struck 
by the high number of failures actually ob- 
tained in the second part of the series. Hence, 
the perceptual impression of failures would 
be dominant (contrast effect). This prediction 
Was not confirmed. On the contrary, subjects 
who had received a high number of successes 
M either the first or the preliminary part of a 
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series overemphasized the number of their 
successes despite the relatively unsuccessful 
second part of the series. In other words, the 
initial impression was dominant. Moreover, 
earlier research has shown that personality 
variables, conceptualized as “optimism” and 
“pessimism,” that is, an elated versus a de- 
pressive self-concept, have an impact on the 
frequency perception of success and failure. It 
was hypothesized, then, that behavioral out- 
comes are perceived in the context of either a 
previously established self-concept with regard 
to performance in general (the so-called ego 
level) or a more specific expectation in a 
particular case (level of aspiration or expecta- 
tion). In fact, ego level and level of aspiration 
are always, to a certain extent, the result of 
previous outcomes. Conversely, it is assumed 
that subsequent outcomes are perceived in the 
context of a preexisting self-percept and level 
of expectation. It is in the light of this hy- 
pothesis that our prediction with regard to the 
following experiment is to be understood. 


Experiment 5 


If our expectation hypothesis is correct, the 
impact of previous success (or failure) should 
depend on whether inferences can be drawn 
from results obtained in the first task with 
regard to probable outcomes in the second, 
that is, whether expectations can be made on 
the basis of the previous outcomes. When sub- 
jects are administered two tasks in such a way 
that they are perceived as measuring the 
same ability, a high performance in the first 
task will allow subjects to expect many suc- 
cesses in the second task. Hence, the number 
of successes perceived in the second task is 
predicted to exceed the actually obtained 
50-50 proportion. A similar effect in the op- 
posite direction is predicted for subjects who 
have received a high number of failures in the 
first task (assimilation). Different results 
would obviously be expected if the two tasks 
referred to two different abilities. In this in- 
stance, subjects would not tend to make ex- 
pectations on the basis of previous results; 
they would simply be struck by the perceptual 
difference in the frequency of successes ob- 
tained in the two dissimilar tasks. Hence, a 
contrast effect would develop. Experiment 5 
was designed to test this prediction. 
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Method 


Subjects. Subjects for the experiment were 40 male 
and 40 female first-year university students. They 
were randomly assigned to the different conditions 
with an equal number of male and female students 
in each condition, 

Procedure. Subjects were individually and suc- 

cessively presented with two series of 22 problems. 
Each response was immediately followed by the 
experimenter’s feedback, “right” or “wrong.” Two 
different tasks were used: The RET described in 
Experiment 1, and a spatial representation task 
(SRT). In the SRT, subjects were presented a series 
of geometrical forms projected on a screen for a 
duration of 5 seconds each. Subjects had to choose 
from among four alternative forms the one that 
would be obtained by rotating the projected figure 
first upside down and then 90° to the left. Each of the 
two serial tasks (RET and SRT) were composed of 
two equivalent halves of 22 items each. A 2X 2X2 
factorial design was used. On the basis of some sex 
differences found in the previous experiments, one 
additional differential variable, sex of subjects, was 
included in the design, The first experimental vari- 
able was the nature of the initial task (either 22 RET 
or 22 SRT items); the second variable was the na- 
ture of the subsequent task used to induce the sub- 
ject’s second series of sucesses and failures (either 22 
RET or 22 SRT items). Thus, each subject was 
presented with two tasks that could be either of the 
same (2 X 22 RET or 2X22 SRT items) or of a 
different kind (either 22 RET followed by 22 SRT 
items or 22 SRT followed by 22 RET items). In 
the former case, the two series of items were pre- 
sented as two equivalent halves of the same test; in 
the latter, the two series were said to be measures of 
two different abilities. In all conditions, the second 
series was presented to the subject after a 1-minute 
interval and was clearly distinguished from the first 
one. The third experimental variable was the relative 
number of successes and failures obtained in the first 
serial task, namely, either 16 successes and 6 failures 
(high condition) or 6 successes and 16 failures (low 
condition). The ratio of successes to failures ex- 
perimentally induced in the second serial task was 
50-50 in all conditions. 


TABLE 5 


MEAN NUMBER OF PERCEIVED SUCCESSES IN THE 
SECOND Task, EXPERIMENT 5 


Performance second task 


Performance first Realistic esti- 


i Spatial repre- 
task mation 


sentation 


Realistic estimation 


High 12.60 9.20 

Low 8.70 11.30 
Spatial representation 

High 9.40 11.90 

Low 11.80 9.20 


ROBERT VREVEN AND JOSEPH R. NUTTIN 


The dependent variable was the sub 
tual impression of the number o 
“wrongs” obtained in the second se: 
function of the different conditions of t! 
Therefore, immediately following the 
subjects were requested to perform wl 
a final test of accurate perception: 
that the second task (i.e., the items 
the interval) was composed of 22 it 
were to indicate the perceived numbe 
and failures that they obtained in this | 


ct’s percep. 
“rights” and 


Results and Discussion 


Table 5 lists the mean number o? vercei 
successes in the second series as a ‘. iction 0 
the various conditions. In order implif 
the table, results are given for mi and fi 
male subjects combined. An interest: v findin, 
was the absence of any sex differen in thi 
experiment as opposed to those in v.\\/ch pres 
vious successes and failures were ol ned by 
a socius. The data were processed means 
of a four-way analysis of variance, © jth na 
ture of the first task, nature of th second 
task, subject’s performance level in ne first 
task, and sex of subjects as the mai» ‘actors, 

A highly significant interaction w: found 
between the nature of the first task, {> num 
ber of successes obtained in that tex, and 
the nature of the second task, FO 64) = 


41.48, p < .001. When the first and th 
series of problems are of the same n: é 
majority of successes (high condition) of 
failures (low condition) in the first series pro- 
duces the impression of a majority of suc 
cesses or failures, respectively, in the second 
series (assimilation). When the two series are 
of a different nature, results are reversed} 
that is, a certain kind of contrast is 
enced: A higher frequency of successes is 
experienced in the second series when per 
formance in the first task was low; on thé 
contrary, a lower frequency of successes i$ 
perceived when the initial performance was 
high. 

These predicted findings confirm the hy- 
pothesis that feedback received during a first 
performance tends to provide the subject with: 
a certain preconception about his ability and 
level of performance with regard to a similar 
subsequent task. This assumption was als 
confirmed in another experiment (Vreven; 
1974, pp. 14-35), which showed that the 
expectation scores for a second similar series ) 


of items are good approximations of the pro- 


portion of successes actually obtained in a 
first jes. Thus, perceptual impressions 
about ihe number of subsequently obtained 
success:s follow the lines of these expectancies 
in the same way as mentioned above with 
| regard io the preestablished self-concept (see 
also Jones & Goethals, 1972; Jones, Goethals, 
Kenniiston, & Severance, 1972; Jones, Rock, 
Shaver, Goethals, & Ward, 1968). As for the 
condition with two dissimilar tasks, although 
subjec\: may develop positive feelings on the 
basis o: their successes in the first task, this 
affective climate does not seem to “spread” 
| (cf. Jonssens & Nuttin, 1976) to the sub- 
sequer! performance as long as the cognitive 
conditions of similarity and the possibility of 
infere is lacking. On the contrary, by 
being «vare of the different kind of perfor- 
mance ‘nd abilities involved, subjects seem in- 
clined compare their outcomes in the two 
fields. ` is by this very comparison that the 
| differe!» between the two series of outcomes 
becom:. the striking feature and tends to 
produc a kind of perceptual contrast. 
Fregus vcy PERCEPTION or NEUTRAL EVENTS 
The -vents whose frequency perception has 
been s\idied until now are ego-involved out- 
comes of subject’s behavior. Successful and 
Unsucc ssful outcomes obtained in performing 
a task «re considered more or less emotionally 
loaded events indeed. The problem to be in- 
} Vestigaied now is whether personality factors, 
as wel! as social and contextual variables 
Whose influence has been shown in the 
Previously reported experiments, do have an 
impact on frequency perception of neutral 
events, 


Several years ago, a tentative study in the 
area of neutral events was carried out with a 
series of task items composed of simple addi- 
tions and subtractions (Nuttin & Greenwald, 
1968, pp. 22-25). After finishing this 20-item 
| task, each subject was requested to give his 
Perceptual judgment as to the relative num- 
ber of additions and subtractions performed. 
It was assumed that additions and subtrac- 
tions were two equivalent and neutral com- 
Ponents of an easy task in which no out- 
comes were given. The order of succession in 


FREQUENCY PERCEPTION OF SUCCESSES 


743 


which the 10 additions and 10 subtractions 
were presented was reversed for the two 
equivalent groups (A and B) into which the 
26 female subjects were divided. The only 
tentative result obtained in this early experi- 
ment showed that Group A perceived a pre- 
dominance of additions, whereas Group B 
overperceived the number of subtractions, 
This effect was tentatively attributed to a 
structural factor: For Group A subjects three 
out of the four items in the beginning and 
at the end of the series were additions, where- 
as Group B subjects started and ended with 
three subtractions out of four items. In this 
respect, the results confirm the well-known 
temporal order effects in impression forma- 
tion. In the present case, however, it is not 
clear whether the obtained effect is one of 
primacy or of recency: The earliest as well 
as the most recent events favor the same im- 
pression. 

The problem studied in our final experi- 
ment is related to our investigation about 
the impact of a previous series of successes 
on the frequency perception of successes in a 
subsequent task. As in the outcome percep- 
tion experiments, a distinction is made be- 
tween a condition in which the first series’ is 
performed by the subject himself and a condi- 
tion in which the first series is performed by 
a socius in the presence of a naive subject. In 
the present experiment, however, both con- 
ditions are compared directly within a single 
design. 


Experiment 6 
Method 


Subjects. Subjects were 56 male and 56 female 
first-year university students who volunteered for 
the experiment. In all conditions there was an equal 
number of male and female subjects. The socius, 
who was of the same sex as subjects, was introduced 
to the subjects as a student from another depart- 
ment. 

Procedure. The task used was similar to the SRT 
described in Experiment 5. In the present experi- 
ment, however, each item was immediately piece 
by the experimenter’s instruction, “left” or ‘right, 

which informed the subject that he had to visualize 
the projected form rotated 45° to the left or to ne 
right, respectively. No outcome was given a ah e 
subject’s response to each item. The auditory s' ia 
uli, “left” and “right,” were the two neutral Gik 

whose frequency perception was measured ee the 
dependent variable; they were considered equivalent 
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to the outcome stimuli, “right” and “wrong,” used 
in our previous experiments. Subjects were instructed 
that because a rather large number of geometrical 
forms would be presented, the series would be di- 
vided into two equivalent halves with a short inter- 
val in between the two parts. The experimental 
design was a 2 X 2 factorial scheme; one differential 
variable, sex of subjects, was included in the design 
for obvious reasons. The first experimental variable 
was the performer, either the socius or the subject, 
of the first series of 22 items. In the self condition, 
subjects performed themselves; in the socius condi- 
tion, subjects attended and observed the socius’ 
performance in the first series. The second series of 
items was always performed by the subject. The 
second experimental variable was the proportion of 
“left” and “right” instructions given by the experi- 
menter in the first series of items. In the left con- 
dition, the instruction, “left,” was given 18 times 
as opposed to 4 times, “right”; in the right condi- 
tion the proportion was reversed. In the second 
series of items, subjects had to represent 11 times 
the form as rotated to the left and 11 times to the 
right. At the end of the two series, subjects were 
requested to give a perceptual judgment on the 
number of “left” and “right” instructions received 
during the second series (i.e., after the interval). 


Results 


Table 6 lists the mean judgment of the 
number of “left” instructions in the second 
series in the various conditions, The data 
were analyzed in a three-factor design; the 
main factors were performer conditions, first 
series, and sex of subjects. The main effect of 
the first series was found to be significant, 
F (1,104) = 5.47, p < .05. Perceptual impres- 
sions of the number of “lefts” in the second 
series are higher in the left condition as com- 
pared with the right condition. The interac- 
tion between the first series and self versus 
socius performer conditions yielded no clearly 
significant effect, F(1,104) = 3.13, .10 > p 
> 05. However, a separate analysis for the 


TABLE 6 


MEAN NUMBER OF PERCEIVED LEFTS IN THE SECOND 
SERIES, EXPERIMENT 6 


Sex 
Sah ena Male Female 
Self 
Left 12.28 12.35 
Right 10.85 10.71 
Sorat 11.64 11.64 
Right 11.35 11.49 


EE Eee 
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self and socius conditions indicate: that the 
effect of the first series was signi: ant only 
in the self condition, #(104) = 2.91 » < 01, 
and not in the socius condition, ¢(1°) = ,39, 
p> 10. No differences were foun between 
male and female subjects. 
Discussion 

The absence of any effect in the s ‘us con- 
dition of the present experiment is sharp 
contrast with the marked impact socius’ 
performances in the outcome perce) ion ex- 


periments. As mentioned in the discussion of 


the previous experiments, observing the 
achievements of a very successful or unsuc- 
cessful socius in tasks that are als- to be 
performed by the subject is assumed > acti- 
vate subject’s tendency to compare own 
outcomes with those of his socius. Thi- social 
comparison process is hypothesized to ų \derlie 
the contrast effect found in less com, «titive 
and field-dependent subjects; a more ¢ xpeti- 
tive kind of comparison with the sociu- tends 
to produce a certain amount of assim) ition, 
as found for male subjects in some © :peri- 
ments. Neither kind of comparison is | -ly to 
be made when subjects are not invo!.d in 
the events perceived. In this respect then, 
frequency perception of neutral events seems 
to diverge from the results obtained x: the 
frequency perception of behavioral out: omes. 

On the other hand, neutral event ; rcep- 
tion and outcome perception yield thy same 
result, namely, an assimilation effect, when | 
the preceding performance is done by the 


subject, that is, in the self condition of the 

present experiment and in conditions with two 

series of a similar task in Experiment 5. In 

the present experiment, after completing the 
first half of their task in which a large major- 
ity of left (or right) stimuli were presented, | 
subjects obviously maintained a perceptual | 
impression of this majority and implicitly 
expected the second half to proceed along the 
same lines. This is a purely cognitive, that is, 
a non-ego-involved type of expectation, func- 
tioning as a kind of hypothesis about events 
to come and as a frame of reference for the 
perceptual judgment to be made. This effect 
may be compared with the well-known an- 
choring effect of previous perceptions (¢.8+ 
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Bieri, “ins, Briar, Leaman, Miller, & Tri- 
podi, 1 55; Helson & Kozaki, 1968). In the 
socius condition, this process was opposed by 
a shar erceptual segregation between the 
two ser hich is the result of the fact that 
the ser vere given to different individuals. 

The ely cognitive and prognostic expec- 
tation o be distinguished from the ego- 
involyc ype of expectancy, which was men- 
tioned he Results section of Experiment 

5. Bec of the affective nature of behay- 

ioral « nes, subjects are more committed 

to the tial expectancy; such commitment 
is an cavor to preserve the ego’s self- 
consist, This dynamic variable is as- 
sumed se Operative when the subject as- 
similat ater outcomes to a previously 
establis «| conception of his own capability, 
which h. . been inferred from earlier successes 
and fai! vs in a particular task. 
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Communication Feedback and Duration as Determinants of 


Accuracy, Confidence, and Differentiation in 
Interpersonal Perception 


Richard S. Powell and Edgar C. O'Neal 
Newcomb College, Tulane University 


Tn an attempt to investigate certain implications of an interpersonal hypothesis- 
testing framework, reciprocal and nonreciprocal communications were opera- 
tionalized in terms of whether or not they provided feedback from the other 
person. It was hypothesized that as the duration of the communication in- 
creases, there is a greater increment when such communication is reciprocal 
rather than nonreciprocal in (a) the accuracy with which one perceives another, 
(b) the confidence one has in those perceptions, and (c) the differentiation ex- 
hibited in such perceptions. Female subjects were randomly assigned to groups 
in which they experienced either 5 or 15 minutes of either reciprocal or non- 
reciprocal communication from one of two female targets. Several measures 
were obtained of the extent to which the subjects demonstrated accuracy, con- 
fidence, and differentiation in their ratings of the target. As a check on the 
validity of the hypothesis-testing concept, the accuracy, confidence, and differ- 
entiation measures were correlated with the number of questions and comments 
addressed toward the target by subjects in the reciprocal conditions. The results 
consistently confirmed the experimental hypotheses and indicated that the hy- 
pothesis-testing concept has some construct validity, It was concluded that the 
feedback provided by reciprocal communication is an important determinant of 


accuracy, confidence, and differentiation in interpersonal perception. 


In Mead’s (1934) theory of social interac- 
tion, there are two major types or modes of 
communication, which might be termed recip- 
rocal and nonreciprocal communication. Re- 
ciprocal communication occurs when actors A 
and B respond to one another’s responses 
interdependently. In contrast, nonreciprocal 
communication occurs when A responds to B 
but not vice versa. Examples of reciprocal 
communication include most face-to-face 
interactions; examples of nonreciprocal com- 
munication include “one-sided conversations” 
and transmissions over the various mass me- 
dia. More recently, Jones and Gerard (1967, 
pp. 505-513) have suggested a similar dis- 
tinction between “mutual interactions” and 
“asymmetrical interactions.” These modes of 
communication can also be distinguished in 
that reciprocal or mutual communication in- 
volves interpersonal jeedback—each communi- 
cator has access to the other’s reactions to his 

Requests for reprints should be sent to Richard 
S. Powell, Department of Psychology, Newcomb 
College, Tulane University, New Orleans, Louisiana 
70118. 


initial responses—whereas nonreciproc:! or 
asymmetrical communication does not. i ves- 
tigators have not always clearly distingu shed 
between these modes, often making th: as- 
sumption that nonreciprocal communic:iion, 
other factors being equal, would lead to the 
same results as would reciprocal communica- 
tion. There is, however, reason to doubt the 
validity of such an assumption, Mehrabian 
and Reed (1968), for example, summarized 
evidence that communication which provides 
feedback from the other allows for more ac- 
curate translation of the communicated mes- 
sage than does communication without inter- 
personal feedback. The purpose of the present 
experiment is to demonstrate that reciprocal 
and nonreciprocal communication also have 
differential effects on three other equally im- 
portant dependent variables: (a) the accu- 
racy with which one perceives another, (b) 
the confidence one has in those perceptions, 
and (c) the differentiation exhibited in such 
perceptions. 

An important function of interpersonal 
feedback is that it provides the means for 
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interpersonal Aypothesis testing. That is, the 
perce:vcr may test his inferences (“hypothe- 
ses”) 2bout the other by initiating a response 
towar the other and then interpreting the 
other reaction to that response. The other’s 
feed! can either confirm or disconfirm the 
percei: `s hypothesis. The confirmation or 
discon ation of the hypothesis, in turn, 
genera new hypotheses. Each confirmed 
(disco: | rmed) hypothesis is capable of elimi- 
natin ternatives among the perceiver’s in- 
terpre’ ‘ions of the other’s behavior and in- 
ferenc’» about his dispositions. For example, 
the po: civer may suspect that the other pos- 
sesses trait A. He therefore tests an appro- 
priate \ypothesis, which is subsequently con- 
firme’ oy the other's feedback. Knowing now 
that i^e other possesses A, the perceiver ac- 
cordi formulates and tests a new hypothe- 
sis, T: © confirmation of this hypothesis leads 
the p- -ceiver to conclude that the other pos- 
sesses ^. but not B. In this manner, the more 


ses tested, the more alternatives elimi- 
)n the other hand, if no feedback is 
to the perceiver, it may be quite 


hype 
nated 


avail2>le 

difñcuii for him to test his hypotheses. In 
summary, it is expected that the presence or 
absen: of interpersonal feedback, and there- 
fore occurrence or nonoccurrence of hy- 
pothe-.s confirmation, will have differential 
effects on accuracy, confidence, and differen- 
tiatio:. Moreover, this difference should in- 
crease as the duration of the communication 
increases, since the number of hypotheses 
the perceiver can formulate, test, and confirm 


is limited by the amount of time available to 
him. Specifically, these predicted effects can 
be stated as follows. 

To the extent that validated information is 
also accurate information, hypothesis testing 
allows the perceiver to construct a more-or- 
less veridical picture of the other. At the 
Same time, since hypothesis testing can also 
lead to disconfirmation, the perceiver can 
Correct his interpretive errors, thereby pre- 
Venting further misleading inferences. In 
brief, the more alternatives eliminated, 
whether by confirmation or disconfirmation, 
the greater the accuracy attained. But if feed- 
back is not available, it may be difficult for 
the perceiver to test his hypotheses regard- 
less of the duration of the communication. 
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The first hypothesis of the present study can 
thus be stated as follows: As duration in- 
creases, reciprocal communication produces a 
greater increment in person perception accu- 
racy than does nonreciprocal communication, 
Indirect correlational support for this hy- 
pothesis is provided in two studies, both of 
which dealt with only reciprocal communica- 
tion situations. Brewer and Brewer (1968) 
found a positive correlation between the num- 
ber of statements exchanged by subjects in 
dyads and accuracy. Similarly, Stryker (1962) 
found a direct relationship between frequency 
of contact and accuracy. 

The presence or absence of interpersonal 
feedback should also affect the perceiver’s 
confidence in his perceptions of the other. 
Drawing from information theory (see Att- 
neave, 1959, pp, 6-8; Newtson, 1973), it 
might be argued that each additional unit of 
information produces an increment in cer- 
tainty to the extent that such information 
succeeds in eliminating alternatives, The pres- 
ent position argues that information which 
has been validated is more effective in elimi- 
nating alternatives than is nonvalidated in- 
formation. The second hypothesis may be 
stated as follows: As duration increases, re- 
ciprocal communication produces a greater 
increment in confidence than does nonrecipro- 
cal communication. Hence, the same process— 
the elimination of alternatives via hypothe- 
sis testing—that is proposed to produce 
greater accuracy should also produce greater 
confidence, 

Finally, interpersonal feedback should af- 
fect the degree of cognitive complexity (Bieri, 
1961) or differentiation exhibited in one’s per- 
ceptions of the other. The perceiver’s initial 
hypotheses probably begin at a simple, gen- 
eral, or stereotypical level. As each new hy- 
pothesis is confirmed, a different set of possi- 
ble alternatives is generated. These new in- 
ferences are in turn investigated by testing 
new hypotheses. The net result of this ex- 
ponentially expanding process is greater dif- 
ferentiation in the perceiver’s impressions. If 
the communication does not provide feed- 
back, however, the perceiver’s alternatives 
should not progress as rapidly beyond the 
initial level. Thus, the third hypothesis 
emerges: As duration increases, reciprocal 
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communication produces a greater increment 
in differentiation than does nonreciprocal 
communication. Some support for this hy- 
pothesis has been provided by Koltuy (1962). 
Arguing that differentiation should be repre- 
sented by smaller intrajudge correlations be- 
tween trait ratings and by greater variance on 
such ratings, Koltuy found that differentia- 
tion thus measured was greater for familiar 
targets than for unfamiliar ones. Similar find- 
ings have been reported by Crockett (1965) 
and Miller (1969). On the other hand, Eke- 
hammar and Magnusson (1973) found that 
subjects who observed interviews of increas- 
ing length showed no systematic differences 
in differentiation. A possible reconciliation of 
these conflicting findings involves the fact 
that the subjects in the Koltuy, Crockett, 
and Miller studies engaged in reciprocal com- 
munication with one another, while those in 
the Ekehammar and Magnusson study did 
not. 

The major problem in interpreting the re- 
sults of the studies cited above is that in no 
case was reciprocity manipulated orthogonally 
to duration. In a preliminary study (Powell 
& O'Neal, Note 1), such a manipulation was 
in fact implemented. Half of the subjects 
engaged in either 5 or 15 minutes of face-to- 
face interaction with a target person. These 
“live” sessions were videotaped as they oc- 
curred and then replayed to the remaining 
half of the subjects. The results were en- 
tirely consistent with the present hypotheses. 
However, since the reciprocal groups enjoyed 
such subtle cues as color and three-dimen- 
sional presentation, as opposed to the black- 
ınd-white two-dimensional cues observed by 

nonreciprocal groups, the reciprocity ma- 
ition may have been confounded with 

of informational cues. The present 

udy ainates this confounding by present- 
ommunication conditions via the same 

ıı nedium, thus allowing for unam- 


retation. 
METHOD 
Over Design 
Group ale subjects were exposed to either 
5 or 15 of either reciprocal or nonreciprocal 
commun from one of two female targets. 
The ex tal design was thus a 2(Reciprocity) 
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X 2(Duration) X 2(Target) factorial. Thu -cciprocal 
conditions involved direct interaction with ‘se target 
over a two-way audiovisual system. E. target's 
performances during these “live” sessions v < video- 
taped as they occurred and subsequently ayed to 
the nonreciprocal groups, who could hear ‘ie com- 
plete dialogue. Afterward, various mes = were 
obtained to assess the degree to which t! ubjects 
demonstrated accuracy, confidence, and rentia- 
tion in their ratings of the target with wi :n they 
had experienced communication. 
Subjects 

The subjects were 64 women in intr: luctory 
psychology at Newcomb College who hac volun- 


teered to participate in a “person perception experi- 
ment.” They were randomly assigned, 8 subjects at 
a time, to the various treatment conditions. 


Apparatus 


The equipment that comprised the two-wa dio- 
visual system was as follows: three Sony A` 3600 
videotape machines, three Sony AVC-140 ideo 
cameras, one Setchell-Carlson 2100VTR blac. and- 
white video monitor with a 16 X 20 inch (4 04X 
50.80 cm) screen, two Sony CVM-192U blaci nd- 
white video monitors with 12X16 inch (4 8X 
40.64 cm) screens, three microphones, and nine »ad- 


phone sets. In the reciprocal sessions, a came: 
focused on the upper half of the target's bod; 
image was videotaped and simultaneously ` ns- 
mitted via the large monitor viewed by the su! octs 
(in another room). The target’s voice was rece ded 
and simultaneously received through the sub, cts 
headphones. Two other cameras were focused om the 
subjects and transmitted through the two 12 16 
inch (30.48 X 40.64 cm) monitors viewed by the 
target; four subjects appeared on one of the meni- 
tors and the remaining four on the other. These iwo 
monitors were placed side by side. The subj «ts 
voices were recorded along with the target’s image 
and voice. 


Dependent Measures 


Adjective checklist. An adjective checklist was 
constructed in a manner similar to that reported by 
Cline and Richards (1960). From the 555 adjectives 
compiled by Anderson (1968), 96 were selected and 
arranged into three separate lists with 48 pairs per 
list. The words were all high (above 3.46) in mean- 
ingfulness and covered the range in likableness from 
5.73 (sincere) to 1.64 (pessimistic). The adjectives 
within a given pair were closely matched in likable- 
ness (mean within-pair difference = .04), At 14 days 
prior to the experiment, each target indicated from 
each of the three lists the adjective of each pair that 
best described herself. For each target, a criterion 
checklist was constructed from among 20 adjectives 
selected by the target three times out of three; 
these “characteristic” adjectives were then paired 
with 20 “uncharacteristic” ones, those selected by 
that target not more than once. Specific pairings 
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were det: mined by within-pair proximity in likable- 
ness: Fi!) characteristic adjective was paired with 


the unc) ~acteristic one that was closest to itself in 
likablen Each criterion checklist was readmin- 
istered i. ‘he appropriate target at 10 days, 5 days, 
and 1 ds prior to the experiment; mean test-retest 
reliabilit.. for these checklists were .98 and .97 for 
targets ud B, respectively, The subjects used these 
checklist 9 rate the target with whom they had 
commun: ted. An accuracy score was calculated by 
counting number of the subject’s responses on 
the chec. st which coincided with the target's own 
response 

Refine * A semantic differential, consisting of 
15 bipo items on 9-point scales, was also em- 
ployed i: obtain accuracy scores. The ordinal na- 
ture of U- scale required controlling for the various 
response ses mentioned above (see Footnote 1). 
Christen (1970) demonstrated that his “refined 
D* sta was more successful in eliminating 
response ases than were several alternative pro- 
cedures, so it was chosen for the present analy- 
sis.? At days, 5 days, and 1 day prior to the 
experime each of the two targets responded to 
the sem differential by (a) rating herself on 
each ite) ind (b) rating the “typical Newcomb 
sophom¢ on the same items. The mean test- 
retest re ility coefficients for the self-ratings and 
“typical’ ings, respectively, were .90 and .93 for 
Target wd .89 and .94 for Target B. The dif- 
ference een the target’s modal self-rating and 
modal t ıl rating on each item constituted that 
target's rion scale. The subjects, in turn, rated 
both the zet and the typical Newcomb sophomore 
on this and item-wise deviation scores were 
obtained \ before. Refined D* scores were computed 
by takin the squared difference between the sub- 
ject’s de ion score and the corresponding criterion 
deviatio: re and summing over the 15 items. Re- 
fined D* ; us measured the extent to which the sub- 
ject and ‘he target agreed on the magnitude and 
direction »1 the latter’s deviation from the norm or 
average ©: the group to which she belonged (both 


targets were Newcomb sophomores). 

Questic aire completion latency. The length of 
time required by each subject to complete the adjec- 
tive checklist, the target-rating semantic differential, 
and the typical-rating semantic differential was un- 


obtrusive measured. Presumably, the less the 
latency, the less hesitant the subject was in respond- 
ing to the questionnaires. 


Amount of participation. As a check on the pro- 
Posed relationship between the amount of hypothesis 
testing and the dependent variables, the number of 
Questions and comments directed toward the target 
by each subject was unobtrusively recorded. Of 
Course, this measure was obtainable only from sub- 
Jects in the reciprocal groups. 

Postex perimental questionnaire, A questionnaire 
Was administered to obtain the subjects’ judgments 
of (a) their confidence in their ratings of the target, 

the perceived accuracy of those ratings, (c) 
their perceived similarity to the target, (d) the 
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extent to which the target expressed her “true feel- 
ings” during her performance, (e) their own interest 
and attentiveness during the experiment, and (f) 
their liking for the target. All ratings were made on 
5-point Likert-type scales, with the exception of the 
perceived accuracy item, which was a 9-point scale. 


Procedure 


Two days prior to the experiment, h target 
participated in a 5-minute practice discussion session 
with a panel of four Newcomb undergraduates; the 
discussion topic was “extramarital sex.” The practice 
sessions were held as an attempt to reduce variation 
in the target’s performances during the experimental 
sessions, 

The actual experiment proceeded as follows. After 
each eight-subject group had assembled, a female 
experimenter Jed them into the discussion room, 
where they were seated. The subjects’ video moni- 
tor, headphones, microphones, and cameras had al- 
ready been positioned in this room. The experi- 
menter informed the subjects that the experiment 
would involve their impressions of a stranger (the 
target) as a function of experiencing a brief discus- 
sion of “human sexual behavior” with her, They 
were told that the stranger was a Newcomb sopho- 
more who had volunteered to participate in the 
discussion. Thereafter, the target was referred to as 
the “volunteer.” 

Reciprocal-15 condition. Subjects in this condition 
were told that the discussion would last for 15 min- 
utes, allowing 5 minutes for each of three discussion 
subtopics: “premarital sex,” “abortion,” and “homo- 
sexuality,” in that order. They were told that the 
volunteer was in another room and that they would 


11t might be noted that this procedure circum- 
vents the measurement problems pointed out by 
Cronbach (1955, 1958) in that (a) the adjective 
checklist is a dichotomous nominal scale, thus elimi- 
nating “elevation” and “differential elevation” com- 
ponents, and (b) each adjective pair member is 
equated for social desirability, so that “stereotype 
accuracy” is controlled for. 

2 According to Christensen (1970), refined D? 
eliminates rating response biases by allowing the 
judge to reveal such biases in both his ratings of 
the target and his ratings of “typical.” To the extent 
that the biases exhibited in rating these two stimulus 
objects are identical, they should “cancel out” when 
difference scores are obtained. Christensen concluded 
from the results of his validity study that refined 
D? eliminated response biases more adequately than 
did either “differential accuracy” (Cronbach, 1955), 
“interpersonal sensitivity” (Bronfenbrenner, Harding, 
& Gallwey, 1958), or “interpersonal accuracy 
(Richards & Cline, 1963). Nevertheless, ifs he Pa 
poses the major advantage of refined D a that a 
subject need rate only one given target, w! ahha the 
other measures require the subject to rate at least 


two targets. 
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have to communicate with her via the video moni- 
tor and headphones; they were assured that the 
volunteer could also see and hear them. The experi- 
menter then explained: “This discussion is being 
held under such highly structured conditions so that 
we can control for any extraneous factors which 
might contaminate the data.” The experimenter 
emphasized that each of the subjects should partici- 
pate about equally and that all subjects could ask 
the volunteer any question they so desired. Subjects 
were requested to pay particularly close attention 
to everything the volunteer said or did, since the 
accuracy of their impressions of her would be tested 
after the discussion. At this point, the target (blind 
as to the experimental condition) appeared on the 
subjects’ monitor. The experimenter initiated the 
discussion of the first subtopic, premarital sex, by 
asking the target the question: “Do you feel that 
premarital sex is more morally right for males than 
for females?” This question was included as an ice- 
breaker; it was the only standardized question used 
throughout the discussion. The target had been pre- 
viously instructed to express her own opinions— 

‘reely, elaborately, and candidly—to all questions 

ddressed to her. The discussion was allowed to 

roceed without interruption for 5 minutes, at which 

time the experimenter changed the subtopic to abor- 
tion. After discussing this subtopic for 5 minutes, 
the subtopic was changed to homosexuality. After 5 
more minutes, the discussion was terminated and the 
audiovisual system turned off. During the discus- 
sion, the experimenter had unobtrusively recorded 
the number of questions and comments posed by 
each subject. 

Reciprocal-5 condition. This condition proceeded 
like the reciprocal-15 condition, except that subjects 
were told that the discussion would last for 5 min- 
utes and that one subtopic, premarital sex, would 
be discussed.* 

It should be noted that all four reciprocal sessions 
(one 5- and one 15-minute session per target) were 
run in a single afternoon, This schedule was intended 
to reduce variation in the targets’ performances. 

Nonreciprocal-15 condition. Subjects in this con- 
dition were informed that they were to observe a 
15-minute “live” discussion via their monitor and 
headphones on the subtopics of premarital sex, abor- 
tion, and homosexuality, in that order. The discus- 
sion they actually observed was one of the two 
videotaped reciprocal-15 discussion sessions, depend- 
ing upon the particular target they were to observe. 
Otherwise, this condition proceeded much like the 
reciprocal-15 condition. 

Nonreciprocal-5 condition. Subjects in this condi- 
tion observed one of the two videotaped reciprocal-5 
discussion sessions, depending on the target they were 
to observe. They were told that the discussion would 
last 5 minutes and that the subtopic of premarital 
sex would be ussed. In all other respects this 
condition proceeded as for the nonreciprocal-15 con- 
dition. 

Having participated in (or merely observed) a dis- 
cussion with the target, the subjects completed the 


RICHARD S. POWELL AND EDGAR C. O'NEAL 


adjective checklist and the two semantic dificrentials 
in a randomized order. The instruction: 
adjective checklist and the target-rating 
differential asked subjects to rate the volu 
way you think the volunteer might rate 
she were completely honest with hersel 
structions on the typical-rating semantic 
requested subjects to rate the typical 
sophomore in a similar manner. The e 
emphasized that subjects should take as 


on the 
semantic 


required and answer all items carefully. ^ they 
had finished each form, they were to turn c their 
answer sheet, which would signal the expe ' enter 
to administer the next form. The experin) r ex- 
plained that since each subject would be wi on 
different forms, which required different : of 
answers, they should not rush their answ just 
because their neighbor had turned her she ver. 
These instructions were intended to minimi roup 
“norm setting.” An unobtrusive observer ( id a 
one-way mirror) recorded the amount of ! the 
subject used to complete each of the questio: sires 
(i.e. the amount of time that elapsed be the 
subject turned over her sheet). After the ects 
had finished the adjective checklist and ntic 
differentials, the experimenter administered (hv vost- 
experimental questionnaire. Upon completion this 
questionnaire, the subjects were debriefed, © oked 
for their cooperation, and dismissed. 
Resurts * 

Accuracy 

The mean adjective checklist and -uned 
D? accuracy scores are presented by e*peri- 
mental condition in Table 1. Inspection o: the 

3 At the operational level, this procedure con’ ounds 
communication duration with number of s\))iopics 


discussed. At the conceptual level, however, duration 


per se is of little significance, while certain processes 
that covary with duration are of utmost impor tance, 
In the present case, we are concerned with duration 


only as it functions to limit the number of hypothe- 
ses the subjects can test. Thus, although we have 
conveniently chosen to label our independent vari- 
able as “duration,” perhaps a more appropriate Jabel 
would be “number of hypotheses testable.” 

4A preliminary multivariate analysis of variance 
was performed on the following variables: adjective 
checklist, refined D?, target-relevant questionnaire 
completion latency, self-reported confidence, i 
accuracy, and perceived similarity polarity. This 
torial analysis yielded a highly significant multiv 
Reciprocity X Duration interaction, Wilk’s 
Rao’s F(6,55) =12.29, p<.01. The analys 
revealed reliable multivariate main effects for reci- 
procity, Wilk’s \=.26, Rao’s F(6, 55) = 25.90, p 
<.01; and duration, Wilk’s A= .37, Rao’s F(6,55) 
= 15.35, p<.01. Separate univariate analyses On 
these variables were therefore warranted. 


F(1, 56) 
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is that on both measures, accuracy 


‘erably greater in the reciprocal-15 


han in any of the other conditions 
` and that these latter means did 
’ differ from one another. A Reci- 
Duration X Targets analysis of 
2s computed on both of these mea- 
‘pected, a significant Reciprocity 
» interaction was obtained on the 
iecklist, F(1,56) = 8.94, p < .01, 
1e refined D? measure, F(1, 56) = 
01. A significant reciprocity main 
btained on the adjective checklist, 
25.27, p < .O1, and on the refined 
F(1, 56) = 12.91, p < .01. Dura- 
‘ided a reliable main effect on the 
hecklist, (1,56) = 29.25, p< 
‘ined D2, F(1,56) = 6.07, p< 
parent from the obtained pattern 
hat these main effects were due 
rely to the Reciprocity X Dura- 
tion. Targets was not a reliable 
variance on either measure, and 
e the remaining interactions. In 


summary oth of these measures seem to 
confirm accuracy hypothesis. 
Confiden 
The ar unts of time used by each subject 
to compl: © the questionnaires were collapsed 
into two parate latency indices. A “target- 
televant” index consisted of the sum of the 
adjective -hecklist and target-rating semantic 
TABLE 1 
MEAN ADJECTIVE CHECKLIST AND REFINED D* 
ACCURACY SCORES 
m — —— 
Duration 
15 5 
Reciprocity minutes minutes 
—— — 
Reciprocal 
Adjective checklist 17.06, 12.94, 
Refined D? 96.44, 22.00» 
Nonreciprocal 
Adjective checklist 13.13, 11.94, 
Refined D? 6.31, 12.25, 
aint: Cells within each measure having different subscripts 
“a at the O01 level by the Newman-Keuls procedure. For 
Mše of presentation, refined D? means have been transformed, 
(150 — M), so that higher numerical values represent 


ter accu 


Refined D? measures the extent to which 


the subject and target agreed on the magnitude and direction 


Of the ta 


er's deviation from the norm or average of the group 


Which she belonged. 


TABLE 2 
MEAN QUESTIONNAIRE CoM 


ELF- 


PLETION SPEED, 
3 PERCEIVED 


RES 
Duration 
15 Fä 
Reciprocity minutes minute 
Reciprocal* 
Speed 384 27o 
Self-reported confidence 4.19, 2 
Perceived accuracy 8.00, 
Nonreciprocal* 
Speed -26n 
Self-reported confidence 2.63 
Perceived accuracy 2.63, 


à Speed 
Self-reported confidence means are on a 5-point sı 
accuracy means are on a 9-point scale, 


differential latency scores, while a “target- 
irrelevant” index comprised the typical-rating 
semantic differential scores. Analysis of the 
target-relevant latency scores yielded a highly 
reliable Reciprocity X Duration interaction, 
F(1,56) = 12.14, p < .01, as well as main 
effects for reciprocity, F(1, 56) = 15.91, p< 
.01, and duration, F(1, 56) = 10.12, p < 01. 
All other effects were negligible. On the other 
hand, analysis of the target-irrelevant latency 
scores revealed no significant effects. It would 
therefore seem safe to assume that the target- 
relevant latency effects were attributable to 
the experimental manipulations rather than 
sampling biases or group norm setting. 

For ease of interpretation, the target-rele- 
vant questionaire completion latency means 
were transformed, as (1/M), into mean speed 
scores. These mean speed scores, along with 
the mean self-reported confidence and per- 
ceived accuracy scores, are displayed by ex- 
perimental condition in Table 2. Inspection 
of the table indicates that on all three mea- 
sures, the reciprocal-15 group was more con- 
fident: Subjects in this group completed the 
questionnaires with greater speed (P < Ol), 
reported more confidence (? < 01), and per- 
ceived their ratings as more accurate ($ < 
(01) than did subjects in any of the other 


5 All probability Jevels reported throughout this 


article are two-tailed. 
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groups, the latter of which did not signifi- 
cantly differ from one another. An analysis 
of variance on self-reported confidence re- 
vealed reliable effects for the Reciprocity x 
Duration interaction, F(1, 56) = 5.57, p< 
.05, the reciprocity main effect, F(1,56) = 
25.13, p < .01, and the duration effect, F(1, 
56) = 12.52, p < .01. A moderately signifi- 
cant targets effect was also obtained, F(1, 
56) = 4.26, p < .05. This finding, although 
not surprising, is nevertheless not particularly 
detrimental to our hypothesis, since targets 
did not interact with either reciprocity or du- 
ration, thus indicating that the latter operated 
independently of the former. The three-way 
interaction was also negligible. An analysis of 
variance on perceived accuracy indicated a 
highly reliable Reciprocity x Duration inter- 
action, F(1, 56) = 35.13, p < .01, in addition 
to significant reciprocity, F(1,56) = 90.61, 
$ < .01, and duration, F (1, 56) = 33.03, p < 
.01, main effects. The remaining effects were 
negligible. 

It should be noted that as was the case 
with the accuracy measures, the reciprocity 
and duration main effects obtained on the 
confidence measures were almost entirely ac- 
counted for by the Reciprocity X Duration 
interaction, as indicated by the pattern of 
means shown in Table 2. 


Differentiation 


Two measures of differentiation were de- 
rived from the raw target-rating semantic dif- 
ferential items. The first measure dealt with 
both the magnitude and direction of interitem 
correlations. Following Koltuy’s (1962) line 
of reasoning, smaller intercorrelations should 
reflect greater differentiation. Extending this 
rationale, it could be argued that more dif- 
ferentiating judges should exhibit not only 
smaller interitem correlations but also more 
variation in the direction of those intercorre- 
lations. In other words, greater differentiation 
should be represented by a reduced tendency 
to rate the target in a generally positive (neg- 
ative) manner. This assum) ‘ion requires that 
the semantic differential isc 0s be coded in 
such a way that a high nu» rical value on 
any given item represents a hi 1 positive rat- 
ing, and a low numerical \ represents a 
high negative rating. After ng the items 
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in this manner, interitem correlation iwatrices 
were obtained for each of the four oxperi- 
mental groups (collapsed across ets). 
Box’s (1950) test for equality of va iance- 
covariance matrices was then applic the 
correlation matrices. Since this p dure 
tests for differences between the deter, nants 
of the matrices being compared, it is = -itive 
to differences in the size as well as th -lirec- 
tion of the correlations. Each group rela- 
tion matrix was compared with eac. other 
group matrix. The results were co: istent 
with expectations: The matrix for t cip- 
rocal-15 group was significantly a «rent 
from that for the nonreciprocal-15 : oup, 
F(55,2906) = 1.44, p < .05, and that the 
nonreciprocal-5 group, F(55, 2906) 40, 
p < .05. There were no other reliable ffer- 
ences. These results are rendered som what 
more interpretable upon consideration the 
mean interitem correlations display by 
experimental condition, in Table 3. Ac ling 
to the table, the reciprocal-15 group priced 
the smallest mean algebraic correlati as 
well as the smallest mean absolute ela- 
tions. These means apparently indicate, =» ex- 
pected, that the reciprocal-15 group ex! ited 
not only smaller intercorrelations bu also 
more directional variation. A somewhat nex- 
pected pattern of results emerges upon a ‘oser 
inspection of these means: While the + \pro- 
cal groups, as predicted, differentiate. nore 
(produced smaller correlations) over ‘ime, 
the nonreciprocal groups apparently dii eren- 
tiated less, thus nullifying any duration main 


effect. Since our hypothesis argues thei the 
differentiation shown by the nonreciprocal 
groups should be relatively unaffected by 


ë Since Box’s (1950) test requires calculating the 
determinant of each matrix, it was necessary to re- 
duce the number of input variables. This was be- 
cause the number of subjects per group (16) was too 
close to the number of semantic differential items 
(15). Whenever the number of variables approaches 
the number of subjects, the determinant of any cor- 
relation matrix approaches zero. This zero value is 
purely an artifact of the sample size upon which the 
correlations are based and is obtained regardless of 
the sizes or directions of those correlations. Conse- 
quently, 5 of the items were randomly discarded, 
so that the order of each input matrix was 10 rather 
than 15, thus allowing for a more valid use ot 
Box's test. 


| 


COMMUNICATION FEEDBACK AND DURATION 


TABLE 3 
MEAN I» “RITEM CORRELATION AND PERCEIVED 
Simi ity POLARITY DIFFERENTIATION 
ScorEs 
Duration 
i5 5 
Rec ty minutes minutes 
Recij l 
r 09 17 
At er 23 26 
Poi à 1.694 69, 
Nonr rocal 
r 29 21 
Ab: er -36 „31 
Po x -63b 15b 
Nole. Whe .»propriate, cells having different subscripts 
lifer at the ©: level hy the Newman-Keuls procedure. 
"Polarity i are on a scale from 0 to 2. 
duration, ‘vis finding would seem to be in- 
consisten'. ‘Nevertheless, the Box test revealed 
no reliab\ differences between the two non- 
reciproca! oups; we are therefore compelled 
to conc! that for these groups, duration 
had no © ct that would not be expected by 
chance aie. The Box results also lead us to 
conclude at there was no reliable duration 
simple «ect for the reciprocal groups as 
well, des: ‘e the difference suggested by the 
Means, ’ vertheless, the Box test did show 
teliable «ferences between the reciprocal-15 
group ar both of the nonreciprocal groups, 
While it cid not show any differences between 
the reciprocal-5 group and either of the non- 


teciproca! groups. Thus the results of the Box 
lest seem more-or-less supportive of our 
hypothesis 

A second measure of differentiation was 
obtained from a factor analysis of the raw 
larget-rating semantic differential items. It 
Seems likely that more differentiating judges 
Should exhibit more multidimensionality in 
their ratings of the target than less differenti- 
ating judges. Accordingly, differentiation was 
Measured as the number of important initial 
factors extracted, where an “important” fac- 
for is defined as any factor accounting for 
More than 13% of the total variance (i.e., 
“genvalue greater than 2). It should be noted 
that this measure provides an adequate test 
of the differentiation hypothesis and yet 
Woids two commonly encountered problems 
m the use of factor analysis: subjective in- 


terpretation of the factors and 

tational method. Separate facio 

were performed for each of the i 

mental groups (collapsed across tary 

initial factors were extracted by the | 
components solution, without iteration 

4 presents the proportions of the total 

ance accounted for by each factor fo: 
first five unrotated factors. According to il 
table, the reciprocal-15 group loaded on fou; 
factors with eigenvalues greater than 2, 
whereas the other three groups all loaded on 
only two factors with such eigenvalues. It is 
interesting to note that these latter three 
groups all loaded quite heavily on Factor I 
(eigenvalues around 4.9), after which there 
is a precipitous decline. In contrast, the recip- 
rocal-15 group loaded only moderately high 
on Factor I (A = 3.6) and then exhibited a 
moderate decline in loadings across Factors 
II, III, and IV. Consistent with the differen- 
tiation hypothesis, this pattern of results 
would seem to suggest that reciprocal-15 
group subjects exhibited more multidimen- 


TABLE 4 


PROPORTIONS OF TOTAL VARIANCE ACCOUNTED FOR BY 
Major Unroratep Factors 


Factor 

Condition STi N SCA 
Reciprocal—15 

Figenvalue* 3.60 3.45 245 217 1.24 
Proportion? 524. ERO) | ahO. epee tone 
Cumulative 

proportion® 24 47 63 .77 86 
Reciprocal—5 

Eigenvalue* 485 444 1.91 106 74 
Proportion” 32, 0% 1S. 0% 05: 
Cumulative 

proportion® 5 a) RY Slee tot 


Nonreciprocal—15 


Eigenvalue® 4.59 2.60 1.91 1.56 1.33 
Proportion AK A naD O AA K IE 09: 
Cumulative 


proportion® 


Nonreciprocal—5 


Eigenvalue* 5.13 4.20 1,52 1.04 .80 

Proportion” 34 28 10 .07 = .05 

Cumulative K 
proportion® ET 162) G72" 1 19° BA 


a Eigenvalue associated with the factor. à 
b Proportion of total variance corresponding to eigenvalue, 
«Cumulative proportion of total variance corresponding 


to eigenvalue, 


oo 
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sionality in their ratings of the target than 
did subjects in the other three groups. 

A third measure of differentiation was de- 
rived from the perceived similarity item on 
the postexperimental questionnaire. Following 
the rationale suggested by Koltuv (1962), 
greater polarity on such an item should repre- 
sent greater differentiation. The polarity 
scores were expressed as the absolute devia- 
tions from the scale midpoint. Referring back 
to Table 3, it is apparent that the mean po- 
larity scores are in line with expectations: 
The reciprocal-15 group demonstrated more 
polarity than any of the other three groups 
(p < .01), the latter of which did not show 
any significant differences. An analysis of 
variance was performed on the polarity scores, 
which yielded a reliable Reciprocity X Dura- 
tion interaction, F(1,56) = 12.19, p< .01, 
as predicted. Significant main effects, which 
were largely attributable to this interaction, 
were obtained for reciprocity, F(1,56) = 
9.63, p < .01, and duration, F(1, 56) = 7.38, 
p<.05. All remaining sources of variance 
yielded negligible effects. In addition to the 
perceived similarity polarity scores, an analy- 
sis of variance was conducted on the raw 
perceived similarity scores. Results of this 
analysisinicated no reliable effects. Thus 
‘7s no support for the argument that the 
fects were due to differential per- 

larity. 


Her 
pola 
ceived 


Additional Measures 


The process of hypothesis testing has been 
proposed as an explanatory construct in the 
pre concep tualization of accuracy, confi- 
dence. and differentiation in person percep- 
tion. a rather crude index of hy- 
pothesis-testi the amount of participation 
measure pr s a check on this conceptuali- 
zation as well as a check on the duration 
However, since this measure 

from subjects in the re- 

aly partial checks are 
«pected, the reciprocal 

nore than did the re- 

o } x mber of questions ad- 
ubjects in the former 
to 20 (M= 7.69), 
- group ranged from 
hermore, subjects in 


ithough 


maniy 
was 


cipro 


ciprocal 
dressed to the target 
group ra 
whereas that in ih 
0 to 3 (M =1.38) 
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the reciprocal-15 group asked questions at a ` 


greater rate: The mean number of «uestions 
asked per minute was .51 for this g:oup but 
only .28 for the reciprocal—5 group. Fhus, it — 


would seem that the duration ulation | 
was effective in creating differenties! oppor- 
tunities for hypothesis testing. As » check 
on the proposed functional aspect: of the 
hypothesis-testing process, amount ©. partici- 
pation was used to predict the accur«y, con- 
fidence, and differentiation scores of . bjects 
in the reciprocal conditions. The cor -lations 
between amount of participation anv che ad- 
jective checklist, refined D°, target- levant 
questionnaire completion latency, seli- “ported 
confidence, perceived accuracy, and © larity 
measures, respectively, were .72 (p =~ .01), 


—.50 (p< .01), —.29 (ms), 30 (ns), „41m 
(p< .01), and .46 (p<.01). All o! these 
correlations were consistent with pecta- 
tions. Apparently, then, the hypothesis-iesting 
concept is supported by the data. 

One item on the postexperimental qu »stion- 


naire asked subjects to rate the ex wnt to 
which they viewed the experiment «a inter- 
esting. Analysis of this item revealed +o reli- 
able effects. All groups reported a mo »vately 
high level of interest (M = 3.9). A milar 
item asked subjects to rate their atten’ veness 
during the experiment. An analysis of this 
item revealed one significant effect: the reci- 
procity main effect, F(1, 56) = 4.98, è < .05. 
The grand means for the reciprocal and non- 
reciprocal groups (collapsed over durations 
and targets) were 3.72 and 4.16, respectively 


indicating that the reciprocity effec: 
tributable to greater self-reported attentive- 
ness for subjects in the nonreciprocal groups. 
This finding, along with the finc of nof 
reliable interest effects, would seem 
against an explanation of the previously men- 
tioned results in terms of group differences 
on these variables. 

Another questionnaire item asked subjects 
to indicate their liking for the target. An 
analysis of this item showed no considerable 
effects. 

A final questionnaire item instructed sub- 
jects to rate the extent to which the target 
expressed her “true feelings” during her pet}? 
formance. This item was included as an ab 
tempt to rule out the possibility that having 


COMMUNICATION FEEDBACK AND DURATION 


observed a prerecorded videotape, the non- 
reciprocal groups may have perceived the 


target's performance as less credible, since as 
} far as y knew, the videotape could have 
been s The results obtained on this item 
were | r complicated. The mean perceived 
honesty scores for the reciprocal-15, recip- 
rocal sreciprocal-15 and nonreciprocal—5 
group: pectively, were 4.56, 4.81, 4.88, 
and 4 An analysis of variance yielded 
[no sigi main effects, but it did show 
a sign iprocity X Duration interac- 
tion, } = 7.18, p < 05. Furthermore, 
the ‘I ‘ Reciprocity interaction was 
reliab 16) = 4.34, p < .05, as was the 
Target Duration interaction, F(1, 56) = 
7.18, / 5. At the present time, these un- 
“expect idings seem uninterpretable. How- 
ever, e these data are apparently not 
systematically related to any of the findings 
reported ‘hus far, it would seem difficult to 


account for the major results of this study 


in terms of group differences in perceived 
honest; 
Discussion 
The ~csults provide strong confirmation of 
the hypothesis that the correspondence be- 
tween «9's perceptions of another person and 


that other person’s self-perceptions increases 
at a grezter rate with the duration of the com- 
munication when such communication is re- 
tiproca!, When subjects were provided with 
feedbacic from the target, their accuracy in- 
creased more over time than when subjects 
Were no! so provided, This pattern of results 
merged consistently, which is impressive in 
view of the fact that the accuracy measures 
vere obtained using quite different procedures, 
There is also strong evidence to indicate that 
ubjects who had access to the target’s feed- 
ack demonstrated a greater increment in con- 
idence over time than did subjects who did 
lot have access to feedback. This interaction 
Yas obtained on two self-report measures of 
Onfidence as well as an unobtrusive behavioral 
asure. Finally, the results indicate that dif- 
entiation in the subjects’ ratings of the 
arget increased at a greater rate as time 
Togressed when subjects were given feedback 
lan when they were not. This pattern of 
sults was consistently obtained on three 
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measures of differentiation, despite the fact 
that these indices were remarkably different 
in nature. 

The hypothesis-testing concept suggested 
earlier would seem to account for these re- 
sults quite adequately. The perceiver formu- 
lates hypotheses about the target in accord- 
ance with his idiosyncratic informational 
demands. Feedback from the target can either 
confirm or disconfirm these hypotheses, so 
that the perceiver’s alternative interpretations 
and inferences are successively eliminated, 
thereby resulting in greater accuracy, confi- 
dence, and differentiation. Since only those 
subjects who engaged in reciprocal communi- 
cation with the target could explicitly test 
their hypotheses, that is, only they had access 
to interpersonal feedback, and since more 
hypotheses can be formulated, tested, and 
confirmed when more time is available, the 
consistently obtained Reciprocity x Duration 
interaction could be expected. Moreover, as 
expected, relatively high correlations between 
the number of responses made toward the 
target by the subjects and most of the vari- 
ous measures of accuracy, confidence, and 
differentiation were obtained. These cor- 
relations suggest that the hypothesis-testing 
concept has some construct validity. 

A possible alternative explanation of these 
results is that subjects who had reciprocally 
communicated with the target were better 
able to devote their full concentration to 
the communication, due to their more active 
involvement. However, there is reason to be- 
lieve that this type of interpretation is un- 
feasible. For one thing, such an approach 
would predict only main effects and thus 
could not account for the consistently obtained 
Reciprocity X Duration interaction. There is 
also the distinct possibility that it was the 
reciprocal groups rather than the nonrecip- 
rocal ones who were the most distracted. 
Besides probable distraction from the other 
subjects who also participated in the discus- 
sion, each subject may also have been dis- 
tracted by having to plan each overt response 
so as not to appear “foolish” or “ignorant” 
in public. The results lend credence to this 
argument: The reciprocal groups reported 
reliably less attentiveness. 

An interesting—although somewhat coun- 
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terintuitive—conclusion might be drawn from 
the results of this experiment: The amount 
of information per se is not a critical determi- 
nant of either accuracy, confidence, or differ- 
entiation in person perception. It may be 
recalled that the nonreciprocal conditions 
were directly derived from (i.e., replays of) 
the reciprocal conditions and were presented 
to subjects over precisely the same audio- 
visual medium. The amount of information 
per se should therefore have been constant 
for these groups. Nonetheless, the reciprocal 
groups demonstrated, with remarkable con- 
sistency, reliable increases in accuracy, con- 
fidence, and differentiation as time progressed, 
whereas the nonreciprocal groups, with equal 
consistency, did not show such increments. 
If the hypothesis-testing approach is indeed 
valid, it might be concluded that the amount 
of information that has been validated via 
interpersonal feedback, rather than the 
amount of information per se, is the crucial 
factor in person perception. 


REFERENCE NOTE 


1. Powell, R. S., & O’Neal, E. C. The influence of 
communication reciprocity and duration upon 
accuracy, confidence, and differentiation in person 
perception. Unpublished manuscript, Tulane Uni- 
versity, 1974, 


REFERENCES 


Anderson, N. H. Likableness of 555 personality-trait 
words. Journal of Personality and Social Psychol- 
ogy, 1968, 9, 272-279. 

Attneave, F. Applications of information theory to 
psychology. New York: Holt, Rinehart & Winston, 
1959. 

Bieri, J. Complexity-simplicity as a personality vari- 
able in cognitive and preferential behavior. In D. 
W. Fiske & S. R. Maddi (Eds.), Functions of 
varied experience. Homewood, Ill.: Dorsey Press, 
1961. 

Box, G. E. P. Problems in the analysis of growth 
and wear curves. Biometrics, 1950, 6, 362-389. 

Brewer, R, E., & Brewer, M. B. Attraction and 
accuracy of perception in dyads. Journal of Per- 
sonality and Social Psychology, 1968, 8, 188-193. 


RICHARD S. POWELL AND EDGAR C. O'NEAL 


Bronfenbrenner, U., Harding, J., & Gallwey, M. The ` 


measurement of skill in social per In D. C, 
McClelland (Ed.), Talent and society. New York; 
Van Nostrand, 1958. 

Christensen, L. Validity of person-percr tion accu- — 
racy scores. Perceptual and Motor Sk: 1970, 30, 
871-877. 

Cline, V. B., & Richards, J. M., Jr uracy of . 
interpersonal perception: A general tr! ? Journal 
of Abnormal and Social Psychology, 1 60, 1-7. | 

Crockett, W. H. Cognitive complexity impres- y 
sion formation. In B. A. Maher (Ed Progress ` 
in experimental personality research (Vo. 2), New — 
York: Academic Press, 1965. 

Cronbach, L. J. Processes affecting scores “under- 
standing of others” and “assumed ilarity.” 
Psychological Bulletin, 1955, 52, 177-19. | 

Cronbach, L. J. Proposals leading to anal. ic treat- | 
ment of social perception scores. In R. giuri & 
L. Petrullo (Eds.), Person perception inter- — 
personal behavior. Stanford, Calif.: Stan «ed Uni- 


versity Press, 1958. 4 


Ekehammar, B., & Magnusson, D. Interi.. it cor- | 
relations as a function of amount of juc mental 
information about the ratee. Perceptual an’ Motor 
Skills, 1973, 36, 373-374. 

Jones, E. E., & Gerard, H. B. Foundations social 


psychology. New York: Wiley, 1967. 

Koltuv, B. B. Some characteristics of in’ »ajudge 
trait intercorrelations. Psychological Mos- raphs, 
1962, 76(33, Whole No. 552). fi 


Mead, G. H. Mind, self, and society. Chicago Uni- 
versity of Chicago Press, 1934. 

Mehrabian, A., & Reed, H. Some determin: nts of 
communication accuracy. Psychological letin, 
1968, 70, 365-381. 

Miller, A. G. Amount of information and » mulus 
valence as determinants of cognitive com: iexity. 


Journal of Personality, 1969, 37, 141-157. 

Newtson, D. Attribution and the unit of perception 
of ongoing behavior. Journal of Personalily and 
Social Psychology, 1973, 28, 28-38. 

Richards, J. M., Jr., & Cline, V. B. Accuracy com- 
ponents in person perception scores and the scoring 
system as an artifact in investigations of the gen- | 
erality of judging ability. Psychological Reports, 
1963, 12, 363-373. 

Stryker, S. Conditions of accurate role-taking: A 
test of Mead’s theory. In A. M. Rose (Ed.), 
Human behavior and social processes. Boston: 
Houghton Mifflin, 1962. 


(Received March 3, 1976) 


EES 


Person to person, 
person to society- 


Random House social psychology texts 
make the vital connections 


SOCIAL PSYCHOLOGY: Explorations in Understanding 
Chief Academic Adviser, Ken Gergen, Swarthmore College 


This research-based introductory text features in-depth coverage of all major topics in social 
psychology. It draws not only from research in social psychology but also from research findings 
in many other disciplines. Using a micro-to-macro presentation of social psychology, the book 
examines relations between an individual and an informal group, and then the relations between 
these groups and society as a whole. Instructor's Manual available, 


CRM/Random House/1974/637 pages hardcoyer/Codes: 31012, 31024 (IM) 


SOCIAL PSYCHOLOGY AND MODERN LIFE 
Patricia Niles Middlebrook, Central Connecticut State College 


The leader in its field for the past two years, this student-oriented text has been enthusiastically 
received for, among other things, its consistently lively writing style supplemented by examples 
familiar to the student. After an opening chapter on research, the text focuses on the influence of 
other people on the individual in the formation of self-concept, attitudes, and motives, Person 
perception is then discussed, and the text concludes by reviewing how people interact, Instructor's 
Manual available, 


Alired A. Knopt/1974/626 pages hardbound/Codes: 31726, 31158 (New text questions), 31831 (IM) 


AN INTRODUCTION TO SOCIAL PSYCHOLOGY, Second Edition 
James A, Schellenberg, Western Michigan University 


A concise introduction to social psychology — now in an expanded paperback edition — with an 
emphasis on interpersonal behavior. By exploration of relevant empirical data from selected case 
histories and experiments, the text actively involves the reader in elementary theory building. 
affiliation; aggression; social influences; atti- 


The material is grouped around seven general topics: 
tudes; groups; social norms; and personality in society. Instructor's Manual available. 


Random House/1974/350 pages paper/Codes; 31829, 31832 (IM) 


For more information write to: 


> 


Random House /Alfred A. Knopf 


College Review Desk 64, 400 Hahn Rd., Westminster, Md. 21157 


Publication 
Manual 


of the American 
Psychological Association 


Second edition, 1974. Cur- 
rent guide to the preparation 
of manuscripts for APA and 
related journals. This edition 
adds many examples, includ- 
ing an entire sample paper. 
New sections on student 
papers, metrication, and 
mathematical copy. En- 
larged sections on content 
and organization of an arti- 
cle, typing the manuscript, 
and all aspects of APA style, 
including reference lists and 
tables. 136 pp. $4.50. 


Please note that payment 
must accompany all orders 
amounting to $15 or less ex- 
cept for those orders sub- 
mitted on institutional pur- 
chase order forms. 


American Psychological 
Association 
Publication Sales Dept. 
1200 17th Street, N.W. 
Washington, D.C. 20036 


Invitation 
| to examine. without obligation, the Jat ahlem 
Workshop Reports. 


\ 


Sciences 
Research Report 


Life Sciences Research Reports (ISSN 0340-8132) 
Vol, 2 


Appetite and 
Food Intake 


Report of the Dahlem Workshop on 
Appetite and Food Intake 
Berlin 1975, December 8 to 12 


Editor: Trevor Silverstone, St. Bartholomew's Hospi 
London. 


|" This publication is a unique document in the litera 

of appetite. It contains not only comprehensive review: 
of the state of knowledge as seen by scientists from h 
different disciplines as physiology, biochemistry, phi 
cology, nutrition, psychology, psychiatry, anthropolccy, 
and internal medicine; there are also four reports 
synthesize the current thinking of these scientists aft 
|_wrestling with the problems in a group situation, 


CONTENTS 

Introduction: T, Silverstone, — 

The Central Nervous System and the Regulation of 
Feeding. J. E. Blundell, Rapporteur; D. A, Booth, 

L. De Ruiter, S. Garattini, R. M. Gold, J. F. Marshall, G. 
Mogenson, S. Nicolaidis, T. W. Robbins, E. T. Rolls, ` 
Slangen, E. Stellar, E. M. Stricker, P. Teitelbaum, R. J. 
Wurtmann. — 

Peripheral Mechanisms and the Regulation of Feediny. 
F. W. Wolff, L. M. Bartoshuk, D. A. Booth, G. A. Bray, 
C. S. Chlouverakis, P. B, Curtis-Prior, A. E. Harper, J. N. 
Hunt, D. Novin, M. B. Russek, K. Schneider, G. P. Smith 
Psychological and Cultural Aspects of Feeding and 
Appetite in Man. S. C. Wooley, Rapporteur; O. W. 
Wooley, L. M. Bartoshuk, M. J. C. Cabanac, R. Ferstl, 

G. W. R. Gutezeit, D. J. McFarland, M. Oetting, V. & 
Pudel, J. Rodin, P. Rozin, F. J. Simoons. 

Feeding and Weight Disorders in Man. J. S. Garrow, Rap- 
porteur; A. H. Crisp, H. A. Jordan, J.-E, Meyer, G. F. M. 
Russel, T. Silverstone, A. J. Stunkard, T. B. Van Itallie. = 
Approaches to Feeding Control: D. A. Booth. — Author 
Index, — Subject Index. =o 
15-day Free Examination Offer: request your copy today 
from 


Dahlem Konterenzen Delbttickstrasse 4 C. D-1000 Berlin 33 | 


| Dahlem Konferenzen | 


(Contents continued) 


P! »sophical Eminence, Beliefs, and Zeitgeist: An Individual-Generational Analysis/ 
Dean Keith Simonton- cna sedane ae aS ease EAE e eee a a 


T: rda Normative and Comparative Biocultural Social Psychology/Albert Pepitone 


Ju cation and Compensation: Rosier Skies for the Devalued Victim/Douglas T. 


Kenrick, John W. Reich, and Robert B. Cialdini ...........00.ccceceeeeseues 
Ce ipensatory Aspects of Dreams: An Empirical Test of Jung’s Theory/George 
Domino’ a syaa e rsx Sale ten eee aes eT sei ee eat 
El: ic Shifts of Opinion: Determinants of Direction and Durability/Robert B. 


“aldini, Alan Levy, C. Peter Herman, Lynn T. Kozlowski, and Richard E. Petty 


The Power of Suggestion: Another Examination of Misattribution and Insomnia/ 
Richard R. Bootzin, C. Peter Herman, and Perry Nicassio ........00.00.ceeuns 


The Recognition and Elimination of Inconsistencies Among Syllogistically Related Be- 
iefs: Some New Light on the “Socratic Effect”/Marilyn Henninger and Robert S. 


WA pret hii dict ba Poa cns Gantt Soo a OAT AS MT ORO REL ETE EEE E 


Use of Consensus Information in Causal Attributions as a Function of Temporal 
resentation and Availability of Direct Information/Nina S. Feldman, E. Tory 
Higgins, Marylie Karlovac, and Diane N. Ruble ...... 0.6. c veces ve ee neces 


Ou. of the Laboratory and into Wonderland: A Critique of the Kinder, Smith, and 
Gerard Adventure with Perspective Theory/Harry S. Upshaw .....-.....0.0005 


Plo ng Perspective Research in Perspective: Reply to Upshaw/Tom Smith, Harold B. 
Gerad, Gnd Donald Rab nee EAr aN DAAA ArT AT ede ela SE Oy 


Co» aitive Tuning and the Attribution of Causality/John H. Harvey, Stephen G. 
Harkins, and’ Dorothy K. KOgentro vi rasoio nau cyc nee y cds ar Voes can tees 


Relationship of Preoperative Fear, Type of Coping, and Information Received About 
Surgery to Recovery from Surgery/A. Marilyn Sime ... 00.00.00 cccueveevenes 


Social Facilitation of Word Associations: Further Questions/Thomas O. Blank, Ilene 
Staf, and Phillip- SROVE? occse cure. cv ence neip ceuecs sees ws aona te s AEREE S 


Frequency Perception of Successes as a Function of Results Previously Obtained by 
Others and by Oneself/Robert Vreven and Joseph R. Nuttin ......++0++00s00-+ 


Communication Feedback and Duration as Determinants of Accuracy, Confidence, and 
Differentiation in Interpersonal Perception/Richard S. Powell and Edgar C. 
O'Neal oeei silty so eis doo inganear re omad sA ne ele ees a Rs 


630 
641 


654 


658 


663 


673 


680 


694 


699 


704 


708 


716 


725 


734 


746 


28 JAN i372? 


Facts you 


* 


need 


to 


Graduate Study in Psychology 
1977-78 is an indispensable 
resource book for Prospective 
Sraduate students and college 
counselors, 


It provides specific information 

on 500 graduate programs in 

the United States and Canada: 

# their application procedures, 

* admissions requirements, 

* degree requirements, 

* tuition, 

* financial assistance, 

* internships, 

* minority considerations, and 
more, 


It includes general information on 

* how to apply to graduate 
school, 

* types of government stipends 
available, 


OW. 


* availability of child day-care 
facilities, 

* information of use to foreign 
students. 


Also, for the first time, Graduate 
Study in Psychology includes a 
listing of postdoctoral programs 
for Psychologists desiring to 
change their area of specialization. 


Graduate Study in Psychology 
1977-78 is approximately 600 pages 
and costs $6. To order your copy, 
please send full payment or 
institutional purchase order to: 


Publication Sales Department 
` S American Psychological 
Association 

1200 17th Street, N.W. 
Washington, D.C. 20036 


ae of Personality 


Social Psychology 
sn E 


NOVEMBER 1976 


Equity Versus Equality as Factors in Children’s Moral Judgments of Thefts, Charity, 
and Third-Party Transfers/Philip Brickman and James H. Bryan ............. 757 


Norm Change Over Subject Generations as a Function of Arbitrariness of Prescribed 
Norms/Mark K. MacNeil and Muzafer Sherif 0.22.00. ccc cec cece cseceeenas 762 


Effects of Noise and Perceived Control on Ongoing and Subsequent Aggressive Be- 
havior/Edward Donnerstein and David W. Wilson ..........000.cec cece eees 174 


A Longitudinal Field Investigation of the Impact of Group Composition on Group 
Performance and Cohesion/James R. Terborg, Carl Castore, and John A. 


O rity se cs ELA e o aa ENN ONAN EE eee. AA Oem OEE iat bol 782 
Validation of Personality Traits: A Multicriteria Approach/John McGowan and 

SORRI GOP oe dk ccs es rA Spgs eile s deal ein pays ae A eae a a E A S 791 
Acquisition of Information in Children’s Bargaining/Christine Eiser and J. Richard 

OB ASEP Oo N s Cena TL ee SAE EG TOPS lai 's Cd RT Re ae oc okey Se ~ 7196 
The Function of Group Size and Ability Level on Solving a Multidimensional Com- 

plementary “Task/ Thor Bgo UBER si. cinovsie en sib sien oa de maaa sta ia aaen 805 
Defensive Attribution and the Kelley Cube/Laura Stevens and Edward E. Jones .... 809 


Operant Autonomic Conditioning and Rod-and-Frame Test Performance/Thomas R. 
McCanne and CE A SONHO ie os ciate vive eRe vn sia io aie «ot eR eRe CO 821 


Frequency Perception of Individual and Group Successes As a Function of Com- 
petition, Coaction, and Isolation/Ludo Janssens and Joseph R. Nuttin ........ 830 


Groups and Crowds as Social Entities: Effects of Activity, Size, and Member Simi- 
larity on Nonmembers/Eric S. Knowles and Rodney L. Bassett ........-+.+++ 837 


Personality and Situational Influences on Changes in Prejudice: A Replication of 
ie Cook’s Railroad Game in a Prison Setting/Linda A. Foley ......+++++eee000e 846 


(Continued on inside back cover) 


PUBLISHED MONTHLY BY THE AMERICAN PSYCHOLOGICAL ASSOCIATION 


<s=-- SCHOOL DESEGREGATION 


by Harold B. Gerard, University of California, Los Angeles 

and Norman Miller, University of Southern California 

This thought-provoking six-year study of desegregation in River 
California, is distinguished from other desegregation reports on 

fronts: the extensive pre-desegregation testing and planning 
ducted by the authors; the long-term nature of the study; and t 

curacy of the scientific method applied. A volume in Perspecti\ 
Social Psychology, a series edited by Elliot Aronson. 

315 pages, 1976, $17.95 


SOCIAL PSYCHOLOGY IN TRANSITION 


edited by Lloyd H. Strickland, Carleton University 
Frances E. Aboud , McGill University 
and Kenneth J. Gergen, Swarthmore College 


This volume reflects the wide variety of diagnoses of and proposals for solving t 

flicts that are currently plaguing the field of social psychology. Of particular not 
transcript of a lively debate, and its evaluation, concerning social psychology’s 
structure.” An interesting mixture of opinions from both authorities and less ex p< a 
social psychologists lends an even balance to the work’s relevance to this ever- ng 
discipline. approx. 345 pages, illus., 1976. $24.50 


INTRINSIC MOTIVATION 


By Edward L. Deci, University of Rochester 
“I would like to see Intrinsic Motivation become a very in ortant 
text in psychology.” —Ralph Norma Haber 


. a very important and valuable contribution to the liter: 
motivation . . . a most worthwhile enterprise.” —D: E. I 


A volume in Perspectives in Social Psychology, a series edited | 
liot Aronson. 361 pages, 1975, $14.95 


REMINISCENCE, MOTIVATION.AND PERSONALITY 
A Case Study in Experimental Psychology 


by H. J. Eysenck and C. D. Frith, University of London 

Here is an up-to-date review of the wealth of recent literature on reminiscence in learn- 
ing—both motor and verbal. Examining in detail individual learning differences, the au- 
thors present for the first time a new theory of reminiscence in motor skill learning that 


carefully consolidates all findings in the area and closely parallels recent physiological the- g 


ories concerning the neural basis of learning. approx. 420 pages, illus., 1976, $27.50 


PLENUM PUBLISHING CORP. 


227 West 17 Street, New York, N.Y. 10011 
D In U.K.: Black Arrow House, 

2 Chandos Road, NW10 6NR, London, England 
d Prices subject to change without notice 


theJanguageofscience Prices slightly higher outside the U.S 


Volume 34 


Journal of Personality, m 227. 
and Social Psycholog 


Editor: 
Associate Editors: 


Irwin Altman 
University of Utak 

Norman H. Anderson 
University of California, San Diego 


T Alert Bandura 


Stanford University 
D 1 J. Bem 
vtanford University 
Seymour M. Berger 
niversity of Massachusetts 
John C. Brigham 
Florida State University 
Urie Bronfenbrenner 
Cornell University 
J. Merrill Carlsmith 
Stanjord University 
Isidor Chein 
New York University 
rt B, Cialdini 
izona State University 
Thomas Cook 
Northwestern University 
Walter H. Crockett 
University of Kansas 
James E. Dittes 
Yole University 
Norman T. Feather 
Flinders University oj South 
Australia, Bedford Park, Australia 
enneth Gergen 
warthmore College 
č. Gross 
sity of Missouri 
m Haythorn | 
rida State University 


Wil 
Flo 


Sı 


es Adams 
Uni 


'rsity of North Carolina 


Theodore X. Barber 
Medjord State Hospital, 
Medford, Massachusetts 


arry Collins 
University of California at 
Los Angeles 


lb Dermer 


versity of Wisconsin—Milwaukee 


Trene eze p 
University of Pittsburgh 

George R. Goethals 
Williams College 


THE AMERICAN PSY 


Copyright © 1976 by t 


November 1976 


John T, Lanzetta, Dartmouth College 
Alice H. Eagly, University oj Massachusetts 


Number 5 


Date 24: 22: 


mherst 


Anthony G. Greenwald, Ohio State University 
Albert Pepitone, University of Pennsylvania ` 


William Scott, James Cook University, Townsville, Australia 


Consulting Editors 


Clyde Hendrick 
Kent State University 
David S. Holmes 
University of Kansas 
Louise Kidder. 
Temple University 
Robert E, Kleck 
Dartmouth College 
Milton E. Lipetz 
University of Colorado 
Clifford E. Lunneborg 
University of Washington 
R. Clark MeCaule 
Bryn Mawr College 
Albert Mehrabian 
University of California, 
Gerald A. Mendelsohn 
University of California, Berkeley 
Messick k 
'ersity a California, 
Santa Barbara 
Judson Mills 
University of Maryland 
Herbert L. Mirels 
Ohio State Uw 
William N. Morris 
Dartmouth College 


isbett 
ity of Michigan, Ann Arbor 


Berkeley 


ity 


avin : f 
ysity of Wisconsin—Madison 
Dean Pruitt , 3 

State University of New York 
at Bufalo 


Issue Consultants 


David Green ORR 
Curnegic-Mellon University 
William M. Kartines 
Florida International University 
Helmut Lamm 5 
University. of Mannheim, 
‘Mannheim, West Germany 


Susan Milmoe 
Lajayette College 
e] O. Raynor Ses 
TS ate University of New York 
at Bufalo 
Steven Reiss man 
University of Ilinois 
at Chicago Circle 
Seymour Rosenberg 
Rutgers University 


Milton F. Rosenbaum 
University. of Towa 
David J, Schneid 
Brandeis Uni 
Marvin Shaw 
University of Florida 
Lawrence Stricker 
Educational Testing Servic 
Princeton, New Jersey 


Lloyd H. Strickland 


ity 


ë, 


Carleton University, Ottawa, Canada 


Fred L. Strodtbeck 
University o] Chicago 
Suedield 
ersity of British Columbia, 
Vancouver, Canada 
James T. Tedeschi 
State University of New York, 
Albany 


Harry, Upshaw 
University of Hlinois at Chicago 
Circle 
Elaine Walster 
University of Wisconsin 
Ladd Wheeler e 
University of Rochester 
Robert $, Wyer, Jr 
University of Ilinois at 
Urbana-Champaign 
Robert Ziller 
University of Florida — 


Madison 


Thilip Zimbardo 
Stanford University i 


Lawrence Sev 
idmore C 


Dennis Sullivan 
Dartmouth College 


Abraham Tesser 
University of Georgia 


William A. Tiller 
Stanford University 


Stuart Valins 
State University of New York 
at Stony Brook 
Robert Frank Weiss 
University of Oklahoma 


David J. Winter 
Wesleyan University 


CHOLOGICAL ASSOCIATION, Inc. 


he American Psychological Association, Inc. 


Yu. wo Arann 


Bureau Edo! Pay: 


Ssear 


The Journal of Personality and Social Psychology publishes original research reporis in the areas 
of social psychology and personality dynamics. The Journal includes articles on the following, topic 
social motivation; attitudes and attitude change; social interaction; verbal and nonverbal commu: 
cation processes; group behavior; person perception; conformity and deviation; and personality 
structure and dynamics. Methodological articles, studies primarily concerned with the development 
of measuring instruments, replications, and reports of negative results or failures to replicate pub- 
lished work are acceptable if they are judged to make a substantial contribution to knowledge, Low 
priority is given to papers concerned with personality assessment or the development and validation 
of assessment instruments. Manuscripts in these latter categories should be brief, not more than nine 
typewritten pages, The Journal does not include studies of abnormal processes or populations. 
Manuscripts submitted after April 1, 1976, should conform to the policies of the Editor-elect, as 
stated in his editorial in the January 1976 issue. 


Manuscripts: Submit manuscripts in quadruplicate, prepared in accordance with the Publication 
Manual of the American Psychological Association (2nd ed.) and the instructions stated in the 
Editor-elect’s editorial in the January 1976 issue of the Journal (Vol. 33, pp. 1-7). Copies should 
be highly legible. Only the original will be returned to the author. Mail manuscripts to the Editor- 
elect, Anthony G. Greenwald, Ohio State University, 404C West 17th Avenue, Columbus, Ohio 
43210. Required supplementary information: A submitted manuscript will not be reviewed for put 

lication without supplementary information in the following categories: (a) a cover letter that in 
dicates the relation (or lack of any relation) of the submitted manuscript to other currently sub- 
mitted, in-press, or recently published papers of the same research program; (b) one copy of cach 
such currently submitted, in-press, or recently published paper; (c) irately from any report in 
the manuscript, one copy of a statement of the research procedures in sufficient detail to serve as 
a guide to conducting an exact replication; and (d) also separately from any report in the manu- 
script, one copy of the summary tables (e.g., correlation tables, analysi 


is of variance summaries) for 
the major analyses reported in the manuscript. This supplementary information is solely for the use 
of the Editor in determining comprehensiveness of the manuscript, and it will not be returned to 
the author. 


After a manuscript is accepted for publication, authors will be expected to provide assurance that 
their raw data are available for reanalyses by other investigators and that their research procedures 
were consistent with the American Psychological Association’s Ethical Principles in the Conduct of 
Research with Human Participants (1973), except insofar as may be detailed in the manuscript. 


Copyright and Permissions: All rights reserved. Written permission must be obtained from the American Psycho- 
logical Association for copying or reprinting text of more than 500 words, tables, or figures. Permissions are normally 
granted contingent upon like permission of the author, inclusion of the APA copyright notice on the first page of repro- 
duced material, and payment of a fee of $10 per page of APA material. Permission fees are waived for authors who wish 
to reproduce their own articles, Address requests for permission to the Permissions Office, American Psychological 
Association, 1200 Seventeenth Street, N.W., Washington, D.C. 20036, 


Subscriptiong: Subscriptions are available on a calendar year basis only (January through December). Nonmember 
rates for 1976: $60 domestic, $64 foreign, $6 single issue. APA member rate: $25. Write to Subscription Section, APA, 


Reprints: Authors may order reprints of thelr articles from the printer when they reccive galley proofs, 


\ 


Bawk ees and Back Volumes: For information regarding back issues or back volumes write to Order Depart- 
ment, he 


Microform Editions : For information regarding microform editions write to any of the following: Johnson Associates, 
Inc., P.O, Box 1017, Greenwich, Connecticut 06830; University Microfilms, Ann Arbor, Michigan 48106; or Princeton 
Microfilms, Princeton, New Jersey 08540. 


Change of Address: Send change of address notice and a recent mailing label to the attention of the Subscription 
Section, APA, 30 days prior to the actual change of address, APA will not replace undelivered copies resulting from 
address changes; journals will be forwarded only if subscribers notify the local post office in writing that they 
will guarantee second-class forwarding postage, 


Published monthly in two volumes per year, six issues per volume, by the American Psychological Association, Inc., 
1200 Seventeenth Street, N.W., Washington, D.C. 20036. Printed in the U.S.A. Second-class postage paid at Washing- 
ton, D.C., and at additional mailing offices, 


Journal Staff 
Anita DeVivo, Executive Editor Robert J. Hayward, Advertising Representative 
Kendall C. Patterson, Production Supervisor Anne Redman, Subscription Manager 


Rita Jaskowitz and Dinah Sunday, 
Production Editors 


J Journal o! Personalit ; 
1976, r, No & se beet Psychology 


Ec ty Versus Equality as Factors in Children’ 
en's Moral ; 
of Thefts, Charity, and Third-Party eS eae 


Philip Brickman and James H. Bryan 
Northwestern University 


Previous research has found that charitable acts and third-part} 

more favorably judged when they increase equality than Ses Way estou 
equality. The present study asks whether this will be the case when the 
transfers decrease rather than increase the equity of the distribution and 
explores how equity and equality each affect subjects’ judgments of fairness. 
Ninety-six fifth-grade boys and girls judged a model who made either a chari- 
table, selfish, or third-party transfer that either increased or decreased the 
equity of a distribution and either increased or decreased the equality of a 
distribution. Transfers were favorably judged when they increased equity, 
regardless of whether they increased or decreased equality. Transfers were 
also favorably judged when they increased equality, regardless of whether 
they increased or decreased equity. Thus the present study finds evidence that 
equity and equality preferences can operate independently and simultaneously 


Dee a a 


and suggests a general method by which their relative strength can be assessed, 


| past study (Brickman & Bryan, 1975) 
we voted that theft and charity may both be 


vie -d as special cases of a general transfer 
of ids from one party to another. Evalua- 
tio - of these transfers may depend upon who 
me vs the transfer (e.g-, the beneficiary or 


the donor) as well as the shape of the distri- 
becion that results from the transfer (e8, 
more equal and equitable or less equal and 
equitable). Tt was found that transfers mak- 
ine the distribution of rewards in a four- 
person group more equal were judged more 
favorably than transfers that made the distri- 
bution less equal, at least if they were chari- 


table transfers (initiated by the person losing 
goods) or third-party transfers (initiated by 
a party who neither gained nor lost through 


the transfer). 
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In that study it was not possible to dis- 
tinguish between equality and equity as bases 
of reward because no information was pro- 
vided for observers to calculate how much 
reward each person deserved for equity. 
Equity is the case in which each person re- 
ceives a reward that is justified by his or her 
contribution, or more precisely, proportion- 
ate to other people’s rewards in the same way 
as the person’s inputs are proportionate to 
other people’s inputs (Adams, 1965; Walster, 
Berscheid, & Walster, 1973). Equality is, of 
course, the case in which everyone receives 
an equal amount of reward. Equal rewards 
are equitable only if each person has also 
made equal contributions towards the rele- 
vant goal. In the Brickman and Bryan (1975) 
study, no information was provided as to how 
hard people worked or how much they ac- 
complished and, hence, no conclusions could 
be drawn as to how much reward would have 
been equitable for each to receive. 

If a transfer increases the equity of a dis- 
tribution (by taking from someone who has 
more than they deserve on the basis of their 
work and giving to someone who has less than 
they deserve on the basis of their work), will 


it still matter whether the transfer also in- 
f the dis- 


creases or decreases the equality 0 
tribution? If a transfer increases equality, 
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EFFECT OF TRANSFER ON EQUALITY 


Increases Decreases 
EFFECT OF 
TRANSFER 
ON EQUITY 
WORK WORK 
DONE DONE 
Increases 
REWARD REWARD REWARD REWARD 
BEFORE AFTER BEFORE AFTER 
TRANSFER TRANSFER TRANSFER TRANSFER 
Work WORK 
ONE NE 
Decreases So 
» a 
REWARD REWARD REWARD REWARD 
BEFORE AFTER BEFORE AFTER 
TRANSFER TRANSFER TRANSFER TRANSFER 


Ficure 1, Transfers that increase and decrease equity and equality. 


will people view this transfer more favorably 
than one that decreases equality even if it 
does not increase equity? It is our belief that 
transfers in general will be approved to the 
extent that they increase equity, regardless 
of how they affect equality. Charitable trans- 
fers, however, would seem to constitute a 
special case in that effects on equality rather 
than effects on equity may be the important 
feature. Charity makes more sense as an act 
that increases equality than as an act that 
increases inequality. Furthermore, charity 
justified by a principle of equality would seem 
appropriate regardless of whether or not it 
increased equity (i.e., regardless of whether 
the donor was giving away goods that he or 
she had rightfully earned or mistakenly re- 
ceived). In any event, it is clearly important 


to know that distributions and the 
that produce them may not all be j 
a single criterion but by different 
that are salient on different occasions. 

Thus, the critical features of the present 


design are whether the transfer increases or 
and 


ed by 
tandards 


decreases equality in a distribution 
whether the transfer incre cr 
equity in the same distribution. 
sibilities are shown in Figure 1. The d 3 
tion at the top of each cell represents the 
amount of work done by each of four persons 
(in this experiment, the number of test book- 
lets completed by each). The distribution to 
the left of the arrow in each cell represents 
the amount of reward received by each person 
prior to the transfer (in this experiment, the 
number of chips left for each person by an 
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experimental assistant). The distribution to 
the right of the arrow in each cell represents 
the amount of reward received by each per- 
son after the transfer. If the height of the 
bars in the distribution representing work 
done coincides with the height of the bars in 
the distribution of reward to the left of the 
arrow, then the initial distribution of reward 
was equitable and the transfer made the dis- 
tribution inequitable. If the height of the 
bars in the distribution representing work 
done coincides with the height of the bars in 
the distribution of reward to the right of the 
arrow, the initial distribution of reward was 
inequitable and the transfer made the distri- 
bution equitable. Finally, if the height of the 
bars to the left of the arrow is equal and that 
to the right of the arrow is unequal, the ini- 
tial distribution was equal and the transfer 
made it unequal, whereas the reverse is true 
if the initial height of the bars is unequal and 
the final height is equal. 

The study also manipulated whether the 
transfer was surreptitiously made by the per- 
son gaining from the transfer (theft), losing 
from the transfer (charity), or not directly 
affected by the transfer (third-party trans- 
fer), as in Brickman and Bryan (1975). 
Thus, the full design of the present study is 
a 3X2x2 factorial involving agent of 
transfer (donor, recipient, or third-party), 
effect of transfer on equality (increases or de- 
creases), and effect of transfer on equity 
(increases or decreases). Dependent variables 
were people’s approval of the transfer, the 
transfer agent, and the resultant distribution. 


METHOD 


Subjects and Experimenter 


Ninety-six fifth-grade students, 45 males and 51 
females, served as subjects for this investigation. 
Eight subjects were randomly assigned to each of 
the 12 experimental conditions. All subjects were 
enrolled in a public school located in a suburb of 
Chicago. The experimenter was a female undergradu- 


ate. 


Materials 


Twelve short (approximately 4 minutes each) 
videotaped scenes served as stimuli for the subjects. 
The tapes were shown via a Panasonic tape-recorder 


unit. 


Procedure 


All subjects were tested in a school conference 
room. The experimenter seated the subjects in front 
of the video screen and then sat behind the monitor 
to be out of view and to manipulate the videotape 
recorder. 

Each subject heard the following instructions given 
by the experimenter: “We want you to look at a 
little film of a child from another school who did 
not know that we were filming her, Please watch 
very carefully because after you see the film, I am 
going to ask you some questions about what you 
have just seen. Do you have any questions?” 

After viewing the videotape, subjects were pro- 
vided with a short questionnaire. Upon completion, 
each subject turned in the survey, was thanked for 
helping in the study, and was dismissed. 


Videotape Script 


The film first establishes the nature of the task 
and the reward by means of a conversation between 
an experimenter and her assistant (both female un- 
dergraduates). The experimenter explains how glad 
she is to be able to give rewards to the children who 
have helped her in the experiment. The rewards 
are chips, which can later be exchanged for toys 
and candy from a display visible on a table at the 
side of the room. One chip is said to be earned for 
each test booklet completed by a child. 

To establish how many chips each child has 
earned, the experimenter reminds the assistant of 
how many booklets each of the day's four subjects, 
Alice, Jane, Mary, and Susan, has completed, She 
then indicates that she must leave and asks the 
assistant to put out the chips the children have 
earned since they will be coming by shortly to pick 
them up. Thus, the lack of correspondence between 
number of booklets completed and number of chips 
left for a child (necessary in the conditions where 
the initial distribution is inequitable) is produced 
by mistakes the assistant apparently makes in 
counting out the number of chips for each of the 
four children. The assistant places the chips by each 
child’s name and the stack of booklets the child has 
completed. 

As the assistant completes this task, a telephone 
rings and she goes into another room to answer it. 
While the assistant is talking on the telephone, Mary, 
a 10-year-old girl, enters the room, identifies her- 
self, and inquires about picking up her chips. The 
assistant explains that she cannot go into the other 
room because she is on the telephone but that Mary 
is welcome to go into the room alone and pick up 
the chips by her name. The purpose of having Mary 
alone was to establish the opportunity for her to 
transfer chips to or from various piles of the other 
subjects and have this transfer go unnoticed, While 
alone in the testing room, Mary counts the chips in 
each pile. She then rearranges the chips according to 
the transfer appropriate to the experimental condi- 
tion and announces the new chip distribution for 
herself, Alice, Jane, and Susan. Mary then gathers 
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up her pile of chips, calls goodbye to the assistant, 
and leaves the room. 


Manipulation of Independent Variables 


The display of test booklets and chips in front 
of each child’s name card was used to establish the 
equity and equality of the initial and final distribu- 
tions in a direct and graphic way. In the equal con- 
ditions, all subjects had a stack of 24 chips. In 
unequal conditions, one subject had 36 chips, one 
had 12, and the remaining two had 24 each. Trans- 
fers always involved moving 12 chips from one 
stack to another. In the equitable conditions, the 
number of chips each person had corresponded to 
the number of test booklets piled behind them. In 
the inequitable conditions, the chips received by two 
of the girls did not correspond to the number of 
test booklets they had completed, one receiving 12 
more chips and the other receiving 12 less chips 
than they deserved, Whether the transfer involved 
theft, charity, or the act of a third-party Mary 
made clear by adding chips to the pile next to her 
name, taking them away from the pile next to her 
name, or transferring them between two piles neither 
of which bore her name. 


Measurement of Dependent Variables 


Subjects responded to the following questionnaire 
items: Do you approve of what Mary did? Do you 
think Mary is a good person? Do you like Mary? 
Do you think what Mary did was fair? Do you 
think the amounts of chips people had after Mary 
moved the chips were fair? Do you think the amounts 
of chips people had after Mary moved the chips 
would make people happy? Subjects answered these 
questions by checking either not at all, a little, some, 
pretty much, or very much. Answers were scored 
from 0 (not at all) to 4 (very much). Subjects were 
also asked to recall how many test booklets each 
girl had completed and how many chips each girl 
had both before and after the transfer, 


REsuLtTs 


Dependent variables were analyzed by 
means of an analysis of variance using type of 
transfer, effect of transfer on equality, and 
effect of transfer on equity as independent 
variables. Initial tests also used sex of subjects 
as a factor but found no sex differences of 
any consequence. Thus, sex was ignored in 
subsequent analyses. Data from the manipu- 
lation checks indicated that subjects in gen- 
eral correctly perceived and recalled the num- 
ber of booklets completed by Mary, Alice, 
Jane, and Susan, as well as the number of 
chips each girl had both before and after the 
transfer, 


The pattern of results on each of our six 
dependent variables was identical, and all six 
were highly intercorrelated. (A single common 
factor accounted for 66.5% of the variance, 
with all items having loadings of at least .725 
on it.) Scores were added, and a single index 
of approval combining all six items was used. 

All main effects were significant, As in 
previous research, the transfer was more 
strongly approved when it represented an act 
of charity (M = 2.37) than when it repre- 
sented a third-party transfer (M = 1.22) or 
when it represented a theft (M = 1.02). This 
effect was significant, F (2,84) = 17.17, p< 
:001. Transfers were also more strongly ap- 
proved when they increased the equity of the 
distribution (M = 2.07) than when they de 
creased equity (M = 1.00). This effect wa 
significant, F(1,84) = 27.85, p < .001. 
nally, transfers were judged more favora 
when they increased equality (M = 1.75 
than when they decreased equality (M = 
1.32). This effect was significant, F(1, 34) 
= 4.66, p < 05. No significant interactions 
were found. 


Discussion 


We did not succeed in showing that charity 
is justified only when it increases equality 
(regardless of equity), whereas theft or third- 
party transfers are justified only when they 
increase equity (regardless of equality). How- 
ever, we did succeed in showing that both 
equity and equality are, independently, bases 
on which all three kinds of transfers are ap- 
proved or disapproved. Other authors have 
speculated that both equity and equality can 
Serve as bases of justice (Brickman & Camp- 
bell, 1971; Lerner, 1974; Rawls, 1971), and 
other resarchers have found evidence for 
equality as well as for equity both among 
children (Lane & Coon, 1972; Leventhal. 
Popp, & Sawyer, 1973) and in older samples 
(Brickman, 1975; Morgan & Sawyer, 1967). 
However, past studies have always forced 
subjects to choose between a distribution that 
is more equitable or a distribution that is 
more equal and have hence been unable to 
establish the simultaneous presence of both 
motives. The present study pioneers a simple 
and graphic method by which acts that affect 
the equity of a distribution can be cleanly 
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disentangled from acts that affect the equality 
of a distribution. Such acts, however carried 
out, are regarded as more fair if the resulting 
distribution is more equitable than the initial 
distribution. They are also regarded as more 
fair if the resulting distribution is more equal 
than the initial distribution. 

When subjects took into account whether 
or not the distribution was equal or equitable, 
they were considering what the philosopher 
Robert Nozick (1975) has called a patterned 
principle of justice. Subjects made more fa- 
vorable judgments when considering distribu- 
tions that conformed to a greater extent to 
either a pattern of equity or a pattern of 
equality. When subjects took into account 
whether the action or the transfer that had 
produced the distribution was just, regardless 
of whether the final distribution conformed to 
a particular pattern, they may have been 
reflecting what Nozick has called a nonpat- 
terned conception of justice. What matters in 
this conception is only whether the distribu- 
tion has been justly derived (by an act of 
charity rather than by an act of theft) and 
not what the pattern or shape of the distribu- 
tion is. Our results appear to indicate that 
both the shape of distributions and the man- 
ner in which they are derived are taken into 
account by subjects in their judgments of 
fairness. It seems clear that assessing the 
relative importance of patterns, which may be 
taken to correspond to ends, and transfers, 
which may be taken to correspond to means, 
is a worthy task for future research. 
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Norm Change Over Subject Generations as a Function of 


Arbitrariness of Prescribed Norms 


Muzafer Sherif 
Sociology Department 


Mark K. MacNeil 


Pennsylvania State University 


Experimental groups making judgments of autokinesis were enculturated with 
one of two arbitrary norms by planted subjects. Participants in sessions 
changed through successive “generations” in accordance with the example of 
Jacobs and Campbell, The principal focus was the effect of differing degrees 
of arbitrariness on norm formation, persistence, and change, Conformity to 
imposed arbitrary norms was found to be an inverse function of the degrees 
of arbitrariness. The two prescribed arbitrary norms were set at clearly 
separated distances from the natural norm and were clearly differentiated 
from each other. Both of the arbitrary norms in this study had a much 
broader latitude (7 inches, or 17.78 cm) than that of Jacobs and Campbell 
(2 inches, or 5.08 cm). Therefore, as would be predicted from latitude theoriz- 
ing, the enculturation and transmission of arbitrary norms were more effective 
and more persistent through generations. The more apparent than real differ- 
ences between the findings of this study and those of Jacobs and Campbell 
are reconciled through discussion of the nature of norms. 


Since they are the product of interaction 
among individuals, norms are subject to 
change as a consequence of further inter- 
actions. A norm that forms without the pres- 
ence of arbitrary influences is a “natural 
norm” for a given group and the existing 
conditions. A norm formed under “arbitrary” 
influences may subsequently be experienced 
as not fitting the prevailing stimulus condi- 
tions, and in conséquence, interactions among 
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individuals may lead to norm change (Sherif 
& Sherif, 1969). 

For social units in which the membership 
changes by means of a gradual replacement 
process, as in a family, the norms that are 
implanted in each successive new member 
through enculturation are basically derived 
from the past. Their present form is likely to 
have been determined, to some degree at 
least, from past conditions, and such norms 
are passed on to the new member in the form 
in which they currently exist, that is, as they 
have emerged under past conditions. The 
conditions under which a given norm formed 
may or may not still prevail. To the extent 
that the norm being implanted in the new 
member does not represent the norm that 
might emerge currently without the influence 
of past conditions, the norm implanted in the 
new member will be arbitrary. The degree of 
arbitrariness, then, is determined by the dif- 
ference between the conditions under which 
the norm originally formed and the conditions 
(other than the norm’s past form) that pre- 
vail at the present time. As Sherif and Sherif 
(1969) have stated: 


One important variable affecting the transmission 
of norms and the extent of conformity to them 
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by new members is the degree of their arbitrariness 

relative to current conditions facing the group. 

When a norm is transmitted by older members 

to a new generation of members, its “arbitrariness” 

can be defined relative to the conditions in which 
the new generation functions. The norm develop- 
ing in those conditions without an enculturation 
process by older members can be termed least 

arbitrary or “natural.” (p. 215) 

A relevant example of persistence and 
change is observed in the following account 
of enlisted men in the United States Army 
who, prior to World War IJ, followed an old 
custom when speaking to an officer: They 
used the article the and the third person 
singular of the verb (e.g., “Does the lieuten- 
ant want his coffee now?”). Initially, the “old 
soldiers” imposed this custom on the new 
selective service “civilian soldiers” with some 
success. By the time the war had been in 
progress a year, however, only a few “old 
timers” continued to use this form of address. 
As an Army norm the custom had disap- 
peared. Lack of a general societal norm of 
language honorifics, increased possibilities of 
moving from enlisted to officer status, and the 
great influx of nonprofessional soldiers during 
the war were all conditions that made this 
norm seem highly arbitrary. Consequently, 
it did not persist. 

An example of an arbitrary norm that per- 
sists despite the efforts of some to change it 
and the considerable awkwardness in its 
usage is the prescribed spelling of English. 
This normative system was developed mainly 
through the attempts of Norman-French 
scribes to transcribe English words (sounds) 
using French orthographic conventions. Pre- 
conquest spellings, the work of Anglo-Saxon 
scribes unburdened by continental spelling 
rules, are far simpler and more straight- 
forward. The principal use of the dictionary 
is to check on the spelling rather than the 
meaning of words. When we spell phonet- 
ically, the form of the words is quite different 
from “normal” usage. It is evident that our 
present system of spelling involves rules, that 
is norms, that are highly arbitrary, yet they 
persist. The spelling of English words would 
be vastly different if this normative system, 
introduced through historical accident, could 
be removed and the spelling rules allowed to 
develop under current conditions. 
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The above examples, and all instances of 
norm change in actual groups, embody com- 
plicated and multifaceted motivational com- 
ponents and established reciprocities, both in 
terms of organizational structure and norma- 
tive orientations. The motivational compo- 
nents involved are significant to the members 
of the cultures and groups concerned. The 
great distance of our present starting point 
from the eventual goal of moving experimen- 
tal analysis to these more complicated areas 
of persistence and change of norms is clear. 
In the present study the experimental norms 
were established without the motivational 
basis of norms formed in actual groups. The 
subject’s motivation was his desire to do well 
in the presence of other individuals like him- 
self and to resolve the uncertainties he felt 
in the situation. 

In a study by Jacobs and Campbell (1961) 
plants (i.e., experimenters’ collaborators posing 
as subjects) were employed to establish an 
extreme cultural norm in a situation involving 
the judgment of autokinetic movement. The 
adoption and survival of the arbitrary norm 
were then studied as the plants, one by one, 
were taken out of the group and naive mem- 
bers were introduced, in turn, in their place, 
The naive members then unwittingly became 
the further transmitters of the norm to still 
newer entrants. 

Jacobs and Campbell employed six condi- 
tions. Two of these conditions were controls, 
with either one naive subject alone or three 
naive subjects together forming a natural 
norm, or baseline, for comparison with arbi- 
trary norms. The remaining four conditions 
were experimental, and they differed primarily 
in terms of the size of each generation (from 
two to four) and the initial number of plants 
(from one to three). In general, it was found 
that under all experimental conditions, there 
was a sharp decay toward the natural norm. 

Of particular interest is Jacobs and Camp- 
bell’s (1961, p. 650) experimental condition 
“X_4-3” in which they had four-member 
generations with three plants in the initial 
generation. The plants presented an arbitrary 
norm of 15-16 inches (38.10-40.64 cm) 
under conditions in which the average judg- 
ment for the natural norm was approximately 
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4 inches (10.16 cm). This was the condition 
in which Jacobs and Campbell had the 
highest plant-to-naive ratio and the condition 
in which plants were in the group over the 
greatest number of generations. Since only 
one plant was removed at the end of each 
generation, it was not until the fourth genera- 
tion, under this condition, that the four-mem- 
ber group was made up entirely of naive sub- 
jects; that is, subjects were exposed to the 
prescribed arbitrary norm over more genera- 
tions and, consequently, a greater number of 
interactions than under the other conditions 
in the Jacobs and Campbell study. Although 
Jacobs and Campbell (1961) expected that 
“the strongest conditions might produce an 
arbitrary culture which would persist without 
apparent diminution” (p. 654), they reported 
that the number of plants had little clear-cut 
effect. 

When the relative structure, that is, the 
degree of ambiguity, in the different aspects 
of the total judgment situation are examined, 
three major areas of consideration become 
apparent: (a) the structure of the physical 
stimulus, (b) the structure in the social situ- 
ation, and (c) the structure in the norm. 

The first to consider is the degree of struc- 
ture in the physical stimulus. Asch’s (1951) 
judgment situation, in which subjects com- 
pared the lengths of lines against a standard, 
did possess a relatively high degree of struc- 
ture; that is, the subjects could make valid 
judgments and could match the lines cor- 
rectly. Asch’s subjects did make a high pro- 
portion of correct judgments when they were 
not under social pressure. Further evidence 
to this point is that despite the uniformity 
of the plants’ judgments in the Asch study, 
only approximately 37% of the judgments re- 
vealed the impact of social influence (Asch, 
1951). Looking at the whole situation in 

which norms may form in relation to a phys- 
ical stimulus, when the structure of the phys- 
ical stimulus is relatively high, the potential 
effect of social influence is proportionately low. 

The autokinetic judgment situation pro- 
vides a physical stimulus that is quite low in 
its degree of structure. In fact, to determine 
the nature of the stimulus in terms of the 
range of judgments, a consensual base must 
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be determined as is the case with psycho- 
social scales, because there is no physical 
reality. The only “reality” is the subjectively 
based reality of the individuals’ span of 
judgments, which does establish certain limits. 
The light moves only subjectively; there is 
no physical movement. The ease with which 
social influence shifts subjects away from the 
natural range of judgments is further evi- 
dence of the relative lack of structure of 
the autokinetic stimulus situation. When the 
physical stimulus lacks structure, one con- 
fident plant would be expected to be able to 
cause distinctive, if only temporary, shifts 
in the perception and judgments of naive 
subjects, and such is the case with auto- 
kinesis (Goldberg, 1954; Kidd, 1958; Pace 
& MacNeil, 1974). 

Why, then, did the arbitrary norm not 
form and persist, as prescribed even under 
the conditions of greatest social influence in 
the Jacobs and Campbell (1961) study? A 
closer examination of the location and struc- 
ture in the situation is revealing. If the degree 
of structure of the physical stimulus is low 
and the structure with regard to social influ- 
ence is high, and there is still a failure of 
the social norm to form at a prescribed de- 
gree of arbitrariness and to persist, the prob- 
lem must lie in the structure of the arbitrary 
norm itself, its reasonableness under the 
conditions, its credibility. 

The nature of a norm involves more than 
simply its central tendency. Most social 
norms include a latitude, a range of accept- 
able percepts or behaviors and are seldom 
an all-or-nothing punctate (Sherif & Sherif, 
1969, pp. 192-196). With regard to auto- 
kinetic judgment, there is invariably a range 
of judgments around a model tendency. When 
the central tendency of the judgments made 
in a particular autokinetic situation falls at 
4 inches (10.16 cm; approximately what 
Jacobs and Campbell, 1961, reported for 
their “natural” condition), the range of the 
norm is approximately 7 inches (17.78 cm); 
that is, the latitude of the norm is bounded 
by judgments of 1 inch and 7 inches (2.54 
and 17.78 cm), Only approximately 2.5% of 
the judgments made fall beyond these bounds, 
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In view of the concept of a norm as a 
reference scale, the norm that Jacobs and 
Campbell prescribed, as given by their plants 
at 15-16 inches (38.10-40.64 cm), is not 
only highly arbitrary in terms of its locus, 
but it appears to be extremely unrealistic in 
terms of its latitude. It is small wonder that 
upon removal of the plants from the group, 
this norm neither effectively transferred to 
the naive subjects nor persisted over genera- 
tions of membership change. The inherent 
structure of the prescribed norm, per se, was 
more than sufficient to overcome the ambi- 
guity inherent in the autokinetic situation. 

Further, Jacobs and Campbell’s 8-foot 
(2.4-m) subject-stimulus light distance addi- 
tionally constricted the lability of apparent 
movement. The subject-stimulus light dis- 
tance in the present study was 16 feet 
(4.8 m). The effects of various stimulus fac- 
tors in the autokinetic situation are revealed 
in the studies of Sherif and Harvey (1952) 
and Hood and Sherif (1962). Among the fac- 
tors involved are the duration of the stimulus 
light presentations, the subject-stimulus light 
distance, the intensity of the stimulus light, 
the subject’s subjective impression of the size 
of the laboratory area, and so on. 

The process of reversion to the natural 
norm through new generations offered by 
Jacobs and Campbell (1961) is a plausible 
one: 


The deterioration of the cultural norm occurs, in 
part, because each naive respondent makes judg- 
ments lower, somewhat closer to the natural norm, 
than do confederates or elder citizens serving as 
his mentors (p. 655). 


This explanation certainly sounds reason- 
able and is undoubtedly the case when there 
are factors such as the natural norm in the 
autokinetic situation or factors that are per- 
ceived by members of new generations as 
reasons for change in real social units. When 
we observe the persistence of apparently arbi- 
trary norms in our own and other cultures, 
we are reminded that the real-life condition 
resembling the “natural norm” of autokinesis 
frequently does not exert sufficient influence 
to change the norm. Norms often remain in 
arbitrary states over many generations. The 
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stimuli to which social norms pertain are, of 
course, often highly unstructured. The refer- 
ents of some social norms must be highly 
unstructured for such norms to persist at the 
highly arbitrary levels at which we find them. 
The experimental situation in which judg- 
ments of autokinesis provide social norm pro- 
totypes, as used by Sherif (1935) and others, 
can offer a set of conditions analogous to real 
life (Hood & Sherif, 1962; Sherif, 1935; 
Sherif & Harvey, 1952; Walter, 1955; 
Whittaker, 1958). 

An extension of the present line of research 
dealing with the fate of arbitrary norms, or 
traditions, is relevant. Varying two aspects of 
arbitrariness in an experimental situation in- 
volving a game, Weick and Gilfillan (1971), 
by manipulating the rules of procedure and 
the difficulty of playing according to those 
rules, extended the Jacobs and Campbell 
paradigm to a less structured and relatively 
more natural interpersonal interaction format. 
Male subjects were placed in social units of 
three members each. They were prohibited 
from communicating with each other in the 
course of determining what number each sub- 
ject would present in order that their indi- 
vidually presented numbers (limited to digits 
between O and 10) would add up to match 
a target number, which was given by the 
experimenter. The procedure used for accom- 
plishing this, the goal of the game, consti- 
tuted a procedural norm for the social unit, 
Any single procedural norm was considered 
“arbitrary,” since there were at least 25 pro- 
cedures that could be used. However, some 
procedures were easier to follow than others. 
The difficult procedures involved rules that 
were more complicated and required a 
greater degree of formalization than the easy 
procedures. 

Weick and Gilfillan (1971) assigned either 
easy or difficult procedures to the three sub- 
jects making up the initial generation of each 
experimental social unit. Each new member 
added to the social unit was given instruc- 
tions by the experimenters as to the specific 
procedures for the unit, and an “old” subject 
was removed as the new member joined the 
unit. This procedure was followed to generate 
11 generations of subjects playing the game 
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according to the normative procedures for 
their own social unit. 

In the Weick and Gilfillan (1971) study, 
it was found that arbitrary procedures that 
were difficult and complicated were changed 
to easier procedures by the fourth generation. 
Arbitrarily assigned procedures that were easy 
were adhered to throughout the experiment, 
although a number of equally easy procedures 
were available. 


Hypotheses 


On the basis of cultural evidence and 
experimental findings, hypotheses were ad- 
vanced concerning norm persistence and 
change in judgments of the extent of auto- 
kinetic movement. 

In terms of both their latitude and focus, 
norms will persist for shorter or longer dura- 
tions, depending on the degree of arbitrari- 
ness under which they are established. Specif- 
ically, the following five predictions were 
made: 

1. A norm formed in the autokinetic situ- 
ation without experimental enculturation (a 
natural norm) will be transmitted to succeed- 
ing generations and will persist with minor 
variations. 

2. When a majority of individuals in an 
experimental group give judgments within a 
prescribed range and around a common mode 
(experimental enculturation), new partici- 
pants will adopt their norm, given that the 
enculturation norm is not extremely arbitrary 
for the defined conditions of norm formation. 

3. When the cultural agents (planted sub- 
jects) are removed one at a time in successive 
generations, the extent of transmission of the 
enculturated norm to the just-succeeding 

generation will vary inversely with the degree 
of arbitrariness of the enculturated norm, 

4. As individual participants change over 
successive generations, there will be a general 
shift from the more arbitrary enculturated 
norms toward the least arbitrary (natural) 
range and mode. 

5. The persistence of the enculturated 
norm over succeeding generations with chang- 
ing participants will vary inversely with the 
degree of its arbitrariness; conversely, the 
change from the enculturated norm will vary 
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positively with the degree of arbitrariness of 
the enculturated norm. 


METHOD 


Testing these hypotheses required the establish- 
ment of norms under different degrees of arbitrari- 
ness and the study of their persistence and change 
through successive subject generations, holding other 
conditions constant. 

The study was designed to test the hypotheses in 
the autokinetic situation, varying the degree of arbi- 
trariness of the norm to which the individuals were 
exposed (enculturation norm). The implantation, or 
enculturation, process occurred during the inter- 
action of four subjects who gave vocal estimates of 
the distance of apparent movement. In a given 
session, following each series of 30 judgments, that 
is, each generation, the participants were successively 
changed by the removal of one experienced subject 
and the introduction of one naive subject (Jacobs 
& Campbell, 1961). 

The autokinetic equipment employed is described 
in Sherif and Harvey (1952; pp. 284-285) and Hood 
and Sherif (1962). The experimental room was 
14 X 35 feet (4.2 X 10.5 m) with subject-light dis- 
tance set at 16 feet (4.8 m). Exposure time of the 
light was 3 sec, and provision was made for four 
subjects. In the course of each generation, 30 judg- 
ments were voiced aloud by each of the four sub- 
jects. At the end of each generation, there was a 
5-minute rest period under extremely dim red 
illumination. During the rest period, the old sub- 
ject, or plant, was removed and the new subject 
was introduced. 

The least arbitrary, or natural, norm (Condition 1) 
formed during the interaction of naive subjects 
throughout successive generations. Thus, the encultu- 
ration norm was produced by the first generation of 
naive subjects. In the first generations, under Condi- 
tions 2 and 3, there were three “planted” subjects, 
who gave arbitrary estimates within a range and 
around a mode prescribed by the experimenter, and 
one naive subject. The plants were removed succes- 
sively during the first three generations, a naive 
subject being added each time. All subjects were 
naive by the fourth generation. 

In Condition 2 (moderately arbitrary), the planted 
subjects gave estimates within the range of 9-15 
inches (22.86-38.10 cm) around a mode of 12 inches 
(30.48 cm). In Condition 3 (most arbitrary) the 
prescribed norm range was 15-21 inches (38.10- 
53.34 cm), and the mode was 18 inches (45.72 cm). 
In both of these conditions, in the course of each 
generation, each planted and naive subject voiced 30 
judgments of the extent of movement. The pre- 
scribed judgments given by the planted subjects were 
given in an apparently random order; the actual 
prescribed distributions resembled a normal curve 
with the focus of judgments at 12 and 18 inches 
(30.48 and 45.72 cm), respectively. 

In Condition 1 (natural norm), the process of 
formation and transmission of the natural norm was 
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studied through eight generations of subjects. In 
Conditions 2 and 3, the enculturation process lasted 
for three generations, and the transmission by naive 
subjects continued for another eight generations. 
Two replications of each condition were conducted. 

Subjects and “plants” were male, high school stu- 
dents between 16 and 19 years of age. Control re- 
quirements were that no subject know, or have an 
established social relation with, any other subject 
in the experimental situation, Subjects were told that 
the purpose of the experiment was to find out how 
accurately people could estimate the distance a light 
moved in the dark. The need for this information 
by military and space programs was stated. 


Resuts 


The concept of norm, defined as a latitude 
of behavior within which exists a focus of 
behavior, requires that a measure of conform- 
ity and a measure of typicality be used to 
fully represent the persistence and change of 
norms as they are transmitted across succeed- 
ing generations. Analysis of the changes of 
the norms that occurred, predictably, as a 
function of the degree of arbitrariness of the 
original norm, was made using both conform- 
ity and central locus measures. Conformity to 
the prescribed latitude under each condition, 
that is, the degree of arbitrariness, was defined 
as the proportion of judgments made by the 
naive (new) member of each generation that 
fell within the originally established normative 
range. For the moderately and most arbitrary 
norms (Conditions 2 and 3, respectively), 
latitudes were established during encultura- 
tion, with the plants giving judgments, from 
9-15 inches (22.86-38.10 cm; Condition 2) 
and 15-21 inches (22.86-53.34 cm; Condition 
3J: 


TABLE 1 


Norms FORMED AND TRANSMITTED THROUGH EIGHT 
GENERATIONS IN N CONDITION 1 ee 


Gener- Range: 25th-75th 

ation pereaujes Mdn M 
1 48 (10. 16-20.32) 6.0 (15.24) 6.0 (15.24) 
2 2-7 (5.08-17.78) 5:5 (13.97) 5.2 ES 
3 3-5 (7 k 4.0 (10.16) 3.8 (9.65) 
4 2-6(5. 4.0 (10.16) 4.1 (10.41) 
5 3-6 (7.6: 4.5 (11.43) 4.2 (10,67) 
6 3-6 (7, 4.5 (11.43) 5.0 (12.70) 
7 3-7 (7.62- 17. 78) 4.5 (11,43) S51 re 
8 3-6 (7,62-15.24) 4.5 (11.43) 4.6 (11.68) 


Note, In each generation there were four naive subjecta 
making 30 judgments, with two replications. 

a All judgments we: n to the nearest inch. Numbers in 
parentheses give metric conversions, 


The natural, or least arbitrary, norm (Con- 
dition 1), which emerged with a range of 
2-8 inches (5.08-20,32 cm), matched in lati- 
tude the range of the moderate and most ar- 
bitrary conditions, that is, a width of 7 inches 
(17.78 cm; see Table 1). Since subjects were 
randomly assigned to both the conditions of 
arbitrariness, and to the generations within 
each condition, a completely randomized 
factorial analysis of variance was applied 
to the proportions following transformation 
to arc sine radians (Kirk, 1968; see Table 2). 

Norm persistence and change, as reflected 
by the central tendencies of the norms that 
emerged under the differing conditions of ar- 
bitrariness, were assessed by examining the 
medians of the judgments of all four of the 
subjects in each generation. Since a social 
norm in its entirety cons of the judg- 
mental range, which of necessity includes the 
foci, of all members of a social unit, it is this 
norm in its entirety to which the new member 


TABLE 2 


ANALYSES OF VARIANCE Across CONDITIONS: PROPORTION OF JUDGMENTS WITHIN THE ORIGINAL Norm by Eac 


OF Au L Susy 


Medians 
Source af MS ET: MS F 
Levels of arbitrariness (A) 2 17.435 53.577944 115.842 65.409*** 
Transmission generations (B) 7 873 2.683* 19.332 10.916*** 
AXB 14 438 1.345 5.918 3.342"* 
Within-cell error 24 325 1771 

a Transformed to are sine radians. a 

*p <05. 

wD < 005, 


+p < 0001. 
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is exposed. The courses of the norms across 
generations were then studied in terms of the 
central tendencies of the norms by a com- 
pletely randomized factorial analysis of vari- 
ance, using the medians of judgments as the 
basic data representing the foci of judgments 
for each generation. The factorial analysis 
was the most conservative assessment of ef- 
fects applicable, since the design required 
taking into account the complex pattern of 
the interrelationships of the subjects. 

Hypothesis 1. As predicted in the first hy- 
pothesis, the norm formed under the least 
arbitrary, or natural, condition (Condition 
1), with no planted subjects in the situation, 
Was transmitted to succeeding generations of 
subjects with minor variations. In terms of 
conformity to the latitude of the norm, the 
proportions of judgments falling within the 
original range of the natural norm (in are 
sine radians) showed the least variability 
across generations of the three conditions, that 
is, levels of arbitrariness as evidenced by the 
simple main effect test, F(7, 24) = 523, ns 
(see Table 2). 

In terms of the focus of judgments, repre- 
sented by the average of the medians of judg- 
ments of all four subjects for each genera- 
tion (see Table 3), Condition 1 was the least 
variable of the three conditions of arbitrari- 
ness across generations, with a simple main 
effect test, F(7, 24) = .848, ns (see Table 2). 
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In terms of both the transmission of the 
focus of the norm and the adoption of the 
range of the norm by the new subjects in each 
generation, it is evident that the least varia- 
bility of the three levels of arbitrariness across 
transmission generations was found in Condi- 
tion 1, the naturally established norm. 

The natural norm in this study was formed 
without the imposition of a prescribed arbi- 
trary norm and was transmitted through suc- 
cessive generations. It provides the baseline to 
evaluate the persistence and change of norms 
produced under conditions consisting of the 
imposition of arbitrary norms. 

Hypothesis 2. Figure 1 and Table 3 present 
the percentage of judgments within the pre- 
scribed ranges for Conditions 2 and 3 , through 
three enculturation and eight transmission 
generations. The second hypothesis, that un- 
der the more arbitrary Conditions 2 and 34 
the naive participants in the enculturation 
generations would adopt the range and mode 
used by the planted subjects, was supported, 
the extent of this conformity being relatively 
greater in Condition 2 than in the most arbi- 
trary Condition 3. In Condition 3 (most arbi- 
trary), over 90% of the naive subjects’ judg- 
ments were within the prescribed range during 
the first two enculturation generations and 
79% in the last when only one planted sub- 
ject remained. In Condition 2 there was al- 
most complete conformity to the moderately 


CONDITION 2 


(Moderately Arbitrary) 


CONDITION 3 
(Most Arbitrary) 
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Ficure 1. Conformity to enculturation norms in the autokinetic situation. (Prescribed norm for 


Condition 2, moderately arbitrary, is 9-15 inches, 
tion 3, most arbitrary, is 15-21 inches, 


or 22.86-38.10 cm. Prescribed norm for Condi- 


or 38.10-53.34 cm.) 
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TABLE 3 
Means, MEDIANS, AND PERCENTAGE OF JUDGMENTS FOR ALL SUBJECTS AND FOR Bacu Naive SUBJECT 
WrrHiN P] RIBED RANGE BY GENERATION FOR CONDITIONS 
Condition 28 F 
within within within 
Generation M Mdn all Ss M Mdn all Ss naive S 
Enculturation 
Ey 11.8 Gan ) 98 99 17.4 17.0 95 95 
(29.98) 29.72 (44.20) (43.18) 
a (028.40) 048) ees Ge Ate Geib 
Es 11.8 12.0 7 95 16.9 16.5 139 69 
(29.98) (38.48) (42.93) (42.91) 
Transmission 
Ti 11.2 11.2 91 93** 15.9 16.5 63 gys 
T Gye WS s s ag EE, 
3 i .5 S: 2.5 
3 (26.92) Q66 (33.27) G15) 
T; 10.5 10.2 74 60** 10.6 10,5 18 159% 
A CE D a os A AE E 
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Note. For both conditions, » = 2 per cell. Numbers in parentheses are centimeters, 
a 9-15 inches (22.86-38.10 cm) mode = 12 inches (30.48 cm), 
b 15-21 inches (38.10-53.34 cm), mode = 18 inches (45.72 cm). 
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‘ arbitrary prescribed range throughout the en- 
culturation generations. 

Figure 2 and Table 3 present the medians 
of naive subject judgments, by generation, for 
enculturation and transmission generations. 
This measure of norm typicality confirms the 
evidence of conformity to prescribed norm 
latitude by indicating that subjects in Condi- 
tion 2 distributed their judgments around 
medians that were very close to the pre- 
scribed mode of 12 inches (30.48 cm) during 
the enculturation phase. Naive subjects in 
the enculturation phase of Condition 3 (most 
arbitrary) centered their judgments within an 
inch of the prescribed mode of 18 inches 
(45.72 cm) in the first two generations, and 
within an inch and a half in the third gen- 
eration. Such variation is well within that 
found for the least arbitrary, or natural, con- 
dition. 


Hypothesis 3. Following enculturation, 
after all planted subjects had been removed, 
the extent to which the prescribed norm was 
transmitted to the just-succeeding generation 
varied, as predicted, with the degree of arbi- 
trariness of the enculturation norm. In Condi- 
tion 2, approximately 93.5% of the judgments 
of the naive subjects in the first transmission 
generation fell within the prescribed range as 
compared with approximately 55% in the 
most arbitrary Condition 3. This difference is 
statistically significant, (24) = 1.8748, p < 
.05. The results show that enculturation had 
taken place and that the initial transmission 
occurred to an extent depending on the arbi- 
trariness of the prescribed norm, Thus, to this 
effect, Hypothesis 3 is supported. 

Hypothesis 4. Hypothesis 4 predicted a 
general shift from the norms initially estab- 
lished (enculturated) in Conditions 2 and 3 


F 
a 


MEDIANS OF GROUP JUDGMENTS, INCHES 
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El E2 E3 TI Mies 


CONDITION 3 
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TRANSMISSION 
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CONDITION 2 
(Moderately Arbitrary ) 


CONDITION 1 
(Natural ) 


GENERATIONS 
TISTA T5 T6 T? T8 


Ficure 2. Averages of judgment medians of replications. (There is no prescribed norm for 
Condition 1, natural. The prescribed norm for Condition 2, moderately arbitrary, is 9-15 inches, 
or 22.86-38.10 cm, with a mode of 12 inches, or 30.48 cm. The prescribed norm for Condition 3, 
most arbitrary, is 15-21 inches, or 38.10-53.34 cm, with a mode of 18 inches, or 45.72 cm.) 


toward the least arbitrary, or natural, range 
and mode over successive generations. The 
shift of percentages within the prescribed 
ranges, as shown in Figure 1 and Table 3, as 
well as the medians shown in Figure 2 and 
Tables 2 and 3, strongly support this hypothe- 
sis. Differences among the medians by simple 
main effect tests under all three conditions of 
arbitrariness are significant up until Trans- 
mission Generation 8, F(2,24) = 2.541, p > 
10, ns. 

Hypothesis 5. The distinctive prediction of 
the present research was that persistence of 
an enculturated norm over succeeding gen- 
erations, with changing participants, would 
vary inversely with the degree of arbitrari- 
ness of the prescribed norm. Conversely, the 
trend of change toward the natural norm 
would vary directly with the arbitrariness of 
the enculturated norm. This prediction was 
supported. 

A comparison of the shifts of the means of 
the judgments made under Conditions 2 and 
3 (moderate and most arbitrary) by Trans- 
mission Generations 1-8 indicates a more 
rapid change toward the natural norm under 


the most arbitrary Condition 3 than under 
the moderately arbitrary Condition 2 (see 
Figure 2 and Table 3). Until Transmission 
Generation 4, when under the most arbitrary 
condition the focus of the norm leveled out, 
the rate of change under the most arbitrary 
condition is conspicuously greater. By Trans- 
mission Generation 3, 56.4% of the total 
change in the mean that occurred in Condi- 
tion 3 (most arbitrary) had already taken 
place. By Transmission Generation 4 in Con- 
dition 3, 81.9% of the downward change in 
the mean had occurred. In the moderately 
arbitrary Condition 2, only a third of the 
total change occurring in that condition had 
been accomplished by the fourth generation. 
It was not until the sixth generation in Condi- 
tion 2 that as much as 50% of the total shift 
occurred. By the sixth generation, 77% of the 
change occurring in the means throughout 
Condition 2 was completed. From Table 3 
and Figure 2, it can be seen that the median 
of judgments was higher in Condition 2 from 
the fourth generation onward than in the 
most arbitrary condition (Condition 3). 
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In terms of conformity, the effect of the 
degree of arbitrariness on the persistence and 
change of the norms is equally apparent. Fig- 
ure 1 and Table 3 indicate that by Trans- 
mission Generation 4, the norm under the 
most arbitrary condition (Condition 3) has 
only 5% of the judgments within the pre- 
scribed norm and continues to contain less 
than 3% of such judgments through Trans- 
mission Generations 5-8. Under the mod- 
erately arbitrary condition (Condition 2), 
the norm still has over 719% of the judgments 
within the prescribed range during the fourth 
generation, and even in the eighth generation 
it has above 26% within the original arbi- 
trary range. The greater persistence of the 
prescribed norm in Condition 2 is also ap- 
parent in Table 3 and Figure 1, which show 
the extent of conformity to the prescribed 
ranges. 

Newman-Keuls tests of the differences be- 
tween means of the transformed proportions 
for the different conditions of arbitrariness, 
across generations, were computed, The mean 
of the transformed proportions for Condition 
3 (most arbitrary) was .524, for Condition 2 
(moderately arbitrary) it was 1.752, and for 
Condition 1 (least arbitrary-natural) it was 
2.60. Comparing Condition 1 with Condition 
3, the observed (3,24) = 5.147, p< 01. 
Comparing Condition 2 with Condition 3, the 
observed g(2, 24) = 3.044, p< .05. The im- 
plication of the order of the means of the dif- 
ferent levels of arbitrariness supports the pre- 
diction that while enculturation was effective, 
the total amount of judgments within the 
range, hence the differential rate of change, 
was ordered from the greatest persistence in 
Condition 1 to the least persistence (greatest 
change) in Condition 3. 


Discussion 


An understanding of the procedures fol- 
lowed by Jacobs and Campbell (1961) and 
the critical modifications in the nature of the 
prescribed norms made for this study is es- 
sential for understanding the differences in 
the outcome of each. Jacobs and Campbell 
presented an ingeniously elegant experimental 
design, which although lacking the highly 
confounding inputs found in norm formation 


during the normal socializing process, does 
provide a straightforward and accurate lab- 
oratory analogy to the formation and change 
of norms over the membership generations 
found in social units such as families and 
groups. 

Jacobs and Campbell used planted subjects 
to inculcate a norm. The latitude of their 
prescribed norm, 15-16 inches (38.10-40.64 
cm) was, however, excessively arbitrary un- 
der conditions in which a mode of approxi- 
mately 4 inches (10,16 cm) was natural. When 
this arbitrariness of latitude was combined 
with the arbitrariness of a prescribed mode at 
approximately 15 inches (about 38 cm), an 
unrealistic level of extreme arbitrariness re- 
sulted. In terms of published data on norm 
formation in the autokinetic situation, and 
particularly in the small laboratory room 
(8-foot, or 2.4-m, subject-stimulus distance) 
used by Jacobs and Campbell, this prescribed 
range and mode was so extremely arbitrary 
for the conditions, that the subjects neither 
accepted fully nor retained the norm, Analo- 
gies in the cultural area are difficult, since 
almost all cultural norms permit some indi- 
vidual variations, whereas 1 inch in the range 
of autokinetic judgments amounts almost to 
no variation psychologically. The results, not 
surprisingly, were-a lack of compliance to the 
prescribed norm, a lack of persistence of the 
somewhat lower norm that did form, and a 
rapid collapse of that norm, 

The results, as reported by Jacobs and 
Campbell (1961), suggest that their initial 
enculturation attempt was not highly effec- 
tive. Therefore, the predictions in the present 
study concerned the formation of the norm 
during the enculturation process as well as 
the effect of arbitrariness on the norm as it 
persisted and changed. 

It was predicted and found, in both the 
Jacobs and Campbell study and our experi- 
ment, that the norm formed under the least 
arbitrary, that is, the natural, condition was 
transmitted to succeeding generations of sub- 
jects with minor variations and with an in- 
creasingly greater degree of stabilization. 
Under the conditions of greater arbitrariness 
(our Conditions 2 and 3), the naive partici- 
pants in the enculturation generations did 


adopt the range and mode presented by the 
planted subjects, the extent of their conform- 
ity being relatively greater in the moderately 
arbitrary condition (Condition 2) than in 
the most arbitrary condition (Condition 3). 

Following enculturation, when all planted 
subjects were removed, the extent to which 
the prescribed norm was transmitted to the 
just-succeeding generation of naive subjects 
varied inversely with the degree of the arbi- 
trariness of the enculturation norm. Thus, 
given the fact that Jacobs and Campbell’s 
condition represented a greater degree of ar- 
bitrariness (because of its very constricted 
latitude) than our most arbitrary condition 
(Condition 3), the results in both experiments 
are predictable by the following proposition: 
The degree of enculturation of an experi- 
mentally introduced norm (of low degree 
of self-involvement), and its transmission 
through succeeding generations, is an inverse 
function of the degree of its arbitrariness. 

It is noteworthy that using an altogether 
different, but conceptually equivalent setup, 
Weick and Gilfillan (1971) confirmed the 
validity of the proposition. The task of the 
subjects in their experiment was more con- 
crete and, perhaps, more challenging than just 
estimating the distance a point of light was 
perceived to be moving in complete darkness. 

In social life, in the actuality of normative 
and functional processes of human groups, 
the norms are not merely transitory events in 
time. Their rise, change, persistence or decay 
over time, and transmission through genera- 
tions, are understandable only when the con- 
ditions of their formation and the course of 
their natural history, under various circum- 
stances, are included in the account. There- 
fore, for the following reasons it can be de- 
servedly acclaimed that Newcomb’s Benning- 
ton research (Newcomb, Koenig, Flacks, & 
Warwick, 1967) will survive the test of time 
as a significant landmark in social psychol- 

ogy: (a) Newcomb specified the conditions of 
interaction (in the newly founded, New Deal 
permeated Bennington college community) 
that brought about changes in the norms and 
values of the participants; (b) he followed 
the fate of these changes—or the resistance 
to them—in the participants, over time, 
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through the senior year of the college gen- 
eration; (c) going far beyond that, Newcomb 
and his associates assessed the persistence— 
or change—of these norms oyer a 20-year 
period of time. 

If the social psychologist is to broaden his 
perspective (as he should) in the study of 
the internalization of norms and their fate 
over years and generations under various cir- 
cumstances, he must either turn to the his- 
torian’s works or become a historian himself, 
Under the constraining conditions of the lab- 
oratory, the internalization of norms and their 
persistence and change can be studied only 
as microsimulations of the actualities of the 
problem area. In a real sense, every labora- 
tory experiment is (and should be) a simula- 
tion. But simulation is a most difficult under- 
taking; microsimulation of situations has to 
embody in miniature the essential properties 
of the actualities of the problem studied. A 
simulation study is not simply a “fancy” 
technique, although uncritically it may be 
treated as such; it has degenerated to that 
level, in various quarters in recent decades, 
Lack of concern with the issue of the iso- 
morphism between the technique used in 
simulation and the actualities of a problem 
area is nothing less than scientific irresponsi- 
bility, no matter how impressive the results of 
such nonchalance may appear. 

The only justification for our own and 
similar laboratory situations in the study of 
norms is their essential, common structural 
characteristics, which are as follows: 


1. A fluid, unstructured stimulus situation 
exists that allows for various alternatives 
(within limits) for the distribution of judg- 
ments. 

2. This unstructured stimulus situation re- 
sults in the subjects’ distributing their judg- 
ments without experimental prescription 
(Condition 1) into a normative latitude of 
their own within the limitations set by condi- 
tions of the setup (distance, intensity, dura- 
tion of the light, etc.). During this process, 
they (consciously or unconsciously) influence 
each other with respect to the task at hand. 
The erratic distribution of responses is dis- 
comfiting to the participants, while stabiliza- 
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tion is settling for them. Thus, in this highly 
ambiguous situation, where there are no sure 
external cues (reference points) to facilitate 
confident judgments, participants rely more 
on each others’ judgments. This dependency 
on social cues increases proportionately to 
the degree of unstructuredness (uncertainty) 
in the situation. 

3. With the experimental prescription, the 
subjects can be psychologically pulled to the 
prescribed latitude during the actual applica- 
tion of influence (enculturation sessions), 
and the influence persists for some genera- 
tions if the prescription (as in Condition 2) 
is not too much removed from what they by 
themselves would settle for under this circum- 
stance. However, if the external prescription 
is too far off (as in Condition 3), subjects do 
not completely accept the norm as it is pre- 
sented, even during the application of influ- 
ence, and the effect of this influence is transi- 
tory, as evidenced by the more rapid collapse 
of the prescribed arbitrary norm in the course 
of the transmission sessions. 


It cannot be argued that the task in our 
experiment even approximates the self-in- 
volvement of the subjects, as might the issues 
of politics or social reforms, which may be 
studied in field research of attitude change 
and persistence. In self-involving issues (like 
politics or religion), attitudes tend to persist 
for long periods and may persist through 
generations, even when they become mal- 
adaptive relative to changing objective condi- 
tions. This was illustrated earlier in the arti- 
cle. More striking illustrations can be readily 
selected from political, religious, and social 
areas. An area of research concentrating on 
the problem of persistence of attitudes and 
myths that have become dysfunctional—mal- 
adaptive in terms of prevailing conditions at 
present—may well be one of the needed de- 
velopments in social psychology. We consider 
the line of research represented by Jacobs and 
Campbell (1961), Weick and Gilfillan (1971), 
and our studies as a baseline start along this 
area. The fruitfulness of this line of research 
can be established only by experimentally 
demonstrating that it can be extended to 
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handle situations (tasks) that are consider- 
ably self-involving for the subjects. If this 
difficult feat can be accomplished, we will be 
gaining the needed tie-in between the unob- 
trusive type of laboratory rigor and the close- 
to-life realism of field studies (such as repre- 
sented by observation, interview, and attitude 
validation through assessment of actual be- 
havior). 
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Effects of Noise and Perceived Control on Ongoing and 


Subsequent Aggressive Behavior 


Edward Donnerstein and David W. Wilson 
Iowa State University 


Two experiments examined the effects of high-intensity (95 dB [A]) noise 
on ongoing and postnoise aggressive behavior. In Experiment 1, subjects 
were angered or treated in a neutral manner and given an opportunity to 
aggress against another subject while being exposed to high-intensity (95 dB) 
or low-intensity (55 dB) noise. Results indicated that high-intensity noise 
facilitated aggression for previously angered individuals. Experiment 2 ex- 
amined postnoise aggression in which subjects completed a math task under 
high-intensity noise with or without perceived control over the noise. In 
comparison to a no-noise control, it was found that angered subjects with 
no control revealed an increase in aggression, whereas perceived-control subjects 
were no different from no-noise subjects. Results are discussed in terms of 
the recent Glass and Singer work on noise and task performance and the 


effects of perceived control on mediating the effects of stressful conditions. 


Recently there has been increasing concern 
with the effects of various environmental 
stimuli (e.g., noise, pollution, overcrowding) 
on the quality of human life. While much of 
this research has focused on health- or task- 
performance-related problems, recent studies 
have begun to deal with interpersonal be- 
haviors such as aggression (Baron & Bell, 
1975), attraction (Griffitt, 1970), and altru- 
ism (Sherrod & Downs, 1974). The effects of 
noise, in particular, have received much at- 
tention. In a significant and systematic in- 
vestigation of high-intensity noise on physio- 
logical responses and task performance, Glass 
and Singer (1972) have indicated that “un- 
predictable and uncontrollable noise should 
affect aggressiveness, exploitative behavior, 
liking for others, and general irritability in 
interpersonal relations” (p. 159). While there 
has been some evidence of noise effects on 
prosocial responses such as helping behavior 
(Sherrod & Downs, 1974), except for studies 
on low-intensity noise (Geen & O’Neal, 1969) 
and field research (Ward & Suedfeld, 1974), 
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there has been to date no systematic attempt 
to examine the effects of high-intensity noise 
on more negative interpersonal behaviors such 
as aggression. It was the purpose, therefore, 
of the present studies to examine within the 
framework of the Glass and Singer research 
the varying effects of high-intensity noise on 
interpersonal aggression. 

In their research, Glass and Singer investi- 
gated the effects of differing forms of noise 
(ie., high or low intensity, predictable or 
unpredictable, controllable or uncontrollable) 
on both ongoing and subsequent task per- 
formance and physiological responses, Since 
their results indicated markedly different pat- 
terns for ongoing and subsequent perform- 
ance, the present authors conducted two 
Separate studies in which noise effects were 
examined with respect to ongoing aggressive 
behavior (Experiment 1) and aggressive be- 
havior following exposure to high-intensity 
noise (Experiment 2). In addition, another 
important variable in the Glass and Singer 
research, that of perceived control, was also 
examined in light of its potential consequences 
on aggressive behavior. 


EXPERIMENT 1; ONGOING AGGRESSIVE 
BEHAVIOR 


An interesting finding of the Glass and 
Singer (1972) work was the somewhat rapid 
adaptation, both physiologically and behay- 
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iorally, to high-intensity unpredictable noise. 
Thus, in terms of ongoing simple task per- 
formance, high-intensity noise did not impair 
performance, In addition, while there was an 
initial increase in physiological arousal, after 
a few minutes individuals returned to a base 
level indicative of an adaptation to the noise. 
The intent of the present study, then, was 
to examine ongoing aggressive behavior under 
noise conditions similar to those of Glass and 
Singer. In the present study, male subjects 
were either angered or treated in a neutral 
manner by a confederate and subsequently 
given an opportunity to aggress against the 
confederate via the administration of electric 
shock. While delivering shock, subjects were 
subjected to either a low-intensity (52 dB 
[A]) or high-intensity (95 dB) unpredictable 
and uncontrollable noise. Unpredictable and 
uncontrollable types of noise were employed 
because they were the forms most conducive 
to potential effects in the Glass and Singer 
work. Based on the Glass and Singer research 
and recent work in the aggression area, one of 
three possible predictions could be made with 
regard to the effects of high-intensity noise on 
ongoing aggressive behavior. 

First, given the results of the Glass and 
Singer work, it might be predicted that there 
would be an initial increase in aggressive re- 
sponding under high-intensity noise due to a 
general increase in the level of arousal. This 
should occur for both angered and nonangered 
subjects since Glass and Singer indicate that 
noise produces a general irritability in inter- 
personal relations. As subjects became more 
adapted to the noise, however, there would be 
a decrease in arousal and subsequent aggres- 
sion. If this is the case, then higher shock 
levels should be exhibited in the early shock 
trials of the present study, with later trials 
revealing a gradual decrease in intensity level. 

A second possible prediction is that aggres- 
sion will be facilitated by exposure to high- 
intensity noise for previously angered sub- 
jects. This is based on the notion that any 
source of emotional arousal will tend to fa- 
cilitate aggression in individuals who are pre- 
disposed (i.e., angered) to aggress (Bandura, 
1073). Research in which individuals have 
been angered and subsequently exposed to 
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arousal via strenuous exercise (Zillmann, 
Katcher, & Milaysky, 1972), erotic stimuli 
(Donnerstein, Donnerstein, & Evans, 1975; 
Zillmann, 1971) or drugs (O’Neal & Kauf- 
man, 1972) have lent support to this posi- 
tion. Recent research in this area suggests 
that subjects come to label their general 
arousal level from previous anger and exter- 
nal sources (e.g., erotic films) as heightened 
anger (Koneéni, 1975b). This is, of course, 
analogous to the Schachter-Singer two-factor 
theory of emotion (Schachter & Singer, 1962) 
and the recent work of Zillmann on excita- 
tion transfer (Zillmann, 1971; Zillmann et 
al., 1972). Given that subjects are in con- 
stant contact with the source of their anger, 
it would be expected that their level of anger 
should remain constant or even increase. In 
addition, based on research utilizing the shock 
paradigm as a means of aggressing (e.g., Buss, 
1961), it would even be expected that shock 
levels should increase over trials due to a 
disinhibition of aggression (Goldstein, Davis, 
& Herman, 1975). 

A third possible prediction is an inhibition 
of aggression for angered subjects due to the 
added aversiveness of the high-intensity noise, 
Baron and Bell (1975) recently found that 
high ambient temperature served to reduce 
aggression in previously angered individuals. 
They speculated that rather than aggression, 
escape from the aversive situation was the 
dominant response in the subject’s hierarchy, 
In order to avoid delays such as victim pro- 
tests or experimenter censure from the use of 
high shocks, subjects reduced their shock 
levels to terminate the experiment more rap- 
idly. If this is the case in the present study, 
it would be expected that shock levels would 
remain low over trials, or even reveal a de- 
crease, for angered subjects exposed to the 
aversive high-intensity noise. 

In summary, the first experiment investi- 
gated the effects of high-intensity noise on 
ongoing aggressive behavior. Given recent 
research and theorizing in the area, aggres- 


1 Recently, Harris and Huang (1974) have shown 
that when subjects can attribute their arousal to the 
noise, there is not an increase in aggressive behavior. 
This, of course, would support the interpretation 
given above. 
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sion was expected to either (a) increase and 
then gradually decrease, (b) be facilitated in 
angered individuals, or (c) reveal an inhibi- 
tion due to the aversiveness of the situation. 


Method 
Subjects 


The subjects were 40 male undergraduates who 
volunteered for extra course credit in introductory 
psychology. 


Apparatus 


Shock was ostensibly administered via a modified 
Buss (1961) “aggression machine.” There were eight 
shock buttons to be used for shock administration 
plus one additional button that ostensibly signaled 
a correct response via a light. A voltage meter above 
the buttons was calibrated to indicate that higher 
numbered shock buttons delivered stronger shocks. 
A set of four lights in the upper right corner was 
used for communication between the subject and 
learner. Noise was transcribed from a standard white- 
noise generator onto a tape record’ng. Decibel read- 
ings were taken at earphone level with two settings, 
55 dB and 95 dB, having been selected for experi- 
mental conditions, Noise was administered via a 
stereo tape recorder and headphone system. 


Procedure 


Upon arriving for the experiment, the subject was 
taken to an experimental room where the aggression 
machine and a number of pieces of bogus physio- 
logical equipment were located. Soon a confederate 
of the experimenter arrived posing as another sub- 
ject. Both subjects were told that the experiment 
dealt with the effects of various forms of stress on 
learning. They were further informed that they 
would both be asked to take some form of a learn- 
ing task under stress. The subjects were then given 
an “informed consent form” to sign, which noted, 
in addition to the above, that they would be asked 
both to administer and receive electric shock during 
the experiment and possibly be exposed to high- 
intensity white noise. After subjects signed the form, 
the experimenter supposedly randomly chose the 
subject to take the first task and took the confed- 
erate to another room where he would presumably 
begin studying for the second task. The subject was 
told that he would be asked to write a short essay 
on a recent social issue under the stress that the 
essay would be evaluated by the other subject 
through the use of electric shock. It was noted that 
the evaluation would range from 0 to 10 shocks, 
with higher shocks indicating a poorer rating. In 
addition, a written evaluation would be given. This 
type of anger manipulation is similar to that em- 
ployed in other studies (e.g., Baron, 1974; Berko- 
witz & Geen, 1966). 

Anger manipulation. The subject was then given 
5 minutes to write a short essay on the topic, 
“whether Richard Nixon should have been par- 
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doned.” After the elapsed time, the experimenter 
returned and took the essay to the other subject, 
supposedly for him to read. He then returned and 
attached two electrodes to the subject’s fingers. The 
subject was told that when the four lights on the 
aggression machine panel were lighted, the evaluation 
would take place. The experimenter then left the 
room and shortly thereafter signaled the onset of the 
evaluation. Subjects in the anger condition were 
given nine shocks of .5-second duration. Nonangered 
subjects received only one shock. The experimenter 
returned to the subject’s room with a questionnaire, 
supposedly completed by the other subject, that 
evaluated the essay and the subject on a number of 
bipolar scales. Subjects under anger conditions 
received negative ratings, whereas nonangered sub- 
jects were given positive evaluations. 

Exposure to noise and administration of aggres- 
sion, After receiving the two forms of evaluation, 
the subject was informed that the other subject 
(who is now referred to as the learner) would now 
take his task under stress, He was told that the 
learner would be taking a paircd-associate learning 
task under the stress of electric shock for incorrect 
responses. Since the experimenter would be involved 
in monitoring some physiological measures, the sub- 
ject was informed that he would administer the task 
and the stressor. The subject was told to call out 
the stimulus word over the intercom and wait for 
the learner’s answer, which would be signaled by 
lighting one of the four answer lights on the shock 
apparatus. The subject was further told that if the 
response was correct, he was to signal the learner by 
pushing the button labeled “light,” which would 
ostensibly inform the learner that he made a correct 
response. The subject was then told that when the 
learner made an incorrect response, he was to deliver 
an electric shock. It was noted that any arbitrarily 
determined intensity could be delivered but that 
each button on the shock apparatus represented a 
different shock intensity of increasing magnitude. The 
subject was then handed a list of 30 consonant- 
vowel-consonant nonsense syllables of medium J’ 
value (Noble, 1961) and a list of correct responses. 

At this point the experimenter indicated that he 
was interested in trying out a new form of stressor 
on physiological reactions while an individual was 
performing a relatively simple task. The subject was 
informed that while he was administering the learn- 
ing task, he was to wear a set of headphones over 
which blasts of white noise would be delivered 
periodically. It was further explained that his gal- 
yanic skin response would be monitored during this 
time. The subject was presumably chosen for this 
supposed pilot test because the administration of the 
learning task was a relatively simple procedure. The 
experimenter then attached two electrodes from a 
bogus galvanometer to the subject, instructed him to 
put on the headphones, and noted that he would be 
signaled when to begin the learning task. 

Shortly thereafter, the subject was signaled to be- 
gin the learning task. The subject verbally admin- 
istered the 30 nonsense syllables and received back 
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from the learner 20 incorrect and 10 correct re- 
sponses randomly distributed over five blocks of six 
trials each. Since the experimenter controlled the 
learner’s signaled responses, the task took approxi- 
mately 8 minutes. During this time the subject was 
subjected to unpredictable aperiodic 1-second noise 
bursts. The bursts occurred on the average every 4 
seconds, with half the subjects exposed to high- 
intensity (95 dB) noise and the remainder to low- 
intensity (55 dB) noise. 

Following the administration of aggression and 
exposure to noise, the subject was given a brief 
questionnaire to complete. The questionnaire asked 
him to rate on a 7-point scale his reactions to the 
noise and the ratings he received from the other 
subject. These scores served as manipulation checks 
for the various conditions. Finally the subject was 
completely debriefed as to the true nature of the 
experiment and thanked for his participation.” 

The experiment was conducted by two experi- 
menters and two confederates who were balanced 
across all conditions. Since neither of these factors 
had any significant effeçts on the results, only the 
data for the Anger and oe factors are reported 
in the subsequent analyses? 


Results and Discussion 


Manipulation Checks 


Subjects were asked to rate on 7-point 
scales the quality of rating they received for 
their essay and how angry they felt after 
having received the rating. Analyses revealed 
that angered subjects perceived their rating as 
poorer, F(1,36) = 361.25, p< .001; Ms = 
6.9 and 1.1, and indicated more anger, F(l, 
36) = 34.42, p<.001; Ms=3.8 and 1.0, 
than their nonangered counterparts. Addi- 
tional questions asked subjects to evaluate 
the noise they had been exposed to during the 
learning task. Subjects exposed to high-inten- 
sity noise rated the noise as louder, F(1, 36) 
= 43.18, p < .001; Ms = 4.9 and 2.4, more 
annoying, 7(1, 36) =18.54, p< .001; Ms = 
4.7 and 2.5, more aversive, F(1,36) = 35.40, 
p< .001; Ms = 4.4 and 2.0, and were more 
anxious for the experiment to end, F(1, 36) 
= 5.12, p < .05; Ms = 4.5 and 3.3. The pre- 
ceding data, then, would tend to support the 
effectiveness of the anger and noise manipu- 


Jations. 


Aggression data 

A 2x25 (Anger X Noise Intensity X 
Trials) analysis of variance (ANOVA) was 
conducted on the shock-intensity data. Signifi- 
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Ficure 1. Mean shock intensity as a function of 
noise and anger. 


cant sources of variation were obtained for 
anger, F(1,36) = 132.63, p < .001, noise in- 
tensity, F(1, 36) = 12.59, p < .01, the Anger 
x Noise Intensity interaction, F(1,36) = 
14.01, p < .01, and trials, F(4, 144) = 2.64, 
p < .05. The trials effect indicated that sub- 
jects increased their shock levels from the 
first to the last block of trials. The Anger x 
Noise Intensity interaction, which is pre- 
sented in Figure 1, indicated that while non- 
angered subjects were not affected by differen- 
tial noise, angered subjects did reveal dif- 
ferent patterns of responding. More specifi- 
cally, angered subjects exposed to high-inten- 
sity noise displayed more aggression than 
their counterparts exposed to low-intensity 
noise, F(1,36) = 5.29, p < 05. 

These results would tend to support the 
facilitation and cognitive labeling hypothesis 
noted in the introduction. Without some prior 
aggressive instigation, high-intensity noise 
had no effect on ongoing aggressive behavior 
in the present experiment. That some form of 
cognitive guidance is needed, in this case 
anger, to produce an aggressive facilitation is 
analogous to research employing other modes 

2 During the debriefing, subjects were questioned 
as to any suspicion regarding the experiment. In 
both Experiments 1 and 2 no subjects expressed 
suspicion concerning any of the manipulations or 
hypotheses under investigation. 


of arousal (e.g., Konečni, 1975b; Zillmann et 
al., 1972). Though it is possible that subjects 
did adapt to the noise, as would be suggested 
by the Glass and Singer (1972) work, the 
facilitation for angered subjects and the trials 
effect suggest that this may not have an effect 
on aggressive responding. The results seem 
more supportive of the disinhibition hypothe- 
sis of Goldstein et al. (1975), in that once 
subjects begin a somewhat high aggressive 
response they tend to become more extreme in 
their behavior over time. It is possible, how- 
ever, that the length of time (8 minutes) 
employed in the present study did not allow 
for complete physiological and behavioral 
adaptation, This could be examined in future 
research by systematically varying the time 
of exposure to high-intensity noise. For the 
present, however, the results indicate that 
aversive noise can be a facilitator of ongoing 
aggressive behavior in individuals who are 
predisposed to aggression. 


EXPERIMENT 2: SUBSEQUENT BEHAVIOR 
AND CONTROL 


The first experiment demonstrated that 
aversive noise could affect ongoing aggressive 
behavior; however, the work of Glass and 
Singer (1972) has indicated that the most 
negative consequences should occur in post- 
noise conditions. Glass and Singer’s research 
showed impaired task performance and intol- 
erance for frustration following aversive-noise 
exposure. Although it was initially believed 
that postnoise impairments were a function 
of the “psychic costs” of adaptation, more 
recent theorizing indicates that the effects 
are a product of “behavioral residues’—a 
function of exposure to noise independent of 
adaptation. The behavioral consequences are 
seen as a function of the intensity of the 
stressor, in which cognitive factors such as 
predictability and controllability interact with 
the physical aspects of the stressor to deter- 
mine its aversiveness. Thus, one purpose of 
the present experiment was to determine 
whether prior noise exposure would affect 
subsequent aggressive behavior. In addition, 
it was of interest to examine the effects of 
perceived control in regard to such behavior, 
Based on the research of Glass and Singer and 
others, a number of tentative hypotheses can 
be suggested. 
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First, given that subjects are less tolerable 
of frustration after noise exposure, it would 
be predicted that subjects exposed to high- 
intensity noise and later angered should dis- 
play, in comparison with a no-noise control, 
increased aggressive behavior against their 
instigator. One important question might be 
whether nonangered subjects who are also 
exposed to aversive noise might also display 
an increase in aggressive responding when 
compared with a no-noise control. In a recent 
study by Sherrod and Downs (1974), sub- 
jects exposed to “distracting” background 
noise during task performance showed de- 
creased postnoise altruism. In their own re- 
search, Glass and Singer suggest that a gen- 
eral irritability in interpersonal relations 
might occur following exposure to aversive 
events such as high-intensity noise. In order 
to examine this possibility, subjects in the 
present experiment were either angered or 
treated in a neutral manner following noise 
exposure. 

A second prediction in the present study is 
that perceived control over high-intensity 
noise should reduce the behavioral aftereffects. 
Glass and Singer found that control over 
noise exposure did not impair subsequent task 
performance, In addition, Sherrod and Downs, 
in the study noted above, observed that per- 
ceived control over noise exposure did not 
inhibit altruistic behavior. As noted earlier, 
the behavioral aftereffects observed in the 
Glass and Singer work were a function of the 
aversiveness of the stressor. Perception of 
some control over aversive events seems to 
reduce the feeling of helplessness during noise 
exposure and to reduce the magnitude of the 
stress response. Thus, a sense of control over 
noise termination, even though not actually 
exercised, tends to reduce the aversiveness of 
the noise and the subsequent behavioral im- 
pairments. Lefcourt (1973) has also stressed 
the fact that control and predictability 
ameliorate the effects of stressful events in 
both humans and nonhumans, In order to 
examine the possibility that perceived control 
might also affect subsequent aggressiveness, 
subjects in the present experiment were ex- 
posed to varying noise conditions while per- 
forming a math task. One third of the sub- 
jects were exposed to high-intensity (95 dB) 
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noise that was both unpredictable and un- 
controllable. This is the type of condition that 
should produce the most behavioral after- 
effects. Another third of the subjects were 
exposed to the same noise but were given 
additional information that led them to per- 
ceive that they had control over terminating 
the noise. The final group of subjects served 
as a no-noise control against which to evalu- 
ate experimental differences. 


Method 
Subjects 


The subjects were 60 male undergraduates in in- 
troductory psychology who volunteered for extra 
credit. None of these subjects had participated in 
Experiment 1. 


Procedure 


The initial portion of the experiment was similar 
to that of Experiment 1. The subject, along with 
the confederate, was informed that the experiment 
involved the effects of stress on learning, signed the 
informed consent form, and was chosen to take the 
first task. After taking the confederate to another 
room, the subject was given instructions concerning 
the writing of the essay and how the other subject 
would evaluate its merit. The subject was then 
given 5 minutes to complete this task. Thus, up to 
this point, the procedure was identical to that in 
Experiment 1. 

Noise exposure and manipulation of control. Upon 
returning to the subject’s room to collect the essay, 
the experimenter informed the subject that he would 
be given a second task to complete while the other 
subject read the essay and also performed a short 
task under stress. The subject was told that the task 
he would be given was a math test, which he would 
try to complete under the stress of high-intensity 
noise. The subject was then handed a sheet of 41 
math problems (addition, multiplication, etc.) and 
given further information on the noise. Subjects 
under high-noise/no-control conditions were in- 
structed to put on the headphones and wait for the 
experimenter to signal when to begin the task. Sub- 
jects under the high-noise/control condition were 
given additional information, similar to that em- 
ployed by Glass and Singer (1972), which led them 
to perceive that they had control over noise termina- 
tion, Subjects were told that any time they wanted 
the noise terminated while working on the math 
problems, they needed only to say the word termi- 
nate over the intercom. At that point the noise 
would stop and they could continue with the re- 
mainder of the task without noise exposure. It was 
further noted, however, that a number of subjects 
had previously used this option and the experi- 
menters were currently interested in obtaining more 
subjects who maintained noise exposure throughout 
the entire task. The experimenter noted that he 
would appreciate it if the subject kept the noise on 
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but that the choice to do so was entirely up to the 
subject. None of the subjects elected to terminate 
the noise. In the no-noise condition, subjects were 
informed that they were to take the task under no 
stress but were instructed to wear the headphones 
to block out any background noise. After a signal 
from the experimenter, the subjects were given 7 
minutes to work on the task. During this time, sub- 
jects under noise conditions were given unpredictable, 
aperiodic 1-second white-noise bursts of 95 dB at 
the average rate of every 4 seconds. 

Anger manipulation and administration of aggr 
sion. After the elapsed time for the task, the experi- 
menter returned and indicated that it was time for 
the subject to receive his evaluation from the other 
subject for the essay, The procedure was the same as 
in Experiment 1, with angered subjects receiving 
nine shocks and a poor written evaluation and non- 
angered subjects receiving one shock and a positive 
evaluation, Following the evaluation, the subject was 
given the information regarding the administration 
of the nonsense-syllable task and use of the aggres- 
sion machine as done in the previous study. The 
remaining procedure was the same as before except 
that the learning task involved 24 trials in which 
the learner made 18 incorrect and 8 correct re- 
sponses. Following completion of the task, the sub- 
ject completed a brief questionnaire in which he 
rated on 7-point scales his reactions to the noise, the 
essay rating, the other subject, and his freedom in 
terminating that noise. The subject was then com- 
pletely debriefed as to the true nature of the study 
and thanked for his participation. 

The experiment was conducted by three experi- 
menters and three confederates who were balanced 
across conditions. None of these factors affected the 
results and are thus eliminated in future analyses. 


Results and Discussion 
Manipulation Checks 


Analyses on ratings of the noise by the two 
noise-exposure groups indicated that they did 
not differ from each other on the measures of 
loudness or annoyance. However, subjects 
under the noise/with-control condition found 
the noise less aversive than noise/no-control 
subjects, #(38) = 2.28, p<.05; Ms = 3.6 
and 4.3, respectively. This would tend to sup- 
port the contention of Glass and Singer 
(1972) that a feeling of control over stressful 
events lessens their aversiveness. An addi- 
tional question with regard to the experi- 
mental groups was their feeling of freedom to 
terminate the noise. Noise/with-control sub- 
jects indicated a higher degree of freedom 
than noise/no-control subjects, (38) = 4.05, 
p < .01; Ms = 6.2 and 4.0, respectively. 

Additional analyses were conducted on the 
subjects’ reactions to the essay rating. Results 
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Ficure 2. Mean shock intensity as a function of 
noise, control, and anger. 


indicated that anger subjects perceived their 
rating as poorer, F(1,54) = 862.05, p< 
001; Ms = 6.7 and 1.3, were angrier, F(1, 
54) = 23.01, p< .001; Ms=2.9 and 1.2, 
and did not feel as good toward the confed- 
erate, F(1, 54) = 45.85, p < .001; Ms = 6.5 
and 4.4, compared with their nonangered 
counterparts. 


Math Test 


A one-way ANOVA with the three noise 
levels as groups was conducted on the num- 
ber of problems completed, the number of 
problems correct, and the percentage of cor- 
rect problems. There were no significant ef- 
fects. These results suggest that subjects were 
able to adapt to the noise during ongoing task 
performance. Such data are analogous to those 
obtained by Glass and Singer (1972) and oth- 
ers (e.g., Sherrod & Downs, 1974). 


Aggression Data 


A 2x3 (Anger X Noise) ANOVA was 
conducted on the shock intensity data. Sig- 
nificant sources of variation were obtained for 
anger, F(1,54) = 260.10, p< .001, noise, 
F(2,54) = 12.47, p < .01, and the Anger x 
Noise interaction, F (2, 54) = 3.96, p < .025. 
The interaction is presented in Figure 2. As 
can be seen, although noise did not affect 
nonangered subjects, it did produce differen- 
tial responding for angered individuals, More 
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specifically, subjects under noise/no-control 
were more aggressive than no-noise, F (1, 54) 
= 11.97, p<.01, and noise/with-control, 
F(1, 54) = 29.70, p < .001, subjects, with the 
latter two groups not significantly different 
from each other. 

These results suggest that postnoise aggres- 
sion is not a function of a general irritability 
in interpersonal relations but is dependent on 
some form of aggressive instigation. One pos- 
sible explanation is that subjects under no- 
control conditions were less tolerable of the 
essay rating received from the confederate. 
This notion would be supported by the task 
performance data of Glass and Singer (1972) 
and would seem to accord well with the ob- 
tained results of the present experiment. An- 
other possibility is that subjects under no- 
control conditions, who found the noise more 
aversive, maintained some form of residual 
arousal when they were receiving their essay 
evaluation. In terms of the recent work by 
Zillmann (Zillmann, 1971; Zillmann et al., 
1972) on excitation transfer and the cogni- 
tive-labeling hypothesis referred to earlier 
(ie., Koneéni, 1975a), it is possible that sub- 
jects came to label their composite emotion 
as a heightened form of anger. While the self- 
report data do not lend support to this inter- 
pretation, perhaps due to problems of the 
scale, it would be of interest for future re- 
search to examine this possibility through the 
use of physiological measures of subjects at 
various times during a similar experiment. 
Whatever the final explanation, the more sig- 
nificant finding is that perceived control over 
the noise eliminated any negative conse- 
quences due to noise exposure. These results 
are similar to those found for altruistic be- 
havior (Sherrod & Downs, 1974) and further 
support the importance of some form of con- 
trol over stressful events noted by Glass and 
Singer (1972) and Lefcourt (1973). 


GENERAL CONCLUSIONS 


It was the interest of the present two ex- 
periments to examine the relationship between 
exposure to aversive noise and aggressive be- 
havior. The model chosen as a context for this 
investigation was the research of Glass and 
Singer (1972) on task performance. The re- 
sults for an interpersonal behavior (in this 
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case, aggression) seem to parallel those of 
Glass and Singer for postnoise behavior, 
though differing somewhat for ongoing re- 
sponses. An important additional factor in 
the present studies is the necessity of aggres- 
sive provocation for enhancing aggression 
from exposure to high-intensity noise. Re- 
cently, Konečni (1975a) has indicated that 
prior anger arousal should be considered when 
one examines the effect of aversive stimula- 
tion on interpersonal behavior. The present 
results seem to support this contention. 

The findings concerning the effects of per- 
ceived control as a cognitive factor in noise 
exposure should direct researchers in the fu- 
ture to examine other variables that might 
mediate the noise-aggression relation. Factors 
such as expectation, relative deprivation, ne- 
cessity, choice, and others examined in the 
Glass and Singer research program would be 
of obvious interest. As noted earlier, the un- 
derlying mechanism for the effects of these 
factors should also be given further consid- 
eration. 

In general, the results of the present ex- 
periments suggest that high-intensity noise 
can have negative effects on interpersonal 
interactions. While these effects seem to de- 
pend on a number of mediating factors (i.e., 
anger, perceived control), there now seems to 
be evidence for reductions in prosocial (Sher- 
rod & Downs, 1974) and increases in anti- 
social behaviors. Given the research on other 
environmental stressors (i.e. overcrowding, 
high temperatures) and their effects on inter- 
personal behaviors, it would seem that re- 
search should now direct itself toward the 
examination of variables that might mediate 
and consequently reduce the negative conse- 
quences of various environmental stimuli on 
human behavior. 
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A Longitudinal Field Investigation of the Impact of Group 
Composition on Group Performance and Cohesion 
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In a Jongitudinal field experiment, 42 groups were assembled on the factors 
of individual ability and attitude similarity, resulting in a 2X2 crossed 
analysis of variance design. These groups then worked on six projects spanning 
a 3-month time period. For each project, measures of group performance and 
group cohesion were taken. Analyses showed that high-ability groups exhibited 
the best performance, whereas attitudinally similar groups expressed the 
greatest cohesion. In addition, correlational data suggested that the influence 
of attitude similarity on cohesion was not immediate but developed over time 
and that across the six projects, performance was both positively and nega- 


tively related to cohesion. 


One of the most pervasive questions con- 
cerning group effectiveness involves group 
composition, How do the skills, personality 
characteristics, likes and dislikes, etc., of indi- 
vidual group members influence the output of 
the group? If answers could be found to 
questions like these, then groups could be 
assembled so as to maximize the occurrence 
of desirable group outcomes. 

To this end, some research in group compo- 
sition has focused on the skills and abilities of 
individual group members. It was assumed 
that group effectiveness could be enhanced 
by selecting group members on the basis of 
technical competence. Standard selection 
procedures for group composition were based 
on this assumption. However, in their review 
of the group literature, McGrath and Altman 
(1966) concluded that while individual abil- 
ity appears to predict individual performance, 
there is little evidence that group performance 
can be reliably predicted from knowledge of 
member ability. 

Yet this need not always be the case. 
Steiner (1972) argued that group perform- 
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ance depends on the adequacy of the re- 
sources members have at their disposal and 
upon the manner in which these resources are 
used. Determination of relevant resources 
depends on an analysis of the demands and 
characteristics of the task. Thus, through con- 
sideration of task demands, it should be pos- 
sible to assemble effective work groups on the 
basis of relevant individual abilities. In a 
more recent review of the group literature 
since McGrath and Altman (1966), evidence 
has been cited that supports this notion (cf. 
Davis, Laughlin, & Komorita, in press). It 
has already been shown that job analysis is 
valuable for individual selection and place- 
ment (McCormick, Jeanneret, & Mecham, 
1972). More research is required to determine 
the degree to which such job or task analyses 
can be useful for group composition. 

A second variable of interest to group re- 
searchers has been the interpersonal compat- 
ability of the group members. It can be as- 
sumed that for tasks involving coordination 
or joint activity, member compatability be- 
comes an important determinant of group 
performance. This factor is necessary so that 
the group may function without problems of 
communication or authority, or any other 
interpersonal problems. Available research 
suggests that actual group performance as 
well as anticipated performance and satisfac- 
tion can be increased by assembling group 
members on the basis of self-selection (Van 
Zelst, 1952), need compatability (Reddy & 
Byrnes, 1972), and attitude similarity (Cas- 
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tore & DeNinno, Note 1, Note 2), The rela- 
tionship of such selection procedures to com- 
patability and performance is, however, 
poorly delineated and not well understood. 

In an attempt to better comprehend how 
such characteristics affect compatability, the 
theoretical formulations and supporting re- 
search of Byrne (1971), Heider (1958), and 
Newcomb (1961) appear useful. They sug- 
gest that the similarity of attitudes and val- 
ues that individuals hold may be important 
determinants of their ability to interact ef- 
fectively. When a high concordance exists on 
attitudinal issues, interpersonal interaction is 
facilitated; when a low concordance exists, 
interpersonal interaction is inhibited or can 
even take the form of hostility. Continuing 
this line of reasoning, one would expect that 
on group tasks requiring member interaction, 
attitude similarity would lead to effective 
group performance while attitude dissimilarity 
might well suppress effective group perform- 
ance. 

In spite of the practical value of knowing 
the effects of group composition on group 
outcomes, reviews of small group research 
have reported that in general, there exists a 
lack of attention to the problems of group 
composition (Davis et al., in press; Helm- 
reick, Bakeman, & Scherwitz, 1973; McGrath 
& Altman, 1966). Further, these reviews 
have mentioned several potential problem 
areas that characterize group research and 
make effective studies of group composition 
more difficult. First, research on small groups 
has been conducted almost exclusively in 
laboratory settings. Second, ad hoc groups 
constructed by the researcher for the pur- 
poses of the experiment have been studied 
more often than naturally occurring work 
groups. And third, the functioning of the 
group has typically been examined at only 
one point in time as opposed to longitudinal 
research where repeated observations are 
taken. The extent to which such laboratory 
studies with ad hoc groups working for short 
time durations have internal validity as well 
as external validity is questionable. Anderson 
(1961) stated that individuals in groups re- 
quire a certain amount of time together before 
they begin to behave as a group. As a result, 
the time duration of the experiment may in- 
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fluence the obtained results. Similarly, Lorge, 
Fox, Davitz, and Brenner (1958) warned 
against generalizing principles found with ad 
hoc groups to groups that interact over time. 
Clearly, meaningful group research should be 
designed with these factors in mind. 

One purpose of the present investigation 
was to examine the effects of group member 
ability and attitude similarity on group per- 
formance in a longitudinal field experiment. 
It was hypothesized that (a) homogeneously 
high-ability groups would perform better than 
homogeneously low-ability groups and (b) 
attitudinally similar groups would perform 
better than attitudinally dissimilar groups. 

A second group outcome of considerable 
importance is the cohesiveness of the group. 
Cohesiveness is assumed to be a desirable 
group outcome, since it is typically associated 
with accurate communication, high satisfac- 
tion, and low absenteeism and turnover (Lott 
& Lott, 1965; Stogdill, 1972). Further, co- 
hesiveness is based in part on the rewards 
obtained through group membership (Cart- 
wright, 1968; Lott & Lott, 1965). These 
rewards can stem from pleasant interpersonal 
relations within the group as well as from the 
attainment of valued outcomes that are asso- 
ciated with successful group performance, 
Therefore, based on previous theory and re- 
search suggesting that interaction with atti- 
tudinally similar people is more rewarding 
than interaction with attitudinally dissimilar 
people (Byrne, 1971; Castore & DeNinno, 
Note 1, Note 2) and on the assumption that 
high group performance is more rewarding 
than low group performance, it was hypothe- 
sized that groups high on attitude similarity 
would express greater cohesiveness than 
groups low on attitude similarity. Accord- 
ingly, if ability is related to group perform- 
ance, and if high performance is more reward- 
ing than low performance, then ability should 
be related to cohesion. Thus, it was hypothe- 
sized that homogeneously high-ability groups 
would express greater cohesiveness than homo- 
geneously low-ability groups. 

A final hypothesis can be made concerning 
the relationship between group performance 
and group cohesion. Before stating the hy- 
pothesis, however, it would be useful to 
describe briefly the nature of the task used in 
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this study. Group performance constituted 
one fourth of the subject’s final grade in a 
course on land-surveying techniques. The 
group task consisted of six standardized field 
projects used by the Civil Engineering De- 
partment. Successful completion of this course 
was required for admission into the highly 
competitive civil engineering program. There- 
fore, based on this information and discus- 
sions with the course instructors, it was 
assumed from the beginning that high perform- 
ance on the projects would be more reward- 
ing than low performance and that all groups 
would be motivated to do well on the six field 
projects. Returning to the cohesion-perform- 
ance hypothesis, Stogdill (1972) has sug- 
gested that only under conditions of high 
group drive will group cohesion be positively 
related to group performance. Under condi- 
tions of low group drive, in contrast, cohesion 
and performance will be negatively related. 
Assuming then that high-drive conditions 
were present, it was hypothesized that group 
performance would be positively correlated 
with group cohesion, 


METHOD 
Subjects 


The subjects were 127 male and 6 female under- 
graduate students enrolled in five sections of an 
introductory course on land surveying. At the first 
class meeting, subjects were told by the course 
instructors that the experimenters were interested in 
examining the performance of the survey groups and 
that the faculty had given the experimenters permis- 
sion to undertake the experiment if the students 
consented. Participation was thus voluntary. No stu- 
dent refused to be included in the study. 


Description of the Task 


Three- and four-person groups worked on six field 
projects that covered basic techniques in land sur- 
veying. For each project, all members of a group 
received the same group grade. Performance on 
these projects constituted one fourth of the student’s 
total grade in the course. It was assumed at the out- 
set of the study that these conditions would pro- 
vide a situation in which the subjects would be 
highly motivated. 

Specifically, each project contained three separate 
parts or subtasks. For each subtask, students had to 
occupy three positions: operating the plumb line, 
working the transit, and writing down the results. 
Students were required to rotate through the three 
positions across assignments so that every student 
had at least one turn at every position. Using Stein- 
er’s (1972) classification scheme, this task can best 
be conceptualized as one that was divisible (i.e. 
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division of labor), with specified matching to speci- 
fied positions (i.e., students were assigned to posi- 
tions), and where group performance was additive 
(i.e., performance on each project was the sum of 
the three subtasks). In addition, the task had dis- 
junctive properties. In a disjunctive task, the per- 
formance of the group is determined by one group 
member. Since the course instructors stated that the 
student operating the transit had the greatest influ- 
ence on the accuracy and, hence, grade received on 
the particular project the group was working on, 
the task was similar but not identical to a disjunc- 
tive task, 


Assembly of Groups 


Groups were assembled to be homogeneously high 
or low on ability and homogeneously high or low 
on attitude similarity, resulting in a 2X 2 crossed 
analysis of variance, 

From discussions with course instructors, it was 
concluded that scores on the quantitative section of 
the Scholastic Aptitude Test and cumulative grade 
point averages would provide the best indices of 
task-relevant abilities. The following procedure was 
used to classify groups. For each subject, his/her 
Scholastic Aptitude Test score and grade point. 
average were converted to z scores. Assigning equal 
weight to each s score, a weighted sum was com- 
puted. Subjects were then placed in rank order on 
the basis of these scores. Homogeneously high-abil- 
ity groups were assembled from students with scores 
above the median, and homogeneously low-ability 
groups were assembled from students with scores 
below the median. 

For attitude similarity, subjects responded on a 
6-point scale to 20 attitude statements taken from 
the Survey of Attitudes Questionnaire (Byrne, 1971). 
These statements covered such innocuous topics as 
state income tax, legal drinking age, athletics, etc. 
In each of the five class sections, a correlation 
matrix was computed that correlated subjects on the 
basis of their responses to the 20 attitude state- 
ments. Within each ability level, homogencously 
high-attitude-similarity groups were assembled from 
subjects who had high correlations with each other, 
and homogeneously low-attitude-similarity groups 
were assembled from subjects who had low correla- 
tions with each other. As a check on the manipula- 
tion, the grand mean of the intercorrelations for the 
similar groups was compared with the grand mean 
of the intercorrelations for the dissimilar groups. The 
two means were significantly different from each 
other: For similar groups, M =.57, SD=.09; for 
dissimilar groups, M = .25, SD =.10; t(40) = 11.10, 
$ < 001.1 


1 Degree of attitude similarity was also computed 
by examining the difference scores between group 
member response profiles to the 20 attitude state- 
ments (see Blum & Naylor, 1968, pp. 72-74). The 
two procedures showed over 90% agreement in 
classification of groups. The correlation index was 
used for all analyses, 
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Tn all, there were (a) 7 three-person groups and 
2 four-person groups classified as high in ability and 
in attitude similarity, (b) 11 three-person groups 
and 1 four-person group classified as high in ability 
and low in attitude similarity, (c) 10 three-person 
groups and 2 four-person groups classified as low in 
ability and high in attitude similarity, and (d) 7 
three-person groups and 2 four-person groups classi- 
fied as low in ability and low in attitude similarity. 
The six female students were randomly dispersed 
among the 42 groups. Also, the four-person groups 
were distributed as equally as possible among the 
four cells given an odd number of four-person 
groups. 


Assessment of Dependent Variables 


For each of the six projects, group performance 
and group cohesion were assessed as dependent mea- 
sures. Group performance was determined by in- 
structors’ grades using a 20-point grading scale with 
20 being the highest grade obtainable. Points were 
assigned on the basis of how close the group’s calcu- 
lations matched the known figures for each project. 
The known figures were based on instructors’ compu- 
tations. Therefore, it was possible for all groups to 
attain a high grade if all groups arrived at the 
known answer, The groups worked independently, 
Group cohesion was based on group member re- 
sponses to three questions that were typed on a 
separate page and attached to each project. The 
operationalization of cohesion used here was similar 
to that reported elsewhere (cf., Schachter, 1951; 
Schachter, Ellertson, McBride, & Gregory, 1951). 
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Subjects were asked to respond to the following 
three questions individually: (a) “How would you 
describe the way you and the other members of 
your survey party ‘got along’ together on this task?” 
(b) “Would you socialize with the members of your 
survey party outside of class?” and (c) “Would you 
want to remain a member of this survey party on 
future projects?” Responses were made on a 7-point 
scale, with high scores associated with favorable 
responses. For each of the six projects, the three 
cohesion items were correlated with each other to 
examine whether the items were measuring the same 
construct. Within each trial, all items were signifi- 
cantly intercorrelated at the .05 level or better 
(median r= .65). The cohesiveness of a group for 
any given trial was equal to the summation of the 
group member responses to the three questions (mini- 
mum possible score =9, maximum possible score = 
63). Responses from four-person groups were com- 
puted in the same manner with the exception that 
four-person group totals were multiplied by .75 so 
as to equate their scores with three-person groups. 
In cases of missing data, the mean of the existing 
members of the group was used in the computation 
of the group score. Missing data did not exceed 7% 
of the total for any given project. 


ResuLtTs 


All analyses were conducted with the group 
as the unit of analysis. For each dependent 
variable, a 2X 2% 6 between-within un- 
weighted-means analysis of variance with re- 


TABLE 1 


Low ability 


“High attitude Low attitude 

similarity? similarity’ 
Trial no. M S M SD 

1 
Performance 17.11 1.97 16.25 -97 16.00 3.64 15.00 447 
Cohesion 48.79 6.65 47.50 6.40 50.17 5.31 48.89 4.68 
Performance 17.00 2.40 17.67 1.56 16.17 3.69 15.33 415 
Cohesion 50.00 6.27 46.58 6.92 50.42 4,54 47.89 6.97 
Performance 19,11 1.36 18.92 1.24 18.33 2.61 18.22 3.19 
Cohesion 51,00 6.04 48.08 5.33 52.75 3.28 48.56 4.72 
Performance 19.56 ` 1.33 19.17 .94 18.75 1.29 18.56 1.59 
Cohesion 51.22 5.85 47.25 6.74 52.67 4.98 47.78 5.85 
Performance 18.89 1.27 18.58 2.02 17:25- 2.56 17.56 2.07 
Cohesion 50.56 7.45 46.75 6.78 51.68 5.57 45.78 5.93 
Performance 17.33 1.73 17.67 1.97 17.58 1.08 17.89 1.90 
Cohesion 51.00 6.87 46.67 8.14 50.50 9.38 46.18 5.83 
an =9, 


by = 12, 


J. TERBORG, C. CASTORE, AND J. DENINNO 


786 
TABLE 2 
SUMMARY OF ANALYSES OF VARIANCE FOR PERFORMANCE AND COHESION 
Source MS a r E 
Performance 
Ability (A) 48.26 1 5.33* -03 
Attitude similarity (B) 2.22 1 29. 
AXB 26 1 03 
Error 9.05 38 
Trials (C) 54.89 2 11.76** ue 
AXC 4.29 5 92 
BXC 1.78 5 38 
AXBXC 1,33 5 29: 
Error 4.67 190 
Cohesion 
A 30.56 1 18 
B 764.02 1 4,42* 10 
AXB 3.31 1 02 
Error 172.86 38 
> 15.63 5 1.28 
AXC 4.01 5 33 
BXC 16.82 5 1.38 
AXBXC 3.44 5 28 
Error 12.19 190 
*p <05, 
p< 01, 


peated measures on the third factor was con- 
ducted. The factors corresponded to high/low 
ability, high/low attitude similarity, and six 
trials (projects), respectively. Means and 
standard deviations are presented in Table 1. 
Since the design included unequal sample 
sizes, prior to analyses, the homogeneity of 
variance assumption was examined. Based on 
Bartlett’s test for unequal Ns, the hypothesis 
of equal cell variance was not rejected for 
the group cohesion variable (p> -15) but 
was rejected for the group performance varia- 
ble (p< .01). Therefore, the obtained F 
ratios for performance data should be in- 
terpreted as approximate F ratios. However, 
it is not known whether the computed F ra- 
tios for the performance data were negatively 
biased or positively biased. As shown in Table 
1, the high-ability groups had smaller stan- 
dard deviations than the low-ability groups. 
If this restriction in variance for high-ability 
groups were due to a ceiling in possible per- 
formance ratings (recall that 20 was the 
maximum performance score), then the group 
performance mean for high-ability groups 
could be restricted, resulting in a smaller mean 
difference between groups and a potentially 
negatively biased test statistic. On the other 


hand, the existence of a ceiling effect could 
also result in a positively biased test statistic 
simply due to the smaller observed variance. 
In all, for the performance data, the results 
should be interpreted with these considera- 
tions in mind. 

Results of the analyses of variance for per- 
formance and cohesion are presented in Table 
2. Specifically, high-ability groups demon- 
strated better overall performance than low- 
ability groups (M = 18.10 for high-ability 
groups; M = 17.22 for low-ability groups; 
$ < .02).° Computation of the eta-squared 
statistic (Æ?) showed that ability accounted 
for 3% of the performance variance. Attitude 
similarity had no statistical effect on group 
performance. There was also a significant 
main effect for trials (p < .001; E? = AET 
There were no significant interactions. Fur- 
ther, results of the Newman-Keuls statistic 
on performance data indicated that Trials 6, 
5, 3, and 4 were all significantly greater than 
Trial 1 (p< .01) and that Trials 5, 3, and 


2So as not to penalize those students who were 
placed in low-ability groups, the course instructors 
were asked to add one point to these students’ 
Project grades prior to computation of their final 
course grades, 
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4 were significantly greater than Trial 2 (p 
< .01). No other comparisons reached statis- 
tical significance. 

There was a significant main effect for 
attitude similarity on cohesion. Attitudinally 
similar groups expressed greater cohesion than 
attitudinally dissimilar groups (for similar 
groups, M = $0.89; for dissimilar groups, M 
= 47.38; p = .04; E? = 10). There were no 
other significant main effects nor interactions 
for cohesion. 

The correlations between attitude similar- 
ity and cohesion, and performance and co- 
hesion for the six trials are presented in 
Table 3, Specifically, the relationship between 
attitude similarity and cohesion tended to 
increase over trials, with the last three trials 
significant at p < .10 or better. Cohesion was 
also positively correlated with performance on 
the first trial (r = .28, p<.10) and nega- 
tively correlated with performance on the 
last trial (r = —.30; p < .05). 


\ DISCUSSION 


The results of this experiment gave support 
to the hypotheses that group performance is 
dependent on the skills and abilities of the 
individual group members and that group 
cohesion can be facilitated by the construc- 
tion of attitudinally similar groups. The data 
also suggest that the impact of such attitude 
or value compatability on cohesion is not 
immediate but requires time to take effect, 
and that over time, performance can be both 
positively and negatively correlated with co- 
hesion. No support was found for the hypothe- 
sized relationships between group member 
ability and cohesion nor between group mem- 
ber attitude similarity and performance. 

Addressing the observed relationship be- 
tween group ability and group performance, 
it appears that two factors must be considered 

. when assembling work groups according to 
standard selection procedures. First, it is 
obvious that task-relevant skills and abilities 
must be identified and measured. Some type 
of job analysis would certainly be useful here. 
Second, it is suggested that considerable at- 
tention be directed toward the characteristics 
and demands of the task. In the present 
study, successful performance on the task was 


TABLE 3 
SUMMARY OF CORR: 


Variable t 2 3 4 5 6 


Attitude similarity/ 
cohesion 11 20 23 


28* 12 —16 


27% 34+ 28* 
—22 05 —30* 


Cohesion/performance 


Nole. Decimal points are omitted. n = 42. 
* p <.10, two-tailed test. 
** p <.05, two-tailed test. 


largely determined by the ability of the per- 
son operating the transit. Since all group 
members rotated through this position, it is 
clear that homogeneously high-ability groups 
would perform better than homogeneously 
low-ability groups. However, if students were 
allowed to permanently place the most capa- 
ble group member behind the transit, then it 
would no longer be necessary for groups to be 
composed of all high-ability members. In this 
case, the ability of one group member could 
compensate for the lack of abilities of other 
group members. This raises some interesting 
questions regarding the processes surrounding 
the optimal use of available group resources: 
How does a group or leader in the group de- 
cide what task positions each member should 
occupy? Would the instrumentality between 
group performance and group rewards affect 
this task assignment process? Would member 
compatability be important? Future research 
on group composition would benefit if studies 
were designed to address this general issue. 
Nevertheless, it would appear that the de- 
mands and characteristics of the task be con- 
sidered so as to better specify the technical 
skill mix required for optimal group perform- 
ance. 

The observation that attitude similarity did 
not influence group performance may be best 
explained by again considering the nature of 
the task. It was assumed that similarity of 
attitudes would minimize disfunctional be- 
haviors among group members. To use Stein- 
er’s (1972) terms, attitude similarity would 
be expected to reduce losses due to faulty 
group processes. However, Steiner also stated 
that when group members are assigned to po- 
sitions by some external source, as was done 
in this experiment, the effect of group pro- 
cesses on performance is reduced. Therefore, 
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the nature of the task may have limited the 
degree to which attitude similarity could 
influence group performance. 

The impact of attitude similarity on co- 
hesion demonstrates once again the pervasive 
effect of this variable. Congruence of atti- 
tudes has been found to influence jury deci- 
sions (Mitchell & Byrne, 1973), dating be- 
havior (Byrne, Ervin, & Lamberth, 1970), 
the dollar amount of loans (Golightly, Huff- 
man, & Byrne, 1972), and interview decisions 
(Peters & Terborg, 1975) to name just a few. 
Given that group cohesion is related to com- 
munication, satisfaction, turnover, and ab- 
senteeism (Lott & Lott, 1965; Stogdill, 1972), 
the use of this inexpensive and easily admin- 
istered technique for assessing this aspect of 
group composition would seem to merit fur- 
ther investigation, 

The finding that group ability was not 
related to cohesion is less clear. It was as- 
sumed that if ability were related to perform- 
ance and if high performance were reward- 
ing, then ability should be related to cohesion, 
The relationship between ability and perform- 
ance was demonstrated. If the proposed causal 
Sequence were correct, then it would appear 
that for these groups, high performance was 
not rewarding. Recall, however, that it was 
assumed from the beginning that all groups 
would value high performance over low per- 
formance. Consequently, verification of this 
assumption was not attempted vis à vis sub- 
ject ratings. An indirect measure can be ob- 
tained, however, by examining the level of 
attained performance. To the extent that 
overall group performance was high, it can be 
inferred that the groups valued high perform- 
ance, Examination of performance cell means 
in Table 1 shows that the overall mean score 
of the high-ability groups was 18.10, and the 
overall mean score of the low-ability groups 
was 17.22. Based on the instructors’ ratings, 

where a score of 20 was an “A+,” the high- 
ability groups received on the average a grade 
of “A—,” while the low-ability groups re- 
ceived a “B+.” Considering that grades were 
assigned on the basis of how close the group’s 
calculations came to the known figures, and 
that scores did not necessarily have to follow 
a normal distribution, these high-performance 


means should support the assumption that the 
groups valued high performance over low per- 
formance. The observed lack of a relationship 
between ability and cohesion, then, would not 
appear to be due to the fact that high per- 
formance was not rewarding. Admittedly post 
hoc, a more tenable explanation could be that 
this difference in grades between high- and 
low-ability groups was too small to elicit dif- 
ferential feelings of task rewards and accomp- 
lishment. If this assumption were correct, to 
test the hypothesis that ability would be re- 
lated to cohesion would require not only that 
ability affect performance, and that high per- 
formance be rewarding, but that the difference 
between high and low performance be of suf- 
ficient magnitude to produce differential 
rewards, 

Of both methodological and theoretical 
interest are the obtained correlations between 
attitude similarity and cohesion and between 
performance and cohesion. Examination of 
Table 3 shows that attitude similarity and 
cohesion were not significantly correlated 
until the fourth project. Had the study ended 
after one or two projects, no relationship be- 
tween attitude similarity and cohesion would 
have been observed. For this reason the 
results, or lack of results, sometimes found in 
the typical group experiment done with ad 
hoc groups over short time durations appear 
questionable. 

The correlations between cohesion and per- 
formance are just as striking. Stogdill (1972), 
in his review of cohesion and productivity, 
reported 12 significant positive correlations, 
11 significant negative correlations, and 11 
nonsignificant correlations. The results ob- 
tained in this study provide additional evi- 
dence for the equivocality between cohesion 
and performance. In this instance, had the 
experiment stopped after the first project, the 
hypothesized positive correlation between 
group performance and cohesion would have 
received some support. However, by the time 
of the sixth and final project, the two varia- 
bles were significantly negatively correlated. 
Unfortunately, in the present experiment it 
was assumed that all groups would be moti- 
vated to obtain high scores, and independent 
measures of group drive were not assessed. If 
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they had, then Stogdill’s predictions concern- 
ing the moderating effect of group drive on 
the performance-cohesion relationship could 
have been examined. Conversations with the 
instructors at the conclusion of the course 
suggested two related potential reasons for the 
findings. First, although as argued earlier, 
high performance was valued over low per- 
formance, after a short time the groups might 
have become aware that almost no group 
would receive a grade lower than a B for this 
portion of the course. Second, the high-ability 
groups were virtually assured of receiving an 
A by the end of the third—fourth project. Thus, 
once the inevitability of receiving a high grade 
had been established, the group drive could 
have shifted from a task success orientation 
to a social orientation, According to Stogdill’s 
model, this shift would have produced the 
obtained results. Research in which group 
drive and cohesion can be experimentally 
manipulated is required in order to better 
understand how these two variables are re- 
lated. 

Although no predictions were made for a 
trials main effect, there was a significant dif- 
ference for performance. Grades tended to be 
lowest on Projects 1 and 2, highest on 
Projects 3 and 4, and then dropped off slightly 
on Projects 5 and 6. Again, a postexperi- 
mental discussion with the course instructors 
suggested one possible reason for these find- 
ings. Although all of the projects were rated 
as equal in difficulty, the instructors stated 
that Projects 3 and 4 covered the most in- 
teresting material; Projects 5 and 6, the least 
interesting material; while Projects 1 and 2 
covered the material that could be considered 
of intermediate interest. If these judgments 
were correct, then these results would be in 
partial agreement with the findings of Pep- 
insky, Pepinsky, and Pavlik (1960). They 
concluded that group performance is highest 
on tasks that are characterized by variety, 
decision making, and coordination. All of 
these are factors that can make the task more 
interesting. This finding again emphasizes the 
need to consider the task as a determinant of 
group performance (see Hackman, 1968, 1969, 
for a more complete discussion of the im- 
portance of task characteristics). The fact 


TABLE 4 


TOTAL OBSERVED FREQUENCIES 
or TEAM MEMBERS Pr 
Across THE Six Pro. 


FOR THE NUMBER 


Ability /atti 
similarity Absent Present 


High/high 5 oe 169 
High/low 5 217 
Low/high 8 226 
Low/low 12 156 


that Projects 5 and 6 were the least interest- 
ing coupled with the realization that by this 
time, nearly all groups had realized that they 
would be receiving a high grade are also con- 
sistent with the above argument that group 
drive could have changed from one of task 
success to a social orientation, 

Finally, given the existence of missing 
data and research evidence showing that 
group cohesion is related to the withdrawal 
behaviors of absenteeism and turnover, it was 
decided to do an analysis on the frequency 
of missing data (one type of withdrawal be- 
havior) according to conditions. The total 
frequencies for the present-absent factor over 
the six assignments in each experimental 
condition are indicated in Table 4. A factorial 
chi-square analysis was performed on these 
data according to procedures outlined in 
Winer (1962, pp. 631-632). Expected cell 
values were estimated from marginal totals 
(total x?(3) = 7.1613, p < .07). Partitioning 
the total chi-square value so as to yield ef- 
fects equivalent to the main effects for ability, 
attitude similarity, and their interaction indi- 
cated the following results: For ability, y*(1) 
= 3.3092, p < .07; for attitude similarity, 
x°(1) = .7578, ns; for the Ability X Attitude 
Similarity interaction, y°(1) = 3.0943, p< 
.08. Finally, an alternative partitioning of the 
total chi-square to yield a comparison of the 
low ability-low attitude similarity condition 
against the other three conditions (cf., Max- 
well, 1961, pp. 52-61) indicated significant 
results: x?(1) = 6.733, p < .01. This latter 
chi-square value represents 94% of the vari- 
ance in the total chi-square. Thus, it would 
appear safe to conclude that there was sig- 
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nificantly greater absenteeism in the low abil- 
ity-low attitude similarity condition (7.14%) 
than in any of the other three, which experi- 
enced only 2.25%-3.14% total absenteeism. 
To the extent that such absenteeism may be 
taken as reflecting withdrawal behavior, these 
data lend further support to the previously 
noted findings and contentions. 

In all, the results of this field experiment 
suggest that groups can be assembled so as 
to maximize the occurrence of desirable group 
outcomes. But perhaps more importantly, 
the results also stress the importance of 
longitudinal research. While this type of data 
collection is usually more costly (and prob- 
lematic) than one-time laboratory studies, the 
utility of the data for understanding group 
behavior may well be worth the added ex- 
pense and anguish. 
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Validation of Personality Traits: A Multicriteria Approach 


John McGowan and John Gormly 
Rutgers—The State University 


The following study investigated the behavioral validity of energetic behavior 
as a personality trait. Sixty-four members of two college fraternities were rated 
by their peers for energetic activity. The split-half reliabilities of the ratings 
were r = .90 and r = .87 for the two fraternities (p < .001). Eight people from 
the upper quartile of energetic behavior scores and eight people from the lower 
quartile were studied in naturalistic settings and were given a structured inter- 
view regarding their physical activities. The correlations between trait scores 
and interview data (r=.65) and between trait scores and performance in 
naturalistic settings (r = .70) were substantial. The findings support the posi- 
tion that while trait scores will not generally be highly predictive of specific 
behavioral acts, they will be highly predictive of performance that is averaged 
across of variety of behaviors related to the trait score. 


Considerable discussion about the validity 
of personality traits has been presented in the 
recent literature (see Alker, 1972; Bem, 
1972; Bowers, 1973; Gormly & Edelberg, 
1974; Mischel, 1973), The major impetus for 
this reconsideration of the trait viewpoint 
came from Mischel’s (1968) review of re- 
search in the areas of consistency in behavior 
across various settings and the predictive util- 
ity of trait scores. In general, the consistency 
in behavior across situations and the correla- 
tions between trait scores and behavior have 
been unsubstantial. Mischel and others have 
concluded from these data that personality 
traits were not rooted in the behavior of 
people and that personality traits have no 
validity. 

Gormly and Edelberg (1974), however, 
demonstrated that interpersonal aggressive- 
ness had traitlike properties and was rooted 
in the behavior of people described by the 
trait. Independent ratings of subjects by 
groups of their peers approached perfect 
agreement in assigning the trait, and this find- 
ing was replicated on a second set of subjects. 
Furthermore, when naive observers rated sub- 
jects for interpersonal aggressiveness after 
watching them behave in a controlled labora- 
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tory setting, they had substantial agreement 
with ratings of peers who shared a homelike 
environment with the subjects, Physical ap- 
pearance and voice were ruled out as major 
determinants of assigning the trait; it strongly 
appears that the trait is descriptive of regu- 
larities in behavior. 

In a later paper, Gormly and Champagne 
(Note 1) argued that many personality traits 
are descriptive of regularities in behavior and 
that the criteria of high predictive values be- 
tween traits and specific acts or high consis- 
tency in behavior across situations were in- 
appropriate for deciding the validity of a 
trait. According to Gormly and Champagne, 
personality traits describe the variance that a 
group of behavioral acts have in common. Al- 
though this common variance is only a small 
proportion of the total variance, it is real and 
meaningful. From this line of reasoning, they 
conclude that trait scores can be highly pre- 
dictive of a variable that represents a person’s 
performance averaged across several inter- 
correlated acts, whereas the trait scores will, 
in general, only be marginally predictive of 
specific acts. To demonstrate these properties 
of trait scores, they had subjects rated by 
peers for their degree of energetic behavior, 
Then the subjects came to the laboratory to 
answer questions about their activities and 
to perform a variety of tasks which require 
quick action. The median correlation between 
energetic behavior and the specific measures 
was .22, but the correlation between the trait 
and the person’s average performance was .78. 
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Clarification of the meaning of personality 
traits must await empirical demonstrations 
that some set of trait labels actually de- 
scribes traitlike properties of behavior. It is 
highly unlikely that all of the many thousand 
descriptive adjectives applicable to people are 
describing common elements of a person’s be- 
havior in many settings, An adequate valida- 
tion of personality traits will be a major un- 
dertaking. The purpose of the present study 
is twofold. The study was designed to provide 
additional data on the validity of energetic 
activity as a general component of behavior 
and to demonstrate a multicriteria approach 
toward validation of a trait. Trait scores were 
generated from the following three sources: 
(a) peer ratings of energetic behavior, (b) 
self-report of physical activities using a struc- 
tured interview, and (c) objective measures 
of behavior in naturalistic settings. Our hy- 
pothesis is that specific measures will be mod- 
estly related to peer ratings, but when the 
person’s scores are averaged over the specific 
measures, the average performance will be 
highly related to the ratings of energetic be- 
havior by peers. 


Method 


Subjects. Sixty-four male undergraduates from 
Rutgers University served as subjects for the first 
part of this study: 32 men came from one fraternity, 
and 32 men came from another fraternity, They 
were paid for their participation. 

Procedure. The subjects came to the laboratory 
to be tested individually. They were told that we 
were interested in their judgments about character- 
istics of people with whom they shared a homelike 
environment. 

The subjects were given a set of rating sheets. 
Each sheet contained a trait name, a list of the 32 
people from their fraternity who were to be rated, 
and the following instructions, which the experi- 
menter also read aloud: 


If the trait is a characteristic of any of the people 
listed below, print ves before that person’s name. 
If the trait is not a characteristic of the person, 
then print No before his name. Each Person should 
have either a yes or no before his name. 


The raters made judgments about their peers for 
15 traits, including the salient trait for this study: 
“energetic—physically active.” 

After the peer ratings were completed, the sub- 
jects were given a structured interview in regard to 
their physical activities. The interviewer asked the 
subjects to identify the sports that they had ac- 
tually participated in during the preceding year. They 
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had before them a rather exhaustive list of sports. 
In addition they were asked the following four 
questions: “On the average, how many hours a week 
do you spend in sports activities? What is the long- 
est distance you have ever walked in a day? What 
is the longest distance you have ever swam in a 
day? What is the longest distance you have ever 
ran in a day?” Data from the interview were ob- 
tained from 29 members of Fraternity A and 30 
members of Fraternity B. 

In the second part of this study, 16 subjects were 
recruited from the original sample so we could ob- 
serve their behavior in naturalistic settings. Based 
on the ratings of the peers, 8 subjects were taken 
from the upper quartile of energetic behavior scores, 
and 8 subjects were taken from the lower quartile, 
The experimenters received a schedule of activities 
from each participant (i.e, the classes and activities 
the subjects attended regularly). The subjects knew 
that the experimenters would be observing them in 
public settings sometime during the semester; they 
were, however, unaware that they had been selected 
based on their energetic behavior score and were 
unaware of the behavior the experimenters were re- 
cording. The experimenters did not know which sub- 
jects had been judged as energetic and which had 
been judged as nonenergetic. 

The experimenter recorded behavior in two set- 
tings: while the subjects were walking on campus 
and when the subjects were seated in class. The fol- 
lowing six areas of behavior Were measured: (a) the 
individual’s walking speed—distance/time, (b) the 
average distance per step, (c) the rate of speed 
while going up stairs—number of steps/time, (d) 
the rate of overt head movements—number of 
moves/time, (e) the rate of postural adjustments 
(observable movements of the spine) while seated 
in class—number of moves/time, (f) after comple- 
tion of the observations, the experimenter made a 
judgment of whether the subject was energetic or 
physically inactive. 


Results 


Our first analysis describes the degree of 
agreement among the raters for attributing 
energetic behavior. The ratings within each 
fraternity were arbitrarily divided into two 
groups; a person’s score for each set of raters 
was the proportion of raters who said the in- 
dividual was energetic. The scores were cor- 
related to determine the split-half reliability 
of the judges. The correlation coefficients were 
-90 for Fraternity A (n = 32) and .87 for 
Fraternity B (n = 32). In the analyses that 
follow, the person’s energetic behavior score 
represents the total proportion of judges who 
said he was energetic. 

All data were converted to rank-order 
scores, and the Spearman rank correlations 
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procedure was used. The data were averaged 
for the five scores from observing the subjects, 
for the five self-report answers regarding activ- 
ities, and a third score was produced by av- 
eraging the observation score with the self- 
report score. The intercorrelations of these 
three scores with the trait scores are pre- 
sented in Table 1. 

The correlation between energetic behavior 
and the average of the ranks for the five per- 
formance scores was substantial (.70 for the 
16 subjects). The correlation between the 
trait score and the average of the interview 
data was also high at .65 (for the total group 
participating from Fraternity A, 7 = .59, n = 
30, and from Fraternity B, r = .66, n = 29). 
Peer ratings of energetic behavior had their 
highest correlation with the combined aver- 
ages of performance and interview scores 
(r = 374). 

The discriminative power of trait scores 
for predicting the upper half and the lower 
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TABLE 1 


CORRELATIONS AMONG PEER 


SPEARMAN RANK 
3 © BEHAVIOR, PERFORMANCE 


OF PERFORMAN 


w Data 
Variable 1 2 3 4 

1, Trait score of 

energetic behavior >.90 
2. Performance in 

naturalistic settings 70 (2) 
3. Interview responses i TE S 
4. Average of 

performance and 

interview data 74° 91 (83 (?) 


Note, (?) indicates that we do not have a good estimate of 
the reliability of these scores, 


half of the averaged scores from the per- 
formance and interview data is displayed in 
Figure 1, One hundred percent of the non- 
energetic subjects were in the lower half and 
100% of the energetic subjects were in the 
upper half of the averaged scores. 
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Ficure 1. The relationship between peer ratings of energetic behavior and the average of 
performance and interview data. 
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TABLE 2 
SpEARMAN RANK CORRELATIONS BETWEEN ENERGETIC 
BEHAVIOR AND THE INDIVIDUAL VARIABLES 


Variable Ts 
Number of steps/sec AG* 
Head turns/sec 37 
Speed of climbing stairs 20 
Longest swim 30 
Hours/week in sports 6** 
Walking speed .60** 
Posture changes//sec saa" 
Longest walk A3 
Longest run A2* 
Participation in sports 

during year .64** 


Note. The internal consistency of these 10 variables (co- 
efficient alpha) is .67. 
*p <.05, 


DD <.01, 


The correlations between energetic behavior 
and the individual variables of this study 
were more modest than the correlations be- 
tween peer ratings and averaged scores, These 
correlations are given in Table 2. 

The chi-square test was used to examine the 
association between peer ratings of energetic 
behavior and the judgment of the experi- 
menter based on his observations in natural- 
istic settings. These data are given in Table 
3; the association was not significant, y°(1) 
=1.1,p> .05. 


Discussion 


The results of this investigation provide 
strong evidence that trait scores of energetic 
behavior are based on observable behavior 
that is performed in a variety of situations 
with scorable consistency. This conclusion is 
Supported by multiple criteria, which con- 
verge in demonstrating the validity of the 
trait: high interrater agreement in attributing 
energetic behavior to people with whom they 
live; substantial correlations between per- 
formance in naturalistic settings or sel f-report 
of activities and trait scores; a significant 
association between behavior in naturalistic 
settings and self-report of activities in other 
naturalistic settings. 

This study further supports the argument 
of Gormly and Champagne (Note 1) that 
whereas the association between trait scores 
and specific acts are modest, the association 
between trait scores and performance aver- 
aged across a variety of specific acts will be 


large. In the present study, the relation be- 
tween the average of the interview data and 
trait scores accounted for 42% of the vari- 
ance, the relation between the average of the 
performance data and trait scores accounted 
for 49% of the variance, and the correlation 
between the combination of performance and 
interview data accounted for 55% of the vari- 
ance. These findings are similar to those of 
the Gormly and Champagne study in which 
the correlation between trait scores and per- 
formance averaged over the other variables 
accounted for 61% of the variance. Taken to- 
gether, these studies show the involvement of 
energetic action in behavior as varied as ac- 
curacy in rotor pursuit, errors on complex 
mazes, the longest distance the subjects have 
run in a day, and speed in going up stairs. 
We accentuate the strength of association be- 
tween our trait score and behavioral score to 
illustrate the impact of averaging a person’s 
acts over a variety of behaviors that are logi- 
cally related to the trait. Of course, there are 
many factors that would change the amount 
of common variance between trait scores and 
behavioral scores. 

The rationale underlying the research re- 
ported here contrasts with that recently pre- 
sented by Bem and Allen (1974). Bem and 
Allen argue that the nomothetic assumptions 
of traditional personality trait research are 
incorrect; in this paper we accept the nomo- 
thetic model and still find a correlation be- 
tween a trait score and behavior to exceed 
-70. We hold that the assumption that per- 
sonality trait scores should be highly predic- 
tive of specific behavioral acts is inconsistent 
with the measurement and meaning of per- 
sonality traits, generally. The behavioral 
meaning of certain traits (e.g., energetic ac- 
tion and aggressiveness) is quite consistent 
for people living in similar environments 


TABLE 3 
RELATIONSHIP BETW PEER RATINGS 


EXPERIMEN JUDGMENT OF 
ENERGETIC BEHAVIOR 


Peer ratings 


Experimenter’s judgment High Low 
High 7 
Low 4 6 
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x ? 
(e.g., a college community); one can see this 
in the high degree of consistency among peer 
raters who are describing the subjects accord- 
ing to personality traits. 

Although we expected that the subjective 
judgments of energetic behavior by the ex- 
perimenter would have significant agreement 

“ with peer raters, such was not the case. In 
retrospect, the experimenter may have been 
too busy observing and recording specific be- 
havior to form a global opinion about the 
degree to which the subjects were energetic. 
Furthermore, the judgment of a single ob- 
server may be biased by idiosyncratic weight- 
ings of the signs of energetic behavior, In any 
case, the nonsignificant relationship between 
the experimenter’s judgment and peer ratings 
reduces the likelihood that the expectations 
of the experimenter regarding which individ- 
uals were more energetic contaminated the 
findings of this study. 

The tasks before us include the further 
development of a multidimensional network 
of traits that can define the structure of per- 
sonality. We believe that traits are meaning- 
ful and important aspects of behavior; we do 
not, however, believe that the list of traits 
that are securely founded in behavior will be 
large, It is still the responsibility of those 
who assert that specific adjectives indicate 
traits to demonstrate the validity of their 
traits. The present study and its predecessor 
(Gormly & Champagne, Note 1) provide a 
model for demonstrating the validity of traits. 

Although the issue of trait validity is funda- 
mental, there are other core concerns for the 
trait viewpoint. One of these concerns is the 
origin of individual differences on trait di- 
mensions. How does it come to be, for ex- 
ample, that one individual is highly energetic 
and another individual is hardly energetic at 
all? Data on the heritability of trait scores 
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(Gottesman, 1963) indicate that an important 
source of individual differences on trait di- 
mensions is rooted in the biology of people. 
Still we do not know to any degree what 
biological variables interact with experience 
to produce personality traits. Since the herita- 
bility data were published, a readily available 
technology has been developed that would 
permit extensive analyses of the performance 
of the autonomic nervous system and the 
hormonal systems, It is certainly time to map 
out the biological correlates of trait scores 
as a first step in understanding the involve- 
ment of biological factors in behavioral traits. 


REFERENCE NOTE 


1, Gormly, J., & Champagne, B. Validity in personal- 
ity trait ratings: A multicriteria approach. Paper 
presented at the meeting of the Eastern Psycho- 
logical Association, Philadelphia, April 1974. 
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Christine Eiser 
Hospital for Sick Children 
Great Ormond Street, London 


J. Richard Eiser 
Institute of Psychiatry 
University of London 


This study is concerned with effects of age and subjects’ orientations on the 
kind of information children request concerning their partners in a trading 
game. A total of 52 sixth- and eighth-grade students (male and female) 


made 20 trades, each of which consis‘ 
“goods” to their partner and receiving 
their (programmed) partner in retur 


low value (in imaginary dollars) to 


high or low cost to the player who sen 


ted of sending one of four alternative 


one of four alternative goods from 
Each of the goods was of high or 


the player who received them and of 


t them. Subjects were told the value 


to themselves of the goods they received and the cost to themselves of the 
goods they sent. They were not told the values and costs of the other player 
but could “buy” this information from the experimenter. Subjects tended 


to ask for more information concernin! 


g the value to the other of what they 


sent than concerning the cost to the other of what they received. Eighth 
graders asked for more information concerning others’ costs than sixth graders. 
The tendency to ask for more value than cost information was most marked 
among (a) those subjects who sent Jeast high-value goods to their partner 
and (b) those subjects who, on the basis of a pretest, appeared to be indi- 
vidualistically rather than competitively oriented, It is argued that differences 
in the amount of cost information requested may reflect differences in subjects’ 
interest in making valid attributions concerning their partners’ intentions. 


Analyses of cooperative and competitive 
behavior have typically taken as their start- 
ing point a consideration of the outcomes or 
payoffs that accrue to the different partici- 
pants as a result of particular kinds of be- 
havior (Carson, 1969; Deutsch, 1949; Thi- 
baut & Kelley, 1959). Experimenters have 
generally taken pains to ensure that all par- 
ticipants understand the implications of the 
full matrix of payoffs. Outside the laboratory, 
however, many transactions take place in 
which individuals do not have complete 
knowledge of the relevant payoffs, particu- 
larly as regards the costs or benefits to other 
individuals of specific kinds of behavior or 
exchange. In such circumstances, therefore, 
what kinds of missing information will par- 
ticipants be most keen to acquire, and what 
relation, if any, will there be between the 


Preparation of this article was assisted by a grant 
from the Social Science Research Council, London. 

Requests for reprints should be sent to Christine 
Eiser, Department of Psychological Medicine, Hos- 
pital for Sick Children, Great Ormond Street, Lon- 
don, England WCIN 3JH. 


acquisition of such information and their own 
intentions? 

An attempt to answer these questions was 
made by Eiser and Tajfel (1972), who re- 
quired pairs of subjects (male nonpsychology 
students of the University of London) to send 
each other a series of messages consisting of 
colored cards from a choice of five colors. 
Depending on its color, each card cost the 
sender a certain number of points and was 
worth a certain number of points to the per- 
son who received it. Subjects knew their own 
values and costs and so could calculate their 
own individual profits, but they were not told 
how much the cards they received had cost 
the other person (cost information) or how 
much the cards they sent were worth to the 
other person (value information). When given 
the chance to buy such information from the 
experimenter (at the cost of a deduction from 
their total score) subjects asked for many 
more items of information concerning the 


1 This previous study was part of a larger program 
of research by the Transnational Working Group 
on the Dynamics of Conflict. 
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value to the other of what they gave than 
the cost to the other of what they received. 
In addition, this value-cost difference was 
found to be largest for subjects classified as 
more competitive on the basis of their ex- 
pressed intentions. 

These data were interpreted in terms of the 
different strategic importance of each kind of 
information. Value information can enable 
both the competitive subject to avoid giving 
the other player any cards of high value and 
the less competitive subject to try and shape 
the other player’s responses (e.g., through 
some strategy of reciprocity). Cost informa- 


. tion has less direct instrumental value because 


it is primarily an indicator of the constraints 
on the behavioral choices open to the other 
player. A knowledge of such constraints, how- 
ever, might be useful to a subject seeking to 
shape his partner’s behavior in the direction 
of mutual cooperation. It might also provide 
him with a basis for making attributions con- 
cerning the other’s intentions (cf. Eiser, Aiye- 
ola, Bailey, & Gaskell, 1973). Kelley and 
Stahelski (1970) have suggested that com- 
petitive individuals tend to approach mixed- 
motive encounters with the assumption that 
the other participant will be competitive also, 
whereas cooperative individuals are more 
heterogeneous and, by implication, more open- 
minded in their assumptions concerning the 
other’s intentions. If such differences were 
reflected in different tendencies to seek in- 
formation relevant to making attributions 
concerning the other’s intentions, it might 
help explain the tendency for the less com- 
petitive subjects in the Eiser and Tajfel 
(1972) study to ask for relatively more cost 
information. 

One ambiguity in this analysis concerns the 
definitions of competitiveness. A distinction 
may be drawn between subjects who seek to 
maximize their own profits, without concern 
for those of the other player, and subjects 
who seek to beat the other player, even per- 
haps where this involves a reduction in their 
own individual outcomes; contrasting with 
both of these are subjects who seek to maxi- 
mize the joint outcomes of themselves and 
their partners. This may be compared with 
Deutsch’s (1960) distinction between indi- 
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vidualistic, competitive, and cooperative ori- 
entations. It is also essentially the same dis- 
tinction that is implied by the finding that 
subjects (typically schoolboys aged 15-16 
years) who are required to assign monet: cy 
awards to a member of their own in-group 
and a member of an out-group will frequently 
choose responses in the direction of produc- 
ing a greater relative difference between the 
amounts assigned to the in-group and out- 
group members (so that the in-group member 
gets more than the out-group member), even 
where this is incompatible with producing 
either the maximum absolute outcome for the 
in-group member considered individually or 
the maximum joint outcome for the in-group 
and out-group members together (Billig & 
Tajfel, 1973; Tajfel, Flament, Billig, & 
Bundy, 1971). 

Further evidence for this distinction is 
provided by McNeel (1973), who classified 
female college students as “own gain” or 
“relative gain” oriented on the basis of their 
responses in a Prisoner’s Dilemma and a 
Maximizing Differences Game and found that 
own-gain but not relative-gain subjects in- 
creased their levels of cooperation in response 
to conditional cooperation by the other player. 
Using similar game situations, Wyer and 
Malinowski (1972) have derived quantitative 
measures of individualistic and competitive 
orientations and have shown these to be re- 
lated, at least in same-sex interactions, to 
subjects’ attempts to maintain equity— 
greater individualism being observed when the 
subject perceives the other player’s inputs in 
the interaction to be similar to his own. It 
should be noted that in the Wyer and Mali- 
nowski approach there is no separate measure 
of cooperativeness. Altruism or cooperative- 
ness consists simply of being low on the com- 
petitiveness dimension. This is essentially the 
approach adopted in the present study, which 
attempts to relate subjects’ requests for in- 
formation concerning the other players’ costs 
and values to subjects’ positions both on di- 
mensions of cooperativeness—competitiveness 
and individualism. 

The measures of subjects’ orientations that 
are used also assume a comparability between 
the two kinds of experimental situations so 
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far considered—the two-person, mixed-motive 
games used by McNeel (1973) and Wyer and 
Malinowski (1972) and the intergroup situ- 
ations used by Tajfel and his associates. In 
the latter, measures of motivational orienta- 
tions are derived from subjects’ distribution 
of rewards between a member of their own 
group other than themselves and a member of 
another group. The measures of individualism 
and competitiveness we use are of this kind, 
and are, in a sense, more “projective” than 
those used by Wyer and Malinowski (1972) 
because the subject’s own outcomes are not 
affected by how he makes such distributions. 
Nonetheless, Turner (1975) has shown that 
distributions of outcomes between an in- 
group and an out-group member closely cor- 
respond to those between self and an out- 
group member, so the distinction between 
the two methodologies may be less crucial 
than might at first appear. 
Our main purpose, therefore, is to replicate 
the Eiser and Tajfel (1972) finding that 
subjects tend to ask for more value than cost 
information and to clarify the effect of sub- 
jects’ orientations on their information re- 
quests. In addition, the problem of informa- 
tion acquisition is considered within a devel- 
opmental context. It has generally been shown 
that both cooperative and competitive re- 
sponses increase with age, with competition 
emerging at slightly younger age levels (cf. 
Cook & Stingle, 1974). However, the empha- 
sis in such research has been mainly on the 
level of cooperative or competitive respond- 
ing. Our aim in this study is instead to in- 
vestigate how children’s motivational orien- 
tations may affect the information they seek 
about another person. Exactly how children 
come to anticipate the thoughts or feelings of 
others is a central issue in theories of cogni- 
tive or social development; the assumption 
is that children become increasingly capable 
of accurately predicting others’ emotions with 
age (Feffer, 1970; Feffer & Gourevitch, 1960; 
Flayell, Botkin, Fry, Wright, & Jarvis, 1968). 
In the present paradigm, it is cost informa- 
tion in particular that enables the child to 
understand the constraints on the other per- 
son’s behavior and hence to interpret interac- 
tions from points of view other than his own. 
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te) 
We therefore predicted that older children 
would ask for more information concerning 
the cost of goods to others than would younger 
children. 


METHOD 
Subjects 


Subjects were 52 pupils from a large independent ~ 
Catholic school in Kitchener, Ontario. All were white 
and from predominantly lower-middle-class and 
working-class backgrounds. They included 16 boys 
and 12 girls from Grade 6 (mean age = 11.91 years) 
and 12 boys and 12 girls from Grade 8 (mean age = 
14,09 years). The subjects were tested in groups of 
four, which were homogeneous with regard to grade 
and sex, 


Procedure 


All sessions were conducted by the first author in 
a trailer that had been parked on the school grounds. 
Subjects were seated behind screens that separated 
them both from the experimenter and from the other 
subjects in their group. 

The main part of the experiment consisted of a 
two-person trading game, which was introduced to 
the subjects as follows: 


I’d like you to pretend that we’re in the North of ` 
Canada and there are some Indians living there. 
One day, some explorers come along, and try to 
trade some goods with the Indians. In return 
the Indians want to trade their goods with the 
explorers. Now, two of you are explorers, and 
two are Indians. Each explorer has to trade with 
one of the Indians. On the table you'll find a red 
book. If you turn it over it will tell you if you 
are an Indian or an explorer. Please don’t tell the 
others who you are till afterwards. I’m not going 
to tell you whom you're trading with, but it’s one ` 
of the three other people here. 


At this stage, before proceeding with the trading 
game proper, subjects were administered a short 
pretest designed to identify possible differences in 
their orientations toward the task. Each subject 
was required to assign certain rewards (imaginary 
dollars) to the two members of the other trading 
dyad. Thus, if Indian 1 was to trade with Explorer 
1, and Indian 2 with Explorer 2, both Indian 1 and 
- Explorer 1 independently assigned rewards to both 
Indian 2 and Explorer 2. It was explained that this < 
was “so that everyone has some dollars to start 
with” and that it did not cost the subjects anything 
to assign these rewards. They were told that the 
rewards assigned to them by the other players would 
be added on to their total scores at the end of the 
game, but until then the amount of the rewards 
assigned would not be disclosed. To make these 
assignments, subjects were given a response booklet 
containing the four matrices (one matrix per page) 
shown in Table 1. Their task was to check one 
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The Function of Group Size and Ability Level on Solving a 
Multidimensional Complementary Task 


Thor Egerbladh 
Department of Education, University of Umeå, Sweden 


The experiment was designed with reference to both an additive model and 
a complementary model reported by Steiner. A multidimensional task (i.e., 
IQ-test items) was used. The subjects were 108 students from Grades 4 and 
3. A 3X3 (Ability Levels High, Middle, and Low X Group Sizes 1, 2, and 
3) analysis of variance was performed. The complementary model was sup- 
ported by the results, and the assumption of linearity from the additive model 
seemed to hold for the complementary model, too. Significant main effects 
were found for group size and ability level ( < 01), High- and middle-ability 
students gained approximately twice as much from the dyad and triad situa- 
tions as low-ability students. The results also give some evidence for the 
importance of giving more attention to different ability levels in this type of 


small-group research. 


Several models for predicting group per- 
formance have been reported by Steiner 
(1966). Each model assumes specific task 
demands, The additive model assumes that 
the task demands require every member of a 
group to perform exactly the same function 
and predicts that potential group perform- 
ance will be a function of the sum of the 
members’ abilities. Steiner states that the po- 
tential productivity of groups will vary as a 
positive linear function of group size. This 
assumption concerning linearity is only stated 
for the additive model. Support for this model 
appears in findings reported by Wiest, Porter, 
and Ghiselli (1961) and Comrey and Staats 
(1955). Wiest et al. (1961) used jigsaw puz- 
zles, with undergraduates serving as subjects. 
Individual and dyad performances were com- 
pared in different ways. One of the main 
results was that an additive combination of 
individual scores gave a good prediction of 
group performance. Comrey and Staats used 
crossword puzzles. The subjects were under- 
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graduates, and performances of individuals 
and dyads were compared for high and low 
ability levels. Eighty-two percent of the vari- 
ance in the groups was predicted from a 
linear combination of high and low individual 
scores. 

The complementary model (Steiner, 1966) 
is, however, of greater interest in this report. 
This model assumes that the task demands 
require the group members to have both 
shared and unshared abilities, that is, “a bit 
of information that is known by one member 
of a population will ordinarily be known by 
at least some of the other members of the 
population” (Steiner, 1966, p. 281). The 
complementary model predicts that group 
performance will be less than the values of 
the summed abilities of the members but 
greater than the values of the individuals 
performing alone. Goldman (1965) used the 
items from an intelligence test (Wonderlic 
Intelligence Test) as the complementary task. 
The test was constructed for the general adult 
with speed instructions, but the subjects were 
college students and no time limit was im- 
posed. Individuals and dyads were compared 
for high, middle, and low ability levels. The 
main findings were that (a) dyads showed 
significant higher values, (b) low-ability sub- 
jects gained more from the dyad situation 
than middle-ability subjects, and (c) the 
middle-ability group gained more than the 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS ror GROUP 
SIZES AND ABILITY LEVELS 


Group size 
Ability level 1 2 3 

High 

M 30.50 35.00 39.32 

SD 3.15 4.05 2.31 
Middle 

M 24.90 31.42 32.70 

SD 6.22 3.57 2.64 
Low 

M 23.67 26.17 27.48 

SD 6.15 4.32 4.03 


high-ability group. Laughlin and Johnson 
(1966) repeated Goldman’s experiment but 
used a iest (Terman Concept Mastery Test) 
constructed for the population from which 
the subjects (college students) were drawn 
as a unidimensional complementary task. By 
unidimensional they meant that the test items 
were chosen from one area (i.e., the verbal 
one). No time limit was imposed. The dyads 
performed better in almost every experi- 
mental condition, but the group effect was 
lowest for low-ability subjects. Laughlin, 

Branch, and Johnson (1969) extended this 

experiment by comparing individuals and 

triads. The triads performed significantly bet- 
ter than individuals and, as in the previous 
experiment, the group effect was lowest for 
the low-ability subjects. Branch (1971) com- 
pared high-, middle-, and low-ability subjects 
on a multidimensional complementary task. 
The task required the rank ordering of 15 
items as well as knowledge of many different 
areas (verbal, numerical, etc.). Main findings 
were that (a) triads improved more than in- 
dividuals, (b) low-ability subjects improved 
more than middle-ability subjects, and (c) 
middle-ability subjects improved more than 
high-ability subjects. Branch (1971), how- 
ever, did not report any characteristics for the 
difficulty of the task, for example, whether or 
not the task was constructed for a normal 
population. 

These last four experiments were here ex- 
tended in mainly three ways: (a) Individual 
performance was compared with both dyad 
and triad performance, which permitted a 
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test of linearity; (b) the subjects were re- 
cruited from another development level, that 
is, the subjects were 10-11 years old; and 
(c) an exact time limit was imposed. 

The main purposes of this experiment were 
to study the effects of group size (1, 2, and 
3) and different ability levels (high, middle, 
and low) on solving a multidimensional com- 
plementary task, which was constructed for 
the population from which the subjects were 
recruited. The experiment was designed with 
reference to the complementary and the ad- 
ditive models reported by Steiner (1966), 


Method 


Subjects. One hundred and eight students from 
Grades 4 and 5 were assigned randomly to the dif- 
ferent experimental conditions. The individuals were 
working alone, the dyads in pairs, and the triads in 
three groups on different ability levels (high, middle, 
and low), with six replications for every experi- 
mental condition. The experiment was performed in 
the students’ ordinary classrooms. 

Material. The even items in an 1Q test constructed 
for Grades 4 and 5 (Westrin, 1966) were used as 
the multidimensional complementary task. The 
maximum time for solving the items was limited to 
16 minutes, that is, half the time stated for the whole 
IQ test (Westrin, 1966). The Kuder-Richardson 20 
for the complementary task was .88 in this experi- 
ment. 

Design and procedure. A 3X3 (Group Sizes 1, 2, 
and 3X Ability Levels High, Middle, and Low) 
analysis of variance was performed. The assignment 
on different ability levels was done according to 
the results from the uneven items in the IQ test 
mentioned before (Westrin, 1966). The students who 
scored 27 points or more were assigned to the high 
ability level. The students who scored between 26 
and 21 points were assigned to the middle ability 
level, and the students who scored 20 points or less 
Were assigned to the low ability level. Same-sex 
groups were used. Both dyads and triads were 
placed at two desks, standing adjacent to each 
other, in order to give the same spatial ground for 
the interaction. The dyads and triads were instructed 
to cooperate; the individuals were instructed to work 
alone. The instructions given in the manual for the 
IQ test were followed, and all students were in- 
structed to do their best and to mark or write down 
the right answers as fast as possible. 


Results 


A 3 X 3 (Group Sizes 1, 2, and 3 X Ability 
Levels High, Middle, and Low) analysis of 
variance was performed on number of correct 
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items. The main effects of group size and 
ability level were significant, F(2,45) = 
12.44, p < .01; F(2,45) = 21.10, p<.01, 
respectively. The Group Size X Ability Level 
interaction was not significant, F(4,45) = 
.85. Means and standard deviations for the 
nine conditions are presented in Table 1. 

The complementary model (Steiner, 1966) 
predicted marked increases in group perform- 
ance as compared with individual perform- 
ance. Therefore, a test for planned compari- 
sons (Keppel, 1973) was performed for dif- 
ferences between group size (1, 2, and 3). 
Only the differences between 1 and 2 and 1 
and 3 were significant (p < .01). 

A test for trend (Winer, 1971) of the main 
effect of group size (1, 2, and 3) showed that 
96% of the variance was predicted by the 
linear regression equation, F(1, 51) = 27.06, 
p < .01. The increase from 1 to 3 was 29% 
for high, 31% for middle, but only 16% for 
low ability level. The results are demon- 
strated in Figure 1. 

It should be noticed that the function of 
high ability level is almost perfectly linear, 
which, however, is not the case for middle 
ability level. Furthermore, the value for mid- 
dle ability level on the individual level is al- 
most the same as for low ability level on the 
individual level. 


Discussion 


The complementary model is supported by 
the results in this experiment, that is, the 
dyad and triad values were less than the 
summed values of the individuals but signiñ- 
cantly higher than the individual values. A 
more interesting finding, however, is that the 
complementary model can be extended with 


TABLE 2 


ANALYSIS OF VARIANCE FOR GROUP Sizes 
AND ABILITY Li 


Source df MS F 
Group size (A) 2 225.20 12.44* 
Ability level (B) 2 382.15 21.10* 
AXB 4 15.35 85 

Error 45 18.11 
* p< 01 
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Ficure 1. The function of group size and ability 
level on solving the complementary task (the test 
items). 


an assumption from the additive model (i.¢., 
with the assumption of linearity). This as- 
sumption is, of course, for the present time 
limited to Group Sizes 1, 2, and 3. Future 
experiments should therefore incorporate 
Group Sizes 4, 5, 6, and so on when experi- 
ments with reference to the complementary 
model are carried out. 

The increase from 1 to 3 is markedly less 
for low than for middle and high ability lev- 
els. These results are consistent with the find- 
ings reported by Laughlin and Johnson 
(1966) and Laughlin et al. (1969) but not 
with the findings from Goldman (1965) and 
Branch (1971). A task of moderate difficulty 
for the respective subjects was used in the 
first two experiments. An easy task was used 
in Goldman’s experiment, which can be an 
explanation of the inconsistent results. Branch 
(1971) did not report normative data for 
task difficulty, and the task in his experiment 
required rank ordering of the items. 

A rather interesting finding is that the 
results for the multidimensional task in this 
experiment are consistent with the results for 
the unidimensional task reported by Laughlin 
and Johnson (1966) and Laughlin et al. 
(1969). This general finding has also been 
reported by Branch (1971). The results of 
the present study with 10- and 11-year-old 
children extend these previous results for 
adults. The performance of students from 
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paired with one of the eight information con- 
figurations: high or low distinctiveness, high 
or low consensus, and high or low consistency 
information. In line with Kelley’s predictions, 
McArthur found that the frequency of stim- 
ulus attribution was greater with high con- 
sensus, high distinctiveness, and high con- 
1 sistency. The frequency of person attributions 
< was high when distinctiveness was low and 

consistency was high, regardless of the state 
___ of the consensus variable. Orvis, Cunningham, 
7 } and Kelley (1975) have recently replicated 
“and extended McArthur’s results to various 
\ patterns of incomplete information. 


7 


A distinguishing feature of McArthur’s 
and Orvis et al.’s investigations is that both 
employed observer paradigms and pencil-and- 
paper presentations of the relevant informa- 
tion variables. Both studies constitute impor- 
tant first steps toward an investigation of the 
theory as it applies to another’s behavior but 
yield no information concerning how indi- 
viduals infer the causes of their own be- 
havior. This distinction is important in light 
of recent attribution research, which has re- 
peatedly found significant differences between 
attributions made by actors and those made 
| by observers (Jones, 1976). Since Kelley’s 
= model is intended to apply to actors as well 
as observers, a logical next step is an investi- 
gation of self-attributions made following ex- 
posure to variations in distinctiveness, con- 
sensus, and consistency regarding one’s own 
behavior. 

y A second limitation of previous research on 

= the Kelley cube (one recognized by Mc- 
Arthur) has been the use of a questionnaire 
format in which the variables of the cube are 
explicitly stated for the subjects. Whereas 
questionnaires may be used to demonstrate 
that individuals are capable of using these in- 
formation variables in a systematic fashion, 
they are mute with regard to whether they 
will use them when the variables are embed- 
ded in the natural flow of events, 

The present study was designed to test 
Kelley’s attribution model by employing a 
framework in which the individual was left to 
derive distinctiveness, consensus, and con- 
sistency information from the pattern of his 
success—failure experience on four tasks of 
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sensory judgment. Because the subject’s own 
performance level was involved in manipulat- 
ing the variables of the Kelley cube, it was 
considered highly probable that defensive at- 
tribution (cf. Shaver, 1970) would distort 
predictions drawn from the rational model in 
the direction of protecting or enhancing the 
subject’s self-esteem. In particular, successes 
would be attributed to internal factors and 
failures to external circumstances, 

Weiner, Frieze, Kukla, Reed, Rest, and 
Rosenbaum (1971) have proposed that one’s 
performance outcome may be attributed to 
various combinations of ability, effort, luck, 
and task difficulty. Ability and effort are 
properties of the person engaging in the task, 
whereas luck and task difficulty are situa- 
tional properties, external to the person. In 
addition, ability and task difficulty are seen 
as stable elements, whereas effort and luck 
are unstable because they vary from situation 
to situation. Weiner et al.’s four variables are 
easily mapped into Kelley’s three attribu- 
tional loci: person (ability and effort), stimu- 
lus (task difficulty), and circumstance (luck). 

Though it seems quite reasonable to expect 
that attributions for good and bad perform- 
ance outcomes will be defensively biased, the 
literature is actually quite confusing, Some 
studies have found support for the notion of 
defensive, self-serving attribution (Beckman, 
1970; Chaikin & Darley, 1973; Fitch, 1970; 
Johnson, Feigenbaum, & Weiby, 1964; Sny- 
der, Stephan, & Rosenfield, 1976; Streufert & 
Streufert, 1969; Wortman, Costanzo, & Witt, 
1973), whereas others have not (Chaikin, 
1971; Feather, 1969; Ross, Bierbrauer, & 
Polly, 1974; Stephan, 1975). 

We assume that these variations in results 
are jointly a function of the affective signifi- 
cance of success or failure, on the one hand, 
and the clarity and status of the kinds of 
informational variables Kelley describes, on 
the other. Weiner et al. (1971) have proposed 
that one’s perception of one’s ability at a 
task is dependent upon the proportion of suc- 
cess one has experienced at that and similar 
tasks. This notion corresponds to Kelley’s 
consistency and distinctiveness factors. When 
high consistency of performance is coupled 
with high generality, greater ability attribu- 
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tion should result. Results of a study by 
Feather (1969) indicated that when subjects 
experienced an expected success (failure), 
they tended to attribute their performance to 
their own ability (or lack of ability). When 
they experienced an unexpected success (fail- 
ure), the outcome was attributed to good (or 
bad) luck. A similar pattern of results was 
obtained by Feather and Simon (1971). 

Weiner et al. (1971) assume that task 
difficulty information is gained from social 
norms regarding the performance of others at 
the task. This corresponds to Kelley’s con- 
sensus factor. If one’s performance is consis- 
tent with a majority of other persons’ (high 
consensus), then thisshs a salient cue for task 
difficulty attributions. Low consensus, on the 
other hand, offers little information concern- 
ing the difficulty of the task but is relevant to 
one’s ability. If one succeeds when most 
others fail, it is easy to infer that one has 
considerable ability at the task. If one fails 
when most others succeed, the obvious conclu- 
sion is that one has little ability at the task. 
Evidence that task difficulty attributions in 
an observer paradigm are highly dependent 
upon consensus information was provided by 
Frieze and Weiner (1971). Low-consensus in- 
formation (that the person’s performance was 
discrepant from a majority of others) re- 
sulted in relatively high attributions to abil- 
ity, effort, and luck. High-consensus informa- 
tion (that the person’s performance was equal 
to a majority of others) increased attribu- 
tions to the task. 

Thus, from a consideration of Kelley’s cube 
and empirical findings from observer studies, 
the following predictions might be made: 


1. Ability attributions should be greater 
under low than high distinctiveness and under 
low than high consensus. The presence of high 
consistency should magnify these differences. 

2. Effort attributions should be greater un- 
der high than low distinctiveness, and greater 
under low consistency and consensus than 
under high consistency and consensus. 

3. Task attributions should be greater un- 
der high consensus and consistency than un- 
der low consensus and consistency. 

4. Luck attributions should be greater under 
high than low distinctiveness, and greater 


under low consistency (especially) and low 
consensus than under high consistency and 
consensus, 


These predictions ignore the role of the 
attributor’s own successes and failures, how- 
ever, and the intriguing question concerns 
the contribution of biasing tendencies that 
may distort attributional logic in the interests 
of self-defense. To the extent that defensive 
attribution tendencies are present, they 
should be most prominent when the subject’s 
self-esteem is most threatened (i.e., when he 
has consistently failed while others have suc- 
ceeded). Defensive attribution should take 
the form of attributing success to ability or 
effort and failure to luck or task difficulty. 


METHOD 
Subjects 


The subjects were 174 male undergraduate stu- 
dents enrolled in introductory psychology courses at 
Duke University. Subjects received credit toward a 
course requirement of research participation. Eleven 
subjects who were well distributed among the ex- 
perimental conditions were excluded for expressing 
suspicion. Two subjects were excluded for failing to 
recall accurately their performance scores, and one 
was excluded for failing to follow instructions. Thus, 
the final analysis was performed on data from 160 
subjects, with 10 subjects in each of 16 cells, 


Overview of the Design 


The design was a 2 X 2 X 2X 2 complete factorial 
with two levels of perceived performance on the 
“critical” task (success or failure), two levels of 
distinctiveness information (high or low), two levels 
of consistency information (high or low), and two 
levels of consensus information (high or low). Sub- 
jects were told that they would be participating in 
four “sensory discrimination” tasks, The first task 
called for visual matching of different sized squares, 
the second called for weight discriminations, the 
third for auditory discriminations, and the critical 
fourth task called for visual discriminations of line 
length. Subjects were given false feedback regarding 
their performance on each task, Following comple- 
tion of the final task, subjects filled out a question- 
naire asking them to assign responsibility among 
four factors (ability, effort, task difficulty, and luck) 
for their performance on that task. Performance 
feedback was manipulated by the random assign- 
ment of success or failure on the fourth task. Con- 
sistency information was introduced by manipulating 
the subject’s perceived performance on the first task 
in relation to his performance on the fourth. The 
similarity of the two tasks was stressed. Distinctive- 
ness information was provided by manipulating the 
subject’s perceived performance on the two dissimilar 
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middle tasks in relation to his performance on the 
fourth one, Consensus information was provided in 
the form of a graph, which supposedly depicted the 
range of scores obtained from persons who had 
previously participated. The graph was placed inside 
the questionnaire so that all subjects were exposed 
to it before answering the questions. 


Procedure 
Subjects signed up for an experiment entitled 


- “sensory discrimination tasks” and were run in 


groups of two, three, or four. When the subjects 
arrived at the experimental room they were seated 
at a long table, which had been partitioned into four 
stations, Each station was fully equipped with all 
necessary experimental materials and once seated, 
the subjects could no longer see one another. 

The experimenter then explained that the tasks on 
which the subjects would be working were being 
developed for future use with hospital patients. The 
cover story raised the possibility that long-term pa- 
tients may, because of their confinement, suffer 
from “decrements in sensory functioning.” The tasks 
would eventually be used to assess the extent of such 
decrements to determine whether a “sensory therapy” 
program was necessary. The current subjects were 
being asked to work on the tasks so that a standard- 
ized normal distribution could be developed. To pave 
the way for the subsequent consensus manipulation, 
the experimenter explained that a preliminary dis- 
tribution of scores had been obtained the previous 
semester from similar subjects but that it was neces- 
sary to expand the sample size to achieve a more 
reliable standardization. 

The experimenter also expressed interest in whether 
a person’s feelings about the tasks might “influence 
his overall performance.” This paved the way for 
later obtaining subjective reactions to the fourth task 
(the crucial attribution measures). It was explained 
that because of time pressure the subjects would only 
be questioned about one task and that this would 
vary from session to session. Actually, all subjects 
answered questions about the same task, always the 
fourth one. 

The subjects were told that after each response 
they gave to each task they would be informed as 
to whether the response was correct or incorrect 
through the use of the two lights at their stations. 
The use of lights in place of verbal reports was 

justified in terms of ensuring control over the 
experimental situation. The experimenter pointed out 
that knowing how others were doing at the task 
might influence a person’s subsequent performance. 
The light system prevented this possible contamina- 
tion. Subjects were instructed to keep track of their 
performance by using the scoring sheets at their 
stations. The independent variables were then 
manipulated by differential feedback on the four 
tasks described below. After completing all the 
tasks, subjects filled out an attribution question- 
naire and answered other questions about their reac- 
tions to the experiment. Finally, each group was 
thoroughly debriefed. 
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The Tasks 


Task 1: Visual-square discrimination task, For 
each of 13 trials each subject turned over a card 
with a blue square on it and tried to match it in 
size with 1 of a randomly arrayed series of 10 
standard squares on a bulletin board 8 feet (243.84 
cm) away. 

Task 2: Weight discrimination task. Subjects were ~~ 
to match each of 13 weights to 1 of 10 standard 
weights. Both sets of weights were made of split shot 
attached by monofilament strands to a wooden 
stick. The weights were concealed in two separate 
boxes and each weight could be lifted by picking 
up the stick to which it was threaded through a 
hole in the box top. 

Task 3; Auditory discrimination task. On each of 
13 trials subjects tried to match a standard tone to £ 
1 of 3 following “test” tones. 

Task 4: Visual line-discrimination task, This was 
identical to the first task except that subjects 
matched the length of lines rather than the size of 
squares (this task was adapted from Phares, 1957). 


Manipulations of the Independent Variables 


Performance. Success and failure feedback on each 
of the tasks was delivered in predetermined patterns 
regardless of the subject’s actual performance on the + 
task. Each discrimination was designed with the aid 
of pretesting to be sufficiently difficult so that sub- 
jects would not be likely to question the validity of 
the feedback. Slightly different feedback patterns 
were given for success and for failure on each task. 
For the success feedback conditions subjects scored 
from 9 to 11 correct on the tasks, and for the 
failure feedback conditions scores ranged from 3 to 
4 correct out of the 13 problems. Since the sub- 
jects had to record the outcome of each discrimina- 
tion attempt, they were clearly aware of their level 
of performance. The high-low performance manipu- * 
lation hereafter refers to success versus failure on the 
critical fourth task. 

Distinctiveness. This was determined by the simi- 
larity or contrast between the subject's performance 
on the fourth task and on the middle two. Although 
Tasks 2 and 3 were conceptually similar to Task 4 
in measuring sensory discrimination, the tasks in- 
volved different senses and were therefore quite dis- 
tinctive. Subjects were always made to succeed or 
fail on both Tasks 3 and 4. High-distinctive sub- 
jects received different performance feedback after i 
the fourth task than after the middle two tasks. 
Low-distinctive subjects received the same level of 
feedback after each of the last three tasks. 

Consistency. Not only did the first and fourth 
tasks appear quite similar, but the experimenter 
verbally stressed their near identity. The fourth 
task was described as being included “as a check on 
the consistency of performance across different ver- 
sions of the same task.” Completely identical tasks 
were not used to avoid the possibility of differential 
suspicion—that is, it was feared that radically dif- 
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ferent performances on the identical task would seem 
too implausible and arouse suspicion, High-con- 
sistency subjects either failed on the first and fourth 
tasks or succeeded on both. Low-consistency sub- 
jects received different performance feedback on the 
two tasks. 

Consensus. Consensus information was provided 
through the use of a mock graph indicating the per- 


4 formance of subjects who had taken the same tasks 


$ 


during the preceding semester. Four different ver- 
sions of the graph were prepared, one for each of the 
four combinations of performance (success or failure 
on the fourth task) and consensus (high or low) 
conditions. Under high consensus, subjects failing on 
the fourth task saw that 84% of the previous sub- 
jects had performed equally poorly. Successful high- 
consensus subjects saw that 84% had performed 
equally well. Under low consensus, failing subjects 
saw that only 16% had done as poorly, and success- 
ful subjects saw that only 16% had done as well. 


P 
Dependent Measures 


One dependent measure on the final questionnaire 
asked subjects to allocate causal responsibility for 
their performance on the critical task by distribut- 
ing 100 points among four factors: ability, effort, 
task difficulty, and luck. In addition, a second item 
presented subjects with four separate 9-point scales 
(one for each of the four causal factors) with end- 
points marked “very important” and “very unim- 
portant.” The subject was asked to place a slash 
mark at the point on the line that best indicated 
how important he felt that factor had been in con- 
tributing to his performance. 

In addition, several manipulation checks were in- 
cluded. As a check on the perception of distinctive- 
ness, subjects were asked to indicate what their 
scores had been on the four tasks. For consistency, 
subjects were asked to place a slash mark on a 
9-point scale indicating how similar they had 
thought Tasks 1 and 4 to be. For consensus, they 
were asked to indicate how well they thought they 
had performed on the critical task in comparison 
with other people. This response measure was again 
a 9-point scale ranging from “much worse” to “much 
better.” 


RESULTS 
Validity of the Manipulations 


In response to the postexperimental ques- 
tion concerning how well they had done on 
Task 4, subjects provided data validating 
both the performance and consensus manipu- 
lations. As Table 1 shows, there is a strong 
main effect for success-failure, F(1, 144) = 
1,101.58, $ < .001, and the Performance X 
Consensus interaction, F(1, 144) = 256.62, p 
< 001. The experience of success or failure 
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TABLE 


VALIDATION OF PERFORMANCE AND CONSENSUS 
JIPULATIONS* 


Consensus 


Performance High Low 
Success 6.53 8.15 
4.43 2.28 


Failure 


better") 
lid on 
ple?” 


a Mean ratings from 1 (‘‘much worse") to 9 (“muc 
in answer to the question “How well do you thin! 
the line-discrimination task in comparison with other 1 


relative to others is, expectedly, amplified by 
low consensus. 

When subjects were asked to rate the simi- 
larity of the first and fourth tasks on a 9- 
point scale, con stent-condition subjects 
thought the tests were more similar (M = 
7.84) than those in inconsistent conditions 
(M = 6.73; F(1, 144) = 31.97, p<.001). 
This presents problems, as discussed below, 
since it indicates that the performance and 
similarity manipulations were not orthogonal. 

There was no separate manipulation check 
on distinctiveness, but all subjects (except the 
two who were discarded) accurately perceived 
the pattern of successes and failures corre- 
sponding to their distinctiveness condition, 


Attributions to Ability, Effort, Task 
Difficulty, and Luck 


Data analysis procedures. In order to as- 
sess the experimental hypotheses both the 
ratings and the point allocations were ana- 
lyzed for the effects of performance, consis- 
tency, distinctiveness, and consensus on attri- 
butions to ability, effort, task difficulty, and 
Juck. The choice between the two dependent 
variable formats poses a dilemma. The four 
dependent variables are conceptually inter- 
dependent to some unknown degree within 
the framework proposed by Heider and the 
other attributionists. That is, it is expected 
that internal attributions such as ability will 
vary inversely with external attributions such 
as luck. The point allocation procedure recog- 
nizes this interdependence explicitly by forc- 
ing a distribution on the responses. This has 
the advantage of theoretical compatibility, 
but the disadvantage of making it difficult to 
talk about effects on one dependent variable 
without talking about inversely covarying ef- 
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TABLE 2 


UNIVARIATE F VALUES FoR Four 
CAUSAL FACTORS 


Condition Ability Effort Task Luck 
Success-Failure (A) 7.41 23,32 64 24,407 
Consistency (B) 3.82 3.18 5.80* «01 
Distinctiveness (C) 1.60 79 OL 8.42 
Consensus (D) A2 .09 2.00 50 


Note. The only significant interaction in the rating data was 
between performance and consensus on task difficulty, F (1,144) 
= 4.10, p <,05, All reported significant univariate effects are 
verified by significant multivariate F values for the patterning 
of the four dependent variables. 

*p <.05. 
bad <.01, 
wre p <O01. 


fects on the others. Whereas the rating pro- 
cedure avoids this forced interdependence, it 
provides opportunities for systematic rating 
biases and, in the present study, turned out 
to be a more confusing task for many sub- 
jects than the point-allocation task. The 
higher variances produced by the more inde- 
pendent rating procedure seemed in part due 
to confusion and careless responding. 

As a solution to this dilemma, we first re- 
port the rating data in terms of the appro- 
priately conservative multivariate analysis. 
Following this we amplify on some of the 
more interesting and subtle features suggested 
by the ratings but featured more prominently 
in the point-allocation data. The latter throw 
into sharper relief the dynamics of attribution 
in the present instance, and more of the point- 
allocation effects exceed or approach statisti- 
cal significance. 

For the first analysis subjects’ ratings on 
each dependent variable were analyzed by a 
multivariate procedure to protect against 
underestimating the role of chance in obtain- 
ing significant differences in a (16 X 4 =) 
64-cell design, The univariate F values from 
this analysis are summarized in Table 2, and 
Figure 1 depicts the means involved in the 
main effects. 

The hypotheses. If we ignore the effects of 
success versus failure, it is clear that there is 
little overall support for the hypotheses de- 
rived from attribution theories. Ability is a 
nearly significant function of consistency, in 
line with the Kelley cube prediction, but not 
of distinctiveness or consensus. Effort is not a 
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function of any of the Kelley cube variable 
and the direction of the marginal differen: 
produced by consistency reverses the predi 
tion following from Weiner et al.’s (197 
assumption that effort is an unstable attribu 
Task difficulty is a significant function of c 
sistency but, as we shall see below, the dire: 
tion involves a reversal of the attribut 
theory prediction. The significant effects 
distinctiveness on luck also reverse the attı 
bution theory prediction. In order to begin to 
understand the experimental results and to 
recover any contact with Kelley’s attribution 
theory, it is necessary to examine the effects 
of each independent variable with particul: + 
attention to success and failure. 

Performance. The most striking result of 
the present experiment is the tendency to at- 
tribute success to ability and effort and to 
attribute failure to luck. This assignment 
pattern departs significantly from chance on 
the multivariate analysis, F (4, 141) = 11.25, 
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p <.001. These results clearly imply a de- 
fensive posture, with successful subjects tak- 
ing credit for their good performance on Task 
4 and failing subjects attributing their poor 
performance to bad luck. Surprisingly, task 
difficulty attributions were not reliably af- 
fected by performance (though failure sub- 


-« jects did attribute significantly more points 


than success subjects to the task in the per- 
centage analysis, F(1, 144) = 10.93, p <.01). 

Distinctiveness. By the logic of the Kelley 
cube, a distinctive success or failure should 
be attributed to such external factors as task 
difficulty and luck. It is quite surprising to 
find, then, that low-distinctiveness subjects 


~ attribute their performance on Task 4 to luck 


more than high-distinctiveness subjects, multi- 
variate F(4, 144) = 2.94, p < .02. To under- 
stand this finding as fully as possible, it 
should be noted that there is a nearly sig- 
nificant Performance X Distinctiveness inter- 
action, F(1, 144) =3.25, P< .07, for luck 
ratings. Whereas luck attributions are little 
affected by distinctiveness when the subject 
succeeds on Task 4, distinctiveness plays a 
strong role with failing subjects. Those who 
fail on Tasks 2, 3, and 4 make a much 
stronger luck attribution (M = 5.85) than 
those who fail after two successes (M= 
4.35), F(1, 144) = 11.07, p < .001. By com- 
parison, the luck ratings of the successful sub- 
jects do not vary as a function of distinctive- 
ness, F(1,144) = .60, ns. It appears, then, 
that the attributional principles embodied in 
the Kelley cube are in this instance subverted 
by defensiveness: Those who fail three tasks 
in a row choose an unstable external attribu- 
tion (luck) where the generalization of re- 
sponse should suggest a stable internal attri- 
bution (ability). In fact, high distinctiveness 
leads to the attribution of slightly greater im- 
portance to ability than low distinctiveness. 

Consistency. It was expected that incon- 
sistent success or failure on highly similar 
tasks would lead subjects to make unstable 
attributions either to luck or to effort. This 
was decidedly not the case. Inconsistent sub- 
jects felt that a variation in task difficulty 
level was a significant factor in their differ- 
ential performance on the two visual acuity 
tasks, F(1, 144) = 5.80, $ < .05. In spite of 


815 
80r 
8 Hi Consensus 
s 70 
= ae 
E Sa. Oy ae — Lo Consensus 
E 
5.0 ! L 


Success Failure 


Ficure 2. Effects of performance and consensus on 
the rated importance of task difficulty. (Lo= low; 
Hi = high.) 


the apparent similarity of these tasks, and 
the emphasis on that similarity by the experi- 
menter, inconsistent subjects tended to per- 
ceive the tasks as relatively dissimilar, as we 
have seen by the similarity check, and capi- 
talized on this perceived dissimilarity in their 
attributions. 

There was a marginally significant effect, 
F(1, 144) = 3.82, p < .06, of consistency on 
ability ratings. Consistent subjects attributed 
more importance to ability than inconsistent 
subjects. This, at least, is in line with the 
prediction that consistent performance should 
be attributed to a stable factor. 

Consensus. None of the main effects of con- 
sensus approached significance, but there was 
a significant Performance X Consensus inter- 
action on ratings of the importance of task 
difficulty. As Figure 2 shows, this interaction 
is almost entirely a function of those sub- 
jects failing the fourth task. When they have 
learned that others also failed the task, there 
is a firm attribution to task importance. When 
they have learned that others succeeded on 
the task, the attribution is considerably lower 
(for failure subjects only, F(1, 144) = 5.92, 
p <.05). This pattern is partly in line with 
Kelley’s attributional principles but even 
more consistent with considerations of defen- 
siveness. Thus, subjects are ready to seize on 
task attribution when they have failed along 
with others; here defensive attribution and 
the Kelley cube operate in the same direction. 
However, when subjects have succeeded, there 
is even a slight reversal of the effects of con- 
sensus. Those who have company in success 
attribute their performance to the task much 
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TABLE 3 


ANALYSIS OF VARIANCE SUMMARY FOR Four CAUSAL FACTORS 
(BASED ON ARCSIN VALUES OF PERCENTAGES) 


Variable Ability F Effort F Task P Luck F 
Success-Failure (A) 12.41*** 225709 10.93** 31.07*** 
Consistency (B) 5.78* 3.26 19,51*** <1.00 
Distinctiveness (C) 3.21 147 5.29* 16.95*** 
Consensus (D) 2.48 4.48* 3.54 6.24* 
AXB <1.00 1.77 3.90 <1.00 
AXC <1.00 <1.00 1,24 2.79 
AXD 1.38 1.42 3.40 3.25 
BXC <1.00 <1.00 1.12 <1.00 
BXD <1.00 <1.00 <1.00 <1.00 
CXD 7.24** <1.00 2.50 1.22 
AXBXC <1.00 <1.00 <1.00 <1.00 
AXBXD <1.00 1.49 <1.00 <1,00 
AXCXD <1.00 3.63 2.25 <1.00 
BXCXD <1,00 <1.00 <1.00 <1.00 
AXBXCXD <1.00 <1.00 1.60 37 

Note. df = 1 in all cases except the within-subjects variable, where df = 144, 
*p <.05, 
**p < 01. 

+ p < 001. 


less than those who have company in failure. 
The high-consensus successful subject seems 
reluctant to attribute his success to the ease 
of the task. 


Some Observations Regarding Distinctive 
Cells: The Point-Allocation Data 


As anticipated by our earlier discussion, 
analyses of the point-allocation data were also 
conducted. As we have noted, the point-alloca- 
tion scores are not independent of each other 
and, in common with all proportional data, 
the means and variances tend to be related. 
Following Streufert and Streufert (1969), the 
variances were stabilized through the trans- 
formation, Xij = 2 arcsin V Xija, where Xia 
is the original percentage entry in each cell 
(cf. Winer, 1971). The resulting values were 
placed in four separate analyses of variance, 
one for each dependent variable. 

The results of these analyses are sum- 
marized in Table 3. Here it is apparent that 
each independent variable has a significant 
effect on at least one of the dependent varia- 
bles. In general, significance values are larger 
than in the comparable analysis of the rating 
data, though the pattern of effects is highly 
similar. 

A notable difference occurs in the effects 
of consensus. With the point-allocation data 
the effects of consensus on effort and luck 
were both significant, F(1, 144) = 4.48, p< 


05; F(1, 144) = 6.24, p < .05, respectively. 
More effort is attributed when consensus is 
high than when consensus is low. Although 
this makes little sense as an isolated finding, 
it may be in part a reflection of the inter- 
dependent ability ratings. The effect of con- 
sensus on ability approaches significance, 
F(1, 144) = 2.48, p < .20, in a direction op- 
posite to its effect on effort. Quite in line with 
the logic of the Kelley cube, low-consensus 
subjects attribute their successes and failures 
to the importance of ability more than high- 
consensus subjects. Perhaps point allocations 
to effort reflect an interdependence among in- 


1A combined mixed-design analysis of all data 
does not permit the comparisons afforded by the 
separate analyses since the percentage estimates (and 
their arcsin transformations) add to unity. How- 
ever, a total analysis does test differences among 
the attribution categories and provides a check on 
the legitimacy of comparisons based on the four 
separate analyses. Such an analysis was conducted, 
combining the four between-subjects factors with 
the four within-subjects factors of causal assign- 
ment. As necessary, all between factors and their 
interactions produced F ratios at or Near zero. All 
two-way interactions between the independent (be- 
tween-subjects) variables and the within-subjects 
causal assignment variable were significant, indicating 
that different patterns of causal attribution were 
indeed made as a function of performance, distinc- 
tiveness, consistency, and consensus. This legitimizes 
the separate analyses for each of the four dependent 
variables. 
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TABLE 4 
MEAN Arrrinutions BY INDIVIDUAL CELLS (RATINGS AND PERCENTAGES) 
Ability Effort Luck 

Conditions Descriptionė Rating % Rating % Rating % Rating % 
1. (HHHH) 64 304 6.7 34.8 6.5 26.6 3.2 8.2 
2. (LHHH) 5.5 26.7 6.1 21.5 69 35.7 44 16.1 
3. (HHHL) 7.2 47.5 7.5 24.8 6.0 16.5 2.8 11.2 
4. (LHHL) 6.0 36.0 6.5 25.5 5.3 20,8 4.7 17.7 
5. (HHLH) 5.8 34.8 6.5 32.5 6.2 20.0 3.6 12.7 
6. (LHLH) 5.6 30.2 5.9 27.0 6.7 21.7 5.5 21.1 
7. (HHLL) 5.8 36.0 6.6 31.2 6.1 21.4 3.0 11.4 
8. (LHLL) 4.9 18.0 4.5 21.0 5.3 21.5 6.6 39.5 
9. (HLHH) 5.5 24.5 7.0 34.9 6.5 26.8 44 13.8 
10. (LLHH) 44 18.7 34 12.7 8.2 57.5 4.0 11.1 
11. (HLHL) 5.6 38.5 6.5 19.0 7A 34.5 3.0 8.0 
12. (LLHL) 5.7 29.5 5.4 16.0 7.2 36.0 4.3 18.5 
13. (HLLH) 6.7 25.5 7.3 38.0 me -26:5 3.6 11.0 
14. (LLLH) 4.3 22.2 4.6 17.9 7.2 36.8 4.9 23.1 
15. (HLLL) 5.2 30.4 6.3 30.5 6.9 22.5 4.0 16.6 
16. (LLLL) SFFEF/So 4.7 16.6 5.0 15.6 6.7 D382 64 32.6 


Nole. H = high; L = low; S = success; F = failure. 


^The notations refer to experimental conditions in the order from left to right: performance, consistency, distinctiveness, and 


consensus (all high or low), 


» The notations refer to successive experiences. (Thus, SFFS/: 


failed the next two, succeeded on the fourth, and learned that 
ness, high consensus, success condition, 


ternal factors: As ability goes up in impor- 
tance, effort comes down. 

The effects of consensus on luck attribu- 
tion are more easily explained. It seems rea- 
sonable to attribute luck as an explanation of 
performing differently than most other sub- 
jects, even though Kelley’s cube proposes 
that external attributions (such as luck) 
should follow high consensus. Here it would 
seem that the unstable nature of luck is more 
important than its external status. There is 
no reason why everyone should be lucky or 
unlucky together. Defensive attribution also 
plays a role in that it is especially the failing 
subjects who seize upon lack of consensus 
to attribute their performance to poor luck, 
This is confirmed by the nearly significant 
Performance X Consensus interaction, F(1, 
144) = 3.25, p <.10, and by the fact that 
the consensus difference for failing subjects is 
highly significant, F(1, 144) = 8.14, p < .01, 
and the difference for success subjects is neg- 
ligible, F(1, 144) < 1.00, ns. 

To penetrate the significant Success-Fail- 
ure X Consistency  Distinctiveness X Con- 
sensus (A X B X C X D) interaction, and 
to understand some of the other attributional 
subtleties, it is necessary to look at the mean 
point allocations for individual cells. Table 4 


So means that in this condition subjects succeeded on the first task, 
others also succeeded, This is the high consistency, high distinctive: 


presents both the rating and percentage 
means for each of the 16 cells in the experi- 
mental design. Our present interest is in ex- 
amining the most distinctive cells to determine 
whether the outcome makes sense by Kelley’s 
attributional logic. To bring out the natural 
interdependence of the dependent variables, 
the discussion will emphasize the point-allo- 
cation data, though we cover only those in- 
stances in which the rating data are in basic 
agreement with the percentage figures, 

The AX BX CX D interaction on luck 
involves each independent variable, As we 
have already noted, luck attribution is espe- 
cially attractive when the subject fails and 
others succeed. This tendency is even stronger 
when the subject has failed Tasks 2 and 3 
(_FFF/S, = 36.05) and strongest when the 
subject has failed all tasks (Cell 8, FFFF/S, 
= 39.5).* It seems undeniably clear that luck 


2 Comparable notations will be used throughout 
the paper to indicate individual cells or combinations 
of cells. Thus S/F. refers to all those subjects who 
succeeded and then learned that others had generally 
failed. SSSS/F, refers to the single cell: success, high 
consistency, low distinctiveness, and low consensus, 
F ___F/F, refers to the two cells in which sub- 
jects consistently failed and learned that others 
failed as well on Task 4. 
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attributions are the preferred explanation for 
failure. In general, the worse things get, the 
more luck is assigned. As Table 4 shows, the 
cell that is especially high in ability attribu- 
tion (47.5%) is Cell 3 (SFFS/F,). Here the 
subject succeeds where others failed and he 
himself has failed on the two preceding tasks. 
Luck can hardly be involved because he suc- 
ceeded on a very similar task before. The task 
cannot be an especially easy one because oth- 
ers have failed it. The fact that the subject 
has failed Tasks 2 and 3 perhaps adds a little 
urgency to the acceptance of personal credit 
through ability attribution. Differential attri- 
bution of ability versus motivation is here 
restricted to the point-allocation data, how- 
ever, suggesting that forcing a distribution 
yields more psychologically meaningful infor- 
mation than the simple rating procedure. 
Another interaction is of borderline signifi- 
cance in the point-allocation data. This is the 
Performance X Consistency interaction on 
Task Attribution, F(1, 144) = 3.90, p < .06. 
The main effect of consistency on task attri- 
bution is a clear reversal of prediction in both 
the point-allocation and the rating data. It 
was expected that subjects would class the 
first and fourth tasks together and, therefore, 
that task attributions would be higher when 
the subject either succeeded or failed on both 
tasks. However, the main effect must be in- 
terpreted in light of the nearly significant 
interaction. It is entirely produced by the 
failure subjects; those who fail after an 
earlier success are apparently especially moti- 
vated to blame the task. It is as if, having 
already succeeded on another task of the 
same type (visual discrimination), the sub- 
jects are tempted to see the fourth task as an 
unfair or aberrant example of this type. 
Furthermore, the relationship between con- 
sistency and task attribution can best be ex- 
plained with reference to a single anomalous 
cell (Cell 10, Table 4, SSSF/F,) where the 
failing subject has not only succeeded on the 
first task but has succeeded on the second 
and third tasks as well. In addition, others 
have generally failed on the crucial fourth 
task. The mean task attribution percentage in 
this cell is 57.5, 20 points higher than the 
next nearest cell! If we take the perspective of 
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these subjects, it now seems less astounding 
that the original prediction was disconfirmed 
and the means even reversed—especially if 
subjects generalized the high-consensus infor- 
mation to include the first three tasks. When 
subjects have been generally successful and 
then fail the fourth task along with the ma- 
jority, it must be something about the task. 
The free responses of these subjects on a 
postexperimental questionnaire indicate often 
elaborate distinctions that are drawn to jus- 
tify the perceived dissimilarity of the square- 
and line-judging tasks. 

Because of the built-in interdependence of 
the percentage scores (and the conceptual in- 
terdependence of the rating scores) it is also 
of interest to examine distinctive tendencies 
to avoid certain kinds of attributions. Thus, 
it turns out that the most luck-avoidant cells 
are those in which there has been a distinctive 
success (for the _FFS Cells 1, 3, 11, and 13, 
mean luck attribution = 9.6, all other cells 
M = 19.52, t=4.31, p< .001). After two 
successive failures the subject who succeeds is 
not likely, it appears, to chalk up his success 
to luck. 


Discussion 


There is little question that the attribu- 
tions for a failing performance are heavily 
influenced by the apparent need to defend 
self-esteem. The most striking and pervasive 
finding is that subjects feel responsible for 
doing well but externalize the attributions for 
failure. This defensive attribution tendency is 
most obviously reflected in the large main 
effects for performance on ability, effort, and 
luck, but the intriguing thing about the re- 
sults is the complex mixture of rational and 
emotional considerations they reflect. 

Generally, subjects who have experienced 
some degree of success appear to be more 
rational than uniformly failing subjects. Sub- 
jects confronting the worst circumstances are 
the most defensive. Probably the most dev- 
astating condition is Cell 8, FFFF/S,. The 
attributional pattern in this cell involves the 
subject’s denial of any reflection on his abil- 
ity (M = 18.0%) and a special emphasis on 
the importance of luck (M = 39.5%). This 
is a clear violation of Kelley’s model since the 
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perfectly rational subject should conclude 
that he is definitely responsible for his fail- 
ure. After all, he has responded consistently, 
indiscriminately, and in deviation from con- 
sensus. We should entertain the possibility, 
however, that subjects view luck as an in- 
ternal, if temporary, state. Thus, they may 
be trying to tell us that “this is just not my 
lucky day.” Such a possibility mutes the raw 
defensiveness interpretation of luck attribu- 
tion after failure and raises questions about 
Weiner’s treatment of luck as a truly external 
and unstable factor. 

According to Kelley’s model, Cells 1 and 2 
should be full of externalizing subjects. In 
both cases consistency, distinctiveness, and 
consensus are high, The externalization ten- 
dency is much stronger when failure is in- 
volved (FSSF/F,) than when success is in- 
volved (SFFS/S,), since defensive attribution 
and rational attribution operate in the same 
direction in the failure case. But the most 
interesting finding is that external attribution 
(task plus luck) is even slightly less in the 
FSSF/F, case than in the dire FFFF/S, case. 
Using the percentage data, in the first case a 
score derived by subtracting the internal 
means from the external means is 3.6. In the 
latter case the score is 22.0. This comparison 
reverses the Kelley prediction and shows, once 
again, the power of defensive attribution 
when things are not going well. 

When circumstances are less dire, how- 
ever, patterns of rational information process- 
ing are more apparent. Individual cell com- 
parisons (especially with the point-allocation 
data) highlight the subjects’ sensitivity to 
variations in consistency, distinctiveness, and 
consensus. The reversal of the consistency 
prediction seems to be less an example of irra- 
tionality or defensiveness than an instance of 
the vulnerability of the similarity manipula- 
tion to performance differences. As the simi- 
larity manipulation check revealed, subjects 
view Tasks 1 and 4 as dissimilar to the ex- 
tent that they perform differently on the two 
tasks. And yet, it is difficult to conceive of a 
stronger similarity manipulation that does not 
involve identical tasks. Consensus seems to 
overpower the predicted consistency effects. 
The high task attribution in Cell 10 (SSSF/ 
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Fo) no doubt derives in part from the diff- 
culty of inferring bad luck when others have 
done poorly on the same task. 

It is more difficult to explain why consen- 
sus affects effort attribution so that those in 
high-consensus conditions see themselves as 
striving harder than low-consensus subjects. 
This effect comes mainly from the success- 
distinctive conditions. When the subject has 
a history of failure and he finally succeeds 
when others fail, ability seems to be a more 
attractive explanation than effort. The sub- 
jects are motivated to claim responsibility for 
success because they have previously failed, 
and the fact that their success is not shared 
by others appears to justify the more attrac- 
tive self-attribution of ability. 

In a recent review of “self-serving biases in 
the attribution of causality” Miller and Ross 
(1975) question most of the evidence sup- 
porting such postperformance bias, They are 
especially skeptical concerning the tendency 
toward external bias after failure, acknowl- 
edging that there is more support for in- 
creases in internal attribution after success. 
This acknowledgment is accompanied by 
three alternative “cognitive” explanations for 
the association between success and internal 
attribution: (a) People tend more to expect 
success than failure, and people are more 
likely to accept responsibility for expected 
outcomes. (b) Especially when success is in- 
creasing, there is often a perceived covaria- 
tion between doing better and trying harder. 
(c) People are biased in associating control 
only with the occurrence of a desired out- 
come; effort associated with negative outcomes 
tends to be ignored. 

In the present study we have tried to sort 
out the cognitive from the motivational (self- 
serving) factors, but it is certainly difficult to 
ignore the latter, and they seem to be operat- 
ing more strongly in the denial of responsi- 
bility for failure than in the acceptance of 
responsibility for success. We have noted, for 
example, that the attributional rules of the 
Kelley model are most likely to be subverted 
or overthrown if a person’s performance is 
irremediably bad. Thus the consistent, low- 
distinctive, low-consensus, failure attributes 
his fate primarily to bad luck. 
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What is especially surprising about the 
degree of self-serving attributional bias is that 
the tasks would seem to be relatively non- 
involving. As compared with anagrams, 
teaching skill, or social sensitivity, one’s abil- 
ity to discriminate weights, colors, and visual 
shapes hardly seems a likely source of great 
pride or fearful threat. And yet we see college 
undergraduates shaping their responses to en- 
hance the meaning of success and dampen the 
meaning of failure. Miller and Ross express 
the intuitive expectation, in spite of their hard 
and skeptical look at empirical data, that self- 
serving causal attributions probably do exist 
in spite of the methodological flaws in the 
demonstration of defensive attribution to date. 
The present results certainly support this in- 
tuition, even though they also show the op- 
eration of more rational, “cognitive” factors. 
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Forty subjects were administered the port 
and after operant autonomic conditionin: 
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able rod-and-frame test (RFT) before 
g training or control experiences. Ten 


subjects were assigned to each of four groups: a heart rate conditioning group; 


a false heart rate feedback group; a galv: 
and a control group. Half of the subjects 
informed of the response-reinforcement 


anic skin-response conditioning group; 
in the three experimental groups were 
contingencies, and half were not in- 


formed, Informed subjects in the conditioning groups exhibited significantly 
lower portable RFT scores after conditioning. The results suggest that aware- 


ness of and control over autonomic fun: 
minant of RFT performance. 


The rod-and-frame test (RFT) developed 
by Witkin and his colleagues as a measure of 
perceptual style (Witkin, 1949; Witkin & 
Asch, 1948a, 1948b; Witkin, Dyk, Faterson, 
Goodenough, & Karp, 1962; Witkin, Lewis, 
Hertzman, Machover, Meissner, & Wapner, 
1954) has been related to a number of other 
personality dimensions (e.g., lack of self- 
awareness, Minard & Mooney, 1969; inner- 
and outer-directedness, Riesman, 1950; and 
passivity in dealing with the environment, 
Rosner, 1957). In their 1962 survey of the 
literature on RFT performance, Witkin and 
his colleagues cited several studies in which 
experimental attempts to influence perform- 
, ance were unsuccessful. Witkin et al. (1962) 
concluded that performance on the RFT is 
“resistant to change by experimental means, 
and ... appears to be a stable characteristic 
of a person” (p. 373). 

However, several other studies that have 
since appeared seem to indicate that changes 
in RFT scores can be produced by experi- 
mental manipulations. Jacobson (1966) dem- 
onstrated that sensory deprivation resulted in 
* lower RFT scores for college males and fe- 
males, whereas Goldstein and Chotlos (1966) 
achieved similar results by exposing chronic 
alcoholics to sensory deprivation. Chess, 
Neuringer, and Goldstein (1971) and MecWil- 


Requests for reprints should be sent to Thomas R. 
McCanne, Department of Psychology, Northern Tli- 
nois University, DeKalb, Illinois 60115. 
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ctioning may be an important deter- 


liams, Brown, and Minard (1975) have also 
reported significant decreases in RFT scores 
among alcoholics who participated in thera- 
peutic treatment programs. Two recent studies 
(Linden, 1973; Pelletier, 1974) have indi- 
cated that the practice of meditation for 
several months has an effect on perceptual 
style as measured by the RFT and the Em- 
bedded-Figures Test. The results of all of 
these experiments suggest that focusing atten- 
tion on internal cues and bodily sensations 
may facilitate performance on the RFT. 
Interestingly, in an early series of studies, 
Witkin and his colleagues (Wapner & Witkin, 
1950; Witkin & Wapner, 1950) had enter- 
tained the hypothesis that individual differ- 
ences in RFT performance might be attribu- 
table to variability in body sensitivity or 
proprioceptive awareness. This hypothesis was 
rejected, although later research by Silver- 
man, Cohen, Shmavyonian, and Greenberg 
(1961) indicated that subjects classified as 
field independent in RFT performance ex- 
hibited better ability to discriminate stimuli 
on the surfaces of their bodies than subjects 
classified as field dependent in RFT perform- 
ance. In a more recent series of experiments, 
Cohen and his co-workers (e.g., Cohen, Sil- 
verman, & Shmavonian, 1962; Cohen, 1967; 
Cohen, Silverman, & Shmavonian, Note 1) 
have suggested that field-independent sub- 
jects respond to classical conditioning and 
sensory deprivation experiments with more 
highly differentiated patterns of physiological 


822 THOMAS R. McCANNE AND CURT A. SANDMAN 


responding than field-dependent subjects. 
Sandman (1975) has also provided data indi- 
cating that field-independent subjects respond 
to stressful situations with more highly dif- 
ferentiated patterns of physiological response 
than do field-dependent subjects, and he has 
suggested that field-independent subjects may 
rely more on physiological cues to evaluate 
their perceptions of stressful stimuli than field- 
dependent subjects. 

All of the work cited above suggests that 
awareness of bodily cues and sensations is 
associated with RFT performance. However, 
Nevill (1974) recently has suggested an al- 
ternate interpretation of some of these results. 
Although she did not examine changes in 
RFT performance, she demonstrated that per- 
formance on the RFT is affected by the de- 
gree to which an individual is dependent on 
others, She has argued that the increased RFT 
performance noted following sensory depriva- 
tion may be due to the decreased importance 
of external cues in such situations and has 
suggested that sensory deprivation may lead 
to decreased dependency on others. 

One means of further examining the rela- 
tionship between awareness of bodily cues 
and RFT performance is the use of operant 
conditioning techniques to produce changes in 
bodily awareness. Such conditioning methods 
have been used extensively in the last 10 
years to increase attentiveness to bodily re- 
sponses (for selective reviews of the litera- 
ture in this area, see Blanchard & Young, 
1973; Katkin & Murray, 1968; Kimmel, 
1967; McCanne & Sandman, 1976). 

Brener and his colleagues (Brener, 1974; 
Brener, Kleinman, & Goesling, 1969) have 
provided data indicating that operant rein- 
forcement serves to “calibrate” the available 
intrinsic information by defining what types 
of internal sensations are associated with ex- 
ternal feedback. Moreover, operant condition- 
ing techniques have the advantage of encour- 
aging an individual to focus on internal cues 

without becoming less dependent on external 
sources of information. That is, in order to 
utilize operant physiological feedback, an in- 
dividual must simultaneously attend to in- 
ternal bodily cues and external reinforcement 
(Brener, 1974). Hence, such conditioning 


approaches provide another method for exam- 
ining the salience of Nevill’s (1974) argu- 
ments concerning sensory deprivation and 
RFT performance. 

One purpose of the present study was to 
examine the impact of operant autonomic 
conditioning on RFT performance. If aware- 


ness of bodily cues is an important determi- ” 


nant of RFT performance, then operant 
autonomic conditioning should result in lower 
RFT scores. Such a change, if noted, would 
be independent of the decrease in dependency 
on external cues, which Nevill (1974) has sug- 
gested as an important determinant of RFT 
performance. 


Physiological Responses and Attention 


Another purpose of the present research was 
to examine the relationships between specific 
physiological responses and RFT perform- 
ance. One important determinant of RFT per- 
formance is the ability of an individual to 
separate an item from its context (Witkin et 
al., 1962), that is, to attend to one portion of 
the environment selectively while simultane- 
ously excluding other parts of the environ- 
ment. Although it is obvious that the task is 
a relatively complicated one requiring a num- 
ber of complex skills, it is also clear that at- 
tentional processes play an important role in 
determining performance on the task. 

One physiological modality that may be 
related to the attentional components of RET 
performance is heart rate. In a series of in- 
genious experiments, the Laceys and their 
colleagues (e.g., Lacey, Kagan, Lacey, & 
Moss, 1963; Lacey & Lacey, 1970, 1974) 
have demonstrated that changes in heart rate 
are related to the performance of tasks that 
require varying degrees of “attention to the 
environment” (Lacey & Lacey, 1970). In 
particular, heart rate deceleration has been 
found to occur during tasks that require en- 
vironmental attention and has been related to 
enhanced performance in reaction time tasks, 
whereas heart rate acceleration has been noted 
during tasks that require minimal attention 
to environmental stimuli. 

On the basis of neurophysiological evidence 
obtained from animal studies (e.g., Bonvallet, 
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d Dell, & Hiebel, 1954), the Laceys have spec- 


ulated that the cardiovascular changes noted 
during attention-demanding tasks may affect 
brain functioning by means of an afferent 
feedback loop and thus may have an impact 
on attention. Recent studies by McCanne and 
Sandman (1974) and Sandman, McCanne, 


” Kaiser, and Diamond (in press) have pro- 


vided some support for the instrumental ef- 
fects of heart rate changes on attention within 
a tachistoscopic recognition paradigm, 

Although there has been considerable de- 
bate concerning the Laceys’ hypothesis (e.g, 
Elliott, 1972; Obrist, Webb, & Sutterer, 1969; 
Obrist, Webb, Sutterer, & Howard, 1970), 
there is a substanital amount of evidence 
indicating that there are relationships between 
heart rate changes and attention to environ- 
mental events. Moreover, the hypothesis 
seems quite pertinent to performance on the 
RFT. It seems reasonable to speculate that 
awareness of and ability to control operant 
heart rate may be more important in deter- 
mining RFT performance than awareness of 
other. physiological responses. Accordingly, 
the second purpose of the present study was 
to examine the impact of operant heart rate 
control on RFT performance and to compare 
this impact with that of operant control over 
another autonomic modality, galvanic skin 
response (GSR). 


METHOD 
Subjects 


Subjects were 40 female undergraduates from an 
introductory psychology course at Ohio State Uni- 
versity who participated in the experiment. Subjects’ 
ages ranged from 19 to 22 years. All subjects were 
required to participate in experiments as part of the 
introductory psychology course and received experi- 
mental credit for participating in the study. 


General Design 


Ten subjects were assigned randomly to each of 
four groups: heart rate conditioning, false heart rate 
conditioning, GSR conditioning, and control. Half 
of the subjects in each group were informed what 
physiological responses were being conditioned and 
half were not informed, Testing with the RFT was 
performed before and after conditioning or control 
experiences. Thus, the statistical model for the study 
was a 4X2X2 (Group X Information X Testing) 
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factorial design with repeated measures over the last 
factor.2 


Apparatus 


Subjects were tested individually in a sound-at- 
tenuated and electrically shielded chamber, The 
stimuli were projected on a screen in the chamber by 
a Kodak Carousel slide projector programmed for 
automatic advance and stimulus Presentation with 
BRS-Foringer digibit logic units. An intercom SyS- 
tem connected the experimental chamber and the 
control chamber so that the experimenter could con- 
tinuously hear the subject and could communicate 
with her when necessary, Physiological recording 
was done with a Grass Model 7B polygraph equipped 
with appropriate bridges, preamplifiers, and driver 
amplifiers. The voltage or resistance measures were 
written out on Grass ink-writing oscillographs pro- 
viding graphic representation of the physiological 
activity. 

Heart rate electrode placement. Grass E5S cup 
electrodes filled with EKG Sol were placed on the 
lower left rib cage and the right collar bone. The 
signal was amplified by a Grass wide band AC pre- 
amplifier and averaged by a Grass cardiotachometer, 
which provided a beat-by-beat record of the heart 
rate, 

GSR electrode placement, A curved Ag-AgCl 
electrode of 3 cm? placed on the volar surface of 
the second phalange of the nonpreferred hand served 
as the active electrode, and a curved armband of 
58 cm? placed on the upper arm served as the in- 
active electrode of a monopolar placement, A con- 
stant current of 20 „A supplied by a Con-Sol GSR 
preamplifier was impressed at the active site, which 
made that portion of the subject’s epidermis lying 
between the active and inactive electrodes one arm 
of a resistance bridge, A bandage with a 2-cm" hole 
in it restricted the current density to 10 nA/em® at 
the active electrode, whereas the current density at 
the reference electrode was .314 A/cm", A .05M 
NaCl solution suspended in an inert plastic medium 
served as electrode paste. Surface body oils were 
removed by swabbing the electrode sites with acetone. 


Procedure 


The subject was greeted by the first author when 
she arrived at the laboratory. Subsequent pretest 
testing, electrode placement, and posttest RFT ad- 
ministration were conducted by a female laboratory 
assistant who was blind to the treatment conditions, 
Each subject was administered the portable RFT 


1 Subjects in the control groups were all given the 
same instructions. Hence, the instructional set varja- 
ble did not apply to these subjects, However, for 
purposes of factorial analyses, subjects in the con- 
trol groups were randomly split into an informed 
group and an uninformed group, 
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after having been greeted by the experimenter.? 
Individual subjects were then seated in a reclining 
chair in the experimental chamber. Before the condi- 
tioning procedure began, the subject was informed 
that the purpose of the study was to determine the 
extent to which individuals could learn to control 
certain physiological responses. The subject was 
informed that she would be seeing a series of red 
and green slides and was told that she was to pro- 
duce “correct” physiological responses during the 
slides. Throughout conditioning, heart rate accelera- 
tion was paired with red slides and heart rate decel- 
eration with green slides. Subjects in the “informed” 
conditioning groups were also told which changes in 
physiological responding were “correct” (i.e., heart 
rate changes or GSR changes), whereas subjects in 
the “uninformed” conditioning groups were told that 
changes in physiological responding were “correct” 
but were not told which modality of response was 
to be rewarded. Each of the subjects in the three 
conditioning groups (heart rate, false heart rate, and 

GSR) was then given a series of operant condition- 
ing trials. 

All subjects were given feedback by means of a 
small, real-time clock that ran continuously as long 
as subjects produced responses of criterion magnitude 
in the correct direction, All subjects were told that 
they should try to keep the clock on as much as 
possible, and they were informed that the subject 
who accumulated the most time on the clock would 
be contacted and paid $20 when the experiment was 
over, Subjects were instructed to remain as still as 
possible and not to change their respiratory patterns. 
After the experimental procedures, each subject was 
again administered the portable RFT. 

Subjects in the control group received no exposure 
to the experimental apparatus at all. These subjects 
were administered the RFT, were then given a series 
of pencil-and-paper tasks, which took approximately 
the same amount of time as the conditioning tasks, 
and were then administered the RFT again. 

Heart rate conditioning procedure. All subjects 
were given 40 conditioning trials, 20 acceleration 
trials, and 20 deceleration trials. All conditioning 
trials were 30 heart interbeat intervals in length and 
were separated from each other by a variable inter- 
trial interval of 10-30 seconds. The order of stimu- 
lus presentation was randomized, with the restric- 
tion that the same color stimulus could not occur on 
more than three successive trials. Following the in- 
structions was a brief rest period, during which a 
basal heart rate was determined by counting the 

number of heart beats in a 1-minute sample. This 
sample constituted the basis for the initial accelera- 
tion and deceleration criteria. 

The heart rate signal from the cardiotachometer 
was fed into two electronic circuits (BRS-Foringer 
level detectors, Model 210), which detected changes 
in the voltage level of the cardiotachometer. These 
circuits automatically controlled the delivery of the 
reinforcement. One circuit was used during accelera- 
tion trials, and the other during deceleration trials. 
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An operant shaping procedure was employed dur- ~ 
ing the conditioning trials. Prior to the first condi- 
tioning trial, the voltage detectors were set at 2 
beats per minute above and below the subject’s 
basal heart rate. For the first acceleration trial, the 
subject had to produce a heart rate 2 or more beats 
per minute above her base rate to obtain reinforce- 
ment, and during the first deceleration trial the rein- 
forcement criterion was 2 or more beats per minute _ 
below her base rate. 

On subsequent conditioning trials, the reinforce- 
ment criteria depended on how well or poorly the 
subject had done on the preceding trial. If a sub- 
ject produced 22 or more interbeat intervals of cri- 
terion magnitude during one trial, the criterion for 
the next trial with the same stimulus was made 
more difficult by raising or lowering the heart rate 
level 2 beats per minute. Conversely, if a subject e: 
produced eight or fewer “correct” interbeat intervals 
during a trial, the criterion for the next trial with 
the same stimulus was made 2 beats per minute 
easier. 

For subjects in the false heart rate conditioning 
groups, the conditioning procedures outlined above 
were not employed. Rather, these subjects were 
provided heart rate feedback by means of a pre- 
programmed, random sequence. Reinforcement for 
these subjects occurred during approximately 59% of 
their heart beats. 

GSR conditioning procedure. Each subject in the 
GSR conditioning groups received 60 conditioning 
trials, 30 with each color slide. However, these sub- 
jects were told that they were to produce the same 
response during both red and green slides. Informed 
subjects were told that they would be reinforced for 
producing decreases in resistance during red and 
green slides. They were further instructed that de- 
creases in resistance occurred during sweating. Each 
conditioning trial was 10 seconds long and was pre- 
ceded by a variable intertrial interval of 10-30 sec- 
onds. Decreases in resistance were detected by an 
electronic level detector, which automatically con- 
trolled the delivery of reinforcement during each 
trial. 

An operant shaping procedure was employed 
throughout. Prior to the first conditioning trial, the 
voltage detector was set to a voltage that corre- 
sponded to a decrease in resistance of 100 Q from 
the subject’s basal GSR reading (determined by 
means of a 1-minute sample following instructions). 
Thus, the subject had to produce a decrease in re- 
sistance of 100 © in order to receive reinforcement. 
The reinforcement continued as long as the subject’s 
resistance level remained below the criterion setting 
on the voltage detector. If a subject obtained rein- 
forcement for 10 seconds on two consecutive trials, 
the criterion for the next trial was lowered 100 Q. 
Conversely, if the subject failed to obtain reinforce- 


2 All subjects were given the sequence of eight 
RET trials described by Witkin et al. (1962), 
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* ment during two consecutive trials, the criterion for 
the next trial was raised 100 Q. 


Data Reduction 


Heart rate. The length of each trial was measured 
to the nearest millimeter on the oscillograph record. 
Since each trial was 30 interbeat intervals long, the 
length of the trial was determined by the subject’s 
average heart rate during the trial. Since the paper 
speed for all subjects was 15 mm/sec, the average 
heart rate on each trial was defined by the formula 
HR = 30 X [(15 mm/sec X 60 sec)/Y mm], where 
HR is heart rate and Y is the length of the trial in 
millimeters. This formula was used to determine 
the subject’s average heart rate for each trial. 

A similar procedure was employed to determine 
the subject’s average heart rate during the 10 inter- 
beat intervals immediately preceding each trial. For 
the pretrial, the formula defining the average heart 
rate is HR =10 X [(15 mm/sec X 60 sec)/Y mm]. 

GSR, The decrease in resistance of each GSR 
response greater than 100 2 was computed for every 
response that occurred during each trial. These re- 
sponses were then summed to yield a measure of 
overall decrease in GSR resistance during each trial. 
The same scoring method was used to compute a 
measure of overall decrease in GSR resistance during 
each pretrial. 


RESULTS 
Rod and Frame Task 


The absolute error in degrees from the 
true upright was recorded for each of the 
eight trials on the pre- and post-RFT. The 
errors were then summed to yield a measure 
of perceptual style for each subject for both 
the pretests and posttests. Pretest and post- 
test means and standard deviations for both 
informed and uninformed subjects are pre- 
sented in Table 1.8 A 4 X 2 X 2 analysis of 
variance with repeated measures over the last 
‘factor was computed using these sums. 

The analysis of variance yielded a highly 
significant main effect for the pre/post fac- 
tor, F(1,32) =15.18, p= .001. No other 
significant main effects or interactions were 


TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR PRETEST 
AND Postrest RFT SCORES 


Pretest Posttest 
Subjects M SD M SD 
Informed 24.27 8.02 16,73 8.56 
Uninformed 33.40 18.37 28.75 14.45 
Control 30.30 14.01 30.70 17,90 


825 


Pane FHRC GROUP 


N, 
^, HRC GROUP 


"= c crour 


N w GSRC GROUP 


N 
AHRC GROUP 


ROD AND FRAME SCORE 
P 
= 


",GSRC GROUP 


t, FHRC GROUP 


—— 
PRE-TEST POST-TEST 
UNINFORMED SUBJECTS 


PRE-TEST POST-TEST 
INFORMED SUBJECTS 


Ficure 1. Changes in rod-and-frame test perform- 
ance for conditioning and control groups (C = con- 
trol; HRC = heart rate conditioning; FHRC = false 
heart rate conditioning; GSRC = GSR conditioning). 


detected. The changes in RFT performance 
are illustrated for each group of subjects in 
Figure 1. Inspection of the results illustrated 
in Figure 1 indicated that subjects in all of 
the conditioning groups exhibited decreases 
in RFT scores following conditioning, where- 
as subjects in the control groups displayed 
virtually no change in RFT performance fol- 
lowing the control experience. 

Although the analysis of variance failed to 
indicate a significant Groups X Testing inter- 
action, post hoc simple effects comparisons 
(Winer, 1972) were computed for each of 
the groups to determine whether the changes 
in RFT scores were significant for each. These 
tests were conducted because of a priori in- 
terests in the magnitude of changes within 
each of the conditioning groups. The simple 
effects tests indicated that informed subjects 
in the heart rate conditioning, false heart rate 


3 It should be noted that the means for RFT per- 
formance (see Table 1) indicate that the subjects in 
the present experiment would be classified as field 
independent according to norms for the portable 
RFT established by Oltman (1968). The means and 
standard deviations presented in Table 1 are compa- 
rable to those presented by Morf, Kavanaugh, and 
McConville (1971) for field-independent females, 
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Ficure 2. Heart rate during acceleration and de- 
celeration training trials (HRC = heart rate condi- 
tioning; FHRC = false heart rate conditioning). 


conditioning, and GSR conditioning groups 
exhibited significant improvement in RFT per- 
formance after conditioning (simple effects 
F = 7.08 for heart rate conditioning group, 
5.25 for false heart rate conditioning group, 
and 4.17 for GSR conditioning group; p < 
.05 for each). The post hoc tests indicated 
that subjects in both control groups and in 
the uninformed conditioning groups did not 


exhibit significant changes in RFT perform- 
ance. 


Physiological Results 


To investigate the effects of the heart rate 
conditioning and false heart rate feedback on 
control of operant heart rate, the average 
heart rates during acceleration training and 
during deceleration training trials were com- 
puted separately for subjects in the heart 
rate conditioning and false heart rate condi- 
tioning groups. A 2 X 2 X 2 analysis of vari- 
ance with repeated measures over the last fac- 
tor was computed using these heart rate val- 

ues. The three factors were groups (heart rate 
conditioning vs. false heart rate conditioning), 
instructions (informed vs. uninformed sub- 
jects), and training (acceleration training vs. 
deceleration training). The analysis of vari- 
ance indicated a significant training main 
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effect, F(1, 16) = 8.89, p <.01, a significant ` 
Groups X Training interaction, F(1,16) = 
10.01, p <.01, and a significant Groups X 
Instructions X Training interaction, F(1, 16) 
= 7.92, p < .05. The significant training main 
effect indicated that the average heart rate 
during acceleration training (M = 78.42) was 
above that noted during deceleration (M = ~ 
77.23). However, the significant Groups X 
Training interaction indicated that subjects in 
the heart rate conditioning group produced 
higher heart rates during acceleration training 
(M = 77.34) than during deceleration train- 
ing (M = 74.90), whereas subjects in the 
false heart rate conditioning group produced. 
virtually identical heart rates during both 
types of training (M = 79.49 for acceleration 
trials and 79.56 for deceleration trials), The 
significant Groups X Instructions X Training 
interaction is illustrated in Figure 2. 
Inspection of Figure 2 indicates that only 
the informed subjects in the heart rate condi- 
tioning group produced significantly higher 
heart rates during acceleration training than ` 
during deceleration training (simple effects F ` 
= 10.69, p < .01). Hence, the differences be- 
tween the heart rate conditioning and false 
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Ficure 3. Pretrial to trial changes in galvanic skin 
response responding during conditioning. 
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heart rate conditioning groups (noted by the 
Groups X Training interaction) were also 
affected by the instructions given to subjects. 

To investigate the effects of instructional 
set on control over operant GSR responses, 
the average change in GSR resistance was 
calculated for all 60 conditioning trials and 
for the 10 seconds preceding each of the 60 
trials (pretrial sample) for each subject in 
the GSR conditioning group. A 2 X 2 analysis 
of variance was then computed using these 
scores. The two factors were instructions 
(informed subjects vs. uninformed subjects) 
and response sample (pretrial responses vs. 
trial responses). This analysis yielded a sig- 
nificant response sample main effect, F(1, 8) 
= 14.68, p < .01, and a significant Instruc- 
tions X Response Sample interaction, F(1, 8) 
= 6.30, p < .05. The significant interaction 
is illustrated in Figure 3. 

As Figure 3 indicates, uninformed subjects 
produced larger pretrialtrial decreases in re- 
sistance than informed subjects during GSR 
conditioning. The pretrial—trial decrease in re- 
sistance was highly significant for uninformed 
subjects (simple effects F = 22.72, p < .01) 
but not for informed subjects. In contrast to 
the operant heart rate data, these results 
suggest that lack of information about con- 
tingencies of reinforcement may facilitate the 
production of changes in GSR responding. 


Relationships Between Initial RFT Scores 
and Physiology 


The initial RFT scores of informed and 
uninformed subjects in the heart rate condi- 
tioning and false heart rate conditioning 
groups were correlated with the differences in 
heart rate obtained by these subjects during 
acceleration training and deceleration train- 
ing. The heart rate difference scores provided 
an index of the extent to which subjects ob- 
tained control over their operant heart rates. 
Similarly, the initial RFT scores of subjects 
in the GSR conditioning group were corre- 
lated with pretrialtrial changes in GSR. 
RFT scores did not correlate significantly 
with either heart rate difference scores (7 = 
—.18, n = 20) or GSR difference scores (7 = 
—.23, m = 10). 


Discussion 


The pretest-posttest changes in RFT per- 
formance noted in the present study are com- 
parable to those reported in previous sensory 
deprivation and meditation experiments. 
Moreover, the changes in RFT performance 
due to operant autonomic conditioning are 
not attributable to decreased dependency on 
external cues, since subjects experiencing con- 
ditioning have to attend to external reinforce- 
ments as well as to internal cues. The changes 
in RFT scores do not necessarily negate Nev- 
ill’s (1974) demonstration that dependence 
on external cues can affect RFT performance 
but rather suggest that attentiveness to bod- 
ily cues can also have an impact on perform- 
ance independent of external dependence. 

The results for the heart rate conditioning 
groups suggest that awareness of and control 
over operant heart rate changes are related to 
accuracy in RFT performance. The apparent 
impact of operant heart rate conditioning on 
RFT performance supports the Laceys’ (La- 
cey & Lacey, 1970, 1974) hypothesis concern- 
ing the relationship between heart rate and 
the performance of attentidhal tasks and is 
consistent with previous reports indicating 
that heart rate changes can have an impact 
on visual perception (McCanne & Sandman, 
1974; Sandman et al., in press). 

The changes in RFT performance noted 
for informed subjects in the false heart rate 
conditioning group are somewhat paradoxical, 
Although these subjects did not achieve sig- 
nificant control over their heart rates, they 
nevertheless produced significant decreases in 
RFT scores. One possible explanation for such 
results might be that these subjects did 
become more attentive to the internal cues 
related to heart rate but were unable to exert 
control over their hearts because of the dis- 
tracting feedback. However, this explanation 
was formulated post hoc and must be ad- 
vanced cautiously until more conclusive data 
are reported. 

The changes in operant heart rate noted 
for subjects in the heart rate conditioning 
groups were essentially similar to those noted 
in previous reports (e.g., Bergman & Johnson, 
1972; McCanne & Sandman, 1975). The 
large changes in GSR noted for uninformed 
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subjects but not for informed subjects were 
somewhat paradoxical. However, increases in 
GSR responding have been noted during ori- 
enting (e.g., Sokolov, 1963) and during emo- 
tional experiences (e.g., Edelberg, 1972), and 
several authors have suggested that operant 
changes in GSR responding are mediated by 
cognitive emotional events (e.g., Shean, 1970; 
Stern & Kaplan, 1967). It seems likely that 
the nonspecific instructions given to unin- 
formed subjects may have resulted in larger 
orienting responses and in more emotional 
activity than the specific instructions given to 
informed subjects. 

The RFT results for the two GSR groups 
were also somewhat paradoxical. The subjects 
in the informed GSR conditioning group (who 
produced significant changes in RFT per- 
formance) did not acquire significant control 
over pretrial—trial changes in operant GSR 
responding, whereas subjects in the unin- 
formed GSR conditioning group (who did not 
achieve significant changes in RFT scores) 
produced highly significant _ pretrial-trial 
changes in GSR responding. However, the 
lack of significant change in RFT scores noted 
for uninformed subjects may be attributable 
to increased emotional “activation.” Hence, 
the RFT results indicate that a certain level 
of awareness and control of GSR responding 
may facilitate RFT performance but suggest 
that large increases in GSR activation do not 
result in significant decreases in RFT scores, 

Finally, it should be noted that the present 
results were obtained with female subjects 
utilizing the portable RFT. Generalization of 
the results to other populations and other 
methods of administering the RFT must await 
further experimentation. However, the RFT 
results are similar to those noted in previous 
experiments (e.g., Jacobson, 1966; Linden, 
1973), which indicate that changes in RFT 
performance are related to sensitivity to in- 
ternal bodily cues. 
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Frequency Perception of Individual and Group Successes As 
a Function of Competition, Coaction, and Isolation 
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University of Leuven, Louvain, Belgium 


The frequency perception of successful and unsuccessful outcomes obtained in 

serial tasks was investigated in two experiments. Subjects were assigned to 

serial tasks in which the outcome stimuli (“right” and “wrong”) immediately 

followed subjects’ responses to each task item. Subjects’ perceptual judgment 

about the number of “rights and wrongs” obtained was requested at the end 

of the serial task (incidental event perception). In Experiment 1, subjects per- 

forming in groups of four were compared with individual performers both 

in competing and noncompeting (coacting) conditions, In Experiment 2, the 

influence of verbal communication and competition versus isolation was ex- 4 
amined. Significant and predicted differences in frequency perception were 

found (a) in group performers as compared with individual performers, (b) 

in groups with verbal communication as compared with groups without verbal 

communication, and (c) in competing groups as compared with noncompeting, { 
especially isolated, groups. The frequency perception of success is discussed 

in the context of impression formation processes, and its impact on person 

perception and learning is mentioned. A tentative explanation in terms of 


affective and cognitive processes related to social interaction is submitted, 


In previous studies it has been shown that 
the frequency perception of the relative num- 
ber of successful and unsuccessful outcomes 
obtained in a serial task tends to be distorted 
as a function of personality factors as well as 
of social and contextual conditions (D’Amo- 
rim & Nuttin, 1972; Lietaer, 1966; Nuttin & 
Abreu, in press; Nuttin & Greenwald, 1968; 
Vreven & Nuttin, 1976). Conversely, such 
distortions are likely to have an impact on 
man’s perception of himself (and others) inso- 
far as he conceives of himself in terms of what 
he succeeds in doing. Moreover, successful 
outcomes or rewards are assumed to influence 
learning inasmuch as they are perceived. Dis- 
tortions on the perceptual level may, there- 
fore, be expected to have an impact on some 
learning phenomena, 
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In a typical experiment on the frequency 
perception of successes and failures, the sub- 
ject is given a series of short estimation tasks; 
each response is immediately followed by a 
clearly communicated “right” (i.e., a success) 
or “wrong” (i.e., a failure). This series of 
feedback in terms of “right” and “wrong” 
constitutes the meaningful auditory or visual 
stimuli whose relative frequency in the series 
the subject is assumed to “experience” or 
“perceive.” Without having received any pre- 
vious instructions regarding the frequency of 
positive and negative outcomes, each subject 
is required, immediately following the serial 
task, to give his perceptual impression of the 
number of successes and failures obtained. 
Since no explicit intention to perceive or to 
count the outcomes is supposed to be present, 
the perceptual process involved is considered 
a kind of “incidental” event perception, as is 
often the case in daily life (Nuttin, 1969, p. 
673). Before being asked to give his percep- 
tual impression of the number of successes 
and failures obtained, the subject is usually 
requested to give his global impression: Did 
he receive more successes than failures or the 
opposite, or was there about an equal number 
of both? Immediately afterwards, he is asked 
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to estimate the number of “rights” and 
“wrongs” on a total number of, say, 20 tasks 
he has performed. It has been generally ac- 
cepted that processes such as short-term mem- 
ory, estimation, guessing, etc., are implied to 
play a part in the subject’s perceptual impres- 
sion. The point is to know what factors sys- 
tematically influence this complex impression 
formation process that is assumed to underlie 
other important processes such as self-percep- 
tion, perception of others, and repetition or 
nonrepetition of responses in learning. 


EXPERIMENT 1 


Among the social factors previously investi- 
gated, we mention the subject’s attendance to 
either a very successful or very unsuccessful 
performance by a socius just a few moments 
before the subject’s own performance (D’Amo- 
rim & Nuttin, 1972; Lietaer, 1966). The 
present experiment sought to determine to 
what extent a subject’s perceptual impression 
of the relative number of successes obtained is 
influenced by the fact that he performed (a) 
as a member of a small group or as an indi- 
vidual and (b) in a competitive or in a non- 
competitive (in the present case, coactive) 
condition. 


Method 


Procedure. The genera] procedure used in the four 
conditions of Experiment 1 conforms to the typical ex- 
periment described above. In order to make the experi- 
ment meaningful to the subjects, elaborate instructions 
and preexperimental practice trials were provided. Sub- 
jects were told that their ability to make realistic 
estimations of the number of objects presented to 
them would be tested. Although quite independent 
of general intelligence as it is usually measured, this 
ability is thought to be related to an important per- 
sonality trait labeled realism, in the sense of having 
a realistic and correct view of things, which is most 
important in practical life. As a result of this em- 
phasis on the interest of the task, the outcomes were 
expected to be perceived by our subjects as ego-in- 
volved successes and failures. A series of slides were 
then successively projected on a screen. Their actual 
number (22) was not mentioned to the subjects at 
this time. Each slide represented pell-mell a certain 
number of identical objects (between 25 and 95) 
such as trees, books, sheep, etc. The subjects were 
told to look at the slide as a whole and that it 
would be presented for 4 sec only. No counting of 
objects would be permitted. Subjects were then asked 
to give the estimated number as soon as the slide 
disappeared by pushing buttons on a keyboard in 


front of them, being informed that the actual num- 
ber of objects represented would never be higher 
than 95 or lower than 25. They were told that their 
estimations should not be made in units but in mul- 
tiples of 5 (e.g, 35 or 70, and not 36 or 71). 

Each subject was then informed that most subjects 
were interested in knowing whether or not their es- 
timations were realistic. Therefore, after each re- 
sponse, the subject would be told whether his estima- 
tion was bad or good. “Good” or “right” meant that 
his estimation was as good an approximation of the 
real number as an average subject would be able to 
make; “wrong” meant that his estimation was not 
within the limits allowed. This outcome was then 
communicated to the subject by means of a light 
signal on a display apparatus in front of him, Ac- 
tually, the whole series of slides and outcomes was 
preprogrammed in a minicomputer. All necessary pre- 
cautions were taken to avoid the subject’s suspecting 
that successes and failures were experimentally in- 
duced, For all of the experimental conditions re- 
ported here, 50% successes and 50% failures were 
given. The usual alternations in presentation of slides 
and outcomes were made. Three introductory slides 
were presented and estimated by the subject in or- 
der to familiarize him with his task; however, no 
outcome was given for these preliminary items. At 
the end of the serial task, each individual subject 
was presented with a sheet of paper on which he 
was invited to make one more realistic estimation: 
(a) Did he receive more successes, more failures, or 
about an equal number of both in responding to the 
slides? (b) Given the fact that there were 22 slides 
altogether, how many of them were “right” and how 
many “wrong” (ie, how many times did he see the 
“right” and the “wrong” signal)? In an additional 
question, subjects were asked to evaluate their per- 
formance in subjective terms: Were they satisfied 
or not satisfied with their performance? 

Subjects. Subjects were 128 young soldiers from 
an army base in Belgium. They got acquainted with 
each other over a few weeks and volunteered for the 
experiment. None of them had previously participated 
in a similar experiment. 

Conditions. The task just described was simultane- 
ously performed (a) by two individual performers 
and (b) by two groups of four subjects each per- 
forming as a group. Each of these two “units” (two 
groups or two individuals) performed in either (a) 
a competitive or (b) a coactive condition, Each of 
these four conditions was run according to the fol- 
lowing special procedures complementing the general 
procedure given above. In the competitive intergroup 
condition, instructions were given to the effect that 
rivalry was aroused between the two groups present. 
Subjects were told that the purpose of the experi- 
ment was to see which group was able to make the 
best (most realistic) estimations. Each group was 
simultaneously ushered into the large experimental 
room and was seated at a table. During the 4-sec 
general presentation of a slide, each member of a 
group had to make his own estimation. Then a 10- 
sec discussion period was provided in which each 


TABLE 1 


MEAN PERCENTAGES OF PERCEIVED INDIVIDUAL 
AND GROUP SUCCESSES, 
EXPERIMENT 1 


Individual Group 

Condition successes successes 
Competition 47.61 57.32 
Coaction 48.35 52.64 


group member verbally communicated his impression 
with other group members. From this discussion, a 
group decision was determined. As soon as a signal 
was given, a representative from each group (the 
one who happened to be seated in front of the key- 
board) then made the response on the keyboard. The 
two responses given by the two group representatives 
were then immediately compared with the real num- 
ber via the computer (the computer room had al- 

ready been shown to the subjects) . 

The outcome was then communicated to each 
group by a programmed signal indicating “worse” 
or “better,” which meant that the response of that 
group was worse or better than the response of the 
other group. Subjects were told that no outcome 
would be given if the two responses happened to 
be the same. At the end of the serial task, each sub- 
ject was given a sheet of paper on which he had 
to indicate (at a separate table) his personal impres- 
sion as to the number of successes and failures ob- 

- tained by his group. 

In the coactive group condition, the two groups 
were told that they would be completely independent 
of each other but that they would be working si- 
multaneously in the same room in order to save time 
for the experimenter. Within each group, responses 
were given and discussed in the same way as men- 
tioned above. The responses were judged by the com- 
puter in absolute terms (“right” and “wrong”) by 
comparing them with the performance expected by 
the average subject, as mentioned above. In this 
condition, the group members did not know one an- 
other’s results. 

In the competitive individual condition, the two 
individual performers were treated as just described 
for each of the two group units, except that each 
individual had to make his own response without 
any prior discussion. Thus, the outcome obtained 
was an individual outcome. In addition, outcomes 
were communicated in terms of better and worse 
with relation to each individual. 

As in the competitive individual condition, the 
coactive individual condition was adapted from the 
coactive group condition. 

Hypotheses. Cooperation is assumed to engender 
positive interpersonal relations among group mem- 
bers (Deutsch, 1949; Julian & Perry, 1967; Lott & 
Lott, 1965). Moreover, experimental investigations 
(Davis, 1969; Sherif & Sherif, 1969) as well as field 
Studies (Fiedler, 1967; Julian, Bishop, & Fiedler, 
1966) have supported the hypothesis that the cathexis 
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of the intragroup relations is a positive function of 
intergroup competition. The cathexis of the inter- 
personal relations is expected to generalize to the 
situation as a whole (Deutsch, 1949). In the present 
study it is expected that this cathexis will generalize 
to the performance outcome and to the perceptual 
impression of the relative number of successes and 
failures obtained. Hence, although the same number 
of successes is experimentally induced for both indi- 
vidual and group conditions, a relative “overpercep- 
tion” of group successes as compared with individual 
successes is predicted. It is also expected that mem- 
bers of competing groups will perceive a higher fre- 
quency of successes than members of coactive groups 
(cf. Ferguson & Kelley, 1964). 

No clear expectations are formulated concerning 
the impression formation of individual successes and 
failures as a function of competition. On the one 
hand, competition among individuals may engender 
negative affect. On the other hand, competition be- 
tween individuals is an arousal condition of achieve- 
ment motivation by which subjects may tend to 
overestimate their successes (Abreu, Note 1). 


Results 


A two-way analysis of variance was carried 
out. The two main effects were the group 
versus individual outcomes and the competi- 
tive versus coactive conditions, There was an 
equal number of observations in each condi- 
tion, The percentage of perceived successes 
was used as the computational unit. 

As expected, the number of successes was 
overestimated in the group outcome conditions 
as compared with the individual outcome 
conditions. The difference in frequency per- 
ception between group and individual out- 
comes was highly significant, F(1, 120) = 
14.817, p < .005. On the contrary, competi- 
tion versus coaction was not significant, nor 
was the interaction. However, a separate 
analysis of the group outcomes indicates that 
the number of successes was overestimated in 
the competitive condition as compared with 
the coactive situation, #(30) = 1.717, p < .05, 
one-tailed. 


EXPERIMENT 2 


In Experiment 1, no clear-cut effect of 
competition versus coaction was found in the 
two conditions as a whole. In the individual 
conditions, especially, no effect was shown. 
Some investigations (Cottrell, 1972) have 
suggested that coaction, even without knowl- 
edge of one another’s outcomes, may give rise 


to rivalry between individuals since they 
know, among other things, that the experi- 
menter is likely to compare their results. This 
presumption seems very plausible indeed. 
Therefore, in Experiment 2, on the one hand, 
coactive conditions were replaced by isolation, 
in which each individual or group had to per- 
form in the absence of any other subject or 
group. On the other hand, a new variable was 
manipulated, namely, the degree of interindi- 
vidual contact or communication between 
members within the competitive as well as 
within the isolated groups. It was expected 
that social interaction and intragroup CO- 
hesiveness would be minimal in groups with 
no verbal communication, whereas intergroup 
competition should enhance intragroup CO- 
hesiveness and interaction. It was also pre- 
dicted that the frequency of perceived success 
would increase as a function of social inter- 
action. 


Method 


Conditions. In a 2 X 2 factorial design the effects 
of intragroup verbal communication and intergroup 
competition were investigated (four conditions). 
With respect to individual performers, subjects per- 
forming in isolation were compared with two indi- 
vidually performing subjects who competed with 
cach other, Experiments were run as described above, 
with the following exceptions. In groups without 
verbal interaction (either competitive or isolated), 
each member was instructed to give his own estima- 
tion for each slide presented to the group by push- 
ing buttons on the keyboard in front of each subject. 
The group response was then immediately calcu- 
lated by the computer by averaging the deviations 
made by subjects of the same group. Thus, the qual- 
ity of each subject’s estimation contributed to the 
group result. The outcome was communicated to the 
competitive and isolated groups, respectively, in com- 
parative and absolute terms, as mentioned above. 
The procedures used in the other conditions are the 
same as those for Experiment 1. 

Subjects. Subjects were 192 young soldiers who 
volunteered for the experiment. 


Results 


A two-way analysis of variance was carried 
out on the conditions with group outcomes. 
The most important variable, verbal interac- 
tion, was highly significant, F(1, 116) = 
9.900, p < .005: The perceptual impression 
of the number of successes increased as a 
function of verbal interaction. The main effect 
of verbal interaction was also significant in 
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MEAN PERCENTAGES OF 


Ep INDIVIDUAL 
AND GROUP SUCCESS! ENT 2 


MENT 2 


Competition Isolation 


Individual 48.63 43.53 
Group 
With verbal interaction 52.23 47.13 
Without verbal interaction 45.10 


43.73 


the analysis of the competition versus isola- 
tion factor. In fact, whereas the main overall 
effect of competition versus isolation was not 
clearly significant, F(1, 116) = 3.731, 05 < 
p < .10, a separate analysis showed a highly 
significant difference between competitive and 
isolated groups in the verbal interaction con- 
dition, ¢(58) = 2.918, p< .005, one-tailed, 
and no effect in the condition without verbal 
interaction, t(29) =.478. Even the general 
group effect of more perceived successes in 
group conditions as compared with individual 
ones disappeared for group conditions without 
verbal interaction (see Table 2). Thus, mem- 
bers of groups without intragroup communi- 
cation did not perform as did the other 
groups in that respect. Finally, in individual 
conditions (see Table 2), successes were over- 
estimated by competing individuals as com- 
pared with isolated subjects, #(58) = 2.205, 
p< .05, two-tailed, whereas no such signifi- 
cant difference was found between competi- 
tive and coacting subjects in Experiment 1. 
Thus the hypothesis appears to be supported 
that coaction may produce some rivalry and, 
therefore, in this context, does not greatly 
differ from competition, whereas competition 
is more clearly differentiated from the iso- 
lated individual situation of Experiment 2. 


GENERAL DISCUSSION 
Spread of Affect 


The research program, of which the experi- 
ments reported here are a small part, is in- 
tended to shed light on the tendency in some 
people to perceive themselves and their ac- 
tivities as successful, whereas others with the 
same objective outcomes tend to perceive 
themselves as failures. Similarly, as stated in 
a well-known proverb: “Some men brag, oth- 
ers seem too modest” (cf. Schneider, 1969). 
This shows that the elated or the depressed 
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self-perception of these people is not always 
based on their objectively obtained behavioral 
results. Our purpose was to investigate the 
extent to which such tendencies can be found 
at work on the elementary level of perceptual 
frequency impressions and what kind of fac- 
tors are responsible for the distortions im- 
plied. Some of these factors may be common 
to distortions in frequency perception of neu- 
tral events (Vreven & Nuttin, 1976). Others, 
on the contrary, may be limited to dynamic, 
emotional, and ego-involved phenomena such 
as the behavioral outcomes studied here. 

It seems plausible to admit that in situa- 
tions where clearly observable data are not on 
display, impressions are formed on the basis 
of general tendencies and vague cognitive 
elements. These impressions will have an im- 
pact on behavioral and verbal responses, es- 
pecially when more decisive elements are lack- 
ing. Some findings in other fields, in memory 
and learning among others, tend to support 
this thesis regarding the impact of “impres- 
sions.” It was found, for instance, in our 
laboratory that most subjects keep a rather 
accurate impression of the number and serial 
position (either in space or in time) of suc- 
cesses (rewards) received in a serial task, al- 
though no concrete memory traces are pre- 
served for individual items. This general 
mnemonic impression of rewards seems to be 
responsible for the more frequent repetition 
of responses in the neighborhood of actually 
rewarded items (cf, the spread of affect phe- 
nomenon, Nuttin, 1949). A similar effect was 
found by d’Ydewalle & Eelen (1975) in a 
study showing that the proportion of response 
repetition tends to follow the proportion of 
successes received in the absence of concrete 
memory traces for individual outcomes. Sum- 
marizing, it is assumed that “impressions” of 
a cognitive or affective nature may play a 
rather important role in behavioral situations 
that are too complex to allow for clear and 
objective perception. Processes such as im- 
pression formation, guessing, and the like are 
convenient substitutes for true perception and 
cognition (cf. Nuttin, 1976). In these vague 

and more subjective types of cognizing, man 
tends to impose his own frames of reference 
rather than submitting to facts. 
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With regard to the results reported here and 
the hypotheses formulated above, positive 
interactions with the group of people one is 
working with, as well as states of self-satis- 
faction, tend to produce a positive climate 
and “optimistic” impressions about the work 
done. At the same time, some cognitive ele- 
ments, such as the implicit opinion that esti- 
mations obtained from a group of people will 
be more accurate than perceptual impressions 
given by a single subject, are likely to pro- 
duce more confidence in group performers 
than in individual subjects. A spread of this 
affective and confident climate to outcomes 
and events situated in the same context is 
hypothesized here. It may be conceptualized 
in terms of generalization and affective trans- 
fer processes or—more intuitively—as an 
“irradiation” of the “warm glow of success” 
as suggested for a different type of phenom- 
enon by Isen (1970). In more cognitive terms, 
it could be said that positive affective feelings 
or impressions spread by making subjects 
expect more success. Perceptual impressions 
about the number of successes received are 
assumed, then, to follow the lines of expecta- 
tions as long as no obviously contrasting ele- 
ments impose themselves. Thus, it is sub- 
mitted that affectively based expectations 
will be responsible for distortions in the 
perception of emotionally loaded phenomena, 
such as the behavioral outcomes studied here, 
whereas more cognitively based expectations 
(e.g., based on subjects’ previous experience 
with similar materials and tasks) will be able 
to account for the so-called assimilation effect 
found in the frequency perception of neutral 
events (Vreven & Nuttin, 1976). Attention 
should be paid to the fact that in perceiving 
the number of outcomes obtained, a spread of 
negative affect manifests itself in some peo- 
ple for whom the failure element tends to 
dominate (Nuttin & Greenwald, 1968, p. 26). 


Gradient of Social Interaction 


The degree of social interaction that is as- 
sumed to be active in the process described 
above is next considered. Combining the vari- 
ous conditions realized in Experiment 1 and 
2, the following degrees of social interaction 
are distinguished. Among group performers, 
the lowest degree of intragroup interaction is 
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TABLE 3 
MEAN PERCENTAGES OF PERCEIVED INDIVIDUAL 
AND Group SUCCESSES, EXPERIMENTS 1 AND 2 


COMBINED 
Mean Experi- 
Performers % ment 
Group 
Competing-intragroup 
communication" 54.8 1&2 
Coacting-intragroup 
communication 52.6 1 
Tsolated-intragroup 
communication 47.1 2 
Competing-no intragroup 
communication 45.1 2 
Isolated-no intragroup 
communication 43.7 2 
Individual 
Competing* 48.1 1&2 
Coacting 48.3 1 
Isolated 43.5 2 


“The result given is the weighted mean of Experiments 1 
and 2, 

expected in groups without verbal communi- 
cation among members, As for the groups 
with verbal communication, intragroup rela- 
tions are expected to be optimal in conditions 
of intergroup competition and coaction and 
minimal in the absence of any other group 
(isolated groups). As for individual perform- 
ers, interaction is expected to decrease as a 
function of the same three factors: interindi- 
vidual competition, mere coaction of two in- 
dividuals and, finally, individual isolation. 
Concerning the subjects’ perceptual impres- 
sions, a parallel gradient is found as shown in 
Table 3. It appears that group performers 
without intragroup communication (compet- 
ing and noncompeting), as well as isolated 
individual performers, perceive the lowest 
relative number of successes (45.1%, 43.7%, 
and 43.5%, respectively). If fact, it must be 
recognized that these individuals do not really, 
perform and interact as a group. On the con- 
trary, competing groups with intragroup com- 
munication produce the highest perceptual 
impressions of success (54.8%). As to indi- 
vidual performers, it appears that competing 
and coacting individuals obtain the same 
results (48.1% and 48.3%). This finding 
supports the hypothesis that coaction is likely 
to produce rivalry in many situations. Iso- 
lated individuals, on the contrary, have lower 
impressions of success (about 43% vs. 48% 
for coacting or competing individuals). These 


835 


results support the hypothesis that positive 
social interaction (in terms of intragroup co- 
operation and, probably, also group cohesive- 
ness) is likely to produce higher perceptual 
impressions of success. The processes in- 
volved are conceived in terms of spread of 
affect and expectation, as described above. 
As to the direct impact of interindividual 
competition alone, the influence of dynamic 
and personality factors on global perceptual 
impressions is a rather well-established fact 
that is not discussed here. 

Further research will investigate the extent 
to which competition and aroused achieve- 
ment motivation produce a tendency toward 
attribution of success to internal factors (cf. 
Weiner, 1972; Weiner & Kukla, 1970). The 
subjective value of successes attributed to in- 
ternal factors is likely to be higher than in 
the case of successes due to external causes. 
The increased value, then, of the successes 
obtained could be hypothesized to manifest 
itself in a tendency toward higher frequency 
perception of success. 
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Groups and Crowds as Social Entities: Effects of Activity, 
Size, and Member Similarity on Nonmembers 


Rodney L. Bassett 
Ohio State University 


Eric S. Knowles 
University of Wisconsin—Green Bay 


The notion that groups and crowds are social entities that have stimulus value 
for others was studied in a field experiment. Experimental social entities were 
created by positioning two, four, or six confederates near the entryway to a 
university library. Within each entity size, confederates stood at the same posi- 
tions but varied in their activity and their similarity to one another. A hidden 
camera filmed patrons entering the library. The distance maintained from the 
entity, looking behavior, and speed of 309 pedestrians were analyzed. The 
entity’s activity influenced both the distance and the looking behavior of 
“ passersby, with closer approaches and more looks directed to a crowd than a 
group activity. The findings for entity size and member similarity suggest that 
the salience of a social entity, rather than invariably producing avoidance be- 
havior, may intensify the stimulus value of the entity’s activity, Analyses of 
responses before, while, and after passing the confederates indicated that entity 
size was responded to initially, whereas activity and similarity affected behavior 
only as or after pedestrians passed by the entity, These findings integrate earlier 
studies, which had looked at either experimental crowds or interacting groups, 
and identify some of the characteristics of social entities that influence others 


in a complex social environment. 


The spatial aspects of social behavior have 
been studied primarily in terms of personal 
space (cf. Evans & Howard, 1973; Sommer, 
1959, 1969). This focus on the interpersonal 
situation has dealt with the factors that lead 
one person to approach or avoid another. A 
number of authors, however, have suggested 
that spatial analyses may apply to larger so- 
cial units, such as groups, collectivities, and 
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crowds (Lyman & Scott, 1967; Milgram & 
Toch, 1969; Patterson, 1968). Patterson, for 
instance, proposed that “it is possible that a 
small group, like an individual, has some psy- 
chological extension into surrounding space. 
. . . Group or interpersonal space may vary 
as a function of the size, intimacy, and setting 
of the interaction” (1968, pp. 358-359). In 
a more general way, Campbell (1958) has 
suggested that groups and aggregates exhibit 
some properties common to individuals and 
other entities: proximity, similarity, and 
common fate of the parts of the entity, preg- 
nance, and resistance to intrusion, From this 
perspective it is possible to look at the general 
characteristics of social entities that affect 
and influence others in a social environment. 

A number of recent investigations have 
looked at the spatial properties of interacting 
groups. Consonant with the concept of a 
bounded interaction territory (cf. Lyman & 
Scott, 1967), several studies have shown that 
pedestrians avoid walking through (Cheyne & 
Efran, 1972; Knowles, 1973; Walker & 
Borden, 1976) or walking near (Knowles, 
Kreuser, Haas, Hyde, and Schuchart, 1976) 
a stationary interacting group. The degree to 
which an interacting group’s territory is pro- 
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tected increases with the size of the group 
(Knowles, 1973; Knowes et al., 1976), the 
status of its members (Knowles, 1973), and 
the physical closeness of the group members 
(Efran & Cheyne, 1973) and varies with the 
sex composition of the group (Cheyne & 
Efran, 1972; Walker & Borden, 1976). These 
investigations, which have used distance as a 
dependent variable, indicate in a variety of 
ways that interacting groups tend to repel 
others in a social environment. 

In contrast to these findings, Milgram, 
Bickman, and Berkowitz (1969) showed that 
other kinds of social entities appear to attract 
people in a social environment. Experimental 
crowds of different sizes assembled on 42nd 
Street in New York City on cue and stared 
for 1 minute at a sixth floor window across 
the street. Motion pictures made from that 
window were analyzed to yield measures of 
passerby behavior. The larger the experimental 
crowd, the larger the percentage of passersby 
who imitated the crowd behavior by also 
looking at the window and the larger the 
percentage of pedestrians who slowed their 
progress and appeared to join the group. 

Although these interacting groups and ex- 
perimental crowds both influenced other peo- 
ple in the social environment, they appear to 
have influenced others in opposite directions. 
In addition to the kind of collectivity studied, 
these two areas of study have differed in the 
kind of dependent variable measured. The 
findings for interacting groups are based on 
measures of the distance maintained from the 
group, whereas the findings for experimental 
crowds are based on measures of looking be- 
havior and pedestrian speed. The purpose of 
the present study is to integrate these two 
areas of investigation by including both kinds 
of collectivities within the same design with 
the same dependent variables. A further pur- 
pose is to achieve some theoretical integration 
of these two areas by attempting to specify 
the characteristics of social entities that attract 
and repel others in the social environment. 

Clearly, the activity of the collectivity is 

an important difference between the inter- 
acting group and the experimental crowd. The 
activity may be important because it provides 
information about the exclusivity of the so- 
cial entity. Some activities, like a small group 
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discussion, are exclusive of others and imply 
the requirement of group membership for 
participation. Other activities, such as listen- 
ing to a lecture, looking up in the sky, or 
watching a demonstration, imply no require- 
ment for membership and can include others 
without disrupting the activity. The size of 
an entity should convey no direct information 
about its inclusivity or exclusivity but should 
make it a more potent or important stimulus 
to others. To this extent, size should inter- 
act with and amplify the attractive or repel- 
lent effects created by the entity’s activity. 
The findings that larger crowds attract 
greater proportions of looks from others » 
(Milgram et al., 1969), while larger inter- 
acting groups repel others further away 
(Knowles et al., 1976) are consistent with 
this hypothesis. 

The hypotheses concerning the effects of 
activity and size seem clear enough and are 
suggested by previous investigations, yet there 
is ambiguity concerning the possible effects 
of other entity characteristics. One may infer 
from Campbell’s (1958) discussion of groups 
as entities that any characteristic of a social 
entity or its members that increases its 
salience would lead to increased avoidance 
by others. Thus, making entity members 
more similar to one another would increase 
their salience and their distinctiveness from 
others, which by this hypothesis would tend 
to repel others from the social entity. How- 
ever, both crowds and interacting groups al- 
ready exhibit properties of entities; that is, 
they are already distinctive by virtue of their 
proximity and common activity. Yet, these 
properties of a social entity do not uniformly 
repel others; they interact with the effects 
of the activity of the entity. These con- 
siderations suggest an alternative hypothesis— 
that any increase in the salience of a social 
entity would increase the effect of the entity’s 
activity. Thus, making entity members more 
similar to one another would make a crowd 
more attractive but make an interacting 
group more repellent to nonmembers. 


METHOD 
Overview 
Experimental social entites were created by placing 


two, four, or six confederates at an alcove archway 
just inside the main entrance to a university library 
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Ficure 1. Experimental setting. (Key features of 
the library entryway include the three measuring 
lines [A, B, and C], the position of the confederates 
[D], and the hidden camera on top of the card 
catalog.) 


The activity and the size of the social entity and 
the similarity of the entity members were varied in 
a 2X3%X2 design. The 12 conditions, each lasting 
2 minutes, were replicated 5 times for a total of 60 
observational periods. People entering the library be- 
tween conditions were recorded as control subjects. 
Time-lapse movie film taken at 1 frame per second 
recorded the behavior of people as they entered the 
library, passed by the alcove, and entered the main 
lobby of the library. These films were used to mea- 
sure three aspects of passerby behavior: (a) the 
lateral distance away from the alcove before, while, 
and after passing the confederates, (b) the number 
of looks directed toward the alcove in each of three 
measurement areas, and (c) pedestrian speed, re- 
corded as the number of frames of film, through 
three measurement areas. Figure 1 presents a dia- 
gram of the experimental setting indicating the 
location of the confederates, the entrance turnstiles, 
the hidden camera, and the measurement areas. 


Independent Variables 


Activity. The confederates engaged in either a 
group or a crowd activity. The crowd activity in- 
volved looking up at the ceiling of the alcove; the 
confederates stood silently gazing at a spot on the 
ceiling just inside the alcove. The group activity 
was a discussion; the confederates looked at each 
other and talked about school-related matters in a 
normal manner. Prearranged marks on the floor 
indicated where the confederates were to stand. 
Regardless of the activity, members of the same-size 
social entities stood at the same places, facing in 
the same direction. 

Entity size. Two, four, or six people were placed 
at the alcove. Six people were placed in a horseshoe 
or semicircle arrangement that was open toward the 
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alcove. Entities of four were created by absenting 
the two end people closest to the alcove, and entities 
of two retained only the two outermost people (see 
Figure 1). Entity size was increased by adding peo- 
ple toward the alcove, thereby assuring that the 
intrusion of the confederates into the pedestrian 
pathway was constant for each entity size. The pool 
of confederates rotated membership in the smaller 
entities and rotated positions within the entity 
among conditions and replications. Each entity had 
an equal number of males and females. 

Member similarity. The confederates either wore 
their normal street clothes (low similarity) or put 
on a blue and red pullover jersey (high similarity). 
The jersey selected was one that would be uncom- 
mon on campus (in fact it was sold as an under- 
shirt) and one that would not identify the confed- 
erates with any particular organization other than 
the stimulus entity. With the jersey worn over their 
regular shirts, the confederates were quite distinctive. 


Apparatus 


Pedestrian behavior was filmed with a 16-mm 
Arriflex motion picture camera that was hidden from 
view. A time-lapse impulse generator operated the 
camera motor at a rate of 1 frame/sec with a ł-sec 
exposure time. The camera, on a 6-inch- (15-cm-) 
high stand, was placed on top of a card catalog 
cabinet directly in front of and 40.4 feet (12.3 m) 
away from the library entrance, A large cardboard 
box with a small hole cut in the front was placed 
over the camera. The box hid the camera from 
pedestrian view until they had passed through the 
measurement area. The impulse generator placed 
behind the card catalog produced audible clicks, 
which were noticeable to some people in the library 
lobby. Pedestrians entering the library and passing 
through the experimental area, however, were unable 
to hear the apparatus and appeared to be unaware 
of the hidden camera. 

The Kodak Super-X film used to record pedestrian 
behavior was viewed on a Kodak Film Analyst 
projector with a frame counter, This projector 
allowed a frame-by-frame advance so that the be- 
havior of each subject could be recorded through a 
sequence of exposures. 


Procedure 


The camera was prepared and started at the 
beginning of each of two morning sessions and ran 
continuously through the session. A different pool 
of confederates, recruited from introductory psychol- 
ogy classes, participated on each day. While the 
camera was being prepared, the confederates were 
given a 10-minute training session which stressed 
the importance of maintaining their exact positions 
and provided coaching in performing the two activ- 
ities. The confederates were not informed of the 
hypotheses until after the experimental session was 
completed. The confederates and an experiment co- 
ordinator remained in a reference room immediately 
behind but not accessible from the experimental 
alcove. From a prearranged list of conditions, ran- 
domized within replications, the coordinator pre- 
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pared the appropriate number of confederates and 
sent them to the alcove, where they assumed their 
positions over prearranged marks and engaged in 
the appropriate activity for a 2-minute period. At 
a signal from the coordinator, the confederates re- 
turned to the reference room, and the next social 
entity was prepared and sent to the alcove, 


Subjects 


Once developed, the films were projected onto the 
wall of a small coding room. Each person entering 
the library was counted and assigned a subject 
number. In approximately 24 hours of filming, 1,003 
people entered the library. A number of these people 
Were not counted as subjects because (a) they entered 
with friends (n= 36), (b) they did not walk com- 
pletely through the experimental area, turning down 
a hallway to their right, entering one of the alcoves, 
or veering to their left across the library lobby 
(n=177), (c) they entered the library while the 
confederates were changing conditions (n= 147), 
(d) they had to change paths to avoid another 
pedestrian (n=37), or (e) the camera view was 
obscured on at least three frames so that it was 
impossible to score the subject's foot placements 
or looking behavior (n= 145). In addition, 50 
people encountering the experimental conditions were 
discounted because one of the confederates moved 
(n= 6), someone else stood sufficiently near the 
alcove to change the perceived size of the entity 
(n= 24), or for other miscellaneous disruptions 
(n = 20). Of the remaining 411 people, 309 encoun- 
tered one of the stimulus entities and were counted 
as experimental subjects, whereas 102 entered when 
no entity was present and were counted as control 
subjects, The experimental and control subjects did 
not differ significantly in sex or race composition ; 
64% of the experimental subjects and 54% of the 
control subjects were male, xP (1) = 3.34, p> .05, 
whereas 6% of the experimental subjects and 10% 
of the control subjects were black, x7(1) = 1.56, 
p> A. 


Measures 


Each subject’s behavior was Scored on a record 
sheet that represented the Projected image of the 
library floor. As subjects came to one of the two 
library turnstiles, they were assigned a subject num- 
ber and their sex, race, and condition were recorded, 
Starting from the entrance turnstile, their path and 
looking behavior were recorded for each frame of 
film until they exited from view. The subject’s path 
was noted by marking for each frame where the 
right foot fell on the floor. Two recorders also 
judged whether or not the subject was looking to 
the right toward the alcove. These record sheets for 
each subject allowed the calculation of the dependent 
variables used in this study. 

Distance. The lateral distance away from the 
alcove was determined at three places along the 
pathway. On the recording sheet, three lateral lines, 
perpendicular to the traffic flow, were drawn. One 
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line, 7.6 feet (2.3 m) from the turnstile and 5.1 feet ™ 
(1.6 m) in front of the social entity, and another 
line 6.3 feet (1.9 m) beyond the entity corresponded 
to tile lines on the library floor. An additional line 
was drawn on the recording sheet that bisected the 
entity. The path of the subject’s right foot was 
drawn by connecting the center points of the right 
foot placements. The subject’s distance away from 
the right-hand wall was measured in centimeters 
from where this drawn path intersected each line. 
These three distance measures were then corrected 
for the effects of perspective.t One centimeter of 
the corrected measure was equal to 3.54 inches 
(9 cm) across the hallway floor. The 7-foot (2.1-m) 
width of the entryway corresponded to 23.7 cm on 
the record sheet, On this scale, the confederates in- 
truded a maximum of 5.3 cm into the entry way 
space, Thus, the sum of the three corrected distance 
measures could range approximately from 10 em to 
66 cm. 

Speed. The three lines drawn on the recording 
sheet defined three areas of the library entryway: 
the 7.6-foot (2.3-m) length between the turnstile and 
the first line, the 5.1-foot (1.6-m) length between 
the first and second line, and the 6.3-foot (1.9-m) 
length between the second and third measurement 
line. The speed of the subject through each of 
these areas was measured as the number of frames 
of film in which the subject’s right foot fell within 
the area. 

Looking. As each right foot placement was re- 
corded, the scorers also judged whether or not the 
subject was looking in the direction of the alcove, 
The film was not sufficiently resolute to determine 
eye direction independent from head orientation, so 
the scoring convention was adopted of recording 
right head orientation. Two scorers worked together 
and judged whether the subject’s-head was turned 
in the direction of the alcove. The number of frames 
in which the subject was looking toward the alcove 
was recorded for each of the three measurement 
areas. 


Reliability 


Two teams of trained scorers recorded and mea- 
sured the behavior of the same 38 subjects. These 
Scores were used to assess the reliability of the mea- 
surements used in this study. The summed measures 
of distance each team made were highly similar 
(r = .950) ; the correlations for the first, second, and 
third measurement lines were .891, .996, and .853, 
respectively, The measures of speed through the 
experimental area were also highly correlated (r= 
-886). The reliability of the looking measure was 
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r assessed in two ways. First, the number of looks 
scored for each subject were highly correlated (r= 
.941). Second, of the 189 frames of film scored for 
these 38 subjects, the two teams agreed on 57 
(30.2%) that a look was present and on 118 (62.4%) 
that a look was not present; only 14 (7.4%) of the 
frames produced a disagreement. 


RESULTS 
Analyses 


A2x2X3 x 2 (Sex of Subject x Activ- 
ity X Size X Similarity) between-subjects 
analysis of variance was conducted on each of 
the three dependent variables summed across 
measurement times. In these analyses, control 
subjects were disregarded. The critical alpha 

w levels for each analysis were adjusted by the 
Bonferroni procedure to control the experi- 
ment-wise significance level.* An additional 
analysis evaluated differences between experi- 
mental and control conditions. Analyses of 
covariance were performed on the distance 
measure, covarying on the entrance turnstile 
used, and on the looking measure, covarying 
on the number of steps taken. In both cases, 
the results of the covariance analysis repro- 
duced the analysis of the untransformed data 
and, thus, are not presented. Because of the 
small proportion of blacks, race was disre- 
garded in these analyses. 


Distance 


The analysis of variance of the summed 
distance measure revealed a significant main 
effect of activity, F(1,285) = 11.72, p< 
.003, and a significant Activity X Similarity 
interaction, F(1,285) = 7.72, p < .018. Fig- 
ure 2 presents the summed distance measures 
for the Activity X Similarity interaction.* 
Although subjects tended to stay farther 
away from the group activity than the crowd 
activity, this effect was much stronger when 
the confederates were similar to one another. 
Tests of the simple effects of activity within 
levels of similarity indicated that there was a 
significant activity difference when the con- 
federates were dressed similarly, F(1,285) = 
17.21, p< .003, but not when they were 
dressed normally, F(1, 285) = 2.44, p> 1. 
This difference in trends resulted primarily 
from the responses to the crowd activity. 

The sex of the subject, though showing no 
significant main effect, did interact with the 
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Activity 
Ficure 2. The effects of activity and member 


similarity on the sum of the corrected distance mea- 
sures (in cm). 


activity of the social entity to affect the dis- 
tance of the path, (1,285) = 8.74, p< 
.009. Males responded more strongly to the 
activity of the entity than did females. Al- 
though the direction of the simple activity 
effect was similar for males and females, it 
was significant for males, /(1, 285) = 18.48, 
p < .003, but not for females, F(1, 285) = 
UST, 2 SSi 

Inspection of the distances maintained at 
each of the three measurement lines revealed 
the points at which deflection occurred. The 
deflection away from the social entity was 
evident at the first measurement line and 
became accentuated as the subjects passed by 
the confederates; the differences between 
experimental and control subjects were sig- 


2The probability levels in the analyses of the 
summed measures were adjusted for three dependent 
variables. Probability levels in the analyses of indi- 
vidual measurement times were adjusted for nine 
dependent measures. 

3To convert these measures into entryway dis- 
tances, consider each centimeter of the summed dis- 
tance measure equivalent to 3 cm across the 2.13-m 
entryway, Experimental subjects had an average 
summed distance of 40.42 cm, which represents a 
geographical distance of 1.21 m across the entryway. 
That is, experimental subjects took a path across 
the three measurement lines that averaged a little 
more than midway (579%) across the space. In con- 
trast, control subjects, with a summed distance mea- 
sure of 34.89 cm, took a path that averaged 1.05 m 
across the entryway, slightly less than midway (49%). 
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nificant at each of the measurement lines. The 
difference between the two activities was not 
significant at the first measurement line, F(1, 
285) = 1.47, p> .2, but did become signifi- 
cant at the second, F(1,285) = 12.29, p< 
.009, and third, F(1, 285) = 10.08, p < .018, 
measurement lines. The activity by similarity 
interaction also was not apparent at the first 
measurement line, F(1,285) <1, but was 
significant at the second line, F(1,285) = 
12.50, p < .009, and was visible though non- 
significant at the third measurement line, 
F(1, 285) = 5.21, p>.2. The activity and 
similarity of the social entity only affected 
pedestrian distances while passing and after 
they passed by the confederates; initially, 
subjects showed a similar deflection from all 
entities regardless of their characteristics, 


Speed 


The speed at which experimental subjects 
passed through the experimental area showed 
no significant effects of conditions. Also, there 
were no differences between experimental and 
control subjects in the number of frames of 
film recorded. However, this measure was not 
particularly accurate because (a) subjects 
were seen for a relatively short time, approxi- 
_ mately 5.0 seconds, and (b) the time-lapse 
camera introduced variability of +1.0 second, 


Looking 


The analysis of looking behavior, summed 
across measurement areas, revealed strong 
effects for entity size, F(2, 285) = 12.22, p< 
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Ficure 3. The effects of entity size and activity 
on the total number of looks directed toward the 
social entity. 
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.003, and activity, F(1,285) = 13.69, p < *` 


.003, as well as a significant interaction be- 
tween these two variables, F(2, 285) = 4.12, 
’<.05. No other significant effects were 
obtained. The entity size by activity interac- 
tion is presented in Figure 3. There was a 
general tendency for subjects to look more at 
larger social entities, but this effect was much 
stronger for the crowd activity, F(1, 285) = 
24.44, p < .003, than it was for the group ac- 
tivity, F(1, 285) = 2.00, p> .1. The differ- 
ences between activities in the number of 
looks drawn were significant only for four and 
six confederates, F(1,285) = 8.14, p < .015, 
and F(1, 285) = 22.36, p < .003, respectively. 
Two confederates received similarly low num- 
bers of looks regardless of the activity, (1, 
285) <1. 

The Entity Size X Activity interaction was 
also inspected in each of the three measure- 
ment areas. In the first area, there was a sig- 
nificant size effect, F(2,285) = 7.19, p< 
-009, with larger entities drawing more looks, 
but no significant difference between the two 
activities, F(1,285) = 1.19, p> .3. A sig- 
nificant activity difference appeared in the 
second measurement area, F(1, 285) = 7.03, 
P < .009, and was present, though nonsignifi- 
cant, in the third area, F(1, 285) = 4.02, p< 
-072, with the crowd activity drawing more 
looks. The entity size difference was main- 
tained through the second measurement area, 
F(2, 285) = 14.59, p< 009, and was still 
visible, though nonsignificant, in the third 
measurement area, F(2, 285) = 3.33, p> .2. 
The Size x Activity interaction was not sig- 
nificant in any of the measurement areas, 
largest F(2, 285) = 3.72, p > .2. Thus, larger 
social entities drew more looks, even before 
subjects passed the confederates. Subjects, 
however, did not respond differentially to the 
two activities until they were close to or 
passed the confederates. 


Correlations among dependent variables 


The relationships among dependent varia- 
bles were inspected Separately for experi- 
mental and control subjects and are presented 
in Table 1. For both sets of subjects, there 
was virtually no relationship between the 
distance of the path taken and the number of 
looks or the pedestrian speed, either for the 
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summed measures reported in Table 1 or with- 
in the particular measurement times. Speed 
and the number of looks, however, were re- 
lated for the experimental subjects (r = 791, 
$ < 001) but not for the control subjects (r 
=.177, p>.05). The difference between 
these two correlations was highly significant 
(Z = 7.73, p < .001). When the relationship 
between speed and looking was inspected by 
measurement areas for experimental subjects, 
strong relationships appeared within measure- 
ment areas (r = 387, .419, and .923 for the 
first, second, and third areas) but not between 
measurement areas (7s range between —.084 
and .051). 


Discussion 


As suggested by the comparison of the Mil- 
gram et al. (1969) study of experimental 
crowds and the studies of interacting groups, 
the activity of the social entity appears to 
have a major influence on the behavior of 
passersby. A crowd activity—staring at the 
ceiling—drew more looks from passersby and 
caused less of a path deflection than did the 
group discussion, Consistent with the Milgram 
et al. study, entity size increased the number 
of looks drawn from passersby to the crowd 
activity. There also was a tendency, though 
nonsignificant, for larger interacting groups to 
receive more looks, The more general effect of 
entity size on looking behavior was, as hy- 
pothesized, to amplify the effect of the en- 
tity’s activity. Size, however, did not affect 
the distances at which pedestrians passed the 
entity. This finding is in contrast to the 
Knowles et al. (1976) study, which did find 
increased deflection from larger interacting 
groups. However, the present study looked at 
the deflection from groups standing in a circle 
with their backs to the pedestrians, a con- 
figuration that Knowles et al. found to pro- 
duce a deflection that was less than from a 
group standing in a straight line facing out- 
ward. Although the present study did not 
involve the same levels of control or sample 
sizes, it does underscore Knowles et al.’s con- 
clusion that the effect of group size on de- 
flection distances is not particularly strong, 
especially for a circle formation. 

The similarity of the confederates influ- 
enced the distances maintained from the en- 
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TABLE 1 
iS AMONG DEPENDENT 
IMENTAL AND 


WITHIN-CELL CORRELATI 
VARIABLES FOR E 
CONTROL § 


Summed measures 1 


1. Distance 


2, Looks —.108 


3. Speed ~.012 791 


Note, Correlations for control subjects are reported above the 
diagonal anc correlations for experimental subjects are reported 
below 


tity but did not influence the number of looks 
received. When the experimental entities were 
more similar to each other and, therefore, more 
distinctive from the passersby, they drew 
pedestrians closer to the crowd activity. Simi- 
larity, however, had no effect on the responses 
to the group discussion, The findings for both 
similarity and size are consistent with the 
hypothesis that characteristics of the social 
entity or its members that make the unit 
more salient or more of an entity intensify 
the effects of the activity. 

The findings indicate that the activity of 
the social entity has a major influence, that 
is, attracting or repelling passersby. The role 
of salience appears to be to increase the ef- 
fects of the activity. The effect for member 
similarity is particularly intriguing because 
the confederates wearing common jerseys 
were more different and distinguishable from 
the passersby. It may seem reasonable to 
suppose that this manipulation of salience 
would have led to greater avoidance of the 
social entity, yet it did not. Subjects walked 
closer to the experimental crowd when the 
confederates wore the jerseys. Alternatively, 
the distinctiveness of the similar confederates 
may have led to their being perceived as an 
outgroup and dismissed by passersby. Ryen 
and Kahn (1975), however, found that 
greater distance was maintained from out- 
groups than from neutral or ingroups. Also, if 
the passersby in the present study were dis- 
counting the similar crowd, which allowed 
them to walk closer, member similarity should 
also have reduced the number of looks di- 
rected toward the crowd. However, subjects 
looked just as often at the similar crowd as 
they did at the normally dressed crowd. 


844 


Thus, rather than supporting the hypothe- 
ses that member similarity leads to greater 
avoidance or to greater discounting of the 
Social entity, the present findings provide 
more support for the hypothesis that the sali- 
ence of a social entity intensifies the influence 
of the entity’s activity on passersby. Although 
this position is more Supported by the data, 
it did not receive complete confirmation, The 
effects of similarity were observed only on the 
distance measure, and entity size showed 
effects only on the looking measure. 

The analyses of the dependent variables by 
measurement areas—before, while, and after 
Passing the confederates—proved to be in- 
structive. Upon entering the library, subjects 
appeared to respond only to the fact that the 
confederates were standing by the alcove, In 
Comparison to the control subjects, all experi- 
mental subjects showed a deflection away from 
the alcove, but the amount of the deflection 
was unaffected by the characteristics of the 
entity. At this initial stage, larger social 
entities drew more looks from the subjects, 
but looking behavior was unaffected by the 
activity or the similarity of the members, As 
the subjects reached and passed by the con- 
federates, the particular characteristics of the 
Social entity began to affect the pedestrian 
behavior. Subjects deflected their paths fur- 
ther away from the group discussion and, for 
the experimental crowds, walked closer to the 
similar than dissimilar confederates. At these 
later stages, the activity of the social entity 
also began to affect looking behavior, with 
subjects looking more at the crowd activity. 

For the distance measure, the lack of an 
initial response to similarity or activity may 
have been affected by the fact that the en- 
trance and particularly the turnstiles fun- 
neled the flow of traffic into the library. How- 
ever, the looking measure, which would not be 
affected by such funneling, also showed no 
differences in the first measurement area. In 
addition, the scorers had the impression that 
subjects were responding to different aspects 
of the situation during different stages of 
their transit through the experimental area, 
Prior to passing the confederates, subjects 

appeared to be looking primarily at the group 
or crowd members. However, as they moved 
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abreast and beyond the confederates, subjects 
encountering the crowd activity appeared to 
change their focus to the alcove ceiling at 
which the confederates were looking, whereas 
subjects encountering the group discussion 
either ceased looking or continued to look at 
the group members. 

More generally, this investigation indicates 
that different kinds of social groupings can 
be studied together and compared, Individu- 
als, interacting groups, crowds, and other 
kinds of collectivities in a public environment 
are social entities that, at least at a molar 
level, can be described on a set of common 


dimensions. Several such taxonomies have | 


been Proposed, though with little empirical 
investigation. Milgram and Toch (1969) sug- 
gested that groups and crowds vary in their 
boundary clarity and permeability, depending 
upon the activity, configuration, and other 
characteristics of the collectivity. Campbell 
(1958) proposed that the “entitativity” of 
any aggregation, the degree to which it exhib- 
its the properties of an entity, can be de- 
scribed by the configuration of the parts, 
their proximity and similarity to one another, 
and the degree to which they share common 
fate in terms of activity or effects, 

This investigation suggests that, in addi- 
tion to developing sets of analytical dimen- 
sions on which to describe various kinds of 
Social groupings, it is possible to evaluate the 
influence of these entity characteristcs on 
nonmembers, However, the present findings 
Suggest that the nature of this influence may 
be quite complex, involving several stages. 
Initially, subjects appeared to make an un- 
differentiated response to any social entity; 
they noticed, looked at, and began to avoid 
all of the experimental entities, After this 
initial orientation, subjects began to be in- 
fluenced by the particular characteristics of 
the social entity. Although each dependent 


variable was considered a measure of the ` 


approach or avoidance responses of the sub- 
jects, it appears that the three measures were 
not isomorphic with one another, They tended 
to be uncorrelated, except for looking and 
speed, and were influenced by different aspects 
of the social entity, To understand the com- 
plex social dynamics operating in public en- 


GROUPS AND CROWDS AS SOCIAL ENTITIES 


vironments requires measuring the kinds of in- 

fluence created by various social entities and 

identifying those entity characteristics that 
affect the direction and amount of influence. 

The present study, which is an initial step in 

this direction, suggests that the activity of the 

entity is a major determinant of the kind and 
amount of influence on others and that the 
size and similarity of the entity, when influen- 
tial, act to intensify the effects of the collec- 
tive activity. 
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Personality and Situational Influences on Changes in Prejudice: 
A Replication of Cook’s Railroad Game in a Prison Setting 


Linda A. Foley 
University of North Florida 


The main purpose of this study was to replicate Cook’s laboratory study of 
racial prejudice in a natural setting. The objective was to determine the influ- 
ences of personality and situational factors on changes in prejudice. The envi- 
ronment studied was a maximum security state prison for males. The subjects 
were black inmates and white inmates admitted to the institution during a 
$ 1-month period. The different environments within the institutions were defined 
in terms of the norms as perceived by the residents of the different living areas. 

Subjects were pretested for cognitive complexity, self-esteem, and attitudes to- 

ward people in general. Interracial attitudes of these subjects were measured at 

í two time periods: as the subjects entered the institution and 3 weeks later, The 
$ results indicate that an individual tends to change racial attitudes toward the 

norms of his living area. The amount of change is dependent on the individual’s 


cognitive complexity, self-esteem, and attitudes toward people in general. 


Some proponents of integration optimisti- 
cally adhere to a belief that interaction with 
members of another race decreases conflict 
and prejudice, thus automatically improving 
© race relations. Social science literature 
= abounds with research indicating that inter- 
racial contact does have a profound effect on 
_ attitudes toward members of another race. 
| The major limitation of the research has been 
i o the unpredictability of the direction of change. 
_ Contact is a necessary but not sufficient factor 
in the decrease of prejudice. Efforts to iso- 
late the predictors of change have tended to 
emphasize one of two major categories of in- 
fluence: environmental factors or personality 
variables. Concentration on either of these 
influences alone does not explain all the varia- 
bles in contact situations, 

In a thorough review of the literature on 
the “contact hypothesis,” Amir (1969) enu- 
merated three categories of factors influencing 
the outcome of interracial contact: the con- 
tact situation, personality characteristics, and 
initial attitude. Summarizing the effect of sit- 
uational variables he concluded: 


that intergroup contact may produce different types 
of relationships. At one end are the casual and super- 
ficial relations; at the other end, the ego-involving 
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and intimate relations, Proximity and frequency of 
contact as well as other factors may exert decisive 
influence in determining the amount of intergroup 
contact and also, indirectly, the nature of the con- 
tact. As to the actual outcome, casual intergroup 
contact has little or no effect on a basic attitude 
change. Intimate contact, on the other hand, tends to 
produce favorable changes. When intimate relations 
are established, the ingroup member no longer per- 
ceives the member of the outgroup in a stereotyped 
way but begins to consider him as an individual 
and thereby discovers many areas of similarity. (p. 
334) 


Cook (1972) developed the following set of 
characteristics of contact situations which, if 
optimum, could facilitate reduction of preju- 
dice. 


The first of these was that the setting or situation 
in which the contact occurred defined as equal the 
status of the participants from the two racial groups, 


The second was that the contact situation called for 
or encouraged a mutually inter-dependent relation- 
ship-cooperation in the usual sense, 


The third was that the contact took place in a 
social climate in which the social norms of the im- 
mediate situation favored interracial association and 
equalitarian attitudes, 


The fourth was that the attributes of the Negroes 
with whom the contact occurred contradicted the 
prevailing stereotyped beliefs about them. 


Fifth, the setting was such as to Promote personal 
or intimate association, that is, association of a sort 
which revealed enough detail about the Negro par- 
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ticipant to encourage seeing him as an individual 
rather than only as a Negro, (pp. 254-255) 


In a 10-year research project, Cook (1972) 
manipulated those characteristics so that all 
Were optimum for the reduction of prejudice, 
In this project, which js referred to as the 
railroad game, anti-black white coeds were 
placed in a situation where it was necessary to 


voluntary on the part of the white subject in 
that the commitment was made before she 
was aware that one of the other participants 
£^ was black. 

j Anecdotal 
that changes i 


during the month’s interaction. However, a 
test of attitudes toward blacks in general indi- 
cated that the subjects who had changed one 


direction numbered between 35% and 40%, 
while about 40% retained essentially the 
same attitudes and approximately 20% had 
become even more prejudiced, 

The results of the railroad game indicate 
that situational factors do not explain all the 
variance in contact situations. Personality 
variables also influence changes in prejudice, 
In an attempt to explain his results, Cook 
compared the personality inventories of those 
subjects who had decreased in prejudice with 
those who had not changed. He found that 
subjects who had changed were characterized 

\by more negative self-concepts and more 
positive attitudes toward people in general, 

Studies on integrated housing (Deutsch & 
Collins, 1951; Grier & Grier, 1960) indi- 
cated that persons living in situations having 
the characteristics described by Cook de- 
creased in prejudice with interaction. Al- 
though Cook’s study incorporated all of these 
variables, 20% of the subjects actually in- 
creased in prejudice, However, Cook's study 
differed from prior studies in two major as- 
pects. First of all, the subjects did not volun- 
tarily agree to interact with a black in the 
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railroad game, In contrast, the subjects who 
decreased in Prejudice in the housing studies 
voluntarily moved into an integrated housing 
unit. The person who is willing to move into 
an integrated area most likely differs from 
someone who moves into a segregated area, 
The former is probably more open to an inter- 
racial interaction and a resultant positive 
change in attitudes toward blacks, 
Housewives in the integrated units reported 


be described as having more Ppositive attitudes 
toward people the very factor 
which Cook found predicted a decrease in 


a correlation between Positive attitudes 
toward people in general and a lack of preju- 
dice. All these findings lend support to the 
existence of individual differences between 
people who decrease in Prejudice as a result of 
ideal interracial contact and those who in- 
crease in prejudice or remain the same, Amir 
(1969) states that the favorable attitude 
change resulting from contact in the housing 
Studies could be an artifact, since a “Process 
of mutual selection might have taken place” 
(p. 333). Finding the same characteristics in 
people who voluntarily choose to live in an 
integrated area and those who decrease in 
prejudice with contact seems a strong argu- 
ment for the existence of a preselection factor 
in the choice of an integrated living area. 
Personality factors due to the preselection fac- 
tor could account for the differing success in 
Cook’s study and those of interracial housing 
(Deutsch & Collins, 1951; Grier & Grier, 
1960; Molotch, 1969), However, the existence 
of a preselection factor involved in the choice 
of an integrated housing unit does not mean 
that it accounts for all the differences in suc- 
cess of the contact situation produced in the 
laboratory by Cook (1972) and the natural 
settings, 
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There is a second major difference between 
the two situations and that is the amount of 
time the individuals spent in each of them. 
Cook’s subjects spent 2 hours a day in the 
interaction and then returned to the conflict- 
ing norms of a southern city. In contrast, the 
housewives in Deutsch and Collins’s (1951) 
study lived in the interracial environment. 
Campbell (1958) reports that the attitudes of 
white students toward blacks were signifi- 
cantly affected by integration, the direction of 
change correlating with the students’ percep- 
tion of the attitudes of their peer group and 
family. The subjects in Cook’s study were 
under the influence of the conflicting attitudes 
of their families and friends a major portion 
of each day. In contrast, individuals moving 
to an interracial environment usually have a 
simultaneous decrease in contact with their 
previous acquaintances. In addition, their 
families are under the same interracial influ- 
ences. Both of these factors support the posi- 
tive influence of the interracial interaction 
and decrease the conflict with opposing norms. 

Positive attitudes toward people in general 
is not the only personality variable related to 
prejudice. Beginning with The Authoritarian 
Personality (Adorno, Frenkel-Brunswik, Lev- 
inson, & Sanford, 1950) and The Open and 
Closed Mind (Rokeach, 1960), evidence rap- 
idly accumulated to indicate the relationship 
between cognitive structure and prejudice. 
The syndrome, related to a simple cognitive 
structure, has been similarly described by 
theorists under many rubrics: authoritarian- 
ism (Adorno et al., 1950), dogmatism (Rok- 
each, 1960), low cognitive complexity (Bieri, 
1955; Scott, 1962), and concrete cognitive 
structure (Harvey, Hunt, & Schroder, 1961). 
Each of these theorists describes the cogni- 
tively simple individual as tending to make 
extreme evaluations, having speed of closure, 
having a low tolerance for ambiguity, being 
rigid, seeking less conflicting and novel infor- 
mation (Feather, 1967), and having low re- 
call of inconsistent information (Kleck & 
Wheaton, 1967). Although evidence of corre- 
lations between the instruments used to mea- 
sure each of these constructs has been equiv- 
ocal, each measure taken singularly has a con- 
sistently high rate of predictability in terms of 
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behaviors associated with the described syn- { 
drome, Cook (1972) states that rigidity was 
not a factor in determining those subjects who 
decreased in prejudice. However, he elimi- 
nated subjects on the basis of the California 
F Scale, which has been demonstrated to cor- 
relate with rigidity (Harvey, 1965) as well as 
with cognitive complexity (Harvey, 1966). 
Extremely rigid and concrete individuals „ f 
would thus have been excluded from the = 
study. Ample evidence exists to document the 
relationship between a simple cognitive struc- 
ture and prejudice. However this relationship 
is not as direct as this evidence would appear 
to indicate. Pettigrew (1958, 1959) looked at 
the relationship between prejudice and au- 
thoritarianism in different areas of the United + 
States and in South Africa. He found the, 
predictably high correlation between preju“ < 
dice and authoritarianism in every area. How- d 
ever, he also found vast differences in the f 
amount of prejudice in the different areas | 
without significant differences in the amount 
of authoritarianism. He concluded that socio- © 
economic factors accounted for the difference 
in amount of prejudice in the areas, pointing 
to the relationship between authoritarianism 
and conformity. iz 
The research reviewed above has isolated a | 
number of situational variables which in- 4 
fluence changes in prejudice: equal status, 
positive social norms, intimate association, co- ju 
operative relationship, and positive attributes t g 
of the individuals. Studies of contact situa- "a 
tions which incorporated these aspects have .~ 
had varying amounts of success in decreasing | 
prejudice. Differing results in the studies of 1 
contact situations could be explained in terms 
of a preselection factor with personality dif- 
ferences accounting for the different results, 
Any study of changes in prejudice must take 3 
into account both the situational variables and _) 
the personality of the individuals involved. 4 
The ideal situation for studying changes in ` 
prejudice would be one in which the indi- 
viduals live in the environment but do not 
enter it voluntarily. The preselection factor is 
not present if the individual does not enter of 
his own volition. Individuals who are confined „4 
to closed institutions meet this criterion and“? 
live in an equal status interracial environment, | 
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The setting for the present experiment is a 
maximum security state institution for males. 
It is a facility housing approximately 1,800 
inmates who have committed crimes in the 
southeastern state in which it is located. On 
admission, each new inmate is assigned to a 
living area: eight-man cells, two-man cells, or 
dormitory. The method of assignment is one 
of systematic randomization. New inmates are 
assigned to the next available living space on 
the basis of race, However, within the con- 
fines of race the men are randomly assigned, 
The objective is to maintain an equal distribu- 
tion by race. 

Different geographic locations within an in- 
stitution tend to develop different norms based 
on the attitudes of the individuals in the area. 
The differences in interracial attitudes of in- 
dividuals newly admitted to the situation can 
be assumed to be a function of the environ- 
ment (norms) as well as the individuals them- 
selves, 

The purpose of this study is to replicate 
Cook’s railroad game in a field setting in order 
to compare the effect of the different situa- 
tions and intrapersonal differences on the at- 
titudes of individuals as they are admitted 
and proceed through the institution,. The 
present study expands on Cook’s research in 
order to examine black racial attitudes to- 
ward whites as well as white attitudes to- 
ward blacks. Given the nature of prisons as 
the field setting and the importance of the 
several factors reviewed as relating to preju- 
dice change, several hypotheses can be gen- 
erated for investigation, 

1. Tt is hypothesized that an individual en- 
ering an equal status total community in 
vhich there is a positive attitude toward 
minority group members tends to decrease in 
prejudice over time. If the norms were nega- 
tive toward interracial interaction in the same 
situation, an individual being initiated into 
thé institution tends to increase in prejudice 
the process. 

In the railroad game, all the variables (with 
the exception of time) in the situation were 
optimal for the reduction of prejudice, The 
present study looks at the changes in prejudice 
of residents entering three different geographic 
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areas of one institution, Each of the living 
areas has different norms for interracial in- 
teraction, These norms vary from negative to 
positive toward interracial interaction, The 
influence on interracial interaction thus varies 
with the norms of the living area. 

2. The cognitively concrete person is more 
influenced by authority and conforms more 
to norms, as well as tending to seek less in- 
formation before making a decision and seek- 
ing less novel information (Feather, 1967), 
Therefore it can be hypothesized that the de- 
crease or increase in prejudice is greater for 
the cognitively concrete individual than for 
the cognitively abstract individual, 

3. Individuals with low self-esteem are most 
influenced by norms and, therefore, tend to 
change most toward the norms of their living 
areas, 

4. Individuals with positive attitudes to- 
ward people in general are most likely to de- 
crease in prejudice after interracial interac- 
tion. (a) The individual most likely to de- 
crease in prejudice is the cognitively concrete 
individual with low self-esteem and positive 
attitudes toward people in general, (b) The 
person most resistant to change is the cogni- 
tively concrete individual with high self- 
esteem and negative attitudes toward persons 
in general, (c) The more abstract individuals 
tend to be less prejudiced to begin with and 
the amount of decrease falls between the other 
two extremes. 


METHOD 


Subjects 


Every new inmate admitted to a correctional in- 
stitution between April 26, 1974, and May 27, 1974, 
was sent a pass requesting an interview, There were 
160 men admitted during that period, Of this number, 
31 were in administrative confinement and 1 was in 
the hospital and thus were unable to participate in 
the study. Of the inmates, 128 men reported for the 
interview with 16 declining to answer the question- 
naire, leaving a total of 112 subjects, 

Every inmate who completed the first set of ques- 
tionnaires was sent a pass requesting an interview 3 
weeks later, There were 110 men notified. Of these 
men, 8 had already been transferred to another in- 
stitution and 19 others did not report for the second 
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session of the study.1 The majority of these latter 
inmates were scheduled at times which conflicted 
with their job assignments. A total of 83 inmates did 
participate in the second session. 


Norms 


The norms of the situation were quantified by a 
Black Equality Scale which was administered to ap- 
proximately one-third of the inmates in the total 
population of the institution. This scale was admin- 
istered at the beginning of data collection for the 
experiment. Every inmate from one floor of 2-man 
cells, one floor with 8-man cells, and 3 dorms were 
requested to take the questionnaire. The total sample 
of completed questionnaires was 339. 

The Black Equality Scale was modeled after, and 
incorporated into, a measure of group norms de- 
veloped by Swanson (Note 1) for use with prison 
populations. The latter scale was a modification of 
one employed by Jessor, Graves, Hanson, and Jessor 
(1968) in their study of a Tri-Ethnic community 
and follows their procedure, The scale was admin- 
istered as the first questionnaire in a packet of 11 
questionnaires in a larger research project. 


Procedure 


The first phase of the experiment consisted of as- 
sessing the norms of different areas of the institution. 
The Black Equality Scale was administered to ap- 
proximately one-third of the entire inmate population 
over a period of 24 weeks. There were two sessions a 
day, 4 days a week with approximately 20 inmates 
per session, The first session began at 8:30 AM and 
the second session began at 1:30 PM. The subjects 
were called out of the work detail by the prison staff 
in order to attend the testing session. The prison staff 
scheduled the inmates according to living area at the 
institution. 

Present at each test session was a male or female 
experimenter and two assistants. There were always 
two male and one female researchers in attendance. 
The test sessions were introduced with a general ori- 
entation by the experimenter geared to establish rap- 
port with the inmates. Then, the experimenter read 
the instructions and the first three items aloud. The 
inmates were instructed to continue at their own pace 
until the questionnaire was completed. The research 
staff was available for questions at all times. In sev- 
eral instances the questionnaire had to be individually 
administered to illiterate or semiliterate inmates. 

The second phase of the experiment focused on in- 
mates newly admitted to the institution. It consisted 
of assessing the cognitive dimensionality, self-esteem, 
and attitudes toward people as well as changes in 
prejudice of these inmates over time, This phase be- 
gan simultaneously with the first phase but extended 
for 43 weeks longer. Over a period of 1 month every 
new inmate admitted to the institution was re- 
quested to take (a) the Rating of Acquaintances 
Scale (a composite of Scott's, 1966, Rating of Ac- 
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quaintances, a measure of cognitive dimensionality, 
and of Wrightsman’s, 1962, Occupational Ratings 
Form, a measure of self-esteem and prejudice), (b) 
Kiddie Mach (Christie et al., 1973, a measure of at- 
titudes toward people), and (c) Rating of Types (a 
measure of prejudice) within 1 week of confinement. 
The inmates’ names were taken from a list of the 
unassigned inmates which was compiled weekly. 

The instruments were administered to 112 subjects — 
in 10 sessions of approximately 10-15 inmates per ses- 
sion. There were 2 sessions a day, 1 day a week for 
5 weeks, The first session began at 10:30 AM and the 
second session began at 2:30 pM, There were three 
experimenters present at each session, one male and | 
two females. The senior experimenter was a female. 
The test sessions were introduced with an orientation 
intended to establish rapport with the inmates. The 
inmates were given the instructions in groups of 5 to 
10 by one of the three experimenters. On one occa- 
sion, the instrument had to be translated to a Cuban- 
American inmate who had language difficulties. The 
group session required 1 to 14 hours. 

Three weeks after his first session, each of the 110 
inmates who completed the first set of questionnaires 
was sent a pass requesting an interview. The second 
Rating of Types scale was administered to 83 inmates 
in groups of 10 to 14 inmates, There were four ses- 
sions a day, 1 day a week, over a 5-week period. The 
sessions began at 9:30 AM, 10:30 AM, 1:30 PM, and 
2:30 pat. The only exception to this was the initial 
testing day which had an additional session at 3:30 
pM. This last session was eliminated due to the poor 
response of the inmates. 

The inmates were reluctant to take the same ques- 
tionnaire a second time. However, the prison setting 
is bereft of social outlets for the inmates’ feelings. 
There is no one in whom the men can confide with- 
out fear of exposure. Therefore, the promise of a 
private interview with one of the experimenters on 
completion of the instrument was enough incentive 
to overcome any hesitancy. 

The instrument required 15-25 min. to complete. A 
few inmates required individual administration due to 
their literacy level. When the inmate completed the’ 
questionnaire, he notified one of the experimenters , 
who would then interview him. The experimente’ 
asked each inmate a series of specific questions, cot 
firming his length of stay at the institution, his livin, 
area, and so on. These questions were followed by 
open-ended questions asking the inmate to tell the 
experimenter about his living area and the institution 
in general. Each interviewer followed the same inter- + 
view schedule but allowed the inmate to elaborate 
and digress within reason. Each session required from 
1 to 14 hours. The entire data collection period re- 
quired 7 weeks. 


1An analysis of the 19 inmates who did not 
return leads to no conclusions about their being a 
special group in terms of the personality measures. 
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RESULTS 


There are three major domains which the 
data define: (a) the differences between the 
norms of the areas which are indicated by the 
results on the Black Equality Scale, (b) the 
changes in prejudice for each area, and (c) 
the personality variables predictive of change 
in each area. 


Racial Equality Norms 


Inmates from living areas with eight men 
in the cells had the lowest norm scale mean 
(48.70, SD = 11.11). This mean score was 
lower (£ = 1.80, p < .10) than the mean for 
those living in the dorms (M = 51.89, SD = 
9.31). The norm score mean for those living in 
the two-man cells was between these extremes 
(M = 4911, SD = 11.42), but not signifi- 
cantly different from either of them. A low 
score on the Black Equality Scale is indicative 
of a negative attitude toward interracial in- 
teraction. 


Changes in Prejudice 


Every subject who responded to both the 
first and second Rating of Types Scale (N = 
70) was included in the sample. The sample 
was split into three groups based on living 
area, In the dorms, 17% of the inmates in- 
creased in prejudice, 12% remained the same, 
and 70% decreased. In the 8-man cells, 38% 
of the men increased in prejudice, 36% re- 
mained the same, and 26% decreased in preju- 
dice. In the 2-man cells, 30% of the inmates 
increased in prejudice, 10% remained the 
same, and 60% decreased in prejudice. 

According to Cronbach and Furby (1970), 
“raw change’ or ‘raw gain’ scores formed 
by subtracting pretest from posttest scores 
lead to fallacious conclusions, primarily be- 
cause such scores are systematically related 
to any random error of measurement” (p. 
68). Therefore a residual gain score, cal- 
culated by the formula: “gain = posttest 


- Score minus the pre-post regression coefficient 


times the pretest score,” was computed for 
each subject (Elliott & Voss, 1974). Cronbach 
and Furby (1970) feel there is “no need to 
use measures of change as dependent vari- 
ables and no virtue in using them” (p. 78). 
Cronbach and Furby contend that a compari- 


son of posttest scores between groups supplies 
the same information. However, in the present 
case the focus of the research is on the change 
which varies within each group dependent on 
personality variables. Therefore, a comparison 
of posttest scores by group (i.e., living area) 
tends to obscure the changes. Division of 
these groups further on the basis of person- 
ality variables (i.e., high and low dimension- 
ality, high and low self-esteem): results in such 
small ms that analyses would be questionable. 
Therefore, I have used a residual gain score 
which Cronbach and Furby (1970) describe 
as “primarily a way of singling out individuals 
who changed more (or less) than expected” 
(p. 74). This is exactly what the present 
study wishes to highlight, The sample was 
split into three groups based on living area. 
The mean residual gain score was —.006 for 
the dorms, +.02 for the two-man cells, and 
+.26 for the eight-man cells, 

A stepwise regression analysis with residual 
change scores as the dependent measure per- 
formed on the total subject sample indicated 
that there were no main effects for any of the 
independent variables. However, an interac- 
tion effect between living area, race, and self- 
esteem accounted for 33% of the variance (p 
< .14). Therefore, the data were split into 
three groups based on living area. 

A stepwise regression analysis with residual 
change scores as the dependent measure per- 
formed on the subjects in the two-man cells 
indicated that five variables entered into the 
equation accounting for 76.5% of the vari- 
ance (p < .06). The five variables most highly 
associated with prejudice change are shown in 
Table 1. Machiavellianism (attitudes toward 
people in general) was a strong influence on 
prejudice change in the two-man cells, both as 
a main effect and in interaction with race and 
dimensionality. High Machiavellian subjects 
increased and low Machiavellian subjects de- 
creased in prejudice, Blacks tended to de- 
crease in prejudice with the high Machiavel- 
lian subjects decreasing less than the low 
Machiavellian subjects. Whites with high 
Machiavellianism had the greatest increase in 
prejudice and those with low Machiavellianism 
had the lowest amount of decrease in preju- 
dice. Dimensionality also entered the regres- 
sion equation as a main effect. Low dimen- 
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TABLE 1 
Stepwise REGRESSION ANALYSIS ON RESIDUAL 


CHANGE SCORES FOR INMATES IN 
Two-Man Cetts* 


Sequential Prob- Cumula- 
Sources af SS F ability tive R? 
RXM 1 246 9.17 023 358 
DXM 1 49 | 1.83 .224 359 
D 1 1.01 3.78 098 -638 
R 1 66 2.45 167 -667 
M 1 63 2.37 173 -765 


R = race; D = dimensionality; M = Machiavellianism. 
b 6. 


sionality tended to produce an increase in 
prejudice, while high dimensionality tended 
to produce a decrease in prejudice. An interac- 
tion effect was obtained between dimensional- 
ity and Machiavellianism, The cognitively 
simple subjects (low dimensionality) had ex- 
treme prejudice scores with Machiavellianism 
seeming to operate as an intervening variable 
determining the direction of change. Low di- 
mensionality, high Machiavellian subjects had 
the highest prejudice scores and increased in 
prejudice in the two-man cells. Low dimen- 
sionality, low Machiavellian subjects had the 
lowest prejudice scores to begin with and did 
not change. 

A stepwise regression analysis with residual 
change scores as the dependent measure per- 
formed on the subjects in the eight-man cells 
indicated that three variables entered into the 
equation accounting for 11.7% of the variance 
(p < .26). The three variables most highly 
associated with prejudice change are shown in 
Table 2. No main effects entered the regres- 
sion equation and the interaction effects only 
accounted for 11% of the variance. Appar- 
ently the variables isolated by Cook and 
utilized in this study are overpowered by the 
influence of the norms in this environment. 
The situation with extremely negative inter- 
racial interaction norms decreases the in- 
fluences of individual differences. 

A stepwise regression analysis with residual 
change scores as the dependent measure per- 
formed on the subjects in the dorms indicated 
that three variables entered the equation ac- 
counting for 78.9% of the variance (p< 
0004). The variables most highly associated 
with prejudice change are Dimensionality x 
Self-Esteem, self-esteem, and Dimensionality 
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X Machiavellianism (see Table 3), This liv- 
ing area most closely duplicates the environ- 
ment and the results of Cook’s (1972) study. 
Machiavellianism and self-esteem are the same 
variables which Cook found predicted changes 
in prejudice. Dimensionality interacts with 
these variables to influence the direction of 
change. The interaction between dimensional- 
ity and self-esteem is most likely a result of 
the rather high negative correlation (ia 
—.37) between the two variables. The reason 
for this correlation is explored in the Discus- 
sion section. The interaction effect between 
dimensionality and Machiavellianism is the 
opposite of that obtained in the two-man 
cells, reflecting the tendency of low dimen- 
sionality individuals to conform to the norms 
of the area, 

In order to establish the stability of the 
results, the total subject sample was divided 
into two random groups of 35 (Group X and 
Group Y) for further analysis, A stepwise re- 
gression analysis was performed separately on 
each group. In addition to the independent 
variables used previously, additional variables 
related to living area were entered into the 
regression analyses. The analysis of Group X 
indicated that 14 variables entered the regres- 
sion equation accounting for 73% of the vari- 
ance (p < .0097). Fourteen variables entered 
the equation for Group Y accounting for 63% 
of the variance (p < .044), Of the nine vari- 
ables which replicated across the two groups, 
there was a main effect for living area and 
four interaction variables with living area as 
a component (Living Area X Machiavellian- 
ism, Living Area X Race, Living Area X Di- 
mensionality X Race, and Living Area x 
Machiavellianism x Race) confirming the ne- 
cessity for analyzing each living area sepa- 


TABLE 2 


STEPWISE REGRESSION ANALYSIS ON RESIDUAL 
CHANGE SCORES ror INMATES IN 
Eigut-Man CeLLS* 


Sequential Prob-  Cumula- 

Sources df SS È ability F tive R? 
DXRXS 1 1.156 131 .2594 -036 
DXS 1550) 127 1903 2085 
DXRXM 1 -1.013 115 > (2916 «117 


*R = race; D = dimensionality; M = Machiavellianism. 
*p <.257. 
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rately. The other replicated variables were 
race, Machiavellianism X Self-Esteem, Di- 
mensionality X Machiavellianism X Self-Es- 
teem, and Dimensionality X Machiavellianism 
X Race. 


DISCUSSION 


The thrust of this study was to replicate 
Cook’s (1972) railroad game in a natural 
setting, The objective was to differentiate the 
characteristics of the individual and situation 
which influence interracial attitudes. First the 
characteristics of the situation are discussed 
emphasizing the differences and similarities to 
the setting of Cook’s (1972) study. Next, the 
personality variables predictive of change are 
discussed, The focus of attention was the in- 
dividual as he enters into, and adapts to, an 
interracial environment. As the inmates ar- 
rived at the correctional institution, they were 
assigned to a living area. This assignment was 
a random one within each race. There were 
basically three possibilities: 2-man cells, 8- 
man cells, or dorms. The living area was de- 
termined primarily on the basis of availability. 
The three areas differed on many dimensions. 

A look at Cook’s characteristics of contact 
situations, which, if optimum, could facilitate 
reduction of prejudice, indicates the differ- 
ences among Cook’s laboratory setting and the 
three living areas in the prison environment. 


Equal Status 


All the situations within the correctional in- 
stitution “defined as equal the status of the 
participants from the two racial groups.” Thus 
the individuals involved in Cook’s laboratory 
study and the subjects in the present prison 
study had equal status within the environ- 
ment. 


Cooperative Relationship 


The laboratory situation was optimum in 
terms of a cooperative relationship. This vari- 
able probably varied according to living area 
and ‘job in the institution. Due to the numer- 
ous types and functions of interracial interac- 
tion in the prison, it was not feasible to evalu- 
ate the level of cooperation between indi- 
viduals. Thus no comparison between the 
levels of cooperation in the two settings is 
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TABLE 3 


STEPWISE REGRESSION ANALYSIS ON RESIDUAL 


CHANGE Scores FOR INMATES IN Dorms* 
Sequential Prob- Cumula- 
Sources df SS F ability F tive R? 
DXS 1 3.847 33.81 0002 594 
5 1 558 4.90 .045 680 
DXM 1 704 6.2 .067 789 
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possible, Some cooperation was necessary for 
survival in the prison, but the only areas 
which appeared to actually encourage a mu- 
tually interdependent relationship were the 
dorms. 


Norms 


The differences between the social norms of 
the immediate situation in the correctional in- 
stitution were quantified by the Racial Equal- 
ity Scale. This scale was administered to the 
inmates in the institution prior to the in- 
carceration of the subjects studied. The norms 
for the diverse living areas were defined in 
terms of what the inmates felt was appropri- 
ate interracial behavior. The data indicate 
that the norms for the dorms were most posi- 
tive toward racial equality in interaction. Of 
the inmates, 70% living in this area decreased 
in prejudice over the time period studied. The 
two-man cells had less positive interracial 
norms than the dorms. Of the inmates, 60% 
living in two-man cells decreased in prejudice. 
The men in this area were segregated by cell 
although they were integrated by cell block. 
In contrast, the dorms were completely in- 
tegrated. Therefore, the interaction patterns 
for the two areas are quite different. The two 
situations are somewhat analogous to the bi- 
racial housing study of Deutsch and Collins 
(1951), without the preselection factor. 

The 8-man cells had the lowest mean racial 
equality norm and the lowest decrease in 
prejudice (26%). The men in this area re- 
ported much interracial conflict. Tt is neces- 
sary for the reader to bear in mind that none 
of the living areas in the institution were ex- 
tremely positive toward interracial interac- 
tion. The racial equality norms in the dorms 
were positive only relative to those in the 
2-man cells and 8-man cells. All the norms 
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probably tended to be more at the negative 
end of the continuum than the extremely posi- 
tive norms developed for the railroad game. 
The norms for the dorms and 2-man cells 
most closely approximated those established 
for the railroad game, 


Attributes of the Individuals 


Cook’s study concentrated entirely on white 
attitudes toward blacks. The present study 
looks at the attitudes of both races toward 
the other. The attributes of the individual 
blacks and whites in the institution were not 
defined. However, it can be assumed that in a 
population of approximately 900 blacks and 
900 whites, the individuals of necessity would 
be diverse in attributes. Thus, some of the 
blacks and whites must have “contradicted 
the prevailing stereotyped beliefs about 
them.” The difference between actual attri- 
butes and stereotypes would not, of course, 
have been so great as that of the black coed 
in Cook’s study. But the attributes of the 
blacks would be more similar to those of the 
whites in the institution, since the population 
tended to come from the same socioeconomic 
Strata. 


Intimate Association 


The last characteristic was even more con- 
ducive to inmate association in the prison 
settings than in the laboratory, The 24-hour 
long-term interaction, in a live-in situation is 
more likely to promote personal or intimate 
association than the 2-hour, 5-day-per-week, 
month-long laboratory interaction in the rail- 
road game study, In summary, the three 
prison environments closely resembled the 
contact situation of the railroad game with 
two exceptions: the norms and the level of 
intimacy. The norms in the dorms and the 2- 
man cells most closely approximated the 
norms in the railroad game. The norms in the 
8-man cells were much more negative than 
those of the railroad game. The level and di- 
rection of change in prejudice reflected the 
norms of the situation, The level of intimacy 
possible was much higher in all the prison 
settings than the railroad game, This greater 
intimacy was due to the much longer contact 
time in the prison. 


Personality Variables 

The concern in the present study is whether, 
and in what direction, the person changes in 
his interracial attitudes due to the contact 
situation, The various living areas influenced 
the inmates assigned to them differently both 
in percentage of subjects who decreased in 
prejudice and in terms of the personality vari- 
ables predictive of change. The variables 
which entered into the regression equations 
differed for each type of environment. The 
predictor variables included in this study ac- 


counted for a large amount of variance in 
the 2-man cells (76.5%) and the dorms 


(79%). These are the areas which most 
closely resembled the situation described in 
the railroad game, Therefore, it is logical that 
the personality factors would have a similar 
effect in both circumstances, However, these 
same variables accounted for only 11.7% of 
the variance in the 8-man cells and the sig- 
nificance level was rather low (p < 257) for 
much confidence in these results, It appears 
that the variables Cook isolated are important 
determinants of prejudice change in situa- 
tions where the norms, as in the railroad 
game, are positive toward interracial interac- 
tion. However, in situations where the norms 
are more negative toward racial equality, 
other factors produce the primary influence 
on prejudice change. 

Machiavellianism was an important pre- 
dictor of prejudice change. The data support 
Cook’s conclusion that individuals with nega- 
tive attitudes toward people in general tended 
to become more prejudiced with interracial 
contact and those with positive attitudes 
tended to decrease in prejudice in the same 
situation, This was true in the present study 
for white and for black subjects in every liy- 
ing area, and was particularly true in high 
dimensionality subjects in areas with positive 
norms for interracial interaction, Cook’s 
(1972) research was conducted with college 
Students. People tend to increase in complex- 
ity with an increase in education, thus his 
subjects were probably similar to those in the 
present study who were high in dimension- 


ality. In that study, all the subjects were- 


white and the situation was a positive one 
for interracial interaction, The present study 
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indicates that the effect of Machiavellianism 
holds true regardless of the norms of the situa- 
tion or the race of the individuals involved. 

Dimensionality was the strongest predictor 
of prejudice change. Low dimensionality sub- 
jects tended to reflect the norms of their liv- 
ing area, Dimensionality in conjunction with 
the norms of the area was the most consistent 
predictor of direction of change. It would 
appear that low dimensionality subjects tend 
to conform to the norms of their living area. 
Thus dimensionality is predictive of a change 
in prejudice toward the norms of the area. 
Dimensionality and closely related variables 
(e.g., rigidity) were not found to be predic- 
tive of prejudice change in Cook’s (1972) 
study, However, Cook eliminated subjects on 
the basis of the California F Scale, thus pro- 
ducing a truncated subject population on this 
variable. It was the low dimensionality sub- 
jects who changed the most in prejudice. 
These are the very subjects who were elimi- 
nated by Cook. 

The present study employed an adaptation 
of the measure of self-esteem used by Cook 
(1972) in the railroad game. Both instruments 
use the same type of responses as the dimen- 
sionality instrument developed by Scott 
(1966). This type of response is confounded 
by the subject’s level of complexity. The 
more complex the subject’s cognitive struc- 
ture, the more likely he is to distinguish more 
levels of a trait. The scoring of the self-esteem 
scale would thus tend to produce a high nega- 
tive correlation between self-esteem and di- 
mensionality. This is particularly true in the 
present study, since the format of the instru- 
ment was also the same as Scott’s (1966) 
measure. The results obtained by Cook and 
the prison study for self-esteem are very 
similar to those for dimensionality in the 
present study, Low self-esteem or high di- 
mensionality are predictive of a decrease in 
prejudice, Although the results were obtained 
for self-esteem in the present study, they 
must be accepted with extreme caution. Di- 
mensionality is'a plausible explanation for the 
changes, However, in the dorms, self-esteem 
entered the regression equation separately 
from dimensionality, indicating a separate in- 
fluence on the results. The present study in- 
dicates that cognitive structure is a more 
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powerful predictor of a decrease in prejudice 
after interracial interaction than self-esteem. 
This effect is particularly apparent in areas 
with negative norms toward interracial in- 
teraction. 

Studies of interracial contact have been 
limited by the unpredictability of the direc- 
tion of change in racial attitudes, In a well- 
controlled laboratory study, Cook (1972) ma- 
nipulated the characteristics of the situation 
so they were optimum for the reduction of 
prejudice, He then demonstrated that positive 
attitudes toward people in general and low 
self-esteem are both predictive of a decrease 
in prejudice in this situation, The present 
study looked at three natural settings which 
varied in norms of interracial interaction, The 
results indicate that the norms of the situa- 
tion have a definite effect on the changes in 
prejudice of persons newly admitted to the 
area, An individual tends to change attitudes 
toward the norms of his living area, The 
amount of change is dependent on the indi- 
vidual’s cognitive complexity, self-esteem, and 
attitudes toward people in general. 

In summary, it appears that the three 
prison environments closely resembled the en- 
vironment of the railroad game with two ex- 
ceptions: the norms and the level of intimacy 
possible. The norms for the dorms and two- 
man cells most closely approximated those 
established for the railroad game, while the 
norms in the eight-man cells were much more 
negative.” The changes in prejudice for each 
living area reflected its norms, The decreases 
in prejudice tended to be much higher in the 
prison environment (70% in the dorms, 
60% in the two-man cells, and 26% in the 
eight-man cells) than in the railroad game 
(35%-40%). This stronger effect is most 
likely a result of the much more personal or 
intimate association in the prison environ- 
ment. The inmates were in the contact situa- 
tion for 24 hours a day. They did not have 
the option of returning to their homes for a 
majority of the day as did the subjects in the 
railroad game. The inmates, therefore, did not 
have any exposure to norms conflicting with 
those in their living areas. 

The most dramatic evidence in support of 
Cook’s (1972) findings is found in the areas 
with positive norms. The subject populations, 
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sex of subjects, environments, length of con- 
tact, and statistical analyses are radically dif- 
ferent. However, the results of the prison 
study support the principal conclusions of 
the railroad game. Little doubt remains that 
in a situation where the interracial norms are 
positive that attitudes toward people in gen- 
eral and self-esteem are highly predictive of 
change in attitudes. Cognitive complexity 
also contributes to. the predictability of the 
variables. It would appear that in areas 
where the norms are extremely negative to- 
ward interracial interaction, the situation 
greatly decreases the influence of these per- 
sonality variables. 
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Observer Bias: A Stringent Test of Behavior Engulfing the Field 


Melvin L. Snyder and Arthur Frankel 
Dartmouth College 


Jones and Nisbett argued that observers often misattribute an actor’s behavior 
to personal dispositions rather than situational constraints. Kelley has criti- 
cized evidence for this bias on grounds that manipulations of behavior have 
been stronger or clearer than manipulations of context. The current experi- 
ment circumvents this problem by manipulating perception of behavior through 
a manipulation of context. Actual behavior was held constant while varying 
observers’ beliefs about the target person's situation. Subjects watched two 
videotaped interviews without any sound track and were led to believe that 
the topic was sex in one, politics in the other. When observers believed that 
the target was in the more anxious situation (sex condition), they not only 
perceived that she was more anxious during the interview but also attributed 
to her more of a disposition to be anxious, We regard this as a misattribution. 
In another condition, observers learned of the nature of the interview only 
after they had watched it. Knowledge of context no longer affected perception 
of emotional state. The result was that observers made stronger attributions 
of a disposition to be emotional to the politics interviewee than the sex inter- 
viewee, in line with Kelley’s discounting principle. Thus, the timing of in- 
formation about context determined whether discounting or the observer bias 


was obtained. 


Jones and Nisbett (1971) proposed that 
“there is a pervasive tendency for actors to 
attribute their actions to situational require- 
ments, whereas observers tend to attribute 
the same actions to stable personal disposi- 
tions” (p. 80). They also stated: “Without 
insisting that the actor is usually right, we 
can point to many instances where the ob- 
server’s interpretation of behavior is simply 
wrong” (p. 88). They cited the tendency of 
observers to infer that a writer’s attitude 
corresponds to his essay, even though situa- 
tional constraints seem an adequate explana- 
tion for preparing the essay, for example, 
when the essay is an opening statement for 
a debate (Jones & Harris, 1967; Jones, 
Worchel, Goethals, & Grumet, 1971). It is 
implicit that to ascribe an attitude consistent 
with the essay is to attribute the essay to the 
author’s attitude. 


Requests for reprints should be sent to Melvin L. 
Snyder, Psychology Department, Dartmouth Col- 
lege, Hanover, New Hampshire 03755. We would 
like to thank the following people for comments on 
the manuscript and for discussions of relevant ideas: 
Edward E. Jones, Robert E. Kleck, John T. Lan- 
zetta, and William N. Morris. 


Kelley (1971) questioned whether such 
examples demonstrate a bias on grounds that 
“context factors have not been manipulated 
as strongly or as clearly as has the behavior 
itself” (p. 18). Thus, in the above studies of 
attitude attribution, the experimenters pre- 
pared the essays and may have inadvertently 
made them more strongly pro or con than 
the constraints would actually produce. For 
observers to infer attitudes that correspond to 
essays might merely be reasonable. To test 
this possibility, Snyder and Jones (1974) 
randomly assigned subjects to write essays in 
support of attitudinal positions. They inter- 
changed the essays produced and asked sub- 
jects to predict the author’s attitude. Even 
though subjects knew that the author, like 
themselves, had been arbitrarily assigned the 
position to support and even though the essay 
was only as pro or con as someone would ac- 
tually write under the circumstances, there 
was once again a significant propensity of ob- 
servers to infer that the author’s attitude 
corresponded to his essay. 

The current study responds to Kelley’s 
(1971) criticism in a different way. It rules 
out a weak manipulation of context factors as 
the explanation for the observer bias by using 
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a paradigm where the bias can only occur to 
the extent that subjects are aware of the con- 
text. It takes advantage of the finding that 
the perception of another’s emotional state is, 
in part, determined by the observer’s belief 
about the other’s situation (e.g., Frijda, 1969; 
Munn, 1940; Sherman, 1927; Watson, 1972). 
Thus, thinking that someone is in an emotion- 
provoking situation can result in the percep- 
tion that he is feeling the emotion provoked. 

The theme of the present paper is as fol- 
lows: The observer bias is manifested when 
an observer who believes that another is in 
an emotional situation not only perceives 
that the other is emotional during the situa- 
tion but also attributes to him a personal 
disposition to be emotional. Provided there is 
no logical basis for such an attribution—and 
we will argue there is not—we believe it 
illustrates the observer bias proposed by Jones 
and Nisbett (1971). Observers may dissociate 
the emotional state from its situational source 
and misattribute it to a disposition of the ac- 
tor. This result cannot occur because context 
is less well manipulated than behavior, for 
the perception of behavior—emotional state— 
is accomplished by a manipulation of context 
rather than actual behavior. Thus, behavior, 
whether real or merely imagined, may “engulf 
the field” (Heider, 1958, p. 54), by which 
Heider meant behavior is unduly attributed 
to the person behaving rather than situational 
constraints. 

Snyder (1974) has used the paradigm 
described, He had observers watch a video- 
tape of a male target person taking a word 
association test. Half believed that the target 
anticipated receiving electric shocks in the 
next phase of the experiment; the other half 
did not. A post hoc division of subjects by 
sex produced some evidence for the engulfing 
bias. Female subjects who thought the target 
anticipated electric shocks not only perceived 
him as feeling more apprehensive at the end 
of the word association test but also attrib- 
uted to him more of a disposition to be 
anxious, 

Logic for Concluding that Misattribution 
Has Occurred 


Jones and McGillis (1976) argued for the 
use of correspondent inference theory to es- 


tablish a rational baseline against which to 
identify attributional bias. A central tenet of 
the theory is that when behavior is expected, 
it is uninformative about the actor's attri- 
butes. The basis for perceiving the target as 
anxious in Snyder’s study was the knowledge 
that he anticipated electric shock. In other 
words, observers perceived him as anxious 
because they expected him to be anxious. Ac- 
cording to correspondent inference theory, 
this expected reaction is uninformative about 
the actor. However, subjects acted as if it 
were informative. 

Jones and McGillis (1976) noted the simi- 
larity between this concept of expectation and 
the concept of consensus in Kelley’s (1967) 
approach to attribution. Observers in Snyder’s 
study perceived the target as more anxious 
when he anticipated shock because they be- 
lieved most people would react that way. In 
Kelley’s terms, they believed that feeling 
anxious is the consensus reaction to the an- 
ticipation of shock. Orvis, Cunningham, and 
Kelley (1975) noted that high consensus im- 
plies that the attribution should be made to 
the stimulus rather than to the person. Thus, 
both Kelley’s model and correspondent infer- 
ence theory imply that perceiving another as 
emotional because the situation provokes emo- 
tion supplies no basis for concluding that he is 
an exceptionally emotional person. 

The “fly in the ointment” is that the sup- 
porting data from Snyder (1974) are, after 
all, based on a post hoc sex split. Unlike the 
female subjects, males who believed the target 
person anticipated electric shocks failed to 
perceive him as more scared and apprehen- 
sive. Therefore, they could not and did not 
show engulfing on measures of dispositional 
anxiety, for there was no perceived emotional 
state to misattribute to the target person. 
Nevertheless, the post hoc nature of the 
analysis prompted us to attempt to replicate 
the engulfing result in the current study using 
male subjects. 

Tn addition, we wanted to see whether we 
could eliminate the anticipated engulfing effect 
by manipulating the timing of information 
about the target’s situation. Zadny and Ger- 
ard (1974) found evidence that giving a sug- 
gestion about how another would behave led 
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to better recall of confirming evidence if sub- 
jects were given it before observing the other 
but not when it came afterwards. Similarly, 
observers’ beliefs about another’s situation 
may only affect their impression of his emo- 
tional state if they get the information before 
observing the other, as in Snyder (1974). It 
may have less or no impact on perceived 
emotional state when it comes afterwards, 

If the situation appears to have had no 
effect but observers expect that it should have 
made the target emotional, according to cor- 
respondent inference theory (Jones & Mc- 
Gillis, 1976), this violation of expectation is 
informative. The implication is that the target 
is relatively uninclined to be emotional, Ap- 
plying Kelley’s (1971) discounting principle 
produces the same result: “The role of a given 
cause in producing a given effect is discounted 
if other plausible causes are also present” (p. 
18). In this case the role of the person in 
producing his anxiety is discounted if the 
situation provides a plausible reason for the 
emotion. Thus, knowing that the target’s sit- 
uation provokes emotion may have opposite 
effects on attributions of basic emotionality 
depending on when the knowledge is obtained, 


METHOD 


Overview of the Design 


Male college students watched two videotapes 
without any soundtrack. Each tape presented a 
female undergraduate who was being interviewed. 
Subjects were arbitrarily told that the topic was sex 
ior one target person and politics for the other, 
They were given this information either before or 
after watching the videotape. We presumed that 
subjects regarded the topic of sex as more anxiety 
Provoking than politics. We predicted that when 
subjects found out the topic before observing the 
interview, those who thought it was sex rather than 
politics would perceive the target as being more 
anxious and would tend to misattribute this anxiety 
to a disposition to be anxious. But when subjects 
discovered the nature of the interview only after 
observing it, interview topic might not influence 
perception of the target’s anxiety. If so, discounting 
should occur. The target would then be seen as a 
more basically anxious person when the topic was 
politics rather than sex. 

Interview topic was varied within subjects, 
Whether subjects were informed of the topic before 
or after the interview was a between-subjects varja- 
ble. The basic experiment was a 2X 2, Sex versus 
Politics X Before versus After design. In addition, 
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We counterbalanced the order in which the target 
Persons were presented and the order of topics, 


Subjects 


Subjects were 73 undergraduate males enrolled in 
introductory psychology at Dartmouth College. Of 
these, 9 were eliminated for various reasons: 4 
knew one of the confederates, 2 did not follow 
instructions, and 3 were suspicious about the inter- 
view topics. The remaining 64 subjects were run in 
24 sessions with from 2-4 people per session, Whether 
information about interview topic came before or 
after observing the interview was manipulated be- 
tween sessions, However, whether the first interview 
or the second was said to be on sex or politics was 
varied within sessions, 


Procedure 


Subjects sat at a long table separated from one 
another by partitions. All had a clear view of a 
television monitor. They were given a written intro- 
duction, which can be summarized as follows: They 
were requested to watch someone being interviewed 
while, at the same time, her heart rate and galvanic 
skin response were being monitored. Our professed 
Purpose was to see how well untrained observers 
could perform the polygraph’s function and assess 
the person’s emotional state. The videotape sound 
track was eliminated on grounds that we were in- 
terested in nonyerbal behavior. Observers were told 
that the target person knew that she was being 
videotaped. Besides assessing: the person's emotional 
State, they would also get an opportunity to give 
their overall impression of the individual, 

After subjects had finished reading the introduc- 
tion, the experimenter said they would view two 
interviews, each lasting about 10 minutes, They were 
told not to communicate with the others while 
watching the videotapes, and they were requested 
not to take notes. After ensuring that subjects under- 
stood the Procedure, the experimenter either dis- 
tributed lists of the purported interview questions 
(before condition) or started the videotape (after 
condition). After condition subjects received the list 
immediately after having viewed the videotape. 
Whether subjects received the list before or after, it 
was given with the comment, “Here is a list of the 
questions that were asked; they may help.” 

Each list contained two warm-up questions as well 
as five questions about either sex or politics. Politi- 
cal questions included: “What is your impression of 
the two-party system?” and “At what age do you 
think people should be allowed to vote and why?” 
Two questions from the Sex list were: “If you knew 
an undergraduate female who engaged in sex with 
most of her dates, would you consider her promiscu- 
ous and why or why not?” and “Do you think 
partners should experiment with various sexual be- 
haviors for the sake of trying everything at least 
once?” The experimenter was blind as to which list 
each subject received. 
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Making the Videotapes 


The videotaped interviews employed in the study 
were obtained by editing master tapes of interviews 
conducted with two female confederates who were 
undergraduates at Dartmouth College. The confed- 
erates were fairly attractive and casually dressed. 
They were requested to behave naturally during their 
interviews and to avoid emotional extremes. They 
were told that their behavior should be compatible 
with having an interview on either politics or sex. 
Lipreading on the part of observers was not a 
problem. Both confederates were actually interviewed 
about their travel experiences. 

The camera was stationary and focused on the 
interviewee’s face and upper torso. To support the 
fiction that we were interested in physiology, the 
middle finger of the interviewee’s left hand was 
fitted with a plethysmograph that was visible on 
the videotape. 

The tapes were edited so that observers saw the 
interviewee respond to seven different questions. 
Each response was preceded by a segment lasting 
five-ten seconds in which the interviewee listened to 
the end of the interviewer’s question. The interview- 
ee’s responses ranged in length from about 30 sec- 
onds to 120 seconds. 


Dependent Measures 


Perception of emotional state. After viewing each 
taped interview, subjects indicated their impressions 
of the interviewee’s emotional state during the 
interview on four 9-point bipolar scales. Endpoints 
were cheerful/sad, comfortable/uncomfortable, an- 
noyed/agreeable, and upset/tranquil, respectively. 
Our interest is in how upset and uncomfortable sub- 
jects saw the target. 

Attribution of emotional disposition. After indi- 
cating their impressions of the interviewee’s emo- 
tional state, subjects were presented with three dif- 
ferent sorts of measures designed to assess the attri- 
bution of dispositional anxiety. They were compara- 
ble to those used by Snyder (1974). Each included a 
number of buffer items. One measure was a ques- 
tionnaire that asked subjects to predict the target’s 
emotional reaction to six different events. The fol- 
lowing is an example: 


She walks into her economics seminar and is 
asked to give a ten-minute talk on the previous 
night’s reading. Although she read the required 
material, she was unaware that the professor 
would ask someone to give such a talk, How does 
she feel? 


Subjects indicated how happy, hostile, and appre- 
hensive the target would feel, each on a scale from 
1-9. The five other events were: being asked by an 
acquaintance right after voting how she voted, 
meeting her sisters at the airport after a two-year 
separation, being badgered by a panhandler, inter- 
viewing for a job, and meeting an attractive male at 
a party. The dependent measure was derived by 
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summing the apprehensiveness scores across the six 
events. 

A paired-comparisons checklist was also used. 
Subjects were asked what traits, what stable person- 
ality predispositions they would expect of the target 
person if they got to know her well. Six adjectives 
were paired in all fifteen possible combinations and 
the subject had to check the one in each pair that 
he would expect to be more characteristic of the 
target person. The 6 adjectives were shy, pleasant, 
trustworthy, hostile, confident, and the one of in- 
terest—nervous. 

The other measure involved assigning percentile 
ranks on various dimensions. Subjects were asked to 
compare the interviewee with other female under- 
graduates, The following trait dimensions were in- 
cluded: responsibility, sincerity, hostility, warmth, 
likability, and the critical one—anxiety. The subject 
was to circle the percentile rank (0%-100% in inter- 
vals of 10%) that reflected the percentage of under- 
graduate females that the interviewee surpassed with 
respect to the trait in question. 

The dispositional measures were in a booklet sepa~ 
rate from the emotional state measures. For half 
of the subjects they were in the order in which they 
were described, while for the other half the order 
was reversed. All subjects began the dispositional 
measures together, and care was taken for both 
written and oral instructions to impress upon sub- 
jects that the dispositional measures concerned what 
the interviewee was basically like. 


RESULTS 
Perception of Emotional State 


Two of the four emotion perception scales 
were relevant to the perception of anxiety: 
comfortable/uncomfortable and upset /tran- 
quil. Scores on the two scales were summed 
together for each subject such that high scores 
indicated the target was perceived as upset 
and uncomfortable. The first row of Table 1 
presents the means. There was a significant 
Timing X Interview Topic interaction, F(1, 
56) = 10.66, p < .002. As predicted, when 
observers knew the nature of the interview 
before watching it, they rated the interviewee 
as being more uncomfortable and upset when 
the topic was sex than politics, (56) = 3.42 
p <-.002. However, when observers were in 
formed about the topic after they had already 
seen the interview, topic made no difference 
(p >.15). The variation in timing of infor- 
mation about the interview topic determined 
whether or not that information had an im- 
pact on perception of emotional state. We 
therefore expect topic and timing to interact 
on the attribution of emotional dispositions. 
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TABLE 1 
PERCEPTION OF EMOTIONAL STATE AND ATTRIBUTION Or EMOTIONAL DISPOSITION 
Timing of information about interview topic 
Before Interview After Interview 
Dependent measure (range) Politics Sex t Politics Sex t 
Emotional state 
Upset and uncomfortable (2-18) 8.50 11,09 3.42** 10.56 9.47 1.44 
Emotional disposition measure 
Predicted apprehensiveness across six 
situations (6-54) 34.38 38.28 2,26* 37.69 32.13 3.23%% 
Paired comparisons checklist: nervous 
(0-5) 2.31 2.63 1.09 3.22 1.97 4.2544 
Percentile rank for anxiety (0%-100%) 46.25 58.44  2.27* 63.75 40.00 4.4344 
Combined dispositional index —.56 69 2.30% 1.26 =1.35 48x 
Note, Each mean is based on 32 observations. The higher the score, the more the assigned emotionality, 
Bh Sin 
mp < 001, 
Engulfing should occur when the information who believed the topic was politics rather 


comes before the interview and affects percep- 
tion of emotional state. Discounting should 
occur when the information comes afterwards 
and is ineffective. 


Attribution of Emotional Disposition 


Table 1 presents the means for the three 
dispositional anxiety measures. Inspection of 
the means reveals that every difference be- 
tween scores for sex and politics is in the 
predicted direction. For each measure the pre- 
dicted Timing x Interview Topic interaction 
was significant: For the predicted apprehen- 
siveness measure, F(1,48) = 15.11, p< 
001; for the number of times nervous was 
chosen on the adjective checklist, F(1, 48) = 
14.10, p < 001; and for the percentile rank- 
ing of anxiety, F(1,48) = 22.43, $ < .001. 
Incidentally, the degrees of freedom were 
fewer for the dispositional measures than the 
state measure because of the addition of a 
variable, order of the three measures. Order 
made no difference. 

In the before condition, the engulfing pat- 
tern occurred, When observers thought the 
topic was sex rather than politics, they pre- 
dicted that the target person would be more 
apprehensive across six situations, marked 
nervous more often on the paired-comparisons 
checklist, and assigned the target a higher 
percentile rank on anxiety. These differences 
were all reversed in the after condition, which 
indicates that discounting occurred. Observers 


than sex assigned the target person higher 
scores on dispositional anxiety. 

Table 1 indicates the ¢ test results for these 
comparisons. One of them was nonsignificant: 
the difference on the paired-comparisons 
checklist in the before condition. Each of the 
three dispositional scores for each subject was 
converted to a z score based on the distribu- 
tion for all 64 subjects, and the three z scores 
were summed together. This combined dispo- 
sitional index was analyzed, and scores are 
presented in the last row of Table 1. The in- 
teraction was significant, F(1,48) = 25.42, p 
< .001. In the before condition, there was a 
significant engulfing effect; in the after condi- 
tion, a significant discounting effect, 

A feature of the results we had not antici- 
pated deserves some consideration. We origi- 
nally conceived of the politics condition as 
providing a neutral baseline, However, before 
politics and after politics conditions differed 
nearly as much as before sex and after sex 
conditions. It seems that the effects occurred 
about as much in the politics conditions as in 
the sex conditions. We therefore assume that 
believing the topic would be politics before 
watching the interview reduced the extent to 
which the target person was seen as upset and 
uncomfortable compared with a hypothetical 
condition in which no information is given 
about the nature of the interview. The sub- 
jects’ impression of the target person as tran- 
quil was then inappropriately reflected on the 
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dispositional measures. And when the infor- 
mation that the topic would be politics was 
given afterwards, subjects regarded the tar- 
get’s behavior as more anxious than the situ- 
ation called for, and therefore ascribed an 
anxious disposition to her. 

Finally, these results were not qualified by 
interactions with first tape versus second tape, 
target persons, order of target persons, or 
order of interview topics. 


DISCUSSION 


Because Snyder’s (1974) finding of engulf- 
ing depended on a post hoc split on subject 
sex, there were doubts about its replicability. 
Those doubts have been assuaged: The en- 
gulfing result did replicate. Moreover, we have 
established that in the emotion paradigm, not 
only females but also males display engulfing. 

In addition, we were able to show that in 
this paradigm, engulfing may not occur when 
subjects discover how much the situation pro- 
vokes emotion only after observing the target 
person in this situation. Indeed, this sort of 
engulfing cannot occur whenever knowledge 
about the situation has no impact on percep- 
tions of the target’s emotional state. In the 
absence of such impact, one would expect 
discounting (Kelley, 1971). The more reasons 
for an effect, the less the effect is specifically 
attributed to any one of them. There are more 
reasons for being anxious when the situation 
provokes that response; therefore, subjects 
should see the target person as less disposed 
to be anxious, and they did so, 

These results stand in contrast to the en- 
gulfing bias we found in the before condition, 
but they may have a bias of their own. One 
could argue that in the sex condition, subjects 
should have inferred that the target was feel- 
ing anxious but was masking that emotion, If 
so, there might have been no basis for deduc- 
ing that she had any less of a disposition to 
be anxious than in the politics condition. 
However, given that subjects in the after sex 
condition did not presume masking or other- 
wise infer an emotional state, discounting is 
the expected and rational result. It is reassur- 
ing that a phenomenon often replicated (Jones 
& Davis, 1965; Kelley, 1971) can be ob- 
tained for emotions. 
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The more exciting finding, however, in our 
view, is the engulfing effect obtained in the 
before condition. We find it exciting because 
we think it cannot be explained in the way 
Kelley (1971) interpreted previous purported 
instances of the bias. He suggested that “con- 
text factors have not been manipulated as 
strongly or as clearly as has the behavior it- 
self” (p. 18). This seems unlikely in the cur- 
rent study in that the manipulation of “be- 
havior,” perceived emotional state, was ac- 
complished by means of a manipulation of 
context, the nature of the interview. One 
would have to argue that the variation of in- 
terview topic affected perception of emo- 
tional state beyond what is attributable to 
the difference in topics. 

This strikes us as implausible. Moreover, 
Frijda (1969) has argued that knowledge of 
the situation has less influence on perception 
of emotion than the target person’s facial ex- 
pression, If knowing that the situation pro- 
yokes emotion enhances perceived emotion- 
ality to an extent that falls short of the level 
expected in such a situation, correspondent 
inference theory (Jones & McGillis, 1976) 
predicts this expectancy violation will result 
in less attributed dispositional emotionality, 
which is the opposite of engulfing. That pos- 
sibility makes it all the more remarkable that 
engulfing occurred. 

It would be rational to attribute anxious 
dispositions to one who is in an anxious situ- 
ation if he were permanently bound to the 
situation or if it were a powerfully traumatic 
event likely to induce a permanent effect on 
the target. In both the current study and in 
Snyder (1974), the situation was temporary 
and of such a scope that the possibility of 
lasting damage seems remote. Both electric 
shock and the opinion inerview, while anxiety 
provoking, are not beyond the anxiety-produc- 
ing magnitude of events common in a stu- 
dent’s life, for example, taking a final exam. 

We now consider other alternative expla- 
nations for the engulfing result. Could attrib- 
uting a disposition to be emotional to one who 
is in an emotional situation be explained as a 
derogation of an innocent victim (Lerner & 
Simmons, 1966)? To test this possibility, we 
analyzed an evaluation index obtained by 
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combining percentile rankings for warmth and 
likability. Snyder (1974) composed and ana- 
lyzed a similar index. Neither analysis found 
evidence for derogation, Upon closer scrutiny, 
this is not surprising. Both in Snyder’s study 
and in the current one, the videotapes were 
presented as records of a past event rather 
than as a live transmission, Lerner and Sim- 
mons’ past event condition also found no sig- 
nificant derogation. 

There is another possible explanation, Once 
subjects perceived the target as emotional, 
they might have simply repeated their re- 
sponses on the dispositional measures out of 
laziness rather than conviction. To discour- 
age this, in both the current experiment and 
in Snyder (1974), different formats and dif- 
ferent terminologies were used on state and 
dispositional measures. Perhaps, more import- 
antly, the results in the after condition rule 
out simple repetition. The nature of the in- 
terview had no significant effect on percep- 
tion of emotional state, but it had a highly 
significant impact on attribution of emotional 
disposition, 

We should mention a factor that may have 
worked against the engulfing effect. McArthur 
(1972) found that emotions, along with opin- 
ions, were less likely to be attributed to the 
person and more likely to be attributed to a 
provoking stimulus than either accomplish- 
ments or actions. Her study provides no way 
of determining when a bias has occurred. 
Nevertheless, that the current study employed 
the response class of emotions makes it espe- 
cially useful as a stringent test of the ob- 
server bias. 

We next consider the role empathy may 
have played. Jones and Nisbett (1971) sug- 
gested, and Regan and Totten (1975) pro- 
vided supporting evidence, that empathy de- 
creases dispositional attributions and increases 
situational ones. Snyder (1974) asked sub- 
jects to empathize with the target person and 
thus may have weakened the engulfing effect. 
From another perspective, in the Snyder 
study, despite instructions to empathize, low 
empathy may have facilitated results, Em- 
pathy may have been low if persons of the 
opposite sex were less likely to empathize with 
one another than those of the same sex. On 
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both occasions that engulfing has occurred, 
the observer-target relation was heterosexual. 
Alternatively, simple dissimilarity may be an 
important factor. Banks (1976) has provided 
some evidence that the more dissimilar the 
target is to the observer, the more likely the 
observer is to make dispositional attributions. 
Conversely, this result might be mediated by 
differences in empathy, As it stands, either 
low empathy or high dissimilarity may have 
contributed to the engulfing result. 

A legitimate question is whether the bias 
we have found is peculiar to observers. Might 
not actors show the same bias if they actually 
engaged in interviews on sex or politics? An 
experiment to test this would have to be a 
good deal more complicated than the current 
one if it were to retain the feature of con- 
trolling actual behavior while manipulating 
the nature of the interview. Actors might in- 
deed show the bias, but we predict that the 
extent to which they do is a function of how 
much attention experimental procedures direct 
to self. This is in keeping with one of the 
explanations that Jones and Nisbett (1971) 
have offered for differences in attribution be- 
tween actors and observers. They have sug- 
gested that the actor is highly salient to the 
observer, though not to himself. Manipula- 
tions of salience of self using mirrors and dis- 
tracting tasks (Wicklund, 1975), others us- 
ing body orientation (Taylor & Fiske, 1975), 
and both self and others using videotapes 
(Storms, 1973) have all found the greater the 
salience, the stronger the attributions, 

Interestingly enough, there is an analogous 
perceptual illusion (Wallach, 1959). In a 
dark room, a point of light is surrounded by 
a luminous circle. The point is more salient 
than the circle by virtue of their figure- 
ground relationship. When the circle moved, 
subjects saw it as stationary, but they per- 
ceived the point to be moving in the other 
direction. For this illusion to occur, subjects 
must have been aware of the circle. Its move- 
ment is the only reason subjects perceived any 
motion at all. Likewise, for the engulfing bias 
to occur, subjects must be aware of the con- 
text. It is the basis for the perception of an 
anxious state. Just as the movement is attrib- 
uted to the more figural point, the emotional 
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state may be attributed to the most salient 
feature, which we presume is the target 
person. 

That this engulfing bias depends on sub- 
jects’ awareness of the context needs to be 
stressed. It answers Kelley’s (1971) criticism 
of earlier instances of the bias on grounds 
that context factors may not have been 
manipulated as strongly or clearly as was the 
behavior. 
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The direct use of verbal and physiological response measures to assess the 
cognitive component of a sociopolitical attitude is made difficult by the 
multiple meanings of sociopolitical stimuli. In the present study a modified 
differential classical conditioning procedure was used to produce an unambiguous 
physiological (skin conductance) response to stimuli related to a discrete 
social concept (black relatedness and not black relatedness). It was demon- 
strated that conditioned stimuli clearly related to this specific concept pro- 
duced a conditioned response. The results also showed that a series of more 
ambiguous test stimuli related to the 1972-1974 election campaigns produced 
a gradient of skin conductance responses that was related to the degree of 
black relatedness or not black relatedness of each stimulus as measured by 
other means. The implications of the results of this study to further research 


and other indicators of the cognitive component of attitude are discussed. 


The measurement of meaning is considered 
to be a serious problem confronting social and 
behavioral scientists who work in the area of 
individual behavior (Cohen, 1964; Converse, 
1964; Luttbeg, 1968; Marcus, Tabb, & Sulli- 
van, 1974). Until we can determine the 
meanings of particular social stimuli to the 
individual, we cannot know what the indivi- 
dual is responding to. 

Social and political stimuli carry multiple 
meanings and associations, thereby undermin- 
ing conventional methodologies for measuring 
attitudinal responses that assume a single, 
consensual meaning of stimuli. Meanings and 
associations that are less ambiguous and more 
socially acceptable are readily verbalized, 
whereas meanings and associations that are 
ambiguous and/or socially unacceptable are 
inhibited. It is not readily possible to over- 
come the repression of verbal expression 
through instruction or experimental controls. 
Orne (1962) and Rosenthal (1966) demon- 
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strated that the demand characteristics of the 
interview situation and the need for individu- 
als to be consistent in their responses and 
helpful to the interviewer often shape the 
subjects’ responses. If, in addition to these 
behavior motivations, the individual’s need 
to maintain the social desirability of his atti- 
tudinal responses is considered, it becomes all 
too likely that the analysis of his verbal re- 
sponses will produce rationalistic attitudinal 
structures that will reduce the possibility of 
predicting future behavior. 

Concern over the sole reliance on verbal 
response as a measure of the various com- 
ponents of attitude led to the suggestion 
(Cook & Selltiz, 1964; Shapiro & Crider, 
1969) that these components be evaluated 
independently by using physiological as well 
as verbal responses in a cross-modal design. 
The use of autonomically mediated, peripheral 
physiological measures in the evaluation of 
emotional and arousal states has been well 
documented (Greenfield & Sternbach, 1972; 
Prokasy & Raskin, 1973; Sternbach, 1968). 
A number of studies (Cooper & Seigel, 1956; 
Cooper & Singer, 1956; Porier & Lott, 1967; - 
Rankin & Campbell, 1955) have used both 
physiological and verbal response measures to 
record the presence of prejudicial racial atti- 
tudes. Although the results of these studies 
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suggest that physiological responses can be 
employed as general response measures of the 
perceived intensity of attitudinal stimuli, they 
do not clearly establish the relationship be- 
tween any specific attitudinal meaning (cog- 
nitive component) and the presence or ampli- 
tude of the physiological response. For ex- 
ample, it can be argued that the galvanic skin 
responses (GSRs) attributed by Rankin and 
Campbell (1955) to a prejudicial attitude 
toward the race of the experimenter were 
elicited instead by the general appearance 
(height and stature) of that black person. 
The strong possibility that physiological re- 
sponses produced by the simple presentation 
of verbal or visual social stimuli are not 
clearly related to a particular aspect or 
meaning of these stimuli makes it necessary 
to use experimental controls to ensure that 
the elicited responses are specifically related 
to that aspect of the stimulus that is of pri- 
mary interest to the study. 

One experimental approach to this problem 
is to use the principles of classical conditioning 
to produce an unambiguous stimulus—response 
relationship. If the meanings of social stimuli 
are considered as learned responses, the as- 
sumption may be made that the meanings 
attributed to social stimuli are learned in the 
same way and under the same conditions as 
are other behavioral responses (Campbell, 
1963; Skinner, 1953; Staats, 1969), and the 
classical conditioning paradigm then offers a 
straightforward approach to the experimental 
study of the cognitive component of social 
attitudes. That is, the established principles of 
the classical conditioning paradigm to stimu- 
lus-response relationships should apply, and 
the rules of generalization and discrimination 
should be maintained. 

Based on this assumption, it was demon- 
strated (Staats & Staats, 1957; Staats & 
Staats, 1958; Staats, Staats, & Heard, 1961) 
that semantic generalization could be used to 
condition Osgood, Suci, and Tannenbaum’s 
(1957) evaluative, potency, and activity com- 
ponents of meaning to nonsense words that 
had no meaning of their own, They were also 
able to demonstrate that evaluative attitudi- 
nal responses to socially significant objects 
(the name of a country or a proper name) 
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can be established or altered through the use 
of classical conditioning and that a common 
denotative component of meaning can be 
transferred to nonsense words. The results of 
these experiments indicate that classical con- 
ditioning can be used to transfer the com- 
ponents of meaning from one set of words 
to another and thereby alter attitudes toward 
socially significant stimuli. It has also been 
evidenced (Staats, Staats, & Crawford, 1962) 
that the conditioning of verbal meaning can 
be accompanied by a parallel conditioned 
GSR. Recently, Tognacci and Cook (1975) 
combined some of the principles of semantic 
generalization and physiological responsivity 
to demonstrate both the direction (approval— 
disapproval) and intensity of racial attitudes. 

Given these findings, it should therefore 
be possible to use classical conditioning pro- 
cedures to transfer a physiological (skin con- 
ductance) response, normally produced by an 
aversive physical stimulus (loud noise), to the 
specific attitudinal meaning of a series of 
verbal or visual stimuli. If such a relationship 
could be established, it would then become 
possible to use the elicitation and possibly 
the amplitude of this physiological response 
to evaluate the degree to which a specific 
attitudinal meaning is generalized to other 
more ambiguous social stimuli. 

The aim of the present study is twofold: 
first, to demonstrate that classical condition- 
ing can be used to elicit a consistent physio- 
logical response to unambiguous social or 
political stimuli that are clearly related to a 
specific social concept and, second, to test the 
extent to which a series of more ambiguous 
social or political stimuli will elicit this con- 
ditioned response. Specifically, we wish to 
determine the extent to which a number of 
political issues regarding the 1972 and 1974 
election campaigns are cognitively associated 
with race. We are also interested in estab- 
lishing the degree to which some of the 
denotatively nonracial issues of these elec- 
tion campaigns, such as crime in the streets, 
welfare, and war on poverty, carried strong 
racial overtones. 

If some of the campaign issues were cog- 
nitively linked to race, then such stimuli 
should produce a gradient of physiological 
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responses related to the degree of generaliza- 
tion and association of these stimuli to the 
conditioned concept, “black relatedness.” The 
amplitude of these conditioned physiological 
responses can be compared with verbal mea- 
sures of the degree of black relatedness at- 
tributed to each stimulus. This multiple-indi- 
cator approach to the study of attitude allows 
for the comparison of self-controlled verbal 
and uncontrollable psychophysiological re- 
sponses, 


METHOD 
Subjects 


Subjects were 24 white (12 male and 12 female) 
undergraduate students who were recruited and 
paid to participate in this study. 

Stimuli. A modified differential classical condition- 
ing paradigm employed black-related and not-black- 
related words and phrases as conditioned stimuli 
(CS) and a loud (100-db.) 1-sec burst of white 
noise as the unconditioned stimulus (UCS). The CSs 
were 60 words and phrases culled from 1972 to 
1974 newspaper stories. These words and phrases 
were judged by a group of 94 male and female un- 
dergraduate students who were instructed to use a 
5-point category scale to evaluate the degree to 
which each stimulus related to black people and 
black problems. The 15 items judged as most black 
related (mean rating=4.7) and the 15 judged as 
least black related (mean rating = 1.3) were se- 
lected as the CSs for this study. A sampling of 
black related CSs includes the following words and 
phrases: Harlem, Black Panther, slavery, sickle cell 
anemia, Afro-American, Negro, NAACP, CORE, Eb- 
ony Magazine, soul food, Martin Luther King, and 
Angela Davis. A sampling of the not black related 
CSs includes the following: Iceland, Caucasian, Irish 
Republican Army, Polish-American, Playboy Maga- 
zine, smorgasbord, Chinatown, Gloria Steinem, and 
Billy Graham. 

The test stimuli were five political issues of the 
1972 and 1974 campaigns that were hypothesized 
to be differentially black related. These issues were 
matched with five topically related items hypothe- 
sized to be not black related, 

Recording and data processing: Skin conductance 
was recorded through a Beckman skin conductance 
coupler (Model 9844) from a pair of silver-silver 
chloride electrodes placed on the first and third 
fingers of the left hand. Heart rate was recorded by 
means of a Beckman cardiotachometer (Model 9857) 
from a pair of standard electrocardiogram elec- 
trodes strapped to the upper arms. Stimuli were 
visually presented on a rear projection screen 
through a one-way mirror from a slide projector in 
the instrument room. The white noise (USASI spec- 
trum) unconditioned stimulus (UCS) was presented 
through a pair of Telephonic earphones (Model 
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TABLE 1 
Ten Porrricat Test STIMULI (T) 


T items hypothesized to be 


T items hypothesized to be 
not black related» 


black related» 


Watergate 
Defense budget 
Women’s rights 
Space program 
NATO 


Crime in the street 
Welfare budget 
Civil rights 

School busing 
War on poverty 


Note. NATO = North Atlantic Treaty Organization. 

“T+ for black conditioning group; T — for not black condi- 
tioning group. 

b T+ for not black conditioning group; T — for black condi- 
tioning group. 


TDH-39). Verbal communication was maintained 
with the subject by means of an intercom, and the 
subject was constantly observed through a closed- 
circuit television system, The presentation and tim- 
ing of trials and stimuli were programmed and 
controlled by use of a bank of Grason Stadler 
solid-state logic modules. All physiological and 
event data were recorded as a continuous paper 
record on a Beckman-Type RM dynograph and 
also on a Tandberg FM tape recorder (Model 100) 
for future computer analysis. 

Procedure, Subjects were seated in a comfortable 
lounge chair in a soundproof subject room in the 
Laboratory for Behavioral Research, and electrodes 
for recording skin conductance and heart rate were 
attached. Subjects were instructed that they would 
view a series of slides displaying words and phrases, 
some black related, some not black related. Subjects 
were assigned randomly to one of two groups: Half 
of them were told that some of the black-related 
items would be accompanied by a loud noise (for 
this group the black-related items were designated 
as CS+ trials); the other half were told that some 
of the not-black-related items would be accompanied 
by the loud noise (for this group, the not-black-re- 
lated items were designated as CS+ trials). 

There has been some disagreement about the effect 
of subject awareness in the conditioning of meaning. 
Cohen (1964) and Insko and Oakes (1966) claimed 
that conditioning of meaning occurs only when sub- 
jects are aware of the contingencies of the experi- 
ment. In a study comparing children aged 5-10 
years, O'Donnell and Brown (1973) reported that 
attitude conditioning improved with age and that the 
improvement appeared to be a function of both con- 
tingency awareness and the greater facility with 
which older subjects transferred symbolic meaning. 
However, these criticisms have been countered in a 
study by McGinley and Layton (1973), who used 
higher order conditioning to demonstrate that aware- 
ness was not a factor in the process of conditioning 
meaning. 

In the present study, awareness of the contingen- 
cies of the experiment was used in a positive man- 
ner. All subjects were clearly made aware of the 
relationship between the CS and the UCS. Specifi- 
cally, half of the subjects were instructed to expect 
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Ficure 1. Comparison of mean amplitude of skin 
conductance (SC) responses to black and not black 
conditioned stimuli (CS) by the group conditioned 
to respond to black-related stimuli (solid, or black, 
bars) and the group conditioned to respond to not- 
black-related stimuli (open, or white, bars). (Shaded, 
or striped, bars indicate the difference in mean SC 
response to CS+ and CS— trials for the combined 
black and not-black conditioning groups.) 


a pairing of the UCS with some of the black-related 
CS items; the other half were instructed to expect 
this pairing to occur with some of the not-black- 
related items. Unlike the studies by Staats and 
Staats (1957, 1958), we were not interested in the 
transfer of meaning from one stimulus to another, 
nor were we particularly interested in prejudicial 
attitudes. Instead, we were looking for responses 
that would clearly demonstrate the association of 
the test stimuli to the already established attitude 
of black relatedness and the possible use of response 
amplitude to classify these stimuli along a generali- 
zation gradient of association with black relatedness. 

To control for order effects, the 30 trials were 
arranged. in three orders, and each order was pre- 
sented to an equal number of subjects in both the 
black and not-black conditioning groups. The first 
10 trials (5 CS+- and 5 CS—) were presented before 
the first test trial to permit acquisition to take 
place. The 10 test trials were then interspersed 
among the remaining 10 CS+ and 10 CS— trials, 
Each trial lasted 10 sec, and on selected CS+ trials, 
after 5 sec had elapsed the UCS was presented for 
1 sec. 

This conditioning procedure was used for two 
reasons: First, it provided a 10-sec period of ex- 
posure to each visual CS, which would ensure that 
all subjects would have time to read, understand, 
and respond to the stimulus material independent 
of their immediate focus of attention or reading 
speed; second, it maintained a CS-UCS interval that 


was as short as possible, which enhanced the prob- 
ability of conditioning while permitting enough time 
for the skin conductance response to occur. 

At the end of the session, subjects evaluated on a 
5-point category scale the extent to which each CS 
and political test stimulus (T) stimulus pertained to 
black people and black problems. 

Measures and analysis. Physiological responses 
were digitized and measured by means of a Digital 
Equipment Corporation PDP/12-30 laboratory com- 
puter. All trials were sampled by the computer’s 
analog-digital converter at the rate of three sam- 
ples/sec. Sampling was initiated by a trigger pulse 
on the FM tape 5 sec before each trial to provide 
physiological baseline measures for each trial. The 
digitized information was stored on computer tape 
for future analysis. 

Skin conductance response measures were recorded 
as the square root of the change in conductance (in 
micromhos) between the last pretrial sec and the 
maximum conductance in the first 5 sec of the CS 
trial. In the test trials, a similar measure was ob- 
tained for both the first 5 sec and last 5 sec of each 
trial. Response measures were also obtained for 
heart rate, These physiological response measures 
and the responses to the questionnaires constitute 
the data available for analysis. Three-way (Group X 
Condition X Trial) analyses of variance for repeated 
Measures were performed for each of the periods of 
interest, and Spearman rank-order correlations were 
used to assess the relationship between the physio- 
logical and questionnaire data. 


Results 


The bar graphs in Figures 1-3 show the 
relationship of the amplitude of mean skin 
conductance responses for each of the two 
experimental groups to the presentation of 
their conditioned and test stimuli, Figure 1 
shows the difference in the mean amplitude 
of skin conductance responses to both black 
and not-black conditioned stimuli by both the 
group conditioned to respond to black-related 
stimuli (solid, or black, bars) and the group 
conditioned to respond to not-black-related 
stimuli (open, or white, bars). The cross- 
hatched bars indicate the difference in mean 
skin conductance responses to CS-+ and CS— 
trials for the combined black and not-black 
conditioned groups. The data reported for 
these trials is related to the skin conductance 
response produced in the first 5 sec of each 
10-sec trial. Response information for the last 
5 sec of the CS+ trials is not reported be- 
cause it was confounded by the response to 
the UCS, which was initiated in the sixth sec 
of each CS+ trial. 
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‘The CS trials chosen for analysis were the 
trials adjacent to each of the test trials, thus 
ensuring that the possible response interac- 
tions between CS and test trials were held 
constant. It is obvious from this figure that 
a substantial difference in mean response 
amplitude between CS+ and CS— trials is 
observed for both conditioning groups as well 
as for the combined groups. The results of 
the analyses of variance for this period indi- 
cate a highly significant main effect for 
condition, F(1, 22) = 11.9, p < .003, with no 
significant main effect for group or trial nor 
any significant interaction among the three 
factors. The lack of group and trial effects in 
this analysis indicates that each set of black- 
related and not-black-related stimuli selected 
as conditioned stimuli for this study shares 
a strong, race-specific meaning within its own 

„category. The results allow us to treat the 
black-related and not-black-related words and 
phrases as CSs, despite their differences in 
denotative meaning, thereby making it possi- 
ble to use them as differential conditioned 
stimuli in a manner similar to the use of such 
simple physical stimuli as tones and lights in 
the bulk of the classical conditioning research. 
Figure 1 further indicates that the black- 
related and not-black-related conditioned stim- 
uli do not produce significantly different re- 
sponse effects when they are used as CS+ and 
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do not appear to differentially contribute to 
the combined main effect for condition. 
Figure 2 shows the mean skin conductance 
response amplitude produced by the black- 
related and not-black-related conditioning 
groups to the presentation of the 10 political 
test stimuli shown in Table 1. To provide 
data comparable to the CS trials, the response 
amplitude of the first 5 and second 5 sec of 
these test trials were analyzed separately. 
Figure 2a shows the difference in mean skin 
conductance response amplitude for each con- 
ditioning group as well as for the combined 
groups for the five political stimuli hypothe- 
sized to be black related and the five political 
stimuli hypothesized to be not black related, 
The analyses of variance for the skin con- 
ductance response on the data from the first 
5 sec produced a significant, F(1, 22) = 74, 
p<.01, main effect for condition and no 
other significant main effect or interactions. 
Figure 2b shows the difference in mean skin 
conductance response amplitude between the 
hypothesized black-related and _not-black- 
related political stimuli for the last 5 sec of 
the test trials. The analysis of variance re- 
sults demonstrate a highly significant, F (4, 
22) = 10.7, p < 004, main effect for condi- 
tion, and in this instance, the three-way 
Group X Condition X Trial interaction ap- 
proached significance, F(4, 44) = 2.2, P< 
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Ficure 2. Comparison of the mean skin conductance (SC) response amplitude produced by the 


black (solid, or black, bar) and not-black (open, 


black-related and political not-black-related stimuli. 
SC response to test stimuli (T+ and T—) trials for 
data for the first 5 sec, 1-5, and Figure 2b presents the data for 


or white, bar) conditioning groups to the political 


Shaded bars indicate the difference in mean 
the combined groups. (Figure 2a presents the 
the last 5 sec, 6-10.) 


TURSKY, LODGE, FOLEY, REEDER, AND FOLEY 


870 
A B 
1.1 
1.0} š 
SF 5 
8+ H 
a 
2.7 H 
E 
a 6p s 
B st L 
Ar H 
sor F 
Aa L 
A L 
CIVIL WELFARE CRIME WARON SCHOOL NATO DEFENSE WATERGATE WOMEN'S SPACE 
RIGHTS BUDGET POVERTY BUSING BUDGET RIGHTS PROGRAM 


Ficure 3. Comparison of the mean skin conductance (SC) response amplitude to each of the 10 
political test stimuli for the black (solid, or black, bar) and not-black (open, or white, bar) condi- 
tioning groups. (Figure 3a shows the comparison of SC responses to the political black-related 
stimuli, and Figure 3b shows the same comparisons to political not-black-related stimuli.) 


08. For each conditioned group, a separate 
two-way analysis of variance of the skin con- 
ductance responses for the 6-10-sec period 
showed that the three-way interaction was 
due primarily to the responses of the black- 
related conditioning group, F(4,44) = 2.32, 
p < .06. For the not-black-related condition- 
ing group, skin conductance responses for 
positive and negative political test stimuli 
(T+ and T—) produced no interactions. 

This effect can be explained by noting the 
difference in amplitude between the average 
T— responses of the two conditioning groups. 
The not-black-related conditioning group pro- 
duced larger average skin conductance re- 
sponses to the political black-related T— 
stimuli than the black-related conditioning 
group produced to the political not-black-re- 
lated T— stimuli, although both groups pro- 
duced equally large average conditioned skin 
conductance responses to their respective T+ 
stimuli. These results suggest that the black- 
related political items were more physiologi- 
cally sensitizing than the not-black-related 
political items. 

Figure 3 shows the skin conductance re- 
sponse amplitude for each of the political test 
stimuli produced by both the black-related 
(solid, or black, bar) and not-black-related 
(open, or white, bar) conditioning groups. 
The stimuli are arranged in two groupings 
ordered by amplitude of skin conductance re- 


sponse. Figure 3a shows the comparison of 
skin conductance responses with the black- 
related political stimuli for both the black- 
related and not-black-related conditioning 
groups, and Figure 3b shows the same com- 
parison of skin conductance responses with 
not-black-related political stimuli. In all but 
one case, the amplitude of the averaged con- 
ditioned skin conductance response was 
greater in the hypothesized direction. 

In the black-related condition (Figure 3a), 
the magnitude of skin conductance responses 
was greater for crime in the streets, war on 
poverty, civil rights, and welfare budget, all 
issues hypothesized to be associated with 
black people, issues, or problems. Conversely, 
in the not-black-related condition (Figure 
3b), the magnitude of skin conductance re- 
sponses was greater for Watergate, North 
Atlantic Treaty Organization (NATO), wom- 
en’s rights, space program, and defense bud- 
get, which are all stimuli hypothesized to be 
unrelated to black issues. 

T— tests comparing the skin conductance 
responses with each of the 10 stimuli between 
the black-related and not-black-related condi- 
tioning groups were calculated. Six of the 
differences are statistically significant at the 
p < .05 level. 

The Spearman rank-order correlations be- 
tween the rank ordered mean category scale 
responses and the rank ordered skin conduc- 
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tance responses produced statistically signifi- 
cant results. The correlations between cate- 
gory and skin conductance response for the 
black-related conditioning group was .78 ($ 
< .01), while the comparison of these re- 
sponses for the not-black-related conditioning 
group was .79 (p < .01). These results indi- 
cated a strong positive relationship between 
the category and skin conductance responses 
in identifying the political test stimuli as be- 
ing black related or not black related. The 
correlation of category scale responses be- 
tween the black-related and not-black-related 


‘conditioning groups was .92 (p < .01), indi- 


cating that the conditioning procedure that 
subjects experienced immediately prior to scal- 
ing the stimuli had no deleterious effect on 
their ability to rate the stimuli or on the in- 
tensity of these ratings. 


Discussion 


The results of the present study demon- 
strate the feasibility of using classical condi- 
tioning procedures to produce an unambigu- 
ous physiological response that is clearly 
related to a specific previously established 
attitude or meaning. The preexperimental 
questionnaire data demonstrate that the con- 
cept of black relatedness was firmly estab- 
lished in this subject population prior to their 
participation in this study. This prior learn- 
ing provided each subject with a cognitive 
set of associates that were then used to evalu- 
ate incoming stimuli. The racial association 
(black and not black) of the words and 
phrases used in this study as CSs was readily 
identified by the subjects in both condition- 
ing groups. When these stimuli were used as 
CS+ items (paired with a UCS), they elic- 
ited skin conductance responses in each group 
that could clearly be differentiated from the 
responses produced by the same stimuli when 
they were presented as CS— items. These 
differentially conditioned responses demon- 
strate that words and phrases that automati- 
cally activate a common interpretation or 
meaning can be used as conditioned stimuli 
and that physiological responses normally 
elicited by aversive stimulation can be trans- 
ferred to such CSs. 

Haying demonstrated that classical condi- 
tioning of a physiological response to a gen- 


‘ 


ko 


eral social concept (black relatedness) is 
feasible, the next logical step in the process 
is to determine the extent to which this con- 
cept is associated with other social stimuli. 
The test items used in the present study, all 
of which are denotatively nonracial, did in- 
deed produce skin conductance responses that 
seemed to reflect the strength of their associa- 
tion to the conditioned concept. Figures 2a 
and 2b show that both the black-related and 
not-black-related political test items produced 
a highly significant mean skin conductance 
response in the appropriate black-related and 
not-black-related conditioning groups, thus, 
indicating the generalization of these response 
to the social concept “black relatedness.” A_ 
comparison of the results shown in Figures 
2a and 2b shows that the responses to T+! 
stimuli are indeed conditioned responses. The 
increase in the amplitude of skin conductance 
response to the T+ stimuli in the second 5 
sec (Figure 2b) of the test trials indicates a 
strong conditioned.efmct. In all cases, the 
response to the ^ — stimuli in the first 5-sec 
period shows only the orienting response ex- 
pected of novel stimuli; no further increase 
is shown in the last 5 sec of the T— trials. 

A related question of interest is whether the 
mean amplitude of skin conductance responses 
elicited by specific stimuli can be used to indi- 
cate the degree of association of each stimulus 
to the basic concept of interest. The data in 
Figure 3 show several indications of an or- 
derly response continuum that relates respons; 
amplitude to degree of generalization. In t 
black-related conditioning group, response t 
the black-related political stimuli—most dr: 
matically, the stimulus civil rights—produce! 
a larger mean amplitude skin conductance 
response than any of the other test stimuli. 
The amplitude of the remaining stimuli in 
this category demonstrates a reasonable 
gradient of generalization response, whereas 
the not-black-related conditioning group’: 
responses to the not-black-related politic: 
stimuli are less clearly differentiated. 

These results suggest the utility of condigy 
tioned skin conductance responses in evaluat: 
ing the relationship between a concept and 
its associates. These results also support to 
some extent the application of physiological 
response amplitude within a classical condi- 
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tioning paradigm to quantify the degree of 
association between each stimulus and the 
concept. 

Although the significant rank-order corre- 
lations between the verbal and physiological 
responses to the test stimuli indicate a posi- 
tive statistical relationship between these mea- 
sures, it can be argued that the use of 5- 
point category scales to judge and select 
these items may not provide a strong enough 
measure of intensity to permit the accurate 
assessment of the stimulus items. The evalua- 
tion of test stimuli and the equating of CSs 

_ could be improved by means of cross-modal 
psychophysical scaling techniques (Cross, 
1974; Lodge, Cross, Tursky, & Tanenhaus, 
1975). The use of these psychophysical scal- 

“ig techniques should provide a better basis 

= or the selection of stimuli and provide a 
stronger test of the relationship between sub- 
jective and physiological measures of associ- 
ition, 

The results of this study are not meant to 

mply that classical conditioning is the only 
iethod available for the evaluation of the 
weaning of complex stimuli. Classical condi- 
oning is only one method for measuring the 
xtent to which the meaning of a stimulus is 
eneralized to and distinguished from other 
omplex stimuli. The cognitive content and 
tructure of the meaning of complex stimuli 
could also be inferred from the results of mul- 
tidimensional scaling (e.g., proximity analy- 
sis) or factor analyses (Carroll & Chang, 
1970; Marcus et al., 1974; Shepard, 1962), 
or one or another variety of Stroop technique 
(Dyer, 1973; Posner, 1973). 

It seems reasonable to suggest that the best 
‘approach to the measurement of meaning is 
fe~id in a multiple-indicator, cross-modal 
a} roach using the best of all of these mea- 


St es. 
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Distraction Can Enhance or Reduce Yielding 
to Propaganda: Thought Disruption 
Versus Effort Justification 


Richard E. Petty, Gary L. Wells, and Timothy C. Brock 
The Ohio State University 


Two experiments were conducted to test competing accounts of the distraction- 
persuasion relationship, thought disruption and effort justification, and also 
to show that the relationship is not limited to counterattitudinal communica- 
tion. Experiment 1 varied distraction and employed two discrepant messages 
differing in how easy they were to counterargue. In accord with the thought 
disruption account, increasing distraction enhanced persuasion for a message 
that was readily counterarguable, but reduced persuasion for a message that 
was difficult to counterargue. The effort notion implied no interaction with 
message counterarguability. Experiment 2 again varied distraction but the two 
messages took a nondiscrepant position. One message elicited primarily favor- 
able thoughts and the effect of distraction was to reduce the number of 
favorable thoughts generated; the other, less convincing message elicited pri- 
marily counterarguments, and the effect of distraction was to reduce counter- 
arguments. A Message X Distraction interaction indicated that distraction 
tended to enhance persuasion for the counterarguable message but reduce i 
persuasion for the message that elicited primarily favorable thoughts. The 

experiments together provided support for a principle having greater gen- 

erality than the Festinger-Maccoby formulation: Distraction works by in- 

hibiting the dominant cognitive response to persuasive communication and, 

therefore, it can result in either enhanced or reduced acceptance. 


arguing, thereby weakening resistance to the 
message. 


Festinger and Maccoby’s (1964) well- 
known demonstration that distraction in- 


| 


\ 
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creases acceptance of propaganda assumed 
that a person tends to engage in active 
counterarguing when confronted with a per- 
suasive message with which he disagrees, and 
that distraction could inhibit such counter- 
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The present study had two aims. The first 
aim was to examine a more general formula- 
tion than that proposed by Festinger and 
Maccoby (1964). We reasoned that if the 
dominant cognitive response to a communica- 
tion was counterarguing, then distraction 
would lead to enhanced persuasion by inter- 
fering with the counterarguing process; but, 
if the dominant cognitive response to a com- 
munication was agreeing or favorable cogni- 
tive responses rather than counterarguments, 
distraction would inhibit these favorable 
thoughts and lead to lowered acceptance. 

The second aim was to evaluate two com- 
peting explanations of distraction-acceptance 
effects, namely thought disruption versus 
effort justification. Numerous experiments em- 


1 Festinger and Maccoby (1964) included condi- 
tions in their experiment in which agreeing comments 
might be expected, but no significant effects due to 
distraction were obtained for these groups, and none 
were hypothesized. 
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’ ploying a wide variety of distractions and a 
diverse list of message topics have replicated 
the original Festinger and Maccoby result 
(e.g., Insko, Turnbull, & Yandell, 1975; Keat- 
ing & Brock, 1974; Kiesler & Mathog, 1968; 
Osterhouse & Brock, 1970; Rosenblatt, 1966; 
Rule & Rehill, 1970; Silverman & Regula, 
1968; Zimbardo & Ebbesen, 1970; Zimbardo, 
Snyder, Thomas, Gold, & Gurwitz, 1970). 
However, the nature of the mediating process 
is still disputed by some. Osterhouse and 
Brock (1970), Keating and Brock (1974), 
and Insko et al. (1975) have provided the 
strongest support for the thought disruption 
hypothesis; using a procedure developed by 
Brock (1967) and Greenwald (1968) for the 
direct measurement of cognitive responses, 
these studies showed that as the level of dis- 


~ traction increased, acceptance increased, while 


the number of counterarguments decreased. 
' Baron, Baron, and Miller (1973) recently 
reviewed many of the distraction-persuasion 
studies mentioned above as well as some un- 
published data, and concluded that there was 
only one explanatory mechanism which could 
rival the counterargument disruption hypothe- 
sis. This alternative explanation stems from 
the theory of cognitive dissonance (Festinger, 
1957). Baron et al. (1973) argued that in a 
typical distraction experiment, subjects per- 
ceive that they have chosen to participate in 
an experiment which requires that they exert 
effort in order to attend to a discrepant mes- 
sage. Subjects can justify this effort expendi- 
ture by adopting the attitude expressed in the 
communication (Cohen, 1959; Wicklund, 
Cooper, & Linder, 1967). Baron et al. (1973) 
contended that all of the distraction manipu- 
lations to date have been confounded with 
effort, and further that this confounding may 
be unavoidable. They then suggested that “the 
principal means of circumventing this sort of 
methodological problem is to devise an experi- 
ment for which the contending explanations 
make competing predictions” (p. 320). 

To meet these two aims, two experiments 
were conducted. Experiment 1 employed dis- 
crepant messages and was primarily designed 
to deal with the effort challenge to the 
thought disruption hypothesis by devising a 
situation in which the two theories would 


875 


make competing predictions.” Experiment 2 
was primarily designed to test the more gen- 
eral distraction formulation: that distraction 
works by disrupting the dominant cognitive 
response to propaganda, and thus could either 
enhance or reduce persuasion, 


EXPERIMENT 1 


Two discrepant messages were specifically 
constructed for Experiment 1. Both messages 
argued that the tuition at Ohio State Univer- 
sity should be increased by 20%, from the 
then current $266 per quarter to $320 per 
quarter, but the messages differed in their 
presentation of five key arguments. One mes- 
sage was designed to contain points that were 
logically sound, easily defendable, and more 
compelling than previous tuition increase 
propaganda (e.g., Osterhouse & Brock, 1970) 
and thus be dificult to counterargue. Another, 
easy-to-counterargue message, was designed to 
be more open to refutation and skepticism, 
and be less compelling than previous mes- 
sages. 

Given that the messages differ predomi- 
nantly in how easy or difficult they are to 
counterargue, it becomes possible to evaluate 
the two major explanations of the distraction- 
acceptance effect. The effort justification ex- 
planation, as derived from dissonance theory, 
states that with increasing levels of distrac- 
tion (and thus effort), there is increased 
acceptance of both messages since for each 
message, subjects have to justify their effort 
expenditure. Besides this main effect predic- 
tion favored by Baron (Note 1), a different 
effort prediction is possible, if one considers 
the quality of the message arguments in con- 
junction with the effortful distraction. Since 
hearing good arguments advanced against 
one’s position should be more dissonance 
arousing than hearing poor arguments (Klein- 
hesselink & Edwards, 1975; Lowin, 1967), it 
can be predicted that subjects hearing the 
difficult-to-counterargue communication will 


? The “competing predictions” approach was taken 
earlier in an experiment by Kiesler and Mathog 
(1968), but see the Baron et al. (1973, p. 320) cri- 
tique arguing that both theories could handle the 
obtained results. 
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experience more difficulty justifying their 
effort than those hearing the easy-to-counter- 
argue message. This line of reasoning predicts 
that the slope of agreement as a function of 
distraction (effort) will be steeper for the 
difficult-to-counterargue message. In any case, 
effort hypotheses expect increased acceptance 
with increased distraction for both messages. 

The counterargument disruption hypothesis, 
on the other hand, would expect an enhanc- 
ing effect only for the easy-to-counterargue 
message, because presumably only for this 
message are counterarguments generated that 
can be disrupted by distraction. In an experi- 
ment designed to test an unrelated hypothesis, 
Regan and Cheng (1973) obtained data rele- 
vant to the current hypothesis. Regan and 
Cheng constructed two messages on each of 
two topics and found that distraction en- 
hanced persuasion for “simple” (short, easily 
understood, but unconvincing) messages, but 
decreased persuasion for “complex” (long, 
difficult to understand, but convincing if un- 
derstood) messages. However, due to the con- 
founding of at least three variables (length 
of message, ease of understanding, and per- 
suasiveness) the results offer only tentative 
support for the argument that distraction 
tended to enhance persuasiveness for the sim- 
ple messages, because counterarguments were 
disrupted; and reduce persuasion for the com- 
plex messages, because favorable thoughts 
were disrupted. 

The following predictions were made for 
Experiment 1. More counterarguments would 
be generated to the easy-to-counterargue mes- 
sage than to the difficult-to-counterargue mes- 
sage, distraction would inhibit counterargu- 
mentation only for the easy-to-counterargue 
message, and a significant Message X Dis- 
traction interaction would be found on the 
attitude dependent variable indicating that 
distraction increased acceptance for the easy- 
to-counterargue message, but not for the 
difficult-to-counterargue message. Thus, the 
results were expected to support thought dis- 
ruption rather than effort justification. 


Method 
Procedure 
One hundred and thirty-two introductory psychol- 


ogy students at The Ohio State University (0.S.U.) 
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chose to participate in the experiment by selecting it 
from over 15 other possible experiments on a “sign- 
up” bulletin board. Students are informed in their 
introductory classes that they have the right to 
leave any experiment in progress and still receive 
credit that partially fulfills a course requirement. 
Additionally, subjects may fulfill the course require- 
ment by writing a term paper rather than by experi- 
mental participation. Thus, the level of choice felt 
by experimental subjects was at least as high (or 
higher) than that for subjects participating in pre- 
vious distraction-persuasion experiments. 

The design was a 4X22 factorial with four 
levels of distraction (no, low, medium, and high), 
two levels of message counterarguability (easy and 
difficult to counterargue), and two orders of the 
dependent variable booklet (list thoughts before ex- 
pressing attitude, and express attitude before listing 
thoughts). Subjects were run in groups of 6 to 12 in 
cubicles constructed so that no subject could have 
visual contact with any other subject. During any 
one session, in which one level of distraction was 
run, half of the subjects heard the easy-to-counter- 
argue message over headphones, while half of the 
subjects heard the difficult-to-counterargue message. 
After listening to the communication under the 
appropriate distraction condition and completing the 
dependent variable booklets, subjects were debriefed, 
thanked, and dismissed. 


Independent Variables 


Message counterarguability. In brief, the difficult- 
to-counterargue message stated that the tuition hike 
was recommended after an intensive 2-year investi- 
gation, that Ohio was 50th in state expenditures on 
education, that improvements were needed in the 
library system, that a mass transit system could be 
initiated, and studies had proven that if O.S.U. could 
upgrade its quality, the average salary of its gradu- 
ates would increase substantially, The easy-to- 
counterargue message stated that the tuition increase 
was recommended after a 2-month study, that Ohio 
was 10th in expenditures on education, that more 
trees should be planted on campus, that the univer- 
sity should start a bus system, and that improved 
lighting was needed in classrooms to cut down on 
student headaches. Each message was approximately 
3 minutes long. Subjects’ ratings in a pretest indi- 
cated that the messages did not differ in the extent 
to which they were “difficult to understand,” were 
“hard to follow,” or possessed “complex structure.” 

Distraction and effort. On arrival, each subject was 
handed a “monitor recording form.” The distraction 
task required subjects to monitor a large screen 
which hung from the ceiling in front of the room. 
The screen was from 2 to 5 m in front of the sub- 
jects, which required them to view it at an angle of 
approximately 45°. The subjects were informed via 
prerecorded instructions that the experiment con- 
cerned the ability to do two things at once. The 
subjects were told that while they listened to the 
message, an X would flash periodically in one of 
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the four quadrants of the screen. Their task was to 
record on “monitor recording forms” in which 
quadrant the X appeared. If the X appeared in the 
upper left-hand quadrant, they were to record the 
letters “UL” to signify upper left; if upper right, 
“UR”; if lower left, “LL”; and if lower right, 
“LR.” 

Twenty-five slides were prepared for each quad- 
rant, and the slides were randomly ordered when 
placed in the projector. For subjects in the low-dis- 
traction conditions, the Xs flashed at 15-sec intervals 
throughout the message. For medium distraction, the 
Xs flashed at 5-sec intervals, and in the high-distrac- 
tion conditions the Xs flashed at 3-see intervals. 
Each slide remained viewable until the next slide 
appeared, Subjects in the no-distraction conditions 
were given instructions identical to those in other 
conditions, but were told that “For now, no Xs will 
flash while you are listening to the message.” Sub- 
jects in the other distraction conditions were not told 
how many flashes to expect during the message. All 
subjects were instructed to try to pay close atten- 
tion to the message. 

It is important to note that the manipulation of 
distraction was also intended to be a manipulation 
of effort expenditure. Thus, the conditions might 
alternatively have been called no, low, medium, and 
high effort. 

Order of the dependent variable booklets. For half 
of the booklets, subjects answered the communication 
acceptance ratings before they recorded their thoughts 
and ideas on the message; for the other half, this 
order was reversed. This was done to check on the 
possibility raised by Miller and Baron (1973) that 
the awareness of one’s attitude, as made salient by 
the attitude scales, would influence subsequent 
counterargument production. 


Dependent Variables 


Measures of attitude. Two measures of attitude 
toward the proposed tuition increase were included. 
The first question was general and asked, “In gen- 
eral, to what extent did you agree with the speaker’s 
recommendations?” Subjects were to respond on a 
7-point scale where 7 indicated “completely agree.” 
The second question was specific and asked, “The 
current tuition at O.S.U, is $266. What do you think 
the appropriate level should be?” Subjects were to 
record a dollar figure in a blank beside the question. 
Subjects’ responses to the two attitude questions 
were converted to standard scores and summed. 

Measures of cognitive responses. Employing a pro- 
cedure adapted from Brock (1967) and Osterhouse 
and Brock (1970), subjects were given 24 minutes to 
write their thoughts on the topic of increasing tui- 
tion after they had listened to the message.* After 
recording their thoughts, subjects were instructed to 
rate their ideas in a manner adapted from Cullen 
(1968) and Cialdini, Levy, Herman, Kozlowski, and 
Petty (in press). Subjects rated their ideas as 
either + (in favor of a tuition increase), — (op- 
posed), or O (neutral). Each idea that a subject 
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wrote was submitted to 2 judges for scoring as either 
a counterargument or a favorable thought. The 
judges agreed on 92% of the statements. In cases of 
disagreement between judges, the subject’s rating was 
employed. Thus, each subject was assigned a counter- 
argument and a favorable thoughts score by the 
judges. Only the judge-assigned scores were analyzed. 

Measures of recall. When subjects had completed 
both the attitude- and thought-listing sections of the 
booklet, they were given 2 minutes to attempt to 
record as many message arguments as they could 
remember. Each booklet was rated by two judges 
(r=.86). An argument had to correctly summarize 
one of the arguments that appeared in the appro- 
priate message to be counted. Repetitions of the same 
argument were not counted. When judges disagreed 
in their independent ratings, ratings were obtained 
from two additional judges and a majority decision 
was reached among the four. 


Unequal n and Missing Data 


Unequal cell size resulted from subjects failing to 
keep appointments. Cell ns ranged from 5 to 11, Un- 
expectedly, 10 subjects failed to respond to one or 
more questions. Rather than eliminate these sub- 


3 Roberts and Maccoby (1973) suggested that 
measures of counterarguing taken during presentation 
of a message “are more easily viewed as possible 
mediators” of attitude change “since they precede 
subjects’ taking an opinion position” (p. 294). Since 
the current study involved distraction, and it was 
hypothesized that the distraction would interfere with 
subjects’ processing of the message, it would have 
defeated the purpose of the distraction to have 
given subjects an opportunity to collect and record 
their thoughts during the presentation of the mes- 
sage. In any case, half of the subjects were given an 
opportunity to express their thoughts after the mes- 
sage, but before they responded to the attitude 
scales. 

*Counted as counterarguments were statements 
directed against a tuition increase which mentioned 
specific unfavorable consequences of raising the tui- 
tion, statements which suggested alternative methods 
of raising money instead of increasing tuition, state- 
ments which challenged the validity of arguments 
raised in the message, and statements of affect op- 
posing the tuition increase. Counted as favorable 
thoughts were statements in favor of the increase 
which mentioned specific favorable consequences of 
raising the tuition, statements ruling out alternative 
ways of raising the money, statements which sup- 
ported the validity of arguments raised in the mes- 
sage, and statements of affect supporting the tuition 
increase. Similar examples of the same statement 
were counted once, The correlation between judges’ 
ratings of counterarguments and subjects? ratings 
was .79 (p<.01), and between judges’ ratings of 
favorable thoughts and subjects? ratings, .82, (p< 
.01). 
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jects from all analyses, it was decided to eliminate 
a subject from an analysis only for those measures 
for which no response was recorded. 


Results 
Independent Variables 


Order of the dependent variable booklet. It 
was not expected that the order of the book- 
lets would make any difference in the pattern 
of results. A 4X 2X2 unweighted-means 
analysis of variance on each dependent varia- 
ble generally confirmed this expectation. Main 
effects and interaction effects involving order 
were generally uninteresting and irrelevant 
for the purposes of this study (Petty, 1975). 
Therefore, on dependent variables for which 
order had no effects, the data were collapsed 
across order in subsequent analyses. 

Message counterarguability. An average of 
2.06 counterarguments was generated to the 
difficult-to-counterargue message, while an 
average of 2.89 counterarguments was gen- 
erated to the easy-to-counterargue message, 
F(1,124) = 5.81, p < .02. Furthermore, of 
the total number of written thoughts in the 
no distraction condition, 61% were coded as 
counterarguments in the easy message condi- 
tion and 37% in the difficult message condi- 
tion. 

An average of 2.21 favorable thoughts were 
generated to the difficult-to-counterargue mes- 
sage, and 2.06 favorable thoughts were gen- 
erated to the easy-to-counterargue message. 
There were no significant differences on this 
measure or on the total number of thoughts 
generated. No interactions on the cognitive 
response measures were significant. In sum, 
Experiment 1 conditions produced differential 
counterarguing, and in the easy message con- 
dition, the dominant cognitive response was 
counterarguing. 

The manipulation of distraction and effort. 
The analysis of variance on the distraction 
manipulation check yielded a significant F(3, 
116) = 19.84, p < .001 for the distraction 
factor. On a 7-point scale where 7 indicated 
that a subject was “completely distracted” 
from paying attention to the message, the 

for the no-, low-, medium-, 
racted subjects were 2.21, 3.30, 
respectively. 


mean respons‘ 
and highly 
4.16, and 4.81, 
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Because of unequal cell frequencies, and 
unequal spacing of the levels of distraction 
measured in slides per minute, all linear trend 
tests in this report were conducted according 
to Gaito (1965). A significant linear trend was 
found for the ratings of distraction, F(1, 116) 
= 52.80, p < .001. 

Responses to the question, “To what extent 
did you find the task effortful?” showed that 
the distraction manipulation was successful in 
varying perceived effort. On a 7-point scale 
where 1 indicated “completely effortful,” the 
means for the no-, low-, medium-, and highly 
distracted subjects were 4.62, 4.80, 3.68, and 
2.64, respectively. Again, an analysis of vari- 
ance for the distraction factor yielded a sig- 
nificant F ratio, F(3, 116) = 16.28, p < .001, 
and a significant linear trend, F(1, 116) = 
42.67, p < .001. In addition, a Message X 
Distraction interaction, /(3, 116) = 2.85, p 
< .05, was obtained. Although the linear trend 
interaction was not significant, the pattern of 
means indicated that rated effort increased 
more sharply with distraction for the difficult- 
than for the easy-to-counterargue message. In 
any case, these results clearly indicate that 
the faster the Xs flashed on the screen, the 
more distracted subjects reported feeling and 
the more effortful they rated the task. 


Dependent Variables 


Communication acceptance, An unweighted- 
means ANOVA on the summed standard 
scores yielded two effects: a main effect for 
message, F(1, 119) = 5.23, p < .02, indicat- 
ing that the difficult-to-counterargue message 
(X = .33) produced more agreement than the 
easy-to-counterargue message (X = —.31); 
and, the predicted Message X Distraction in- 
teraction, F(3,119) = 2.97, p < .03. The top 
panel of Figure 1 indicates that the easy-to- 
counterargue message shows the much repli- 
cated effect of increasing communication ac- 
ceptance with increasing levels of distraction. 
A test of linear trend on the means (—.84, 
—.46, —.17, and .23) proved significant, F(1, 
119) = 3.96, p < .05. The difficult-to-counter- 
argue message shows an opposite pattern— 
decreasing acceptance with distraction. A test 
of linear trend on the means (.66, .95, —.19 
and —.11) indicated a significant negative 
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Fıcure 1. Top panel: mean attitude scores in relation to message and level of distraction for 
Experiment 1. Bottom panel: mean attitude scores in relation to message and level of distraction 


for Experiment 2. 


slope, F(1,119) = 4.49, p < .05. In addition 
the linear trend interaction was highly signifi- 
cant, F(1, 119) = 8.39, p < 01. 

Distraction and thought production. In the 
easy-to-counterargue condition, the mean num- 
bers of counterarguments for the no-, low-, 
medium-, and highly distracted groups were 
3.75, 3.39, 2.65, and 1.72, respectively. The 
linear trend on these means was marginally 
significant F(1, 124) = 3.49, p < .10. In the 
À difficult-to-counterargue condition, the corre- 

sponding means were 2.43, 2.22, 1.68, and 
1.90. The linear trend on these means was not 
significant (F < 1). The main effect for dis- 
traction was reliable, F (3,124) = 2.95, p< 
.05. Using the Dunn multiple comparison 


am 


procedure (Kirk, 1968, p. 79), the amount of 
counterarguments generated at no and high 
distraction was compared for both easy- and 
difficult-to-counterargue messages. The tests 
indicated that for the easy-to-counterargue 
message, the difference between no and high 
distraction was reliable (p < .05), but there 
was no significant difference for the difficult- 
to-counterargue message in number of counter- 
arguments. 

In the difficult-to-counterargue condition, 
the mean numbers of favorable thoughts for 
the no-,"low-, medium-, and highly distracted 
groups were 2.56, 2.83, 2.47, and 1.00, re- 
spectively. The linear trend for these means 
only approached reliability F(1, 124) = 2.46, 
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p< .15. In the easy-to-counterargue condi- 
tion, the corresponding means were 1.62, 2.22, 
2.21, and 2.18. The linear trend on these 
means was not reliable (p > .25). Comparing 
the number of favorable thoughts generated 
at no and high distraction for both messages 
using the Dunn procedure indicated a near 
significant drop in favorable thoughts for the 
difficult message (P < .10), but not for the 
easy message. In sum, the number of counter- 
arguments elicited by the easy-to-counterargue 
message and the number of favorable thoughts 
elicited by the difficult-to-counterargue mes- 
sage tended to be inhibited by the distraction 
manipulation as expected by the thought dis- 
ruption hypothesis. 

Recall of the persuasive message. The ma- 
nipulation of distraction produced a signifi- 
cant effect on the number of message argu- 
ments recalled, F(3,124) = 9.07, p < .001. 
The pattern of means for the no-, low-, me- 
dium-, and highly distracted groups was 3.8, 
4.0, 3.7, and 2.5, respectively (a perfect score 
would be 5 arguments recalled). Testing all 
pairwise comparisons on these four means 
using the Dunn multiple comparison pro- 
cedure indicated that the recall scores for the 
no, low, and medium groups did not differ 
from each other, but the high-distraction 
group had significantly less recall than any 
of the other groups (all ps < .05). There were 
no interactions on this measure. 

Ancillary measures. There were no main 
effects nor interactions on the measures of 
speaker credibility, persuasive intent of the 
speaker, or enjoyment of the experiment. 


Discussion 
Distraction and Recall 


The highest level of distraction unexpect- 
edly led to a significant drop in the number of 
arguments that subjects could recall. Yet, for 
the easy-to-counterargue message, the amount 
of acceptance produced under the highest 
level of distraction was slightly greater than 
that produced under medium distraction, and 
clearly greater than that produced under no 
distraction. (See Insko et al., 1975, for a 
similar result.) Previous researchers (Fest- 
inger & Maccoby, 1964; Osterhouse & Brock, 
1970; Vohs & Garret, 1968) suggested that 
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“distraction serves to facilitate acceptance of 
a counterattitudinal message only when the 
distraction is not so severe as to inhibit recep- 
tion of the arguments contained in the mes- 
sage” (Osterhouse & Brock, 1970, p. 355). 
Certainly at some high level of distraction, 
reception of the message could be so severely 
impaired that the decrement in message learn- 
ing would outweigh all other factors. How- 
ever, Insko et al. (1975) suggest that there is 
a relatively flat slope relating recall and atti- 
tude change and, therefore, it takes a rela- 
tively large change in recall before there is 
any associated attitude change. 


Distraction and Thought Production 


The findings for the easy-to-counterargue 
message replicated the patterns found by 
Osterhouse and Brock (1970), Keating and 
Brock (1974), and Insko et al. (1975). There « 
Was more agreement with the message as the 
level of distraction increased, and addition- 
ally, there was a parallel drop in the number 
of counterarguments generated when the no- 
and high-distraction cells were compared. 

Given a message which elicits predomi- 
nantly counterarguments, the effect of distrac- 
tion is to interfere with counterargument pro- 
duction. The logical parallel, as we suggested, 
is as follows: given a message which elicits 
predominantly favorable thoughts (here, the 
difficult-to-counterargue message), the effect 
of distraction might be to interfere with fa- 
vorable thought production, and thus decrease 
message acceptance. In point of fact, under 
conditions of no distraction, the difficult mes- 
sage did elicit more favorable thoughts (38% 
of total thoughts) than counterarguments 
(36%). Of course, this difference was unreli- 
able, but having favorable thoughts disrupted 
by distraction could have accounted for the 
observed decline in acceptance (see Figure 1), 


EXPERIMENT 2 


Since Experiment 2 was designed to more 
fully explore the possibility that for a difficult- 
to-counterargue message, distraction would 
interfere with favorable thought production, 
the difficult-to-counterargue Message in Ex- 
periment 2 advocated a proattitudinal posi- 
tion and used arguments that were logically 
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~ sound, and easily defendable. The easy-to- 


counterargue message also advocated a pro- 
attitudinal position, but as in Experiment 1, 
used arguments that were open to refutation 
and skepticism. This is in line with Roberts 
and Maccoby’s (1973, p. 303) suggestion that 
distracting a receiver from generating agree- 
ing comments about a proattitudinal appeal 
should result in less acceptance of the appeal. 

Although Experiment 2 was primarily de- 
signed to explore the more general distraction 
formulation, it might also be viewed as an- 
other test of “effort justification.” Besides the 
need for a proattitudinal message to maxi- 
mize favorable thought production, Baron et 
al. (1973) have argued that “Under certain 
circumstances, the effort hypothesis predicts 
distraction effects even on proattitudinal mes- 
sages. If the effort involved is particularly 
high . . . and there are few other justifica- 
tions . . . enough dissonance could remain to 
. .. make one more extreme in one’s favorable 
views” (p. 317). 

Briefly, two other changes were made in 
Experiment 2: (a) Only one order of the 
dependent variable booklet was used since 
order had minimal effects in Experiment 1, 
and (b) only two levels of distraction were 
used (low and medium) since these levels did 
not affect recall in Experiment 1. 

The following predictions were made. More 
counterarguments would be generated to the 
easy-to-counterargue message than to the dif- 
ficult-to-counterargue message, but more fa- 
vorable thoughts would be generated to the 
difficult-to-counterargue message. There would 
be a significant Message X Distraction inter- 
action on the dependent variable measuring 
acceptance: Distraction was expected to in- 
crease acceptance for the easy-to-counterargue 
message, but reduce acceptance for the diff- 
cult-to-counterargue message. Distraction was 
expected to inhibit counterargument produc- 
tion for the easy-to-counterargue message, 
and inhibit favorable thoughts for the diffi- 
cult-to-counterargue message. 


Method ` * 
Procedure 


Fifty-four introductory psychology students at The 
Ohio State University were run in a 2 X 2 factorial 
design with 2 levels of distraction (low and me- 


dium) and 2 levels of message counterarguability 
(easy and difficult to counterargue). Thirteen sub- 
jects were run in each cell, and subjects were run in 
groups of from 4 to 9 members. The attitude ques- 
tions preceded the thought listing section in the 
booklets. After completing the booklets, subjects 
were debriefed, thanked, and dismissed. 

Message counterarguability. As in Experiment 1, 
two messages were constructed for this study. Both 
messages argued that the tuition at O.S.U. should be 
reduced by 50% from the then current $270 per 
quarter to $135 per quarter. The difficult-to-counter- 
argue message was designed to minimize counter- 
argumentation, and maximize favorable thoughts. 
The difficult message claimed that Ohio was 48th in 
per capita spending on higher education, and if Ohio 
could move to 30th place, fees could be cut in half. 
Other revenue sources discussed in the message were: 
a $22 million surplus in the state budget, the state 
sales tax, and profits from the new state lottery. 
The message also argued that tuition needed to be 
reduced because the high price of an education in 
Ohio had led the state to be 49th in the percentage 
of high school graduates going on to college. 

The easy-to-counterargue message was designed to 
minimize favorable thoughts and to maximize 
counterarguments, and thus took a different approach 
to the idea of reducing tuition. The message called 
for replacing high-prestige faculty with lower pres- 
tige, lower paid faculty; for cutting back on book 
and magazine purchases in the library; for instituting 
a 5¢ per ride charge on campus bus rides; for in- 
creasing class sizes; and for exerting more control 
over courses taken by students to insure maximally 
efficient use of instructors. Each message was ap- 
proximately 3 minutes in length. 

Distraction and effort, The distraction task was 
identical to that employed in Experiment 1. For 
the low-distraction group the Xs flashed at the rate 
of 1 every 15 sec, and for the medium distraction 
group, the rate was 1 every 5 sec. Again, this manip- 
ulation was also designed to vary effort. 


Dependent Variables 


Attitude and thought measures. The specific tuition 
attitude measure employed in Experiment 1 was used 
again in conjunction with a question which asked, 
“In general, to what extent did you agree with the 
speaker’s recommendations about lowering the tui- 
tion?” As in Experiment 1, these two attitude mea- 
sures were converted to standard scores and summed 
for each subject. The thought collection and thought 
scoring procedure paralleled that employed in Ex- 
periment 1. Manipulation checks and ratings of 
speaker credibility, persuasive intent, and enjoyment 
of the experiment were also taken. All questions 
except the specific tuition measure were responded 
to on 12-point Likert-type scales. 

Measure of recall. The last question in the book- 
lets asked subjects to try to recall as many argu- 
ments as they could from the message. The argu- 
ments recalled were scored in a manner identical to 
that in Experiment 1. 
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TABLE 1 
MEAN NUMBER OF COUNTERARGUMENTS AND 
FAVORABLE THOUGHTS IN RELATION TO 
MESSAGE COUNTERARGUABILITY AND 
LEVEL or DISTRACTION 


Easy to Difficult to 
counterargue counterargue 
Low Medium Low Medium 
distrac- distrac- distrac- distrac- 
Measure tion tion tion tion 
Counterarguments 3,920 2.615 38o ABe 
Favorable thoughts 115a 1.778 3.69 2.316 


Note, Cells in any row having a common subscript are not 
significantly different at the .05 level by the Duncan multiple 
range test. 

In the low-distraction condition, subjects received 4 flashes 
per minute. 

In the medium-distraction condition, subjects received 12 
flashes per minute. 


Results 
Independent Variables 


Message counterarguability. Significantly 
more counterarguments (see Table 1) were 
generated to the easy message (X = 3,27) 
than to the difficult message (X = 42); F(1, 
43) = 52.49, p < .0001.° Sixty percent of the 
thoughts generated to the easy message were 
coded as counterarguments, while the corre- 
sponding figure for the difficult message was 
8%. 

Significantly more favorable thoughts were 
generated to the difficult-to-counterargue mes- 
sage (XY = 3.00) than to the easy-to-counter- 
argue message (X = 1.46); F(1,48) = 14.84, 
p< .001 (cell means in Table 1). In addi- 
tion, favorable thoughts predominated in re- 
sponse to the difficult-to-counterargue mes- 
sage: 62% of the total thoughts were coded 
as favorable. In the easy-to-counterargue mes- 
sage, which used weak arguments to support 
an agreeable position, the percentage of fa- 
vorable thoughts was 27%. There were no 
significant differences between groups in total 
thoughts generated. 

The manipulation of distraction and effort. 
Subjects in the medium distraction conditions 
reported feeling more distracted (X = 6.39) 
than subjects in the low-distraction conditions 
(X = 3.93); F(1, 48) = 9.07, p < .005. Also, 
subjects in the medium-distraction conditions 
perceived the task as more effortful (X¥ = 
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6.70) than subjects in the low-distraction con- 
ditions ( = 5.00); F(1, 48) = 3.91, p < .05. 


Dependent Variables 


Communication acceptance. Subjects who 
heard the difficult-to-counterargue message 
expressed more agreement with the message 
(X = .68) than subjects hearing the easy-to- 
counterargue message (X = —.72); F(1,48) 
= 14.10, p < .001. Also, the predicted Mes- 
sage X Distraction interaction was significant; 
F(1,48) = 6.89, p < .02 (see bottom panel 
of Figure 1). Pairwise comparisons employing 
the Duncan multiple range procedure indi- 
cated that for subjects hearing the easy-to- 
counterargue message, those under medium 
distraction (X =.01) showed significantly 
more agreement with the speaker than sub- 
jects under low distraction (X = —1.45), % 
< .05. Although not statistically significant, 


for subjects hearing the difficult-to-counter- ‘ 


argue message, the pattern was reversed (for 
low-distraction subjects, X = .95, for medium- 
distraction subjects, X = .42). 

Thought production. A marginally signifi- 
cant Message X Distraction interaction (1, 
48) = 3.10, p < .08, was obtained on the 
number of counterarguments generated. Dun- 
can’s multiple range procedure (Table 1) in- 
dicated that, as predicted, the number of 
counterarguments generated under low dis- 
traction for the easy-to-counterargue message 
was significantly greater than the number 
generated under medium distraction. The 
number of counterarguments generated to the 
difficult message did not differ as a function 
of distraction. As in Experiment 1, increased 
distraction reduced counterargumentation for 
the easy message only. 

A significant Message X Distraction inter- 
action was obtained on the number of favor- 
able thoughts generated F(1,48) = 6.27, p 
< .05. The Duncan multiple range procedure 
(Table 1) indicated that significantly more 
favorable thoughts were generated to the dif- 


5 The reader may wonder why a message advocating 
a proattitudinal position would generate any counter- 
arguments at all. Recall that the easy-to-counter- 
argue message discussed specific consequences that 
would result if the tuition were reduced (e.g., initia- 
tion of bus fares, reduced books in the library, etc.) 
that most students viewed as undesirable. 
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ficult message under low distraction than 
under medium distraction. For the easy-to- 
counterargue message, there was no difference 
as a function of distraction. The patterns of 
differences on the thought measures are ex- 
actly congruent with the thought disruption 
hypothesis. 

Recall. The manipulations produced no sig- 
nificant differences on the number of message 
arguments that could be recalled. The average 
number of arguments recalled across all cells 
was 3.77 out of 5 possible. 


Discussion 


The pattern of means on attitude, counter- 
argument, and favorable thought measures in 
both Experiments 1 and 2 supported the gen- 
eral thought disruption hypothesis over the 
effort justification notion. As the level of dis- 
traction went up, the number of counterargu- 
ments went down for the easy-to-counterargue 
message, but remained unchanged for the 
difficult-to-counterargue message. Likewise, as 
the level of distraction went up, the number 
of favorable thoughts generated went down 
for the difficult-to-counterargue message, and 
remained unchanged for the easy-to-counter- 
argue message. The role of distraction appears 
to be selective. The present evidence suggests 
that distraction inhibits the dominant cogni- 
tive response to a persuasive appeal. If the 
dominant cognitive response is counterargu- 
mentation, then distraction will tend to in- 
hibit this and result in increased persuasion; 
but if the dominant cognitive response is 
favorable thoughts, distraction will tend to 
inhibit this and result in decreased persuasion, 

Following Osterhouse and Brock (1970) 
and Insko et al. (1975), several analyses of 
covariance were conducted using the stand- 
ardized attitude index as the criterion in all 
cases. In Experiment 1 the crucial Message X 
Distraction F of 2.97 is reduced to a nonsig- 
nificant 2.49 when counterarguments are the 
covariate, to a nonsignificant 2.51 when fa- 
vorable thoughts are the covariate, and to a 
nonsignificant 2.28 when both are used as co- 
variates. This same ‘result occurs in Experi- 
ment 2 in that a significant Message X Dis- 
traction F of 6.89 is reduced to a nonsignifi- 
cant 3.85 when counterarguments are co- 
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varied, to 2.03 when favorable thoughts are 
covaried, and is reduced substantially to 1.48 
when both are used as covariates. These re- 
sults are consistent with the notion that the 
subjects’ cognitive responses mediate the in- 
teraction. 


CONCLUSIONS 


Why not conclude that the distraction ef- 
fect is mediated by both thought disruption 
and effort justification? Why insist on a mo- 
nistic explanation? The answer lies in the 
predictive power of the competing explana- 
tions. The effort justification prediction is 
that the more effortful the distraction, the 
more successful it will be in producing ac- 
ceptance. Yet the studies reported here indi- 
cate that the distraction effect appears to be 
dependent not upon the amount of effort pro- 
duced, but on the message content and the 
nature of the thoughts elicited by the mes- 
sage. The thought disruption hypothesis al- 
lows the specific prediction that distraction is 
most likely to lead to enkanced persuasion 
when a message presents poor arguments (i.e., 
arguments that are open to refutation and 
counterargumentation) and to reduced per- 
suasion when a message presents very good 
arguments (i.e., arguments that are likely to 
elicit favorable thoughts). The effort justifica- 
tion hypothesis fails because it is unable to 
make such differential predictions. 

In sum, the results of the present experi- 
ments appear to have extended the role of 
distraction beyond the confines of counter- 
attitudinal persuasion. Not all of the perti- 
nent comparisons were statistically reliable, 
but the replicated patterns of differences make 
it likely that subsequent investigators will be 
able to determine the dominant cognitive re- 
sponse to a communication, estimate the 
cumulative effect of these responses on ac- 
ceptance, and control much of the variance in 
acceptance by systematic manipulations of 
distraction, The current research suggests that 
distraction can enhance or reduce yielding to 
propaganda depending on the nature of the 
message (whether easy or difficult to counter- 
argue) and the extent to which appropriate 
responses (favorable thoughts or counterargu- 
ments) are evoked, 
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REFERENCE NOTE 
‘1, Baron, R. S. Personal communication, October 
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This study examined the impact of some basic exchange-theory variables, (i.e., 
outcome alternatives and outcome value) on multistrategy evaluations in a con- 
flict setting. Using a role-playing simulation method, 320 college students took 
the position of an employee in conflict with an employer and evaluated their 
own likelihood of adopting four strategies suggested by exchange-theory 
schemes: self-enhancernent, threat to leave, coalition with other employees, and 
conflict avoidance. A 2X 2X22 factorial design manipulated self’s (em- 
ployee) outcome alternatives, self’s (employee) outcome value, other’s (em- 
ployer) outcome alternatives, and other’s (employer) outcome value. The re- 
sults showed that outcome alternatives and outcome value have differential 
effects on strategy evaluations. The results generally support predictions implied 
by the Blau and Emerson analyses of power relations and a subjective prob- 
ability of success interpretation suggested by Kipnis’s “treatment of choice” 
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metaphor, 


Conflicting parties in “natural” settings 
typically have a range of strategy options, 
meaning that they must evaluate and select 
one of several strategies. This multistrategy 
decision process prompts two important ques- 
tions: What criteria do parties use to evaluate 
diverse strategies? Are similar or different 
criteria applied to disparate strategies? These 
questions have not been answered in research 
on power tactics, which uses “forced-choice” 
settings to investigate one strategy at a time 
(see reviews by Kipnis, 1974; Michener & 
Suchner, 1972; Tedeschi, Schlenker, & Linds- 
kold, 1972). Such single-strategy research 
yields valuable information on why persons 
decide to use or not use a particular strategy, 
but it ignores the problem of what information 
persons use to simultaneously evaluate diverse 
strategies. Research incorporating a full range 
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of strategy options seems essential to examine 
this issue. 

The present study addresses the multistrat- 
egy issue and is theoretically based on the 
exchange framework of Blau (1964) and Em- 
erson (1962, 1972). In conflict settings, per- 
sons typically evaluate the context and alter- 
natives available to them before undertaking 
some action. A significant aspect of these cog- 
nitive processes appears to be the development 
of “plans of action” (i.e., tentative decisions 
regarding the behavioral alternatives). The 
present research examines these plans of ac- 
tion rather than behavior per se and is, there- 
fore, concerned with cognitive processes that 
precede actual behavior in conflict settings. 

In most conflict settings, there are a large 
number of specific strategies available to ac- 
tors, and exchange theorists have suggested 
various strategy categorizations, The major 
strategies can be identified as self-enhance- 
ment, threatening to leave the relationship, 
coalition formation, and conflict avoidance 
(Blau, 1964; Emerson, 1962, 1972; Michener 
& Suchner, 1972). Consider these strategies in 
a situation where an employee might try to 
extract a pay raise from an employer, The 
self-enhancement strategy involves an attempt 
to persuade the employer that one’s inputs 


885 


886 


(e.g., performance) are high enough to war- 
rant a pay raise. The threat-to-leave strategy 
is a threat to seek employment elsewhere, A 
coalition refers to joint action with other 
employees against the employer. Conflict 
avoidance involves resigning oneself to cur- 
rent pay levels (Blau, 1964; Emerson, 1962, 
1972; Michener & Suchner, 1972). The pres- 
ent study uses an employee-employer context 
to examine how persons evaluate these four 
Strategies, 

Some recent theory and research suggests 
that multistrategy evaluations can depend on 
a number of factors, including expected suc- 
cess, normative constraints, and the approval 
of a third party (Kipnis, 1974; Michener & 
Schwertfeger, 1972; Raven & Kruglanski, 
1970). The basic idea guiding this research is 
that persons will estimate the probable suc- 
cess of alternative strategies and that strategy 
evaluations will reflect these expectations of 
success, This idea has been developed by Kip- 
nis (1974) in his research on how leaders 
Overcome resistance from subordinates, He 
argues that there is a “treatment of choice” 
for each form of subordinate resistance. The 
treatment is adjusted to the situation, mean- 
ing that leaders diagnose the reason for a 
subordinate’s resistance and adopt strategies 
that are most likely to overcome this resist- 
ance (i.e., strategies with the greatest expected 
success). The present study is concerned with 
a subordinate’s attempt to overcome the re- 
sistance of an employer to a pay raise and 
uses expected success to suggest criteria or 
information a subordinate may use to make 
strategy evaluations. 

Exchange-theory treatments of power sug- 
gest that a person might use information re- 
garding the power position of “self” and 
“other” to form success expectations and make 
strategy decisions. To Blau (1964) and Emer- 
son (1962, 1972), outcome alternatives and 
outcome value determine the power each per- 

over the other. Outcome alternatives 

he availability of outcomes from 
and value is generally viewed 
nce of” or “need for” the out- 

, rather than outcome mag- 

t; Emerson, 1962, 1972; 

wt & Kelley, 1959). A 
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person (employee) has power over an other 
(employer) to the extent that the other has 
few alternative sources and highly values the 
outcomes at issue; similarly, the other (em- 
ployer) has power over oneself ( employee) to 
the extent that self has few alternatives and 
highly values the outcomes at issue. A recent 
study shows that these determinants of “ob- 
jective” power also affect a person’s subjec- 
tive estimate of the power of self and other 
(Bacharach & Lawler, 1976). 

Blau (1964) and Emerson (1962, 1972) 
identify four variables that might provide 
persons criteria for making multistrategy eval- 
uations: self’s alternatives, self’s value, other’s 
alternatives, and other’s value. Blau and Em- 
erson do not explicitly predict which criteria 
will affect which strategies, but Emerson 
(1972, p. 68) has recently posed this research 


question and suggested that expected success > 


might provide the foundation for such predic- 
tions. From an expected-success premise, this 
study expects each of the foregoing variables 
to be associated with different strategies be- 
cause (a) different variables give persons 
information on different points of vulnerabil- 
ity in one’s own or the other’s situation, and 
(b) different strategies tend to be directed at 
disparate sources of vulnerability, For exam- 
ple, if one (self) has many alternatives, then 
he may see a threat to leave as more effective 
than tactics that do not use the power re- 
Source provided by such alternatives, A per- 
son may expect greater success from a threat 
to leave under such circumstances because 
this strategy attacks a vulnerable point in the 
other’s situation. With few alternatives, a 
person (self) would be foolish to use this 
strategy because such a threat is buttressed 
by minimal force and unlikely to be viewed as 
credible, 

The implication of the Blau-Emerson anal- 
yses of power and the expected-success prem- 
ise (Kipnis, 1974) is that each alternative- 
value variable will bear most heavily on one 
of the four strategies. Before Presenting the 
specific Predictions, two points should be em- 
Phasized. First, an independent variable may 
affect more than one strategy without under- 
mining the general hypothesis, which suggests 
that each independent variable may be more 
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important to one strategy than to the remain- 
ing strategies. Second, consistent with a cogni- 
tive focus, the dependent variables are a per- 
son’s (self) report of how likely he would be 
to adopt each of the four strategies (i.e., 
strategy plans). 


Self’s Outcome Alternatives 


One’s own alternatives should be most im- 
portant to the threat-to-leave strategy. As 
noted above, the availability of alternatives 
for self is especially relevant to the expected 
success of threats to leave, with greater suc- 
cess anticipated under high as opposed to low 
alternatives. Persons should see such alterna- 
tives as a more relevant criterion for threats 
to leave when compared with other strategies. 
Although several studies document links be- 
tween alternatives and cooperation, prior re- 
search has not determined whether persons 
view self’s alternatives as more relevant to 
threats to leave than to other possible strate- 
gies (e.g., Burgess & Nielsen, 1974; Schellen- 
berg, 1965). 


Other's Outcome Alternatives 


The other’s alternatives should most 
strongly affect coalition evaluations. A coali- 
tion limits the other’s alternatives (e.g., Em- 
erson, 1962; Michener & Suchner, 1972); 
consequently, persons may see a coalition as 
more useful when the other has many alterna- 
tives because the coalition reduces this base 
for the other’s power. If the other has few 
alternatives, then coalitions would be pointless 
and a threat to leave may become a more 
viable strategy. In sum, other’s alternatives 
should be most important to coalitions but 
may also affect threats to leave. Under condi- 
tions of high as opposed to low alternatives, 
persons’ expressed tendency toward coalitions 
should increase, whereas their expressed ten- 
dency toward a threat to leave should de- 
crease. 

The impact of other’s alternatives on coali- 
tions has not been explicitly investigated, but 
this effect is suggested by a study which found 
that the frequency of “weak” coalitions (i.e., 
coalitions between subordinates) was greater 


when the other (i.e., superordinate) has an 
alternative outcome source (Kelley & Arro- 
wood, 1960). However, this finding could be 
due to the superordinate refusing a coalition 
and selecting the alternative rather than to 
the effect of other’s alternatives per Se on 
subordinates’ inclination toward the weak 
coalition, The present study attempts to dem- 
onstrate a link between the other’s alternatives 
and a subordinate’s evaluation of the coalition 
option. 


Self’s Outcome Value 


The value persons attach to the outcomes 
should be the prime criterion for conflict 
avoidance. The conflict-avoidance option 
enables a person to avoid the risk or effort 
associated with “confrontation” strategies, 
such as coalitions, threats to leave, and self- 
enhancement. Persons should report a greater 
tendency toward conflict avoidance when they 
attach low as opposed to high value to the 
outcomes at stake. There are two possible 
reasons for this, both of which suggest that 
the expected success of attempting influence 
will decrease under low value. First, persons 
may not view the salary grievance as legiti- 
mate enough to justify influence attempts 
under conditions of low value. Given that 
legitimate grievances provide a better founda- 
tion for influence, persons may expect less 
success from influence in the low-value condi- 
tion. Second, given that the other has infor- 
mation on self’s value, persons in the low- 
value condition might expect to be viewed by 
the other as lacking the motivation or com- 
mitment to push for a pay raise, and this may 
make the other less responsive to influence 
attempts. 

In sum, low value should reduce the ex- 
pected success of influence attempts and facili- 
tate conflict avoidance. Regarding prior re- 
search, conflict-avoidance responses (e.g., 
“withdrawal”) have been documented in a 
variety of contexts, but research has not exam- 
ined the impact of extant outcome value on 
conflict avoidance or on multistrategy evalu- 
ations (e.g, Hammock & Brehm [cited in 
Brehm, 1966]; Leventhal & Bergman, 1969; 
Michener & Suchner, 1972). 
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Other's Outcome Value 


The importance that other ascribes to the 
outcomes should be strongly associated with 
self-enhancement. Other’s value reflects the 
cost of making concessions to self. If other’s 
value is low, the costs are minimal, and per- 
sons might expect the other to be more recep- 
tive to influence. If other’s value is high, on 
the other hand, persons might expect the other 
to adopt a more intransigent stance because 
the cost of making concessions to self is 
greater. 

Persons may expect greater success from 
self-enhancement when the other attributes 
low value to the outcomes in question. The 
reason is that self-enhancement involves more 
gentle persuasion than the other confronta- 
tion strategies (i.e., threat to leave, coalition), 
and the success of such gentle persuasion 
should be especially sensitive to the other’s 
concession cost. Self-enhancement may be 
fruitless when the other highly values the out- 
comes because of the other’s intransigence; 
the high-value condition suggests a more 
forceful strategy, a coalition. In sum, an in- 
crease in the other’s value should decrease the 
expressed likelihood of self-enhancement and 
increase the likelihood of a coalition strategy. 


METHOD 


Subjects and Procedures 


The four independent variables (i.e., self’s outcome 
value, self’s outcome alternatives, other’s outcome 
value, other’s outcome alternatives) were manipu- 
lated in a 2X 2X 2X 2 factorial design. A total of 
320 undergraduates at Cornell University partici- 
pated in the study and were randomly assigned to 1 
of 16 experimental treatments (i.e., 20 per cell). The 
data were gathered via questionnaires administered in 
large classes, and care was taken to assure that 
classes had not discussed ideas relevant to the study. 

Before responding to a questionnaire, subjects read 
a description of a situation involving conflict between 
an employer and an employee. All subjects adopted 
the role of the employee and imagined themselves as 

sales representative for a large clothing company. 

cifically, the description was as follows: 


re a salesman (saleswoman) for a large com- 
ich produces and sells clothing. Sales per- 

» yourself, work on a salary basis and 

re a commission. Recently, the com- 

pay raises to production workers, 

salary increases to sales personnel 

\. Over the last year, your own 


sales have steadily increased, and you are somewhat 
unhappy about not getting a salary increase. 


Faced with the failure of the company to give 
sales personnel pay raises, you have four alterna- 
tive courses of action: (1) try to persuade your 
Supervisors that you, personally, deserve a pay 
increase by pointing to your sales performance; (2) 
join with other’sales personnel and, as a group, 
pressure the company to give pay raises; (3) indi- 
vidually, threaten to find another sales job and 
leave the company; (4) stay in your present job 
and accept your current salary. Your task is to 
decide which of these options to select. 


Subjects then read information that manipulated 
the independent variables. The availability of alterna- 
tive jobs for self and alternative employees for other 
manipulated self-other alternatives. The self-alterna- 
tives (low vs. high) manipulation stated, “The job 
market for sales people with your qualifications is 
(very tight/very good), and there is (only a 10%/a 
90%) chance that you could find a better job.” The 
manipulation of other’s alternatives (low vs. high) 
stated, “If you quit, the company would find it 
(difficult/easy) to replace you because there is (only 
a 10%/a 90%) chance that they could hire another 
person with your qualifications.” 

Self value and other value were manipulated by 
varying the consequence a pay raise would have for 
one’s own and the employer's financial position. 
Specifically, the self-value (low vs. high) manipula- 
tion stated, “A pay raise is (not at all important/ 
very important) to you because your financial posi- 
tion has (improved/deteriorated) in the last year.” 
Other value (low vs. high) was manipulated by 
stating, “The company is in (good financial condi- 
tion/poor financial condition), and there is (only a 
10%/a 90%) chance that giving salary increases will 
affect the financial security of the company.” 


Dependent Variables 


A questionnaire, containing the dependent mea- 
sures, followed the description of the situation. Sub- 
jects were instructed to read the description of the 
situation twice before proceeding to the question- 
naire. This served to equalize subjects’ understanding 
of the situation and account for differentiation in 
reading-comprehensive ability. Separate questionnaire 
items for each strategy asked subjects how likely 
they would be to adopt the strategy. Subjects re- 
sponded on 9-point scales labeled “not at all likely” 
at the low end and “highly likely” at the opposite 
pole. The specific items were as follows: 


How likely is it that you would decide to accept 
your current salary and not try to get the com- 
pany to change its decisions on pay raises? (con- 
flict avoidance) 


How likely is it that you would try to persuade 
the company to give you a pay raise by pointing 
to your sales performance? (self-enhancement) 
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How likely is it that you would threaten to leave 
the company and get another job? (threat to 
leave) 

How likely is it that you would organize with 

other sales personnel and, as a group, pressure the 

company to give salary increases? (coalition) 

Additional questionnaire items, with similar re- 
sponse formats, asked subjects to evaluate the legiti- 
macy of attempting social influence and estimate the 
success of social influence in general as well as the 
likelihood that particular strategies would success- 
fully extract a pay raise from the other. 

The last item on the questionnaire asked subjects, 
“How confident are you with the answers you gave?” 
Subjects answered on a 9-point scale ranging from 
“not at all confident” to “highly confident.” The 
overall mean across experimental conditions was 6.87, 
reflecting considerable confidence. 


RESULTS 


With multiple dependent variables, sepa- 
rate analyses of variance for each dependent 
variable (strategy) may yield significant ef- 
fects by chance. To avoid capitalizing on 
chance, it is necessary to perform a multi- 
variate analysis of variance to ascertain 
whether each independent variable has an 
overall effect on the multidependent variables. 
If the overall effects are statistically signifi- 
cant, then it is appropriate to examine uni- 
variate Fs from separate four-way analyses of 
variance on each dependent variable (for per- 
tinent discussions of multivariate analysis of 
variance, see Anderson, 1966; Cooley & 
Lohnes, 1971; Jones, 1966). 

A multivariate analysis of variance, includ- 
ing the four strategies, revealed main effects 
for all four independent variables using the 
Wilks-lambda criterion, 7(1,304) = 6.43, p 
< .001, for self’s value; F(1,304) = 20.85, 
p < .001, for self’s alternatives; F(1,304) = 
3.97, p < 004, for other’s value; F(1, 304) 
= 4.55, p< .001, for others alternatives. 
Multivariate tests for interaction effects were 
not statistically significant. Therefore, it is 
appropriate to examine univariate Fs for the 
main effects of each independent variable. 
Table 1 presents the mean likelihood scores 
for each strategy by experimental condition. 

The results are divided into four sections. 
First, the pattern of main effects for each 
independent variable is presented. Second, 
standardized discriminant coefficients from the 
multivariate analysis assess the relative im- 
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MEAN LIKELIHOOD SCORES FOR EACH or Four 
STRATEGIES BY EXPERIMENTAL CONDITION 


Strategy 
Experimental 

condition Self-en- Conflict 

(SV-SA- hance- Threat avoid- 

OV-OA)s ment Coalition to leave ance 
L-L-L-L 6.60 6.40 4.75 3.20 
L-L-L-H 7.00 5.25 5.45 3.15 
L-L-H-L 6.60 4.35 4.20 4.35 
L-L-H-H 6.45 5.15 4.95 4.75 
L-H-L-L 7.25 5.75 2.75 4.05 
L-H-L-H 6.80 5.40 3.60 3.90 
L-H-H-L 6.60 4.35 4.20 4.35 
L-H-H-H 7.35 4.85 3.75 4.25 
H-L-L-L 7.35 6.55 4.35 2.50 
H-L-L-H 7.75 5.95 5.30 2.45 
H-L-H-L 7.50 5.75 6.15 3.05 
H-L-H-H 715 4.00 6.80 2.15 
H-H-L-L 7.35 630 2.90 3,05 
H-H-L-H 7.35 5.55 3.65 4.05 
H-H-H-L 7.25 6.00 2.90 3.30 
H-H-H-H 7.65 5.75 3.95 3.50 


SV _ = self's outcome value; SA = self's 


outcome alternati OV = other's outcome value; OA = 


other's outcome alternatives; L and H refer to low and high 
conditions for the value and alternative variables. 

* Each experimental condition is defined by a combination of 
letters (L and H) that correspond to the variables SV-SA- 
OV-OA, respectively. 


portance of each independent variable to the 
four strategies. Third, the effects of the inde- 
pendent variables on perceptions of strategy 
success are presented. Fourth, the overall pref- 
erence rankings of the strategies are discussed. 


Main Effects 


Self's outcome alternatives. As expected, 
self’s alternatives constituted a criterion for 
threats to leave, Persons reported greater 
inclination toward a threat to leave in the 
high- as opposed to low-alternatives condi- 
tion, F(1, 304) = 64.75, p < .001. High al- 
ternatives for self also decreased the tendency 
of persons to say they would adopt conflict 
avoidance, an effect that was not predicted, 
F(1, 304) = 7.89, p < .005. In sum, persons 
with many alternatives reported less inclina- 
tion toward accepting their current pay and 
more inclination toward a threat-to-leave 
strategy. 

Other’s outcome alternatives. Two main ef- 
fects occur for the other’s alternatives. As 
predicted, persons in the high-alternatives con- 
dition said they were more likely to opt for a 
coalition than persons in the low-alternatives 
condition, F (1,304) = 4.94, p < .03. More- 
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TABLE 2 
ZeERO-ORDER CORRELATIONS BETWEEN 
Four STRATEGIES 
Self-en- Threat to 

Strategy hancement Coalition leave 
Coalition —.02 S 
Threat to leave DIS .05 
Conflict avoidance = .25* aoe = 20 


*p <.001. 


Over, persons reported less tendency toward a 
threat to leave when the other had high rather 
than low alternatives, 7(1,304) = 10.67, p < 
.001. Thus, as expected, where the other had 
many alternatives, persons reported a greater 
tendency toward coalitions and a lower ten- 
dency toward threats to leave. 

Self’s outcome value. The importance of the 
outcomes to self affected two strategies. As 
expected, persons reported greater inclination 
toward conflict avoidance when they (self) 
attached low as opposed to high value to the 
outcomes, F (1,304) = 21.72, p < .001. When 
the outcomes were of low importance, persons 
simply expressed less tendency to attempt 
changing their current outcome position. Low 
value also reduced persons’ tendency toward 
the gentlest confrontation strategy—self-en- 
hancement, F (1,304) = 8.84, p < .003. 

Other's outcome value. The prediction that 
other’s value would affect self-enhancement is 
not supported (F< 1). However, other’s 
value did affect coalition responses, Persons 
reported greater inclination toward a coali- 
tion when the other attributed low as opposed 
to high value to the outcomes, F(1,304) = 
6.71, p < .01. This pattern is the opposite of 
that predicted in the introduction and we 
return to it later. 

To conclude, the main effects indicate that 
different independent variables have somewhat 
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different tactical implications. Three of the 
four hypotheses were supported. As predicted, 
high alternatives for self enhanced the likeli- 
hood of threats to leave, high alternatives for 
the other increased the likelihood of coalitions 
while decreasing the likelihood of threats to 
leave, and low outcome value for self facili- 
tated conflict avoidance. The hypothesized 
effect of other’s value on self-enhancement 
did not emerge. A couple of effects that were 
not predicted also occurred. However, recall 
that the hypotheses do not preclude the pos- 
sibility of additional effects but suggest that 
specific independent variables will be more 
important to some strategies than others. 
Main effects alone are not sufficient because 
they do not allow one to systematically com- 
pare the effects of a given independent vari- 
able on the four strategies. Discriminant 
analysis was used for this purpose. 


Discriminant Analysis 


The use of standardized discriminant func- 
tions has two compelling advantages. First, 
this controls for intercorrelations among 
strategies (see Table 2). Second, and more 
important, standardized discriminant functions 
enable one to compare the sensitivity of dif- 
ferent strategies to variation in a given inde- 
pendent variable by determining the ability of 
each dependent measure (strategy) to dis- 
criminate between the categories of the inde- 
pendent variable (Anderson, 1966; Cooley & 
Lohnes, 1971; Jones, 1966). The equations 
containing the standardized discriminant co- 
efficients are presented in Table 3. In each 
equation, the dependent variable (strategy) 
with the largest score is the most sensitive to 
variation in a given independent variable and, 
by implication, the strategy for which the 
specific independent variable is most impor- 


TABLE 3 
STANDARDIZED DISCRIMINANT COEFFICIENTS FOR EACH INDEPENDENT VARIABLE 


Independent. variable Equation hen elic 
‘if’s alternatives = .43(SE) — .99(TL) + -30(C) + .26(CA) $ < .001 
1Ps value = _ A0(SE) + .03(TL) + .11(C) — .79(CA) $ < .001 
r’s alternatives = —.16(SE) + .91 (TL) — .62(C) + .09(CA) p < .001 
r’s value = —.35(SE) + .71 (TL) — .67(C) + .41 (CA) p < .004 


ient; TL = threat to leave; C = coalition; CA = conflict avoidance. 
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tant (see Anderson, 1966; Cooley & Lohnes, 
1971; Jones, 1966). 

To evaluate the hypotheses, compare the 
size of the coefficients within each equation 
and the coefficients for a given strategy across 
the four equations. The data provide unequiv- 
ocal support for predictions regarding self’s 
alternatives and self’s value. In the first equa- 
tion, a threat to leave was clearly more sensi- 
tive to self’s alternatives than other strategies; 
moreover, contrasting the threat-to-leave 
scores across equations, the threat-to-leave 
score was largest in the first equation. A simi- 
lar pattern was revealed for self’s value 
(Equation 2). Conflict avoidance had the 
largest discriminant score within the second 
equation, as well as across equations. The 
predicted effect of other’s alternatives is par- 
tially supported. Consistent with the predic- 
tion, coalition responses were quite sensitive 
to other’s outcome alternatives, but the co- 
efficient for a threat to leave was larger than 
expected. Other’s alternatives apparently con- 
stituted a major criterion for both threat-to- 
leave and coalition evaluations. Regarding the 
other’s value (fourth equation), self-enhance- 
ment did not attain the highest score, con- 
trary to the prediction. The most important 
implication of Equation 4 is that other’s value 
is important to coalition decisions because the 
coalition coefficient was the highest compared 
to the other equations, Within Equation 4, 
the value of threat-to-leave was slightly 
higher, but other equations contained even 
larger threat-to-leave scores, and the main 
effect of other’s value on threats to leave is 
not statistically significant.* 

In sum, self’s alternatives was the prime 
criterion for threats to leave, and self’s value 
was the prime criterion for conflict avoidance. 
Other’s alternatives was important to both 
threats to leave and coalitions, but of some- 
what greater importance to the former, where- 
as other’s value was important primarily to 
coalitions, 


Perceptions of Success 


“The discriminant functions show what 
strategies are most sensitive to given inde- 
pendent variables. If expected success (Kip- 
nis, 1974) is a viable interpretation, then each 
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independent variable should affect the ex- 
pected success of the strategy that is most 
sensitive to it. 

Data on subjective probabilities of success 
indicate that the observed effects on strategy 
evaluations are consistent with the observed 
effects on expected success. Regarding self’s 
alternatives, persons expected threats to leave 
to be more effective when they had high as 
opposed to low alternatives, F(1,304) = 
28.96, p < .001, but self’s alternatives did not 
affect the success attributed to the other con- 
frontation strategies (self-enhancement and 
coalitions). Based on the discriminant func- 
tions, a threat to leave is also most sensitive 
to other’s alternatives, and persons accordingly 
perceived it as more effective when the other 
had few alternatives, /(1,304) = 107.29, 
$ < .001. Other’s alternatives did not affect 
perceived coalition success but did unexpect- 
edly affect the expected success of self- 
enhancement, F(1,304) = 6.65, p < .01. In 
addition, consistent with the observed effects 
of other’s value, a coalition was perceived as 
more effective when the other attached low 
value to the outcomes, F(1, 304) = 4.33, 
p < .05, whereas other’s value did not affect 
the expected success of threat-to-leave and 
self-enhancement strategies. Finally, self’s 
value did not affect the subjective success of 


1 Note that these data allow one to compare the 
effects of each independent variable on the four 
strategies rather than compare the effects of the 
four independent variables on each strategy sepa- 
rately. This is consistent with the major purpose of 
the research, but one might argue that Blau (1964) 
and Emerson (1962, 1972) also imply that certain 
independent variables will have larger effects on a 
given strategy compared with other independent 
variables. For example, threats to leave should not 
only discriminate the most, compared with other 
strategies, between the low and high self alternatives, 
but among the independent variables, self’s alterna- 
tives should have the largest effect on threats to 
leave. To examine this issue, each strategy was sepa- 
rately regressed on the four independent variables. 
The results of multiple regression complement and 
are generally consistent with the discriminant analy- 
sis. Comparing the independent variables, self’s value 
had the largest effect on conflict avoidance, self’s 
alternatives had the largest effect on threats to 
leave, and other’s value and other’s alternatives 
had the largest (and nearly equal) effects on coali- 
tion evaluations. 
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specific confrontation strategies (threat to 
leave, self-enhancement, and coalition) but 
did affect the success attributed to social in- 
fluence in general. When persons felt the out- 
comes were of low importance, they expected 
social influence to be less effective, F(1, 304) 
= 8.92, p < .03. 

These results suggest that perceptions of 
success played an important role in persons’ 
use of the independent variables to make 
strategy evaluations. The major effects on 
strategy evaluations were generally isomorphic 
with the effects on expected success. 


Strategy Preference Rankings 


The data also provided crude information 
on the ranking of strategy preferences. A 
comparison of the overall strategy likelihood 
scores showed that self-enhancement, coali- 
tions, and threats to leave were the most pre- 
ferred strategies in this particular setting. 
Since these three responses, in contrast to 
conflict avoidance, involved a direct confron- 
tation with the other, the rankings affirmed 
that persons were inclined to exert overt pres- 
sure on the other. Of course, these data should 
be interpreted with considerable care, because 
such overall scores should be affected by fixed 
aspects of the experimental situation, In the 
present study, a conflict situation was inten- 
tionally induced, and it is quite likely that 
these preference rankings were a function of 
this conflict. Of primary interest is whether 
the independent variables have different tacti- 
cal implications, regardless of the preference 
rankings, and this has been addressed in prior 
sections. 


Discussion 


From the Blau (1964) and Emerson (1962, 
1972) analyses of power relations and Kipnis’s 
(1974) “treatment of choice” notion, this re- 
search proposed that outcome alternatives and 
value would provide cues for perceptions of 
strategy success and thereby affect strategy 
evaluations. The effects of different variables 
on persons’ evaluation of different strategies 
lv substantiate this notion, Self’s alter- 
i primarily on threats to leave, 
srimarily on conflict avoid- 

rnatives are important, 
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though not most important, to coalition evalu- 
ations. The only clear negative finding is that 
other’s value does not affect self-enhancement. 
Despite the negative finding, however, the 
observed effects of alternatives and value on 
strategy evaluations are consistent with the ef- 
fects on expected success, supporting expected 
success as an interpretive premise. Overall, 
the results show that persons view different 
power determinants, specified by exchange 
theories, as having different tactical implica- 
tions. The following discussion will elaborate 
the tactical implications of each independent 
variable and relate the major findings to 
prior research, 


Outcome Alternatives 


Research has shown that self’s alternatives 
affect the tendency of persons to maintain co- 
operative relations with an other (Burgess & 
Nielsen, 1974; Schellenberg, 1965; Thibaut 
& Kelley, 1959). Using a role-playing setting, 
the present study reveals a related finding on 
threat-to-leave evaluations. Specifically, this 
research makes two contributions regarding 
this strategy. First, it demonstrates that 
threat-to-leave evaluations are a function of 
the other’s alternatives as well as self’s alter- 
natives. When the other has few alternatives, 
persons believe he is more vulnerable to this 
strategy than when he has many alternatives. 
Second, this study shows that both self’s and 
other’s alternatives are more important to 
threat-to-leave evaluations than to coalitions, 
self-enhancement, and conflict avoidance. 

The effect of other’s alternatives on coali- 
tion evaluations is also an important finding. 
If the other has many alternatives, persons 
report a greater inclination toward a coalition, 
yet at the same time they report less tend- 
ency toward a threat to leave. This link be- 
tween other’s alternatives and coalitions is 
relevant to a finding in Kelley and Arrowood 
(1960) indicating that “weak” coalitions (ies 
coalitions between subordinates) were more 
likely when the superordinate had an alterna- 
tive outcome source. This could have. occurred 
because (a) the superordinate was simply 
more inclined toward the alternative than 
toward a coalition with a subordinate or (b) 
because subordinates felt a ccalition with each 
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other was more useful to contain an other 
who had alternatives. The present study dem- 
onstrates a link between other’s alternatives 
and a subordinate’s expressed tendency toward 
coalitions where only the second interpretation 
(b) could apply. In general, this finding sug- 
gests that persons use the fact that coalitions 
obstruct an other’s alternatives to identify 
cues for making coalition evaluations. The 
level of the other's alternatives is one such cue. 


Outcome Value 


Most research on strategies assumes that 
valued outcomes are at stake and investigates 
strategy behavior in the context of such 
valued outcomes. As a consequence, little re- 
search has explored the role of extant out- 
come importance on strategy evaluations (see 
Michener & Suchner, 1972). The present 
study reveals that the value ascribed to the 
outcomes by self and other affects strategy 
evaluations and that persons see self’s value 
as having different tactical implications than 
other’s value. 

Consider the effect of self’s value on con- 
flict’ avoidance. Persons with low value say 
they are more likely to resign themselves to 
current outcomes, Low value reduces the sub- 
jective probability of extracting any conces- 
sions from the other and enhances one’s 
proclivity toward simply adjusting to current 
outcome levels. This occurs despite the fact 
that persons had some reason to perceive the 
pay decisions of the employer (other) as in- 
equitable, since the other had awarded pay 
raises to production personnel but not to sales 
personnel (such as self). 

Ancillary questionnaire data shed further 
light on why low self value reduces the ex- 
pected success of social influence. First of all, 
persons felt they had a less legitimate griev- 
ance in the low-value condition. A question 
asked persons how legitimate they felt an 
influence attempt would be. Under conditions 
of low self value, persons perceived influence 
attempts as less legitimate than under condi- 
tions of high value F(1,304) = 18.56, p< 
.001. And the lower the perceived legitimacy 
of the grievance, the lower the expected suc- 
cess of influence (r = .25, p < .001). Second, 
persons with low as opposed to high value 
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reported a tendency to expend less effort to 
get a pay raise. A question asked persons how 
much effort they would be willing to put forth, 
and the results indicate that low value re- 
duced the estimated effort, F(1, 304) = 79.29, 
$ < .001. Less effort was also associated with 
lower expected success (r = .35, p < .001). 
Thus, low value undermined the perceived 
legitmacy of the grievance and lowered the 
effort persons expected to make in behalf of 
a pay raise. Persons are apparently aware that 
successful influence requires a legitimate griev- 
and a substantial expenditure of effort, Self’s 
value is one basis for making these judgments. 

In contrast to self’s outcome value, other’s 
value does not affect conflict avoidance but 
does affect evaluations of the most severe 
confrontation tactic—coalitions. In retrospect, 
the unexpected direction of the effect on 
coalition evaluations is understandable. Under 
low other’s value, persons probably did not 
believe the other would be more receptive to 
social influence because, given the situation, 
the other had already implicitly refused to 
award raises, The employer failed to increase 
pay even when it involved minimal cost 
and, therefore, persons probably inferred that 
collective pressure was more essential to win 
concessions where the other attributed low 
(compared with high) value to the outcomes, 
Under high value, persons may have viewed 
the employer as constrained by his financial 
situation, attributed less personal responsibil- 
ity for the lack of a pay raise to the employer 
(e.g., see Jones & Davis, 1965), and thereby 
expressed less proclivity toward coalitions. 

In conclusion, using the Blau-Emerson 
exchange framework, this study undertook 
the first complete investigation of the links 
between certain power determinants (i.e, 
alternatives and value) and strategies. In a 
multistrategy setting, the present research 
demonstrates that outcome alternatives and 
outcome value have differential effects on 
the evaluation of strategies. Future research 
should examine the impact of alternatives 
and value on other cognitive elements of con- 
flict settings, such as the expected response 
(i.e., counterstrategies) of the adversary, and 
relate the cognitive elements of “plans of 
action” to actual behavior. 
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Public Versus Private Reactions to Positive Tnequity 


Alba N. Rivera and James T. Tedeschi 
State University of New York at Albany 


In a worker-supervisor paradigm, opposing predictions of public and private 
reactions to equity situations were tested. In a 3 X2 factorial design, equity 
and positive inequity situations were created by having a supervisor (actually 
a confederate) distribute $1 between two workers: a subject and a second 
confederate. In the equity condition, subjects were paid 50¢, whereas in the 
two levels of positive inequity, subjects received either 75¢ or 90¢. The input 
of the workers was identical throughout the six experimental conditions. The 
way of measuring the dependent variables (pencil-and-paper or bogus pipe- 
line) was manipulated. As predicted by an impression management inter- 
pretation, when a pencil-and-paper test was used, subjects reported feeling 
more guilty and less happy and liking the allocator less, the greater the reward 
received. When a bogus pipeline procedure was used, subjects reported feel- 
ing more happy and liking the allocator more, the greater the reward re- 


ceived; there was no relationship between reward magnitude and guilt. 


Numerous studies have shown that persons 
adhere to a norm of equity in allocating re- 
wards within a group (cf. Pritchard, Dun- 
nette, & Jorgenson, 1972). In accord with 
the expectation that each person should re- 
ceive rewards proportional to his contribu- 
tion to the group, subjects who receive more 
than they should (positive inequity) have 
been shown to give away some of their re- 
wards to reestablish equity in the group (Ross 
& McMillen, 1973), while those who receive 
less than they should (negative inequity) take 
enough of later rewards to restore equity 
between themselves and other group members 
(Austin & Walster, 1975). 

The mechanism proposed as mediating 
equity-restoring behavior is an intrapsychic 
state referred to as distress (Adams, 1965; 
Walster, Berscheid, & Walster, 1973)., The 
particular emotional state accompanying this 
distress may be guilt (positive inequity) or 
anger (negative inequity). Consonant with 
older equilibrium notions in individual psy- 
chology (e.g., Hull, 1952), equity-restoring 
behavior is presumed to reduce distress (or 
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cognitive dissonance). While the evidence re- 
garding this presumed mechanism is sparse, 
pencil-and-paper measures of distress have 
been associated with equity-restoring behavior 
(Austin & Walster, 1975; Ross & McMillen, 
1973). 

The assumption that positive inequity pro- 
duces distress runs counter to the general no- 
tion that the greater the reinforcement, the 
more satisfied the individual will be. Consider 
a situation in which two persons must share 
$10 and each has contributed equally to earn- 
ing the total amount. Suppose that across 
dyads who experience this situation, one per- 
son is given $5, $6, $7, $8, $9, or $10. Equity 
theory states that the more the individual 
receives—that is, the greater the positive 
inequity—the more distress he should experi- 
ence. 

An extrapolation drawn from the distress 
hypothesis is that persons receiving equitable 
outcomes as compared to persons experienc- 
ing positive inequity will like their super- 
visors more. This association of equity with 
interpersonal attraction is apparently incon- 
sistent with a reinforcement theory that as- 
sociates degrees of liking with magnitudes of 
reward. However, the internal states gen- 
erated by adherence to or violations of the 
equity norm could be associated with liking 
in the manner specified by traditional rein- 
forcement theory. Negative and positive in- 
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trapsychic states may be associated with neu- 
tral stimulus persons, who through the process 
of secondary reinforcement (Lott & Lott, 
1972) are then disliked or liked. Partial sup- 
port for the notion that work partners and 
supervisors in an equitable situation will be 
liked more than those associated with an in- 
equitable one was found by Austin and Wal- 
ster (1974a). 

The several studies that have shown rela- 
tionships among inequity, felt distress, and 
interpersonal attraction have relied on pencil- 
and-paper measures of the relevant psycho- 
logical states. It is reasonable to suspect that 
subjects use pencil-and-paper measures in 
psychological experiments as an opportunity 
to fake a good impression of themselves 
(Crowne & Marlowe, 1964; DeFleur & 
Westie, 1958; Sigall & Page, 1971). The 
normative influence (Deutsch & Gerard, 
1955) of a group may dictate that the indi- 
vidual give lip service to standards of equity 
even when he profits from violation of them. 
This interpretation views the individual who 
experiences positive inequity as secretly happy 
and attracted to the allocator (and increas- 
ingly so the more he or she profits), but as 
publicly feigning guilt, unhappiness, and dis- 
like for those responsible for allocating in an 
inequitable manner. This impression manage- 
ment interpretation of the public reactions of 
a person who experiences positive inequity 
proposes a social explanation for equity-re- 
storing behavior as an alternative to intra- 
psychic states of guilt, dissonance, or dis- 
tress. That is, an individual who is overpaid 
relative to others who have done essentially 
the same amount and quality of work pri- 
vately experiences no guilt or distress, but 
feels he or she needs to express such negative 
feelings in order to show strong support for 
the important social norm of equity. 

A 3 X 2 factorial design was contrived to 
test the above impression management hy- 
potheses. Subjects experienced equity or one 
of two degrees of positive inequity in a work 
situation with a partner. Subsequent mea- 
surements of feelings of guilt, happiness, dis- 
the supervisor who allo- 
cated the rewards were taken on the usual 
pencil-and-paper measures of else were ob- 
tained through the bogus pipeline technique 
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(Jones & Sigall, 1971). The response set gen- 
erated by the bogus pipeline procedures is 
that the individual should tell the truth be- 
cause the equipment used by the experimenter 
can allegedly detect lies. 

The basis of the present study was the 
assumption that what people publicly say and 
what they privately feel are not the same anc 
are based on different considerations. Thus 
the more a person benefits from a violation < 
the equity norm, the more he will need by hi: 
verbal expressions to indicate his support fo 
the norm. He may, therefore, express guilt 
distress, and a disliking for the allocator 
however, if the individual is constrained to 
tell the truth about his or her feelings (as he 
or she presumably is under bogus pipeline 
conditions), he or she should not report feel 
ing guilty and, according to reinforcement 
theory, he or she should report greater happi- 
ness and liking for the allocator, the greater 
the amount of reward received. 

Specifically, the following hypotheses were 
tested: 


1. There will be a positive linear function 
of degree of expressed guilt with amount of 
reward received (positive inequity) under 
pencil-and-paper conditions; however, there 
should be no relationship between guilt and 
reward magnitude in the bogus pipeline con- 
ditions. 

2. Expressed happiness will be a negative 
linear function of reward in the pencil-and- 
paper conditions, and a positive linear func- 
tion of reward under bogus pipeline condi- 
tions. 

3. In the pencil-and-paper conditions, the 
subjects should express greater liking for the 
equitable than for the inequitable allocator, 
whereas the opposite pattern should be found 
in the bogus pipeline conditions. 


METHOD 
Subjects and Laboratory Personnel 


Sixty female students fulfilled an introductory psy- 
chology course requirement by their participation. 
They were randomly assigned to the six experi- 
mental conditions (z = 10). Six females served as 
confederates and four males served as experimenters? 


1The authors take this opportunity to thank Joan 
Napoli, Karen Vasto, Barbara Nass, Harrissi Stern, 
Peggy Gould, Sue Becker, William Hummel, William 


PUBLIC VERSUS PRIVATE REACTIONS TO POSITIVE INEQUITY 


The confederates and experimenters were randomized 
across conditions. 


Procedures 


When each subject arrived at the laboratory, she 
found two other female students (confederates) 
already seated in the waiting room. The experimenter 
introduced the students to each other and ushered 
them into the expeirmental room. They were asked 
to complete a five-item opinion survey on 7-point 
scales anchored by agree-disagree. The two confed- 
erates surreptitiously copied the subject’s answers 
for future use during the yerisimilitude procedures 
associated with the bogus pipeline; for control rea- 
sons the same procedure was followed for subjects 
in the pencil-and-paper conditions, Subjects were 
informed that the surveys would be used for another 
study and were asked to fold the answer sheets and 
to prevent the experimenter from seeing their 
answers, 

Subjects were told that the purpose of the study 
was to examine supervisor—-worker relations. A fake 
lottery was held to assign one supervisor and two 
worker roles to the three students and the subject 
always drew a worker assignment. Each worker inde- 
pendently assembled very difficult anagrams. The 
task was described as a test of spatial acuity devel- 
oped for airplane pilots. Subjects were informed of 
a bogus scoring procedure that allegedly weighted 
various aspects of the task for difficulty. Ambiguity 
regarding performance standards allowed the experi- 
menter to present bogus feedback without arousing 
suspicion. Subjects were informed that they would 
be paid for their work.2 The experimenter would 
score their work performance, but the supervisor 
would be given $1 to distribute between the workers 
in any way she wanted. Each worker was then left 
alone to work on the task for 10 minutes, after 
which she was taken to an individual cubicle, in- 
formed of her work score and that of the other 
worker, and then was informed about how the super- 
visor allocated the $1 to the two workers. In every 
case the subject learned that she and her co-worker 
had performed equally well (same inputs), but 
(across conditions of the experiment) that she had 
received SO¢, 75¢, or 90¢. The subject was then given 
the appropriate amount of money. 

Pencil-and-paper conditions. Subjects in the pen- 
cil-and-paper conditions were then asked to respond 
to a Mood Adjective Check List (MACL) as part of 
a survey on research participation by undergraduates 
being carried out by the psychology department. On 
each adjective item they were asked to indicate on 
a 1-4 scale (not at all-very much) the degree of an 
internal state, referred to by a mood adjective, expe- 
rienced by them during the experiment. The MACL 
was the same one used by Austin and Walster 
(1974b) and included guilt, anger, happiness, and 
satisfaction. Subjects were then asked to evaluate 
the supervisor regarding her fairness, efficiency, 
equitableness, attractiveness, and how much they 
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would like to have her as a supervisor again in the 
future. Subjects also rated the experimenter on polar 
adjectives (good-bad and honest-dishonest) yielding 
a measure of the evaluative dimension of the sc- 
mantic differential (Osgood, Suci, & Tannenbaum, 
1957). These interpersonal ratings were made on 7- 
point scales (anchored by +3 and —3 for scoring 
purposes), 

Bogus pipeline conditions. Subjects in the bogus 
pipeline conditions responded to the MACL adjec- 
tives that were presented one by one on index cards 
with the appropriate instructions and scale that 
were identical to the ones used in the pencil-and- 
paper conditions. Subjects made their ratings by 
turning a dial moving a pointer along a meter 
forming two parallel scales (1-4 and +3-—3). Which 
scale was appropriate was indicated by the index 
card presented to the subject. A response set for 
“telling the truth” was induced by a set of decep- 
tion procedures (described in more detail by Jones 
& Sigall, 1971) that leads subjects to believe the 
bogus apparatus can detect their true feelings by 
implicit muscle responses. Presumably, if subjects 
believe the verisimilitude procedures, they will be 
constrained to provide self-belief statements and 
will not engage in feigning attitudes they do not 
privately hold. 

The bogus pipeline verisimilitude procedure was as 
follows. Subjects were tested in a room containing a 
combination of equipment that conspicuously dis- 
played General Electric inventory seals: an electro- 
myograph (EMG), electroencephalograph, and elec- 
trocardiogram. A signal generator fed an audible 
500-Hz input into the equipment. This provided a 
continuous output on the EMG’s chart recorder which 
was visible to the subject. The subjects in these 
conditions were told that the equipment measured 
both the direction and intensity of attitudes and 
feelings people have. It was represented as being like 
a lie detector, only superior, and had been recently 
invented by physiologists at John Hopkins Univer- 
sity along with a team of electronics experts from 
General Electric in Schenectady, New York. Elec- 
trodes were attached to the subjects’ right hand. 

Subjects were asked to manipulate a dial with 
their right hand. The dial was attached to a gray 
metal box which had two meter faces, labeled “in- 
put” and “output.” The EMG electrodes, subjects 
were told, could detect their feelings; thus, the 
equipment could detect and display on the output 
meter the subjects’ answers to items from the opinion 
survey taken at the beginning of the experiment. In 
order to calibrate the equipment, as each item was 
presented, subjects were asked to use the dial in 
their right hand to predict on the input meter what 
answer the equipment would be giving on the output 
meter. They were told that they could try to fool 
the EMG by manipulating the input meter to any 


? Subjects served in the experiment as part of a 
course requirement and the money earned was a 
bonus or extra reward, This was the only study 
during the course of the semester in which money 
was given to subjects. 
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TABLE 1 
Mrans on MACL AND SUPERVISOR RATING ITEMS 


Reward condition 


Response mode 50¢ 15¢ 90¢ 
Guilt* 

Pencil-and-paper 1.2. 17a 2.0, 
Bogus pipeline 1.0. 1.0, 1.0, 
Happiness* 

Pencil-and-paper 2.85 2.95 2A. 
Bogus pipeline 2.8b 2.9 Sila 
Liking for supervisor” 
Pencil-and-paper Lih ha Te 
Bogus pipeline 1.9, 2.Anb 2.65 
Desire to work with supervisor again? 
Pencil-and-paper 1.75, —.8, —.5, 
Bogus pipeline 8p -To 1.3ab 


Note, Means with different subscripts are significantly differ- 
ent from each other according to Duncan's mul ple range tests. 

a Ona 4-point scale (1 = not at all, 4 = very much). 

bOn a 7-point scale (—3 = would dislike very much, 
+3 = would like very much). 


position they wanted, but that the EMG would 
analyze implicit responses and record their true atti- 
tudes on the output meter. The output was con- 
trolled by a confederate in another room and she 
adjusted the output meter a half-point away from 
the subjects’ actual prior answer on the opinion 
survey she had surreptitiously copied. The experi- 
menter turned to the equipment, and while throwing 
some switches (intended to represent his calibration 
of the EMG) the confederate gradually centered the 
output dial exactly on the subject’s rated attitude. 
After reviewing all five items from the initial opinion 
survey, the subject was asked to produce her copy, 
and the experimenter compared her rated attitudes 
on the survey with the EMG’s output readings for 
the same items. They were, of course, identical. 

The machine’s output dial was then covered so 
that only the experimenter could view it, and under 
the pretense that the experimenter was interested in 
how accurately each subject knew her own attitudes, 
the subjects were asked to make accurate predictions 
on the meter dial (input scales) regarding the items 
on the MACL and the supervisor rating scales pre- 
viously described. 

Debriefing regarding the deceptions and purposes 
of the experiment was carried out in classes at the 
end of the semester after all data had been collected. 


REsULTS * 
Man: Check 
The in ~ manipulations had the in- 


“ain effects of this factor were 
gs of the supervisor’s equi- 


tended efie: 
obtained on r: 
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tableness, F(2,54) = 10.21, P< 001, and 
fairness, F(2, 54) = 22.78, p< .001. Sub- 
jects in the equity conditions perceived the 
supervisor as more equitable (M = 1.15) than 
did subjects in either the 75¢, positive iv 
equity (M = —.95) or 90¢, positive inequ 
(M = —.95) conditions. Subjects perceiy 
the supervisor as most fair in the equity co 
dition (M = 2.15), moderately unfair in | 
75¢ positive inequity condition (M=-1.1 
and most unfair in the 90¢ positive inequi 
condition (M = —1.45); all three means we 
different from one another at .05 by Dunc 
multiple range comparisons. No other effec 
were obtained on these dependent variab': 
(ps > .1). 

An indication that the response modes ass: 
ciated with the dependent measures (penci 
and-paper or turning dials on a meter) repre 
sented commensurate scales is indicated by the 
fact that on 47 dependent variables the com 
parisons (Bartlett, 1973; Cochran, 1941) ac 
cording to response mode showed that all of 
the variances were homogeneous. The plausi- 
bility of interpreting the response set in the 
bogus pipeline condition as successful depends 
on how the pattern of results obtained ap- 
pears in the context of the theoretical con- 
siderations that led to the study (Ostrom, 
1973). 


Feeling States in Reaction to Reward 
Allocation 


The major hypotheses of this study involve 
tests of linear trends. The reward magnitudes 
manipulated represent unequal intervals (50¢, 
75¢, and 90¢). Appropriate coefficients were 
used in testing the relevant trends (Kirk, 
1968, p. 513). 

According to Hypothesis 1 there should be 
a positive linear trend between the degree to 
which subjects express feeling guilty and the 
amount of reward they received when their 
responses are measured by a pencil-and-paper 
scale, A test showed that only the linear trend 
was significant, F (2, 54) = 6.87, p < .01, anc 
the means are shown in Table 1. As can be 
seen, there was no relationship between the 
amount of reward received and expressed 
guilt in the bogus pipeline conditions, F < i- 


: 3 The authors thank Gerald G. Gaes for his help 
in performing some of the statistical analyses. 


PUBLIC VERSUS PRIVATE REACTIONS TO POSITIVE INEQUITY 


Furthermore, only a main effect of response 
mode was found on expressed guilt, F(1, 54) 
= 12.64, p < .001. Subjects in the pencil-and- 
paper conditions reported feeling more guilty 
(M = 1.6) than did subjects in the bogus 
pipeline conditions (M = 1.0). 

Hypothesis 2 was also confirmed. In the 
pencil-and-paper conditions, expressed happi- 
ness was a negative linear function of reward 
magnitude, F(2, 54) = 3.34, p< .05, and in 
the bogus pipeline conditions expressed happi- 
ness was a positive linear function of reward 
magnitude, F(2,54) = 5.99, p<.01. The 
means are shown in Table 1. Only a main 
effect of response mode was found on reported 
happiness, F(1, 54) = 7.2, p < .002. Subjects 
in the pencil-and-paper conditions reported 
feeling less happy (M = 2.6) than did sub- 
jects in the bogus pipeline conditions (M = 
33y: 


Attraction for Supervisor 


Hypothesis 3 predicted that subjects would 
express more liking for the supervisor in the 
equity than in the positive inequity conditions 
when the measure of attraction was obtained 
by a pencil-and-paper format, but that the 
opposite relationship should be obtained un- 
der bogus pipeline conditions. An interaction 
contrast was performed comparing equitable 
versus inequitable conditions across the pencil- 
and-paper and bogus pipeline modes of mea- 
surement, and it was significant, F(1, 54) = 
4.01, p < .05.* Cell means are shown in Table 
1. A main effect of response mode was also 
found on the measure of liking for the super- 
visor, F(1, 54) = 30.83, p < .001. Subjects 
expressed more liking for the supervisor in the 
bogus pipeline conditions (M = 2.2) than in 
the pencil-and-paper conditions (M = .57). A 
very similar pattern of results was obtained 
on the measure of the subjects’ rated pref- 
erence for working with the supervisor on a 
future occasion. The interaction contrast was 
significant, F(1,54) = 8.56, p< .01, and 
shows the same pattern as the data on liking 
(see Table 1 for cell means). A main effect 
of response mode was also found on the mea- 
sure of working preference, F(1, 54) = 3.81, 
p < .05. Subjects expressed greater preference 
for working with the same supervisor again in 
the bogus pipeline conditions (M = .91) than 
did subjects in the pencil-and-paper condi- 
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tions (M = .13). This main effect was clearly 
produced by the preference of subjects in the 
bogus pipeline conditions for the inequitable 
supervisor. 

There were no effects of the independent 
variables on the subjects’ rating of the experi- 
menter (all ps > .1). 


Discussion 


The pattern of results obtained in the pres- 
ent study supports an impression-manage- 
ment interpretation of public responses to 
experiencing positive inequity, and is contrary 
to the postulated intrapsychic processes that 
have been proposed as mediating postalloca- 
tion responses. It is difficult to see how postu- 
lated states of distress, guilt, anxiety, or dis- 
sonance, which presumably are aroused by 
the experience of positive inequity, can ex- 
plain why subjects in the bogus pipeline con- 
ditions reported being happier the greater the 
positive inequity experienced. 

It seems clear that subjects do expect re- 
ward allocations to be distributed according to 
a norm of equity. They indicated that the 
supervisor was very fair when she allocated an 
equal split of the reward to the two workers 
and they had performed equally well at the 
task. Furthermore, the subjects considered 
the supervisor to be more unfair the greater 
the inequity. These results were not affected 
by the mode of obtaining the judgments. 

A revision of equity theory is suggested by 
the present findings. Apparently, persons do 
behave in a manner that supports the equity 
norm, at least when they are under surveil- 
lance by others. Persons who benefit from an 
unfair allocation of rewards (positive in- 
equity) make the appropriate public responses 
to indicate their support of the norm. How- 
ever, persons experiencing positive inequity 
are secretly pleased that they have profited 


* Presumably, persons publicly support the equity 
norm by indicating dislike for anyone who violates 
it and hence an equity-inequity difference would be 
expected in the pencil-and-paper conditions, How- 
ever, when subjects must reveal their private feel- 
ings, as they are assumed to do in the bogus pipe- 
line conditions, a direct linear function of liking with 
reward level would be expected. This interaction be- 
tween response mode and the linear component of 
the magnitude of reward on liking for the supervisor 


was obtained, F(1, 54) = 5.67, p < .05. 
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from a violation of the norm. The implication 
is that equity-restoring behaviors displayed in 
laboratory experiments (e-8-, Leventhal, Al- 
len, & Kemelgor, 1969) by subjects who have 
experienced positive inequity are not medi- 
ated by distress or guilt but by concern about 
group acceptance and/or the approval of the 
experimenter. It might be expected in future 
research that the status or attraction of the 
experimenter or the cohesiveness of the group 
would affect the probability or degree of re- 
stitutive behavior. 

In order to appear supportive of the norm 
of equity, persons experiencing positive in- 
equity feign dislike for the unfair allocator. 
In the present study this was shown by the 
fact that in the pencil-and-paper conditions 
(where faking can be easily carried out) sub- 
jects disliked the supervisor when he dis- 
tributed rewards inequitably, but liked him 
when they believed the experimenter could 
detect lies. The positive function of liking 
with rewards under bogus pipeline conditions 
is consistent with both reinforcement and ex- 
pectancy theories of attraction. The latter 
predicts an increment in liking with unex- 
pected rewards (Tedeschi, 1974). 

In summary, the present study reveals a 
social mechanism that locks people into sup- 
port of an equity norm. Presumably, such 
support is manifested by verbal adherence to 
the norm, denial of pleasure at benefiting 
from violations of the norm in the form of 
distress and expressions of guilt, and equity- 
restoring behavior. These supporting responses 
are made because of concern regarding social 
acceptance and approval from others and per- 
haps as a protection against future violations 
of the norm at the expense of the norm-sup- 
porter. As we have seen, impression manage- 
ment strategies, which include feigning of 
guilt and dislike for an unfair allocator, are 
unmasked and the opposite feelings are found 
when persons believe their lies can be de- 
tected. 
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Ego Involvement and Attributions for Success and Failure 
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An experiment was conducted to investigate the impact of task importance on 
the causal attributions for success and failure on a bogus social-perceptiveness 
task. Consistent with previous research, individuals assumed more personal 


responsibility for success than failure. M 
Ç the social-perceptiveness test was presented as being, the more pronounced was 
l 


the effect of outcome on the subjects’ 


as favoring a motivational as opposed t: 


of asymmetries in causal attribution. 


A substantial body of literature indicates 
that individuals do not employ the same 
causal explanations to account for their suc- 
cesses and failures (Fitch, 1970; Luginbuhl, 
Crowe, & Kahan, 1975; Streufert & Streufert, 
1969; Weiner & Kukla, 1970; Wolosin, Sher- 
man, & Till, 1973; Wortman, Costanzo, & 
Witt, 1973). Specifically, individuals tend to 
see themselves as being more “personally” 
responsible for their successes than their fail- 
ures and to view “external” factors such as 
luck or task difficulty as being more responsi- 
ble for their failures. For the most part, in- 
vestigators have been guided by the concept 
of “ego defense” in explaining this phenom- 
enon. Presumably, by taking credit for their 
successes and by denying responsibility for 
failures, individuals can bolster and protect 
their ego or self-esteem. 

The concept of ego defense has historically 
enjoyed a prominent position in personality 
and psychoanalytic theorizing (eg., Adler, 
1956; Allport, 1937; Heider, 1958; Merton, 
1946; Murphy, 1947; Sherif & Cantril, 1947). 
Allport has called the defense of the ego 
“nature’s eldest law” (1937, p. 170). Sherif 
and Cantril describe the varied cognitive 
processes by which individuals defend the ego 
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for their assistance in the experimental phase of this 
| research. 

j Requests for reprints should be sent to Dale T, 
Miller, Department of Psychology, University of 
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oreover, the more valid and important 


attributions, The results are interpreted 
o an information-processing explanation 


as follows: 


If our ego is injured we resort to all kinds of ration- 
alizations, to protective adjustments, to selective 
modes of reasoning . . . in which we manipulate 
things, persons, memories or ideas in a highly selec- 
tive way. (1947, p. 100) 


Adler goes so far as to say that one of the 
major benefits of causalistic thinking to an 
individual is that it “frees him from responsi- 
bility and excuses him from blame” (1956, p. 
270). 

Despite the intuitive acceptability of the 
role of the ego in the attribution process, a 
recent review of the relevant literature (Mil- 
ler & Ross, 1975) has suggested that the 
empirical observation that individuals tend to 
explain success and failure differently may 
not necessarily reflect the influence of motiva- 
tionally based, “self-serving” biases. Instead, 
Miller and Ross (1975) argue that the exist- 
ing data are as easily interpreted within a 
rational, information-processing framework. 
More specifically, they contend that three fac- 
tors operating in isolation or in combination 
may cause individuals to take more personal 
responsibility for success than failure. First, 
individuals are more likely to accept respon- 
sibility for expected outcomes than unex- 
pected outcomes and, in general, people ex- 
pect success and not failure. Second, individu- 
als discern a closer covariation between be- 
havior and outcomes in the case of increasing 
success than in the case of constant failure, 
where changes in behavior are not perceived 
to be associated with changes in outcomes. 
Third, individuals tend to hold an erroneous 
conception of contingency, which leads them 
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to associate control primarily with the occur- 
rence of the desired outcome. 

In the present study, an attempt was made 
to secure more compelling evidence for the 
proposition that the causal explanations of in- 
dividuals are often guided by a motive to 
protect and/or enhance their self-esteem. In 
addition to the performance level obtained by 
individuals at a task (success or failure), the 
degree of self-esteem involvement was also 
manipulated. It is surprising that the level of 
self-esteem involvement has not been directly 
manipulated in previous investigations of 
causal attribution because the construct of 
self-esteem is a pivotal one in the “defensive 
attribution” formulation. This is all the more 
surprising because experimental investigations 
of the phenomenon of repression, which also 
focus on the cognitive dynamics of self-esteem 
maintenance, have made extensive use of this 
manipulation (Holmes, 1974). 

Building upon the assumption that there is 
a greater tendency to protect one’s self-image 
under conditions of high- than low-self-esteem 
involvement, it is hypothesized that the 
valence of a task outcome will affect causal 
attributions more under the former than the 
latter conditions. More precisely, it is pre- 
dicted that individuals will assume more per- 
sonal responsibility for success and less re- 
sponsibility for failure under high- than low- 
self-esteem involvement. 


Method 


Overview. Upon arrival at the experimental set- 
ting, subjects were required to complete a bogus 
social-perceptiveness scale (SPS) consisting of 2 case 
studies and 30 questions. On completion of the SPS, 
subjects were told either that the SPS was a well- 
established and valid test of social perceptiveness or 
that it was an unvalidated, recently devised test of 
social perceptiveness. While subjects were filling out 
a subsequent scale (an introversion-extroversion 
scale), the experimenter allegedly scored their per- 
formance on the SPS. The experimenter returned 
on completion of the second scale and informed 
the subjects that they had done well (80th percen- 

2) ly (20th percentile). While the experi- 
supposedly scoring the second scale, the 

asked to fill out a questionnaire con- 
< and perceptions of the SPS. 
T} ts were 52 male and female 

students who were re- 
o -up sheet circulated in 
1 sub red experimental credit 


cerning 
Subjects 
undergradua 
cruited by m 
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for their participation. Thirteen subjects were ran- 
domly assigned to each of four experimental condi- 
tions. The variables manipulated were nature of test 
outcome (success or failure) and involvement of 
self-esteem in the test outcome (high or low). 

Procedure. Subjects arrived individually at the ex- 
perimental laboratory area. The experimenter intro- 
duced himself and indicated that the subject would 
first be required to complete the SPS. They were 
told that this test examined the extent to which in- 
dividuals, with only a limited amount of information 
pertaining to another individual, could predict that 
individual’s behavior or feelings in a wide range of 
situations. The test employed was bogus and was 
adapted from one used by Wortman et al. (1973). 
The test presented information about two separate 
males. The information included descriptions of the 
men’s social behavior, achievements, and attitudes, 
as well as friends’ descriptions of them. Following 
this information, there was a series of 15 questions. 
The subjects were instructed to read the first case 
study, fill out the 15 questions, then read the second 
case and fill out the 15 questions again. The re- 
sponses to the questions were put on Markex com- 
puter cards because the subjects were told that their 
responses would be scored by computer. The ques- 
tions took a multiple-choice format and included 
inquiries as to the described person’s considerateness 
of others, his likely mode of child discipline, his 
personality, his concern for monetary attainment, and 
his tendency to brag about his accomplishments. 

When the subject had completed the two case 
studies, the experimenter returned to the experi- 
mental area. It was at this point that the experi- 
menter introduced the involvement manipulation. 

High or low involvement. In both conditions, the 
experimenter began by saying that after having com- 
pleted the SPS the subject should perhaps be told a 
little about the scale. In the high-involyement condi- 
tions, the experimenter indicated that the SPS was 
a well-established social perceptiveness test and had 
been administered to well over 100,000 people in 
various walks of life. He went on to say that many 
agencies and businesses use people’s scores on the 
SPS to assist them on client and employee decisions. 
The experimenter then indicated that high scores on 
the SPS were positively correlated with such desira- 
ble characteristics as intelligence, personal and mari- 
tal happiness, and job satisfaction. 

In the low-involvement condition, the experi- 
menter indicated that the SPS had only been devel- 
oped recently by a psychologist at the subject’s uni- 
versity and that its usefulness was still very much 
in doubt as it had only been administered tosabout 
100 people. The experimenter then intimated to 
the subject in a skeptical tone that although it was 
hoped that this test might constitute a good test. of 
social perceptiveness, scores on the test thus far had 
not been found to correlate with any of the other 
factors known to be related to social perceptiveness, 

To augment the involvement manipulation, the 
experimenter brought in an impressive, professionally 
printed folder labeled “Social Perceptiveness Scale”. 
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in the high-involvement condition and a dog-cared 
manilla file folder in the low-involvement condition. 

After describing the history of the SPS to sub- 
jects, the experimenter indicated that the purpose of 
the present experiment was to see if there was any 
relationship between an individual's score on the 
SPS and the degree of introversion or extroversion 
possessed by the individual. For this reason, indi- 
viduals were now being asked to complete an intro- 
version-extroversion scale, which was also to be 
answered on a computer card. Before leaving the 
subject to work on this scale, the experimenter picked 
up the subject's card and said, 


As I indicated earlier, your responses will all be 
analyzed by computer at a later date but since a 
lot of people have wanted to find out how well 
they did, I’ve prepared a key and have been 
scoring the“answers by hand. So, if you'd like I’ll 
score your answers while you're filling out the 
second scale. 


Subjects all assented to this arrangement. The ex- 


.Perimenter then left the subject and went to see 


whether the subject was in the success or failure 
condition. 

If the subject was in the failure condition, the 
experimenter wrote an 8 on the subject’s card and 
a 24 if he (she) was in the success condition. The 
experimenter yeturned with the card when the sub- 
ject had completed the second scale. The subject 
was shown the card and was informed that this was 
the score the subject had received out of a possible 
30. In the failure condition, the experimenter indi- 
cated that approximately 20% of those who have 
taken the SPS did worse than the subject, with 80% 
having done better. In the success condition, the 
subject was informed that only 20% of those who 
had taken the test had done better, while 80% did 
worse, 

After revealing to the subject how well he (she) 
had done on the SPS, the experimenter handed the 
subject a questionnaire that he asked the subject to 


, fill out while he scored the second scale. The purpose 


of the questfonnaire was presented as an attempt to 
collect the subject’s feelings and reactions to the test 


. with the possibility of using these responses to help 


interpret their scores. The questions on this question- 
naire constituted the main dependent variables in 
this study. The questions of most interest were the 
following: (a) “How difficult did you find the 
SPS?” (b) “How accurately do you think your 


* score on the SPS reflects the true degree of social 


— 


perceptiveness you possess?” (c) “How hard did 
you try on the SPS?” (d) “How important do you 
think luck was in determining your score on the 
SPS?” Responses to these questions were collected 
on 7-point bipolar scales. 

When the subject had completed the question- 
naire, the experiment was terminated and the experi- 
menter debriefed the subject. Special care was taken 
to ensure that the subject realized that the SPS was 


a A bogus test and that the scores were prearranged 
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TABLE 1 
MEANS FOR EACH OF THE DEPENDENT MEASURES 


' 
Outcome 


Success Failure 
Measures High Low High Low 
Difficulty 1.54 2.50 3.70 2.85 
Social perceptiveness 477 385 116 2.77 
Effort 4.85 4.93 3.24 4.31 
Luck 147 147 4.24 2,62 


Note. The higher the mean, the more difficult the task; the 
greater the role of social perceptiveness, the more the expended 
effort and the greater the importance of luck. (High and low 
refer to level of involvement.) 


and in no way reflected his (her) true level of social 
perceptiveness, 


Results 


Manipulation checks. After the subjects 
had completed the questionnaire they were 
asked by the experimenter to indicate how 
many correct answers they had on the SPS. 
All subjects correctly reported their alleged 
score. In addition, subjects were asked to in- 
dicate on a 7-point scale in the questionnaire 
how well they thought they had done on the 
SPS. An analysis of variance indicated that 
individuals felt they did much better in the 
success condition than in the failure condi- 
tion, F(1,48) = 28.80, p < .001. It appears, 
therefore, that the manipulation of the out- 
come valence was an effective one. 

Attributions of causal responsibility. The 
major dependent measures in the present ex- 
periment were the questions that asked sub- 
jects the extent to which various causal agents 
were responsible for their performance level 
on the SPS (see Table 1). 

A multivariate analysis was performed in 
which the subjects’ responses to the depen- 
dent measures were combined to yield a single 
F value (Overall & Klett, 1972), The multi- 
variate analysis revealed a significant main 
effect of success-failure, F(4, 45) = 19.67, p 
< .001, and a significant Success—Failure X 
High-Low Involvement interaction, ¥(4, 45) 
= 5.09, p < .002. The main effect for high- 
low involvement did not approach signifi- 
cance,* 


1It should be noted that a preliminary analysis 
was undertaken to determine if the attributional 
pattern was affected by the sex of the subject. No 
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Since the multivariate analysis revealed 
significant effects, univariate analyses of vari- 
ance were conducted on the subjects’ re- 
sponses to each of the dependent measures. 

Task difficulty, Consistent with predictions 
and previous research, subjects indicated that 
the SPS was more difficult when they failed 
than when they succeeded, F(1,48) = 6.13, 
p < .02. Though no main effect of involve- 
ment emerged (F < 1), a significant Outcome 
X Involvement interaction was found, F(1, 
48) = 5.31, p < .05. The pattern of results 
indicated that subjects felt that the task was 
more difficult under high than low involve- 
ment when they failed, but they believed the 
task was more difficult under low than high 
involvement when they were successful. Sim- 
ple effects analyses, however, indicated that 
neither of these differences reached signifi- 
cance. 

Social perceptiveness. On this measure, as 
was the case with task difficulty, a main ef- 
fect of outcome emerged, F(1, 48) = 61.34, 
p< 001, indicating that subjects thought 
their performance more accurately reflected 
the true degree of social perceptiveness they 
possessed when they succeeded than when 
they failed. The main effect of outcome, how- 
ever, was qualified by an Outcome X Involve- 
ment interaction, F (1, 48) = 17.95, p < .001. 
Successful subjects thought their performance 
more accurately reflected their ability under 
high than low involvement, t(48) = 2.44, p 
< .05, whereas failing subjects thought their 
performance more accurately reflected their 
ability under low than high involvement, 
t(48) = 4.15, p < Ol. 

Effort. Outcome valence also had a pro- 
nounced effect on perceived effort expendi- 
ture. Subjects indicated that they worked 
harder when they succeeded than when they 


failed, F (1,48) = 10.80, $ < .001. Although 
the Outcome X Valence interaction did not 
reach ii ynventional level of significance, 
F(1,4 10, p<.15, we find that in- 
volver not affect attribution under 
sex effects cn om this analysis, which is of 
interest as n to find sex differences in 
causal exp! performance on achieve- 


ment tasks (Feather . 
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success (ż < 1) but that subjects indicated 
they expended less effort under high-involve- 
ment failure than low-involvement failure, 
t(48) = 2.34, p < .05. 

Luck. Once again a main effect of outcome 
emerged, F(1,48) = 22.68, p< 001. Sub- 
jects indicated that luck was much more im- 
portant when they failed than when they 
succeeded. The results also produced an Out- 
come X Involvement interaction, F(1,48) = 
4.10, p <.05. Simple effects revealed that 
this interaction was due to the fact that in- 
volvement had no effect under success (¢ < 
1) but produced greater attributions of luck 
under the high-involvement failure condition 
than the low-involvement failure condition, 
t(48) = 2.76, p < .02. 


Discussion 


The results of the present study corrobo- E 


rate earlier findings, which indicated that in- 
dividuals take more personal responsibility 
for successful outcomes than for failing out- 
comes. More than the simple demonstration 
of attributional differences as a function of 


outcome valence, however, the present study , 


investigated the importance of ego involve- 
ment in the outcome, 

The results indicated that not only do dif- 
ferences between success and failure condi- 
tions emerge but these differences are greater 
under conditions that might be expected to 
produce high ego involvement than under ones 
leading to low ego involvement. It was ob- 
served that individuals engaged in more self- 
protective attributions under the high- than 
low-involvement failure conditions. Specifi- 
cally, high-involvement failure subjects at 
tributed more responsibility for their pet 
formance to luck and less to their ability an 
effort than did low-involvement failure su! 
jects. There was also some evidence to in 
cate that high-involvement success subje 
engaged in more self-enhancing attributions 
than did low-involvement success subje-is. 
That is, the former sub‘ects indicated that 
their performance was more affected by their 
ability than did the latter subjects. 

It is interesting that involvement had 
greater impact upon attributions under the 
failure condition than under success. Perhaps 


> it is more difficult to manipulate feelings of 
ego involvement when one succeeds than when 
one fails. Even success on an unvalidated, 
unimportant task may provide an opportunity 
for self-gratification and self-enhancement. On 
the other hand, failure on an unvalidated, un- 
important task does not appear to be nearly 
as threatening to the individual as failure on 
a much more important task. 

The present study appears to have obviated 
the interpretational problems that Miller and 
Ross (1975) argued beset previous experi- 
ments. Neither the phenomenon of greater 
expectancy of success than failure nor the 
misconstrual of contingency posited by Mil- 
ler and Ross as plausible explanations for the 
success-failure difference appears able to ex- 
plain the effect of involvement. Under cer- 
tain circumstances it may be possible that 
. information-processing differences could me- 
diate the impact of involvement. For instance, 
individuals may approach a high-involvement 
activity with different expectations or moti- 
vations than they do a low-involvement ac- 
tivity. In the present experiment, however, 
the involvement manipulation did not occur 
until after the task had been completed. 


Therefore, the task experience for all sub- 
` jects was equivalent. 


Although the perception that external fac- 
tors played a larger part in one’s failure than 
in one’s success may not necessarily reflect 
motivational distortion, the perception that 
external factors played a larger role under 
high- than low-involvement failure would cer- 
tainly seem to indicate a motivationally based 

* attributional distortion. Assuming that the 
present results and earlier studies do reflect 
self-serving attributional biases, why is it 
that the phenomenon is not found in all con- 
texts (Miller & Ross, 1975) and is sometimes 

4 reversed (Ross, Bierbrauer, & Polly, 1974)? 
A lengthy discussion of this issue is clearly 
beyond the scope of the present paper, but a 
few possibilities may be discussed briefly. 
First, doing well in all activities may not be 
equally important. For the majority of indi- 
viduals, a skill like mathematical ability is 
not something they feel they have in abun- 
dance and have adjusted their self-concept 
accordingly. Social perceptiveness on the 


t 


EGO INVOLVEMENT 


905 


other hand, like a sense of humor, is some- 
thing most of us feel we have more than our 
share of. The importance of the task or the 
activity to us, therefore, may be of critical 
importance in determining whether self- 
serving attributional biases will be evoked. 

One study that provided data relevant to 
this hypothesis was reported by Ross et al. 
(1974). The procedure involved having sub- 
jects (undergraduate or experienced school 
teachers) either successfully or unsuccess- 
fully instruct an 11-year-old on the correct 
spelling of commonly misspelled words, The 
results indicated that the teachers were in- 
clined to take more responsibility for failure 
than the undergraduate subjects. Since the 
teachers were presumably more ego involved 
in their teaching competence, this result is 
quite surprising. It is possible, however, that 
the teachers may have had a stronger self- 
concept and found it easier to endure a 
single failure experience, A second possibility 
in the Ross et al. study and other similar 
studies is that other norms and motives are 
operative. In our society, pressure often exists 
to take responsibility for our mistakes and 
failures. To deny personal responsibility for 
failure is often seen as immature and detri- 
mental to a positive public image (Jones & 
Wortman, 1973). It may be the case, there- 
fore, that one occasionally best serves him- 
self by taking public responsibility for his 
failure. This is likely to be particularly the 
case when the individual feels secure about 
the ability in question or when he expects 
that others may anticipate “defensive” be- 
havior from him. 

Future research must examine more closely 
the prevalence and the dynamics of self- 
serving biases with respect to causal attribu- 
tions for one’s own behavior as well as for 
the behavior or character of others. 
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Relationship of Negative Mood State and Helping 
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A study was conducted to provide a means for reconciliation of the conflict- 
ing data on the relationship of negative mood state to altruism, Whereas 
some studies have shown that negative mood leads to increases in altruistic 
action, others have shown the reverse. It was hypothesized that the incon- 
sistency of these results was due to differences in the ages and consequent 
levels of socialization of the subjects employed in the earlier studies. In order 
to test the hypothesis, subjects from three age groups (6-8, 10-12, and 15-18 
years old) were asked to think of either depressing or neutral events and 
were subsequently given the opportunity to be privately generous. Consistent 
with predictions from the negative state relief model of altruism, the youngest, 
least socialized subjects were somewhat less generous in the negative mood con- 
dition, but this relationship progressively reversed itself until in the oldest, 
most socialized group, the negative mood subjects were significantly more gen- 
erous than neutral mood controls. The data were taken as support for a 
hedonistic conception of altruism that views adult benevolence as self-grati- 
fication. It is suggested that the reward character of benevolence derives from 


the socialization experience, 


Both social and developmental psycholo- 
gists have recently shown an increasing in- 
terest in the relationship between one’s mood 
state and the tendency to be helpful. For 
example, Krebs (1970), in his review of the 
altruism literature, was able to cite 16 sepa- 
rate experiments examining this relationship, 
with the first appearing just 4 years prior to 
his paper. At present, the size of such a list 
would be more than tripled. The results of 
these investigations have gained importance in 
that at least two sets of workers in this area 
have used the data to make theoretical state- 
> ments about the nature of altruism (Cialdini, 
Darby, & Vincent, 1973; Rosenhan, Under- 
wood, & Moore, 1974). 

Cialdini et al. (1973) have argued that the 
data from studies of mood state and helping 
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suggest a conception of altruism as hedonism, 
It is contended that benevolence is a rein- 
forcing event not only for its recipient but for 
the benefactor as well. Cialdini et al. (1973) 
have also argued from their data and the 
results of other studies that a U-shaped rela- 
tionship exists between mood state and altru- 
ism with elationlike and depressionlike states 
leading to high levels of altruistic action and 
neutral affective states leading to the least 
such activity. Rosenhan and his co-workers 
(e.g, Rosenhan et al., 1974) have argued 
from the perspective of their data that only 
positive affective states lead to more helping 
than neutral states. These researchers have 
typically found negative mood to impair help- 
ing responses to various degrees. Thus, the 
form of the mood-helping relationship sug- 
gested by Rosenhan and his colleagues is es- 
sentially a linear one, with negative affect 
promoting the least benevolence. 

There is no inconsistency between the two 
proposed models in the instance of positive 
mood, and the data strongly show that over 
a variety of populations and procedures, posi- 
tive mood-inducing operations have led to 
enhanced altruistic activity. Task success 
(e.g., Berkowitz & Connor, 1966; Isen, 1970, 
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Study 3; Kazdin & Bryan, 1971), empathy 
leading to positive mood (Aderman & Berko- 
witz, 1970), humor (Kidd & Berkowitz, 
1976), the receipt of unexpected money (Isen 
& Levin, 1972), or gifts (Isen, Clark, & 
Schwartz, 1976; Isen & Levin, 1972), think- 
ing happy thoughts (Moore, Underwood, & 
Rosenhan, 1973; Rosenhan, Underwood, & 
Moore, 1974)—all have been associated with 
increases in helping behavior.' The evidence 
concerning the relationship between negative 
mood and altruism, however, is considerably 
more equivocal, 

The body of findings supporting the notion 
that negative mood induction procedures fa- 
cilitate altruism is varied and impressive. 
Greater helping has resulted from empathy 
leading to negative moods (Aderman & Berk- 
owitz, 1970), embarrassment (Apsler, 1975), 
receipt of an insult (Steele, 1975), receipt of 
harm (Greenglass, 1969), receipt of a deviant 
score on a personality test battery (Filter & 
Gross, 1975; Freedman & Doob, 1968), cog- 
nitive discomfort (Kidd & Berkowitz, 1976), 
perpetration of another’s harm (e.g., Carl- 
smith & Gross, 1969; Freedman, Wallington, 
& Bless, 1967; McMillen, 1971), and obser- 
vation of another’s harm (Cialdini et al., 
1973; Konecni, 1972; Rawlings, 1968). Cial- 
„dini et al. (1973) sought to explain the in- 
fluence of such procedures upon helping via a 
negative state relief model of altruism. They 
suggested that subjects exposed to operations 
of this sort would incur temporary, negative 
mood states and would seek to relieve these 
states through altruistic activity.? According to 
this account, altruism would serve to alleviate 
a negative affective state because it is a per- 
sonally gratifying and thus, mood-enhancing 
experience for the altruist. Just like any other 
opportunity for mood enhancement, then, the 
opportunity for benevolence would be seized 
by one whose mood was temporarily de- 
pressed. In order to test this negative state 
relie pothesis, Cialdini et al. predicted 
th iving another’s harm, which had 
been to produce noxious affect in the 
, Boyer, Lombardo, & Stich, 

se subjects to attempt to 

e negativity via altruism, 


unless tl had already been re- 


relieve 
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moved through some other mood-enhancing 
event. In confirmation of their prediction, it 
was found that either causing or witnessing 
harm increased helping behavior, except 
among those subjects who received a gratify- 
ing event (unexpected money or praise) be- 
tween the occurrence of harm and the chance 
to help. These results were taken as support 
for the negative state relief model. 

Despite the above evidence, there is a sub- 
stantial amount of data that seem to show an 
inverse relationship between altruistic activ- 
ity and mood-depressing operations. Isen, 
Horn, and Rosenhan (1973) showed in two 
experiments (Studies 1 and 3) that fourth- 
grade children who had been made to fail pri- 
vately on a bowling game task donated to 
charity less, though not significantly less, than 
a control group of children who performed on 
the task but received no score. Employing a 
different mood induction procedure, Moore et 
al. (1973) also found negative mood to inhibit 
helping responses in second- and third-grade 
children, Their subjects who were asked to 
think about sad events were significantly less 
likely to share than were neutral affect con- 
trol subjects. Finally, Rosenhan et al. (1974), 
using similar subjects and procedures, repli- 
cated the shape of the Moore et al. (1973) 
sharing data. 

One outstanding difference between the two 
conflicting sets of studies that we have re- 
viewed is the nature of their subject popula- 
tions. Those results consistent with the view 
that negative affect enhances altruism have 
been obtained with adult subjects while those 
consistent with the opposing view have been 
obtained with young children. Age has been 
shown to be an important determinant of al- 
truistic action, with benevolence increasing 


1 Several studies that have employed mood induc- 
tion procedures but have not employed a no-treat- 
ment or neutral mood control group (e.g., Aderman, 
1972; Isen, 1970, Studies 1 and 2) will not be con- 
sidered in this article. The lack of such a control 
condition precludes the informativeness of their re- 
sults for the questions addressed here. 

2 Cialdini et al. (1973) characterized these nega- 
tive states as depression or sorrow and differentiated 
them from frustration or anger which are standardly 
reduced via aggressive action and thus would not be 
expected to enhance helping behavior. 
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over age levels (cf. Bryan & London, 1970 
for a review). It may be possible to reconcile 
the discrepant findings of this area in terms 
of this age factor, The negative state relief 
model contends that an individual in a state 
of temporary depression tends toward helping 
action in an attempt to relieve that negative 
mood because altruistic behavior is personally 
gratifying. However, there is good evidence 
indicating that the reward character of al- 
truism is conditioned through the socialization 
process (Aronfreed, 1968). That is, the social- 
ization experience provides for the systematic 
pairing of helping responses with approval and 
other positive outcomes. Such a consistent 
association can be expected to cause the help- 
ing act itself to acquire a secondary reinforc- 
ing value. Thus, the negative state relief 
model would suggest that only for adults, who 


~ have completed the socialization experience, 


would negative affect be expected to lead to 
benevolence. Young children, who have not 
yet internalized the reward value of altruism 
would not be expected to find benevolence re- 
warding and, consequently, would not be ex- 
pected to employ it as a gratifier when they 
are depressed. In fact, since pure altruism 
for an unsocialized individual involves only 
the giving away of gratifiers, it might well 
be expected that negative moods would inhibit 
such action. Following from such an argu- 
ment is the prediction that the relationship 
between negative mood and altruism will 
change as a child undergoes socialization. It 
has already been demonstrated (e.g., Under- 
wood, Moore, & Rosenhan, 1973; Rosenhan 
et al., 1974) that even among the young, 
negative mood leads to a strong tendency for 
self-gratification, If, as we have argued, the 
socialization process makes altruism self-re- 
warding, it should be expected that for a 
young unsocialized child, negative affect 
would tend to decrease altruism; but, with 
age, the relationship between depressed mood 
and altruism would become increasingly posi- 
tive until depressed mood significantly en- 
hanced altruistic behavior, as it has been 
shown to do with other self-gratifying behav- 
iors. In order to test this hypothesis, an ex- 
periment was conducted to examine the ef- 
fects of a negative mood-inducing operation 
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upon the charitable behavior of subjects from 
three distinct age groups (6-8, 10-12, and 
15-18 years old). 


METHOD 
Subjects 


Subjects were 100 children and young adults of 
both sexes from an elementary school and a nearby 
high school, attended by lower to lower-middle so- 
cioeconomic status students. 


Procedure 


Subjects were taken individually from a classroom 
or study hall by a male experimenter and brought 
to one of two adjoining rooms in the school. The 
room contained a table and chairs, books, a tape re- 
corder, and a sealed box labeled “Coupons for the 
other students.” The experimenter informed subjects 
that they would be participating in two separate 
tasks, a “hearing task” to be performed initially and 
an “imagination task” to take place subsequently. 
Subjects were also told that as payment they would 
receive a number of coupons that could be ex- 
changed for a prize at the end of the week. The 
nature of the available prizes was not specified, but 
each subject was told, “The more coupons you have, 
the better the prize you can get with them.” At this 
point, the experimenter directed subjects’ attention 
to the sealed box and explained that they could use 
the box to give some of their coupon winnings to 
other students who would not participate in the 
experiments. Specifically, the experimenter said, 


Not all of the students at the school will be able 
to help us with these tasks and, so, they won’t 
get a chance to earn any coupons or get any 
prizes. But when you've finished, you can share 
your coupons with those students by putting some 
in this box here [the experimenter pointed to a 
slot atop the box] if you want to. You don’t 
have to share any if you don’t want to, Just do 
what you want, 


The experimenter then administered the hearing 
test in which pairs of tones were played on the tape 
recorder and the subject was asked to indicate which 
tone of each pair was louder. At the completion of 
the test, the experimenter indicated that the subject 
had done as well as the average student on the test 
and so had earned 10 coupons. The experimenter 
then left and a second experimenter, always the same 
sex as the subject, entered to begin the imagination 
task. Subjects were told that the second task would 
involve trying to imagine certain things, Each was 
given an additional 5 coupons as payment for par- 
ticipation before the task began. 

Mood manipulation. The imagination task em- 
bodied the affect manipulation of the present study. 
Half of the subjects (negative mood) were asked in 
a manner identical to Moore et al. (1973) to gen- 
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NEGATIVE 
‘MOOD 


NEUTRAL 
MOOD 


57 10-12 


GRADE LEVEL 


Ficure 1. Mean number of coupons contributed by subjects at each grade level. (ns per cell 
for neutral and negative mood, respectively, were as follows. Grades 1-3: 28, 25; Grades 5-7; 17, 


16; Grades 10-12: 7, 7.) 


erate and reminisce about sad experiences.* The 
other subjects (neutral mood) were asked to look at 
a chair in the room with a book on it, close their 
eyes, and try to imagine the book and then the 
chair. In each mood condition, the time elapsed was 
similar and the experimenter provided equivalent 
verbal interaction with the subjects concerning their 
imagination tasks. 

Helping opportunity. At the end of the mood 
manipulation, the second experimenter left, and the 
first experimenter, blind to the subject’s mood condi- 
tion, returned to the room. The first experimenter 
indicated that the tasks were done, thanked the 
subject, and said that there would be a short delay 
before the subject could be accompanied back to his 
or her room. The experimenter then left the subject 
alone for 90 sec after again calling attention to the 
“Coupons for other students” box and reminding the 
subject of the option to share coupons with nonpar- 
ticipating students. After 90 sec had elapsed, the sec- 
ond experimenter returned and walked with subjects 
back to their classroom or study hall. Subjects in the 
negative mood condition invariably reported feeling 
“bad,” “sad,” etc., in postexperimental questioning, 
so for ethical reasons, subjects in this condition were 
asked to imagine happy events during the walk. The 
experimenter also asked that subjects not discuss the 
tests with any of their classmates until the testing 
was complete at the week’s end. 

Independent variables. Three factors were em- 
ployed: mood (negative and neutral), grade in school 
(Grades 1-3, Grades 5-7, and Grades 10-12), and 
sex of subject. The design was thus a 2 X 3 X 2 com- 
plete factorial. 

Dependent variable. The dependent measure was 
the number of coupons a subject placed in the 


sealed, sharing box during the 90-sec period when no 
experimenter was in the room. 


RESULTS 


An analysis of variance employing all three 
independent variables as factors was per- 
formed on the coupon sharing data. The sex 
of subject factor produced no significant main 
or interaction effects; consequently, all results 
are presented collapsed over sex. Of the theo- 
retically relevant factors, only grade in school 
produced a reliable main effect, F(2,88) = 
9.52, p < .001. Consistent with a substantial 
body of previous research (see Bryan & Lon- 
don, 1970, for a review), generosity was 
found to increase with age; the mean number 
of coupons shared was .53 for our youngest 
group, 1.97 for the intermediate group, and 


The mood induction technique is based upon 
procedure developed by Mischel, Ebbesen, and Zeiss 
(1972) for use with children. Besides its use in stud- 
ies of children’s altruistic responding (Moore et 
1973; Rosenhan et al., 1974), it has also been shown 
to be effective in producing differences in children's 
self-gratification activity (Mischel et al., 1972; Ros- 
enhan et al., 1974; Underwood et al., 1973). 

4The data of two subjects who gave coupons 
while an experimenter was in the room were dis- 
carded on the possibility that the action constituted 
approval-seeking behavior rather than generosity. 
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2.43 for the oldest group. None of the interac- 
tion effects of the design proved significant, 
except for the predicted Mood x Grade in 
School effect, F (2,88) = 4.61, p < .012. The 
means contributing to that interaction are 
presented in Figure 1. 

As Figure 1 shows, under negative mood, 
generosity was somewhat suppressed for first- 
third graders, was slightly enhanced for fifth- 
seventh graders, and was substantially en- 
hanced for tenth-twelfth graders. Simple com- 
parisons between the negative and neutral 
mood subjects within each of the grade in 
school conditions showed no significant dif- 
ference for either first-third graders, t(88) 
= .58, or fifth-seventh graders, ¢(88) = .79, 
However, there was a highly significant differ- 
ence between the negative and neutral con- 
ditions for our high school subjects, ¢(88) = 


2.96, p < OL. 


In sum, the data supported this study’s 
major prediction that the relationship between 
negative mood state and altruism would 
change from an inverse to a direct one with 
increases in the ages of subjects. Our data 
for first-third graders showed the same ten- 
dency of negative affect to reduce altruism 


“that has been found repeatedly by Rosenhan 


and his co-workers (e.g., Moore et al., 1973; 
Rosenhan et al., 1974), who used subjects of 
comparable age. However, the relationship 
progressively reversed itself among the older 
subjects until among high school students 
negative mood significantly enhanced helping. 
This significant enhancement effect fits well 
with the evidence that among adult popula- 


a tions negative affective states have reliably 


led to increased altruism. 


Discussion 


The results of the present study appear to 
provide the means for a parsimonious recon- 
ciliation of the seemingly contradictory data 
concerning negative affect and altruism. By 
varying the age and consequent degree of 
socialization of our subjects, it was possible to 
obtain the conflicting forms of the negative 
mood-helping relationship that had been found 
in previous work. Consistent with the findings 
of Rosenhan and his associates, who used 
young children as subjects, procedures de- 
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signed to produce negative mood tended to 
Suppress generosity in our first-third graders; 
and consistent with the results of Cialdini et 
al. (1973), who employed adult subjects, such 
procedures enhanced generosity in our high 
school students. 

The question of why the relationship be- 
tween negative mood and altruism reversed 
itself with the age of our subjects is an in- 
teresting one that has implications for a con- 
ception of the nature of altruism. It is our 
contention that this effect is mediated by the 
socialization experience, which works to give 
altruism reward status. Thus, as altruism 
acquires a gratifying function, it is increas- 
ingly employed by individuals in a negative 
affective state. There are several ways by 
which the socialization process arranges for 
the consistent pairing of altruistic activity and 
positive outcomes for the altruist, thereby 
lending a conditioned reinforcing character to 
altruism. The first is that the primary agents 
of socialization (e.g., parents and teachers) 
positively sanction acts of selflessness and 
negatively sanction acts of selfishness in the 
young. Berkowitz and his co-workers (e.g., 
Berkowitz & Daniels, 1963, 1964; Goranson 
& Berkowitz, 1966) have amassed consider- 
able evidence for the workings of a societal 
norm of social responsibility that prescribes 
assistance to others in need. Children in a 
culture that subscribes to such a norm expe- 
rience approval for the emission of helping 
responses and disapproval for the withholding 
of such responses. A second means by which 
aiding responses may acquire secondary rein- 
forcing value can be seen in the work of 
Aronfreed (1968). Aronfreed, in collaboration 
with Paskal (Aronfreed, 1968), has argued 
that the distress of a suffering other and the 
joy or relief of another whose suffering has 
been terminated is empathically experienced 
by an observer. Thus, aid for another in need 
not only removes a negative affective state 
and produces a positive affective state in the 
recipient but in the helper as well, Finally, 
the reciprocation-of-favors relationship may 
be another way in which assistance becomes 
paired with reward during the enculturation 
process. Gouldner (1960) has maintained that 
a rule for reciprocity of benefits exists in all 
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societies such that a recipient of aid is obli- 
gated to return it. It has been repeatedly 
demonstrated (e.g., Goranson & Berkowitz, 
1966; Pruitt, 1968; Regan, 1971) that the 
receipt of a favor is highly related to the per- 
formance of an earlier favor. The perform- 
ance of prosocial acts, then, is likely to have 
a history of association with reward in the 
form of reciprocated favors. 

Weiss, Buchanan, Alstatt, and Lombardo 
(1971) have provided independent evidence 
for the assertion that by the end of the so- 
cialization process, benevolence will have be- 
come its own reward. They showed that for 
college students the opportunity to be helpful 
to another produced task behavior that was 
identical in pattern to that of the opportunity 
to receive standard forms of reward. The op- 
portunity to be of aid to someone in need 
not only resulted in learning on an instru- 
mental task but also led to the characteristic 
delay of reinforcement and partial reinforce- 
ment effects that typically occur with conven- 
tional, nonaltruistic reinforcers. There is addi- 
tional data as well that altruism is gratifying 
for adults. Pisarowicz (Note 1) has demon- 
strated that increasing the altruistic nature 
of a task performance led to a greater sense of 
satisfaction and to a lessened tendency for 
adult subjects to avail themselves of direct 
self-gratification (prize tokens) after task per- 
formance. It seems, then, that altruistic activ- 
ity tended to satisfy these adults’ desire for 
self-reward. A similar study by Masters and 
Pisarowicz (1975) found no such simple rela- 
tionship with second-grade subjects, 

In light of such evidence, the data of the 
present study seem best interpreted within 
the negative state relief framework which 
contends that negative moods lead to a ten- 
dency to self-gratify via altruism. From our 
results, it appears that the direct relationship 
that has been shown to exist between nega- 

tive mood and self-reward is paralleled by a 
comparable relationship between negative 
altruism only among adults, 
is rewarding (Weiss et al., 


mood and priva 
for whom alt 
1971). 
Although 
well with mos 


ta we have presented fit 
prior research on nega- 


tive mood and there is one area of 
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discrepancy. Failure, which has been assumed 
to be a mood-lowering event, has not been 
shown to increase helping in adults (Berko- 
witz & Connor, 1966; Isen, 1970). These 
authors have found that experimentally in- 
duced success increased helping behavior but 
experimentally induced failure had no effect. 
An explanation for this deviation from the 
traditional pattern of adult altruism under 
conditions of negative mood may be found in 
a study by Mischel, Ebbesen, and Zeiss 
(1973). They demonstrated that success in- 
creased self-gratification but failure had no 
effect upon self-gratification among their col- 
lege age subjects. Mischel et al. argued that 
failure was not an effective manipulation for 
their subjects on the basis of evidence indi- 
cating that subjects do not attribute their 
failures on experimental tasks to internal, per- 
sonal causes; rather, they attribute such fail- 
ures to the action of external factors like 
chance and luck (e.g., Fitch, 1970; Frieze & 
Weiner, 1971). Subsequent work (e.g., Wolo- 
sin, Sherman, & Till, 1973; Wortman, Cos- 
tanzo, & Witt, 1973) has further confirmed 
that subjects attribute failure externally (al- 
though they do make internal attributions af- 
ter successful performances). Such data sug- 
gest that failure on an experimental task is 
not viewed by a subject as personally infor- 
mative and hence is unlikely to influence 
mood state. It is not surprising and it is con- 
sistent with the perspective of this article, 
then, that failure has been shown to be iden- 
tically ineffective in affecting altruism and 
self-gratification among adults. 

The conception of altruism as having re- 
ward value that is conditioned through the 
enculturation process aligns well with Camp- 
bell’s recent position (Campbell, 1972, 1975) 
concerning the nature and origin of self-sac- 
rificial behaviors. Campbell has reversed his 
earlier assertion that altruistic action in hu- 
mans is genetically selected for (Campbell, 
1965) and now argues that dispositions 
toward self-sacrifice are products of social 
indoctrination rather than genetic selection, 
This indoctrination process, embodied in the 
socialization experience, appears to provide 
selflessness with reinforcer status and thus 
renders understandable benevolent acts that 


have no likelihood of producing direct, ex- 
trinsic rewards (e.g., money, social approval) 
for their performance. 

REFERENCE NOTES 

1. Pisarowiez, J. A. Selj-reinforcement following 
altruistic behavior. Paper presented at the meeting 


of the Rocky Mountain Psychological Associa- 
tion, Phoenix, May 1976. 
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societies such that a recipient of aid is obli- 
gated to return it. It has been repeatedly 
demonstrated (e.g., Goranson & Berkowitz, 
1966; Pruitt, 1968; Regan, 1971) that the 
receipt of a favor is highly related to the per- 
formance of an earlier favor. The perform- 
ance of prosocial acts, then, is likely to have 
a history of association with reward in the 
form of reciprocated favors. 

Weiss, Buchanan, Alstatt, and Lombardo 
(1971) have provided independent evidence 
for the assertion that by the end of the so- 
cialization process, benevolence will have be- 
come its own reward. They showed that for 
college students the opportunity to be helpful 
to another produced task behavior that was 
identical in pattern to that of the opportunity 
to receive standard forms of reward. The op- 
portunity to be of aid to someone in need 
not only resulted in learning on an instru- 
mental task but also led to the characteristic 
delay of reinforcement and partial reinforce- 
ment effects that typically occur with conven- 
tional, nonaltruistic reinforcers. There is addi- 
tional data as well that altruism is gratifying 
for adults. Pisarowicz (Note 1) has demon- 
strated that increasing the altruistic nature 
of a task performance led to a greater sense of 
satisfaction and to a lessened tendency for 
adult subjects to avail themselves of direct 
self-gratification (prize tokens) after task per- 
formance. It seems, then, that altruistic activ- 
ity tended to satisfy these adults’ desire for 
self-reward. A similar study by Masters and 
Pisarowicz (1975) found no such simple rela- 
tionship with second-grade subjects. 

In light of such evidence, the data of the 
present study seem best interpreted within 
the negative state relief framework which 
contends that negative moods lead to a ten- 
dency to self-gratify via altruism. From our 
results, it appears that the direct relationship 
that has been shown to exist between nega- 
tive mood and self-reward is paralleled by a 
relationship between negative 
altruism only among adults, 

is rewarding (Weiss et al., 
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A two-phase field experiment was conducted with lower socioeconomic pre- 
school children to test the overjustification effect, which proposes that in- 
trinsic motivation will decrease with the addition of contingent, external 
rewards which alone are sufficient to justify performance. Children showing 
high intrinsic motivation on the target activity were exposed to one of six 
conditions. In the three experimental conditions the child received either 
money, an award, or positive verbal reinforcement for his performance on 
the target activity. Comparison groups controlled for time or history, the 
presence of the experimenter, and the personal attention given the child by 
the experimenter, Money and awards, expected to be perceived as sufficient to 
justify performance, reduced subsequent intrinsic motivation during a free-play 
period. Positive verbal reinforcement, predicted to be insufficient to justify 
performance, resulted in increased intrinsic motivation. Intrinsic motivation 
in control groups for time and presence of the experimenter did not change, 
Unexpectedly, a large decline in intrinsic motivation occurred in a control 
group where the child was ignored. This effect was replicated and discussed 


in terms of attribution and learning theory. 


In our everyday lives, we talk of play be- 
coming work, But, commonsense statements 
are often unspecified and contradictory. Who 
would have thought that play could be turned 


“ into work by rewarding people for doing what 


they like to do? The addition of external re- 
wards has been found to decrease intrinsic 
motivation in the form of less liking for the 
activity (Kruglanski, Alon, & Lewis, 1972; 
Kruglanski, Friedman, & Zeevi, 1971) and 
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reduced time performing that activity (Deci, 
1971, 1972) because the individual constantly 
updates his attributions (Bem & McConnell, 
1970). By overjustifying performance, the 
adding of an external reward shifts the at- 
tribution about performance from intrinsic 
to extrinsic motivation. This effect, known as 
the overjustification effect, has now been lim- 
ited to rewards that are expected or contin- 
gent upon performance (Lepper, Greene, & 
Nisbett, 1973), salient (Ross, 1975) and 
irrelevant to the activity’s content (Krug- 
lanski, Riter, Amitai, Margolin, Shabtai, & 
Zaksh, 1975). 

The overjustification effect has been dem- 
onstrated using a wide variety of extrinsic 
rewards, including prizes (Kruglanski et al., 
1972; Ross, 1975), awards (Lepper et al., 
1973), money (Calder & Staw, 1975; Deci, 
1971, 1972; Kruglanski et al., 1975), food 
(Ross, 1975), and special activities (Krug- 
lanski et al., 1971; Lepper & Greene, 1975), 
thus expanding the generalizability of the 
finding. However, an external reward in the 
form of positive verbal reinforcement tended 
to increase subsequent intrinsic motivation 
(Deci, 1972; Deci, Cascio, & Krusell, 1975). 
Unfortunately, no one has yet incorporated a 
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wide variety of rewards into one design with 
appropriate controls. 

The varying effects of these external re- 
wards can be explained by Bem’s (1967) use 
of Skinner’s mand-tact distinction. Tangible 
rewards such as money, awards, or special 
treats may alone motivate behavior. The 
presence of such rewards when performing an 
activity may lead to the perception of the 
reward as payment and, hence, the activity is 
controlled or manded by the reward. Further- 
more, money may have more manding prop- 
erties than awards since it is a more direct 
and unambiguous payment for performance. 
In contrast, verbal reinforcement has more 
tact properties because verbal reinforcement 
is not ordinarily seen as controlling one’s 
behavior. The activity is seen as under the 
discriminative control of the activity itself 
and, if performed, it must be a positive ex- 
perience. Moreover, positive verbal reinforce- 
ment lends the activity additional positive 
associations. Therefore, subsequent intrinsic 
motivation will increase because of the en- 
hanced perception of the task as a pleasant 
one. 

This field experiment attempted to extend 
and specify the generalizability of the over- 
justification effect by using a wide variety of 
external rewards and a population of predomi- 
nately lower socioeconomic and Afro-Ameri- 
can preschool children. Previous studies with 
children (Lepper et al., 1973; Lepper & 
Greene, 1975) have used nursery school chil- 
dren, mostly Caucasian, from middle- to up- 
per-class homes. In Experiment 1 children 
were asked to perform an intrinsically moti- 
vated activity (a) while expecting money or 
- an award, (b) while receiving positive verbal 
reinforcement, or (c) while simply in the 
presence of an experimenter (control group). 
Subsequent intrinsic motivation, measured by 
the amount of time performing the target ac- 
tivity, was expected to decrease after receiv- 
ing money or an award, increase after receiv- 
ing positive verbal reinforcement and remain 
unchanged in the control condition. 


EXPERIMENT 1 
Method 


Subjects and experimental setting. The subjects in 
this experiment were 4- and 5-year-old children who 


R. ANDERSON, S. MANOOGIAN, AND J. REZNICK 


were enrolled in the kindergarten program 

by the Child Development Program in V 
Salem, North Carolina, in the spring of 197 
separate facilities participated in the study. | 
one children were given pretest, posttest, and 
ment; however, nine children were dropped fr: 
data analysis. Of these nine, one was reinforc 
a teacher during the pretest and eight did not 
plete the treatment. Of the eight who did not 
plete the treatment, five asked to leave (usus 
go to the bathroom), two had to leave to get S 
home, and one quit drawing (according to a 


priori criterion described below.) Of these eight lo 


dren, one was in the money condition, two we n 
the symbolic-reward condition, three were in 
verbal-reinforcement condition, and two were in 
control group. Most of the final sample of 72 
dren, 27 males and 45 females, had participated 
previous research projects, but none were motiy 
tional studies. Most of the children were Afr 
American and from low socioeconomic families. 

Experimental activities and materials. The targ: 
activity selected was free-style drawing with multi 
colored, felt-tipped pens. This activity was selected 
for several reasons. First, pretesting had revealed 
that it was intrinsically motivating to this subject 
population. Second, it was amenable to the required 
three time-period design without problems such as 
mastery of the task. Moreover, it provided some 
comparability with the results obtained by Lepper 
et al. (1973). 

Each child was provided with a set of 10 different 
colored felt-tipped pens and a numbered T-shirt for 
their temporary use during the experiment. The T- 
shirts were numbered to facilitate the observers’ 
tracking of the children as they roamed freely about 
the room. 

Experimental design. The three reward conditions 
in this study were money, symbolic reward, and 
positive verbal reinforcement. Of the eight classes 
that participated, four had enrollments greater than 
10 and four had enrollments less than 10. There- 
fore, the four large and small classes were separately 
and randomly assigned to the three treatments and 
one control group to ensure comparable cell size for 
this quasi-experimental design. A pretest/treatment/ y 
posttest paradigm was employed, thus yielding a 
2X4 (Pretest/Posttest X Conditions) design in 
which pretest and posttest measures of intrinsic mo- 
tivation were taken for each of four conditions. In 
both the pretest and posttest, intrinsic motivatior 
was deñned as the amount of time spent on th 
target activity of drawing. 

Procedure. The experimenter was the second 2:- 
thor, a male Caucasian graduate student. For ‘he 
initial pretest observation, he introduced himself 2nd 
told the children that he had brought something 
for them to play with. Once the children were seated 
he dispensed the numbered T-shirts. They were told 
that the T-shirts would help keep them clean. Then, 
each child was given a packet of 10 felt-tipped pens 
and a sheet of 8) X 11-inch (21.59 X 27.94-cm) 
paper. They were told that the experimenter was | 
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“interested in how boys and girls draw pictures.” 
After instructing the children in how to remove the 
caps from the markers and draw a colored line, the 
children were told to “go ahead and draw any kind 
of picture that you want,” A 2-minute period was 
given for the children to acquaint themselves with 
the materials. This also allowed the experimenter to 
check that all of the children were able to remove 
the caps and to be sure that the teachers had left 
the classroom. The experimenter placed additional 
paper in the center of each table where the children 
could reach it. At the end of the 2-minute period 
the experimenter said, “You can draw as many 
tures as you want and there is plenty of paper. 
to write the number that is on your T-shirt on 
each picture that you draw. If you get tired and 
want to do something else in the room, you can.” 

At this point, the two observers who had entered 
with the experimenter began their observation of the 
children’s drawing behavior. The observers remained 
seated, t, and as unobtrusive as possible 
throughout the session. The observations involved a 
time-sampling procedure. At the start of each min- 
ute, the observers marked whether each child was 
drawing or not. A child was defined as drawing if 
he had a marker in his hand, was within an arm’s 
reach of his Paper, and was not doing something 
incompatible with drawing (e.g., hitting someone). 
Thus, each child’s behavior was sampled once per 
minute for 30 minutes, 

At 5-minute intervals, the experimenter re- 
peated, “There is plenty of paper in the middle of 
the table if you want it. Try to write your number 
on each picture. If you get tired and want to do 
something else in the room, you can.” During the 
testing, the experimenter refrained from all possible 
interaction with the children. Questions were an- 
swered with a short “yes” or “no,” and future ques- 
tions were discouraged by the experimenter turning 
away from the children after responding and by 
avoiding eye contact throughout the session. As each 
child completed a picture, his paper was collected to 
avoid cluttering the table. At the end of 30 minutes, 
the experimenter announced that the time was up, 
collected the materials, and left with the observers, 

The .experimental treatments took place the fol- 
lowing week when the experimenter returned to the 
classroom and individually asked each child if he 
would like to come to the “surprise room” with him.” 
The child was led to the experimental room and 
seated at a table containing a packet of felt-tipped 
pens and a sheaf of paper. Up to this point, all chil- 
dren had been treated identically. 

Children in the symbolic-reward group were toid 
the following by the experimenter: 


I am interested in how boys and girls draw pic- 
tures and Pd like you to draw some pictures. If 
you will draw some pictures, T'I] give you a good 
player award like this [the experimenter displayed 
the good player award to the child] with your 
name and center on it to put on the good player 
board [the experimenter pointed to a standard 


O17 


display]. There is plenty of paper if you want it. 
Go ahead and draw whatever you’d like while I 
do some work that I have to do. 


After 8 minutes, the child was told that time was up 
and that “you have earned a good player award,” 
The experimenter wrote the child’s name and day- 
care center on a 3 X 5-inch (7.62 X 12.70-cm) note 
card adorned with a large gold star and a red rib- 
bon. The child chose where his good player award 
was placed on the good player board, a 2 X 3-foot 
(30.48 X 91.44-cm) poster board with four perma- 
nently attached awards. Once this was completed, 
the child was returned to his room. 

Children in the positive verbal-reinforcement and 
money conditions received the same general instruc- 
tions as children in the symbolic-reward group, 
However, in the verbal-reinforcement condition, after 
each 2-minute interval, the experimenter looked over 
and delivered one of the following verbally reinforc- 
ing statements: “That picture is real nice,” “You are 
pretty good at this,” “I like the way you are draw- 
ing these pictures,” and “You really did a good 
job.” Children in the money condition were told, 
“If you will draw some pictures, I'll pay you with 
these stacks of pennies.” (The experimenter pointed 
to four stacks of five pennies.) After each 2-minute 
Segment, the experimenter looked over and said 
some variation of “you have just earned five cents,” 
as he pushed a stack of pennies toward the child. At 
the end of 8 minutes, the child was told, “You have 
just earned five more pennies and it looks like our 
time is up. You have earned 20 cents for drawing 
Pictures for me. Now let’s see if we can find our 
way back to your room.” 

Children in the control group were told the fol- 
lowing by the experimenter: “I am interested in 
how boys and girls draw pictures and Pd like you 
to draw some pictures, There is plenty of paper if 
you want it. Go ahead and draw whatever you'd 
like while I do some work that I have to do.” The 
experimenter did not attend to the child’s drawing. 
At the end of 8 minutes, the child was told that 
“our time is up” and “let’s see if we can find our 
way back to your room,” 

It was apparent to the experimenter that the con- 
trol treatment was uncomfortable for both the ex- 
perimenter and child, It was abnormal to sit in abso- 
lute silence for eight minutes (except for the direc- 
tions) with a child who had been specifically asked 
to accompany you to the “surprise room.” While 
the experimenter was (rather painfully) striving to 
avoid eye contact, conversation, or attending to the 
child’s drawing, the child was striving equally hard 
to elicit some recognition or validation from the ex- 
perimenter for what he was doing. The child often 
showed his drawing or talked to the experimenter, 
The experimenter ignored these attempts by saying, 
“I've got work to do,” and looking away. It was 
clear that ignoring a child’s activity, while in his 
presence, was in itself influential and quite different 
than if the child were drawing alone. 
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In each condition, if the child stopped drawing 
for more than 20 seconds, the experimenter said, 
“There is plenty of paper if you want it.” If the 
child stopped drawing again he was dropped from 
the data analysis on the assumption that he had not 
received the treatment. The only child dropped from 
the experiment was one in the symbolic-reward 
group. 

The following week, the posttest involved the 
identical procedure as in the pretest. 


Results 


Before analyzing the effect of the experi- 
mental treatments on subsequent intrinsic mo- 
tivation, pretest levels of intrinsic motivation 
were tested for initial comparability. The pre- 
test means found in Figure 1, which plots 
mean number of minutes spent drawing dur- 
ing the pretest and posttest by experimental 
treatments, illustrate the pretest equivalence 
of the four groups of children. The pretest 
means range between 20.46 and 22.07 min- 
utes. The one-way analysis of variance per- 
formed on these pretest means was not sig- 


nificant. This result is particularly important 
because intact groups of children were as- 
signed to treatments. 

The effect of treatments on intrinsic moti- 
vation is illustrated in Figure 1 by the solid 
lines. The means indicate that, as expected, 
money and award treatments decreased and 
positive verbal reinforcement increased in- 
trinsic motivation over pretest levels. Sur- 
prisingly, the control group resulted in th: 
largest decrease in intrinsic motivation. Th 
significance of these results was tested in 
4x2 factorial design with treatments ( ) 
as the between variable and pretest and pc -t- 
test observations (2) as the within varia ie. 
As predicted, the effect of experimental tr at- 
ment, time of observation, and the Tim of 
Observation X Experimental Treatment 


in- 

teraction were significant, F(3,68) = 75, 

p < 005; F(3,68) = 26.20, p < .001; F(3, 
68) = 10.84, p < .001. 

Simple effects were performed to te < the 

adequacy of the experimental hypoth: 2s by 


Aon 
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isolating the source of the significant inter- 
action. Importantly, the effect of the treat- 
ments during the posttest was highly signifi- 
cant, F(1, 68) = 37.88, p< .001. Further- 

vore, the mean drawing times between the 
pretest and the posttest for the money, award, 
l-reinforcement groups were signifi- 
i in the predicted direction, F(3, 68) 
, Ż < 05; F(3, 68) = 14.33, p < 001; 
58) = 7.33, p< .01. However, in the 
| group, where the treatment was not 
to affect drawing times, the decline 
nificant, F (3,68) = 43.00, p < .001. 


und 


Discussion 


Fhe unexpected control group results raise 
in interpreting the expected results in 
money, award, and positive verbal-rein- 
reement conditions. Clearly, the attempt to 
replicate the control group of Lepper et al. 
(1973) failed, perhaps due to procedural dif- 
ferences. Lepper et al. (1973) gave the fol- 
lowing description of their control group: 
“Generally, he (experimenter) attempted to 
show interest in, rather than explicit approval 
of, the subject’s performance.” What is the 
meaning of “interest in” or “approval of?” 
How much generalized social reinforcement 
did the child get from the experimenter? In 
interpreting these descriptions, the most in- 
conspicuous (least manding) and reinforcing 
procedure was chosen. In retrospect, the ex- 
perimenter did not show sufficient interest in 
the child’s drawing. Thus, the reduced in- 
trinsic motivation may have resulted from 
the generally aversive atmosphere or the 
rather heavy-handed surveillance by the ex- 
perimenter. 

Experiment 2 was designed to replicate the 
results of the control group in Experiment 1 
and to test the effect of two additional control 
groups. Control 1 was a replication of the 
control group in Experiment 1. Control 2 
again attempted to replicate the control group 
in Lepper et al. (1973) by having the experi- 
menter show more interest or attend more to 
the child’s drawing but otherwise providing 
little social reinforcement. In Control 3, only 
pretest and posttest assessments of intrinsic 
motivation were taken, thus controlling for 
time, history, and so on, Children in Control 


919 


1 were expected to decline in subsequent in- 
trinsic motivation, whereas the children’s in- 
trinsic motivation in Controls 2 and 3 was 
expected to remain stable. 


EXPERIMENT 2 
Method 


Subjects and experimental setting. The subjects in 
Experiment 2 were 4- and 5-year-old children who 
were enrolled in the kindergarten program spon- 
sored by the Department of Social Services of Win- 
ston-Salem, North Carolina, in the fall of 1974. Two 
separate kindergarten facilities participated in the 
study. There were 28 subjects, 14 males and 14 fe- 
males, almost all of whom had participated in pre- 
vious research projects, but none of these were moti- 
vational studies. Most of the children were Afro- 
Americans from families of low socioeconomic status. 
The experimental activities and materials were the 
same as used in Experiment 1. All subjects completed 
the treatment, and therefore none were dropped 
from the data analysis. 

Experimental design. The experimental treatments 
were three types of control procedures. In Control 
1 the experimenter was present but ignored the child 
(a replication of the control group in Experiment 1). 
Control 2 was one in which the experimenter was 
present and attentive but not directly reinforcing the 
child for drawing (the control procedure described 
by Lepper et al, 1973). Control 3 was a time 
control with no treatment between the pretest and 
posttest measurements. One third of the subjects in 
each class were randomly assigned to one of the three 
control groups. 

Procedure. The procedure for pretests and post- 
tests was identical to that described in Experiment 
1. The procedure for Control 1, with the experi- 
menter ignoring the child, was identical to that 
described for the contro] group in Experiment 1, In 
Control 2, the control group procedures used by 
Lepper et al. (1973) were again attempted. Explic- 
itly, the experimenter watched the child’s drawing 
intently but was careful not to show direct ap- 
proval of the drawing. Any questions were answered 
with noncommittal answers. For Control 3, the con- 
trol for time and history, the children were pre- 
tested and posttested but received no intervening 
treatment. As in Experiment 1, the pretest/posttest 
interval was approximately 2 weeks and subjects 
were tested by a male Caucasian psychology student. 


Results 


The results of Experiment 2 are illustrated 
by the dotted lines in Figure 1. As expected, 
intrinsic motivation in Controls 2 and 3 re- 
mained stable, yet it dropped sharply in Con- 
trol 1. The difference between pretest and 
posttest means for Control 2 and Control 3 
were not significantly different, as revealed 
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by ¢ tests. The decline in intrinsic motivation 
for Control 1, however, was significant, t(16) 
= 2.45, p < .05, thereby replicating the re- 
sults of Experiment 1. 

The intent of Experiment 2 was to more 
accurately interpret the results of Experi- 
ment 1 by analyzing the combined data. How- 
ever, since Experiments 1 and 2 were done at 
different times, by different experimenters, 
and on different subject populations, a one- 
way analysis of variance was performed on 
the pretest means for all groups in Experi- 
ments 1 and 2 in order to test for initial 
equivalence in intrinsic motivation. These pre- 
test means ranged between 20.46 and 22.17 
minutes. As expected, the result was not sig- 
niñcant, and an analysis was performed on 
combined data of Experiments 1 and 2. This 
analysis combined the data for Controls 2 
and 3 of Experiment 2, and the control group 
in Experiment 1 with its replication, Control 
1, in Experiment 2 since results for these 
groups were nearly identical. 

A 5X2 factorial with five treatments 
(Controls 2 and 3 combined, Control 1s com- 
bined, money, award, and verbal reinforce- 
ment) as the between variable and pretests 
and posttests (2) as the within variable was 
performed on the number of minutes spent 
drawing. The analysis resulted in significant 
main effects for treatments and time of ob- 
servation, F(4,95) = 3.65, p< .01; F(1, 
95) = 23.76, p< .001, and a significant 
Treatment X Time of Observation interac- 
tion, F(4, 95) = 8.44, p < .001. 

Simple effects were performed to test the 
explicit experimental hypotheses by isolating 
the source of the significant interaction. All 
results fit the predictions. That is, the post- 
test differences in experimental treatments, 
F(1, 95) = 38.34, p < .001; the decreases in 
intrinsic motivation in the money, award, and 

Control 1 conditions, F(4,95) = 13.86, $ < 
001: F(4,95) = 7.09, p< 01; F(4,95) = 
41,57, p <.001; and increases in intrinsic 
ion after positive verbal reinforce- 
(4.05) = 3.36, p < .05, were signifi- 


moti 


ment, 


cant, The pretest and posttest change for Con- 
trols 2 and 3 í ined was not significant, as 
expected 

Scheffé’s a pos paired comparisons on 


the posttest scor < treatments resulted 


R. ANDERSON, S. MANOOGIAN, AND J. REZNICK 


in six significant differences. Tha he chil- 
dren in the positive verbal-r ement 
group were significantly greater sttest 
intrinsic motivation than childr: the 
award, money, or Control 1 condii F(1, 
95) = 6.01, p< .05; F(1,95) = 1 p< 
05; F(1,95) = 25.76, p< 05. The test 
intrinsic motivation in Controls 2 an M- 
bined was significantly greater than the 
money or Control 1 conditions, Fl, = 
5.38, p < .05; F(1,95) = 13.12, $ < 05, rnd 


intrinsic motivation in the award conc) om 
was significantly greater than in Conti 
F(1,95) = 6.01, p < 05. 

Although the relative pretest differen 
between the award, money, and Control 
conditions are in the same order as the pos! 
test, analysis of covariance eliminating pre- 
test differences does not seem appropriate for 
the following reasons. First, the pretest 
means for all seven groups in Experiments 1 
and 2 have a maximum difference of only 1.71 
minutes (although it may appear larger in 
Figure 1) and pretest means are not signifi- 
cantly different. Therefore, a simple factorial 
is probably more conservative. Second, the 
experimental hypotheses are not concerned 
with the relative degree of decreased intrinsic 
motivation due to various rewards but rather 
that decreases do occur. 


Discussion 


Experiments 1 and 2 incorporated three 
types of external rewards and three types of 
control conditions. Past research has employed 
all three rewards separately, but none has 
combined them into one study. As expected, 
positive verbal reinforcement increased sub- 
sequent drawing, whereas money and awards 
decreased subsequent drawing. Theoretically, 
money and awards were seen as sufficient 
justification for drawing, and thus intrinsic 
motivation decreased. Positive verbal rein- 
forcement, however, was not seen as a justifi- 
cation for drawing. Therefore, drawing ac- 
companied by positive verbal reinforcement 
was perceived as intrinsically motivated. In 
addition, drawing in the presence of positive 
verbal reinforcement may have acquired posi- 
tive associations, thereby further increasing 
intrinsic motivation. 

Though not significant, the decrease in in- 
trinsic motivation was greater after money 


ts 
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than after awards. Money is a direct and un- 
ambiguous mand or justification for perform- 
ince, Whereas an award has more tact prop- 
Stated another way, money is an un- 
euous payment, and awards are disguised 
Theoretically, perhaps, the greater 
justification, the more detri- 
ifect on intrinsic motivation. Un- 
this discussion is highly specula- 
se this research (and that of others) 
consider the reinforcement value of 
rewards used in the treatment 

. Mand-—tact properties of a reward may 
not only with qualitative changes in re- 
i but also with quantitative changes with- 

pe of reward. Given that a small 

vard may have effects profoundly 
(han a large monetary reward, an 
eved question is whether the children 
experiment perceived five pennies as a 
i large payment. Furthermore, we 
might ask whether the effects of a trophy 
might have been different from a good player 
ward. At any rate, we must approach a com- 
parison between a good player award and five 
pennies with caution. Our conservative conclu- 
sion is that both money and symbolic reward 
decreased intrinsic motivation, whereas posi- 
tive verbal reinforcement increased intrinsic 
motivation. Furthermore, the apparent dif- 
ference between money and awards is interest- 
ing and deserves further study. 

The search for an adequate control group, 
where level of intrinsic motivation remained 
stable, has extended our knowledge of the 
i(tributions underlying intrinsic motivation. 
Control 3, which merely assessed pretest and 
posttest intrinsic motivation, demonstrates 
that intrinsic motivation maintains itself at a 
relatively constant level over time, if left 
untouched by experimental treatments. This 
expected result permits simpler interpreta- 
tions (i.e., those which do not consider time- 
related influences such as history, matura- 
tion, etc.), a feature not characteristic of 
most of the research in this area. 

In Control 2, the experimenter attempted 
to watch the child’s drawing in a relaxed, but 
nonattentive style. He neither engaged in 
unnecessary conversation or eye contact with 
the child nor did he show approval of the 
child’s drawing. Since Control 2 did not result 
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in any significant change in intrinsic motiva- 
tion, it appears that the procedures were suc- 
cessful in controlling for the child’s drawing 
and the presence of the experimenter during 
the treatment period without interfering with 
the child’s attributions about his motivation. 
It appears then that Control 2 approached a 
replication of the control group in Lepper et 
al. (1973). 

The behavior of the Control 1 experimenter, 
who did not watch the child or his drawing, 
resulted in a large decline in intrinsic motiva- 
tion. If the preschool children were quite ac- 
customed to the presence of pleasant, atten- 
tive adults, the Control 1 experimenter’s 
behavior may have created a very unusual situ- 
ation for the child. The child may have per- 
ceived the experimenter’s behavior as aversive 
or as surveillance. If the experimenter’s be- 
havior was aversive, drawing might come to 
have negative associations. Subsequent de- 
creases in drawing time would not be surpris- 
ing. On the other hand, the awkwardness and 
discomfort of the Control 1 treatment may 
have accentuated the presence of and surveil- 
lance by the experimenter. If so, the child may 
have seen his drawing as externally controlled. 
Subsequent intrinsic motivation would there- 
fore decrease, as Lepper and Greene (1973) 
have demonstrated. 

There are alternative explanations for the 
results. Since the experimenters are adults, it 
is particularly difficult for them to see the 
experiment as the children did. For instance, 
the children may have responded to their per- 
ceptions of the experimenter’s expectations 
or subtle demand characteristics, For exam- 
ple, a child may have decided that the experi- 
menter paid to have the task done because he 
didn’t like the task, Therefore, on the post- 
test the paid child drew less in order to 
please the experimenter. This may be, but it 
imputes considerable cognitive complexity to 
4- and 5-year-olds and certainly is not parsi- 
monious. Furthermore, does verbal reinforce- 
ment suggest that the experimenter likes or 
dislikes the task? 

Since minor changes in the experimenter’s 
behavior result in major differences in the 
child’s subsequent behavior, it is imperative 
that researchers understand the child’s per- 
ception of the situation and the child’s rein- 
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forcement history. For instance, Offenbach 
and Meyer (1964) have found that children 
who have been exposed to rewards interpret 
silence as punishment, whereas children ex- 
posed to punishment interpret silence as re- 
ward. Without knowledge about the meaning- 
fulness of various rewards to particular 
populations, generalizations about their sub- 
sequent impact on behaviors must be limited. 
For instance, the results of the money treat- 
ment in this study might have been different 
if middle-class children had been used. Chil- 
dren accustomed to being ignored may have 
reacted differently to the control treatment 
where the experimenter ignored the children’s 
drawing. In some cases, it might be necessary 
to originate reinforcement histories in the lab- 
oratory before presuming that they are indeed 
sufficient justification for performance, 
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warning and noise was varied 


: 5, 1, 3, or 11 sec) for each subject. The low-socialization group gave sig- 

icantly smaller SCRs to noise than the high-socialization group, revealing a 

ative relationship between level of socialization and amplitude of the SCR 

2 noxious stimulus in a normal population, Previous studies reporting this 
relationship contrasted clinical sociopaths with nonsociopathic control subjects, 
The differences between groups in the SCR were significant at the 0-see (i.e, no 
warning) as well as the remaining warning intervals, indicating that the differ- 
ence between groups is not mediated by “cue-utilization” ability. Control pro- 
cedures permitted excluding differential sensory sensitivity, rates of SCR 
habituation, and general skin conductance responsivity as mediators of the dif- 
ference between groups in the SCR to noxious stimulation. 


The role of punishment in the process of 
uman socialization has been documented by 


_ Aronfreed (1968). The effectiveness of pun- 


ishment in promoting socialization, however, 
is presumably mediated by the aversive state 
it creates, Punishment is unpleasant, and the 
child does what he can to avoid it. An indi- 
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vidual in whom the mechanisms underlying 
this mediating state have a relatively high 
threshold for activation might be predicted, 
therefore, to be little influenced by punish- 
ment. The result of such a higher threshold 
and the consequent reduction in the influence 
of punishment might be the partial failure of 
the socialization process, 

This model of the role of punishment in 
socialization and its mediation by an aversive 
state are in accord with an influential theory 
(Cleckley, 1964; Lykken, 1957) of the de- 
velopment of sociopathic or undersocialized 
(Wiggins, 1968) individuals. According to 
this theory, sociopaths are capable of only a 
weak affective response to punishment (aver- 
sive state), are therefore less influenced by it, 
and are consequently poorly socialized. 

Empirical support for this theory was pre- 
sented by Lykken (1957), who found that 
clinical sociopaths showed significantly weaker 
skin conductance responses (SCRs) in antici- 
pation of and in response to electric shock and 
significantly poorer performance on an inci- 
dental passive-avoidance learning task than 
control subjects. Results of other assessments 
of the SCR in sociopaths have tended to 
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support Lykken’s findings (Hare, 1965a, 
1965b, 1965c; Hare & Craigen, 1974; Hare & 
Quinn, 1971; Schmauk, 1970). 

These studies, however, have dealt exclu- 
sively with the extremely sociopathic end of 
the socialization dimension. They typically 
involve prison inmates, contrasting those diag- 
nosed as sociopathic with other inmates 
diagnosed as nonsociopathic (e.g., socialized 
delinquent) or nonincarcerated, normal con- 
trols. A more general hypothesis implied by 
this research, that is, that in a normal popu- 
lation individuals who are electrodermally 
hyporesponsive to noxious stimulation tend to 
be poorly socialized, has not been tested. One 
purpose of the study reported here was to test 
this hypothesis by examining the relationship 
of level of socialization to amplitude of the 
SCR in a quite different population, college 
students. The Socialization Scale (Gough, 
1964) of the California Psychological Inven- 
tory was administered to obtain an index of 
each subject’s level of socialization. Relevant 
behavioral differences have been reported 
among college students differentiated on the 
basis of the Socialization Scale. Hetherington 
and Feldman (1964), for example, found that 
college students who cheated in a disguised, 
contrived situation scored significantly lower 
on the scale than did students who did not 
cheat. The behavioral domain tapped by the 
scale is also illustrated by normative data 
(Gough, 1964) on its most substantial cor- 
relations with other major personality scales: 
46 with the Restraint Scale of the Guilford- 
Zimmerman Temperament Survey, —.34 with 
Block’s Under-Control scale (derived from 
the Minnesota Multiphasic Personality In- 
ventory [MMPI]), and —.32 with the Pd 
scale of the MMPI. It is only weakly cor- 
related (.17) with a major index of trait 
anxiety, the Taylor Manifest Anxiety Scale. 

Studies reporting smaller SCRs to noxious 
stimulation in sociopaths typically employ a 
fear-conditioning paradigm in which a signal, 


such as a light or tone, precedes the noxious 
stimulus. A warning signal, such as a condi- 


however, reduces the ampli- 
the unconditional stimulus 
069; H. Kimmel, 1966; 
kken, 1962; Lykken, 


tional stimulus 
tude of the >! 
(Grings & Sc! 
E. Kimmel, 19 
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Macindoe, & Tellegen, 1972; Lykken & 
Tellegen, 1974), and it is therefore possible 
that the reduced SCR to a noxious stimulus 
shown by sociopaths could be due to a 
greater efficiency in the sociopath’s utiliza- 
tion of cues of impending noxious events. 
Perhaps the poorly socialized individual is 
less influenced by punishment because he 
has developed a greater than normal 
ability to “get set” for a noxious stimu- 
lus, when its occurrence is signaled, so as 
to reduce its subjective impact. The second 
purpose of this article was to test this hypoth- 
esis. Since the effectiveness of a signal in re- 
ducing the SCR to a noxious stimulus has 
been shown to be dependent upon the dura- 
tion of the warning interval (Lykken, 1962), 
the effects of several different warning inter- 
vals were examined, Finally, I tested the pos- 
sibility that either differential sensory sensi- 
tivity or differential rates of habituation of 
the SCR might mediate any apparent differ- 
ences between subjects in the amplitude of 
the SCR to noxious stimulation. 


METHOD 
Overview 


Subjects scoring high, medium, or low on the 
Socialization Scale were exposed to 30 pairings of 
a light (signal) with a noxious auditory stimulus 
while continuous skin conductance was recorded, For 
each subject, the warning interval varied from 0 sec 
(i.e., no warning) to 11 sec in duration. To preclude 
confounding the effects of a warning signal with 
response interference, the signal was a slight change 
in the illumination of the room, which required no 
visual attention beyond keeping eyes open and 
evoked little if any orienting SCR. 

As a control for group differences in general skin 
conductance responsivity as opposed to reactivity to 


1 Response interference occurs when the SCR 
evoked by one stimulus (e.g., a signal) is underway 
when the SCR to a second stimulus begins, the latter 
being smaller than it would have been had no pre- 
ceding SCR occurred. The principal evidence for 
response interference is that the briefer the interval 
between the onset of the signal and the onset of the 
noxious stimulus, the smaller the SCR to the stimu- 
lus. In addition, the more intense the signal and 
consequently the larger the SCR it evokes, the 
greater the response interference (i.e., the greater 
the reduction in the SCR to the second stimulus). 
As reported elsewhere (Waid, Note 1), the mild 
stimulus used as a signal in the present. experiment 
precludes the occurrence of response interference, 


SKIN CONDUCTANCE AND LEVEL OF SOCIALIZATION 


noxious stimuli, each subject was also exposed to 
a series of 30 signal-stimulus pairings in which the 
auditory stimulus was innocuous. In both series of 
pairings the subject estimated the intensity of the 
auditory stimulus on specific trials via the magnitude 
estimation method (Stevens, 1956, 1975). 


Subjects 


were 48 males and 32 females, ranging 
years of age, who were recruited by 
sements placed on the Rutgers University 
Socialization score was independent of both 
ind age and, therefore, neither variable was 
fered further. 


i pparatus 


cin conductance recordings were obtained using a 
constant .75-V source (Beckman skin conductance 
couplér) applied to a pair of .64-cm? silver/silver 
chloride electrodes attached to the palmar surfaces 
of the first and second phalanges of the index finger 
e right hand. The electrodes were of the well 
ï , and adhesive collars were used with them 
to prevent spread of the electrolyte. Johnson and 
Johnson K-¥ Jelly (Edelberg, 1967) was used as 
an electrode medium. Display of recordings and 
experimental event markings was on a Beckman 
Type-R Dynograph. 

An Archer 6-V de alarm buzzer (Cat. No. 273-049) 
was situated on a wall behind the subject and was 
powered by a variable voltage supply set at approxi- 
mately 5.2 V to produce the noxious auditory 
stimulus. The subject chair was situated so that the 
sound pressure level of the buzzer was approximately 

*98 dB (re .0002 dynes/cm*). Tones at 500 Hz (neu- 

tral auditory stimulus) were delivered over a 7-inch 
‘(17.78-cem) speaker situated on a wall behind the 
subject. The sound_pressure level of the speaker 
output was calibrated to reach 68 dB at the location 
of the subject. 

A light stimulus, serving as the warning stimulus, 
was produced by a standard 100-W soft-white light 
bulb situated on a wall behind the subject. This 
stimulus increased the general illumination of the 
room, so its effectiveness as a signal did not require 
the subject to direct his gaze to a specific site. 

Durations of tones, buzzer, light, and warning 
interval were controlled by solid-state logic circuits, 
and intertrial intervals were programmed on a film 
programmer that activated the logic circuits. 

A Panasonic closed-cicuit television camera and 
monitor were used to present messages to the subject 
during the experiment. 


= 


Procedure 


Upon arrival each of the 80 participants passed 
an auditory screening test indicating that hearing was 
within normal limits bilaterally. Ten other volunteers 
failed the screening test. The subject then completed 
the Socialization Scale and the Eysenck Personality 
Inventory (Eysenck & Eysenck, 1968). 
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Following electrode placement, the subject was 
seated in a padded reclining chair in a sound- 
attenuated chamber and allowed to relax for 5 min- 
utes. The subject was then told that during the fol- 
lowing 20 minutes he would be exposed to sounds 
and lights of varying intensities and durations and 
that he would be asked to estimate the loudness of 
some of the sounds, The magnitude estimation pro- 
cedure was explained and the subject practiced the 
procedure. 

The subject was then reminded that a light would 
precede and/or accompany each sound. Subjects in 
the predictable warning-interval condition were told 
that the duration of the light would “change every 
six trials,” whereas subjects in the unpredictable 
warning-interval condition were told that it would 
“change almost every trial.” Subjects in the predict- 
able warning-interval condition received the six pair- 
ings at each duration of warning interval in succes- 
sion (e.g., 1, 1, 1, 1, 1, 1, 3, 3, 3, 3, 3, etc.), whereas 
the other subjects (unpredictable warning-interval 
condition) received them in irregular sequence (e.g., 
1, 3, 0, 3, 11, .5, 0, etc.). In both cases, the order 
of presentation of the five durations of warning 
interval was randomized. 

Finally, the subject was told that the sounds and 
lights would begin occurring shortly. Thirty presen- 
tations of the tone (neutral condition) or noise 
(noxious condition) then occurred, each preceded 
and/or accompanied by the signal. The intertrial 
interval ranged from 30 sec to 48 sec (M = 39 sec). 
On intensity-estimation trials the experimenter man- 
ually transmitted a printed message stating, “This 
is a loudness estimation trial,” via the closed-circuit 
television system. The message was presented 10-13 
sec following the onset of the preceding stimulus 
and remained visible to the subject until 4-6 sec 
after the stimulus. At this time a second message 
was presented stating, “Report your loudness estima- 
tion.” The second message was removed following 
the subject’s report. This procedure precluded con- 
founding the SCR to the stimulus with any SCR to 
the reporting of the magnitude estimation. 

Following this part of the experiment the subject 
was exposed to the remaining condition, noxious or 
neutral, depending on which he had been exposed to 
previously. The instructions were essentially the same 
for both parts of the experiment, with only the 
description “soft” or “very loud” varying. The sub- 
ject practiced the estimation procedure again with 
tones of appropriate loudness. 


Data Analysis 


Amplitude of the SCR was scored as the change in 
micromhos from onset to peak of the phasic change 
in skin conductance beginning within a 4-sec period 
starting 1 sec following onset of the noise or tone. 

For the purposes of analysis, each subject was 
assigned to one of three groups (high-, medium-, or 
low-socialization) on the basis of his score on the 
Socialization Scale. Subjects with a score of 37 or 
higher were assigned to the high-socialization group, 
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Ficure 1. Mean amplitude of the skin conductance response (SCR) of each socialization 
group (high, medium, low) to two types of stimulus, under two levels of predictability (P) 
of warning (High, Low), as a function of warning interval. 


those with a score of 31-36 to the medium group, 
and those with a score of 30 or lower to the low- 
socialization group. 

The design was a 3X2X2%X5 factorial, with 
groups (high-, medium-, or low-socialization), pre- 
dictability of warning interval (predictable or un- 
predictable), type of stimulus (noxious or mild), and 
warning interval (0, .5, 1, 3, 11 sec) as the indepen- 
dent variables, with repeated measures on the last 
two factors. Because a total of 25 observations of 
magnitude estimation and four observations of the 
SCR were lost due to technical difficulties, analyses 
of variance for missing data were performed. 


RESULTS 
Skin Conductance 


The skin conductance data are presented 
in Figure 1. The main effects of level of 
socialization, type of stimulus, and warning 
interval were each significant, F(2,74) = 
3.48, p < .05; F(1,74) = 41.58, p < .0001; 
F (4,296) = 6.33, p < .001, respectively, The 
main hypotheses of the present study, how- 
ever, must be tested via a priori individual 
comparisons. Relevant comparisons were made 
using Duncan’s Multiple Range Test, adopt- 
ing a .05 confidence level. 


Response to noise. Both the low- and 
medium-socialization groups produced signifi- 
cantly smaller SCRs to noise than did the 
high-socialization group. Further individual 
comparisons showed that these differences 
were significant at the no-warning and at the 
remaining warning intervals, indicating that 
warning utilization does not mediate the dif- 
ferences between socialization groups in the 
SCR. In all three groups the signal produced 
a significant reduction in the SCR to noise, 
at least if the warning interval was longer 
than .5 sec: The SCRs at the 1-, 3-, and 
11-sec warning intervals were each signifi- 
cantly smaller than those at either the 0- or 
-5-sec warning intervals. The effects of the 
warning interval were virtually the same at 
all three levels of socialization. 

Response to tones. There were no signifi- 
cant differences between socialization groups 
in the amplitude of the SCR to tones in the 
predictable warning-interval condition, indi- 
cating that the differences between the groups 
are not due to general differences in skin 
conductance responsivity to stimulation. In 
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the unpredictable warning-interval condition, 
however, the low- and medium-socialization 
groups showed significantly smaller SCRs to 
tones than the high-socialization group, indi- 
cating that they were less reactive to in- 
creased unpredictability of mild stimulation 
as well as to noxious stimulation. 

Habituation. The data were also examined 
for habituation of the SCR and its influence 
on the preceding results. With SCRs grouped 
into blocks of six across trials, there was a 
significant, F (4, 296) = 27.43, p < .001, dec- 
rement in amplitude of the SCR across trials. 
The rate of habituation of the three groups, 
however, was virtually the same, the differ- 
ences in amplitude of the SCR among the 
three groups being fairly consistent across 
blocks, The high-socialization group dis- 
played significantly larger SCRs to the nox- 
ious stimulus than did the low-socialization 
group at all blocks except the third in the 
unpredictable warning condition and second, 
third, and fourth blocks in the predictable 
warning condition. Hence, differential rates 
of habituation cannot be a mediator of the 
differences between socialization groups in 
the amplitude of the SCR. 


Magnitude Estimations 


Table 1 presents the mean magnitude esti- 
mations of the two types of stimuli made by 
each of the three groups. For the sake of 
simplicity these data have been collapsed 
across levels of warning interval and predict- 
ability of warning interval. Although there 
was no significant main effect of level of 
socialization, individual comparisons indicated 
that the medium-socialization group made 
significantly lower estimations of noise inten- 
sity than either of the other two groups. The 
high- and low-socialization groups made vir- 
tually identical intensity estimations. The 
differences between groups were independent 
of warning interval. There were no differences 
between groups in magnitude estimations of 
the intensity of the tones. 


Extraversion, Neuroticism, and Socialization 


Neither extraversion nor neuroticism was 
significantly correlated with socialization 
(r= —.14, p > .21 and r=.11, p > .67, re- 
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TABLE 1 


Mean MAGNITUDE ESTIMATIONS OF Noise AND TONES 
Mave By THREE SOCIALIZATION GROUPS 
(REFERENCE = 100) 


‘Type of stimulus 
Leyel of . 
Noise 


socialization Tone 
High 114.8 98.4 
Medium 106.8 99.2 
Low 112.0 102.6 


spectively), nor were there any significant 
differences among the mean scores of the 
three groups on either scale. The mean scores 
of the high-, medium-, and low-socialization 
groups were, respectively, 12.2, 10.8, and 13.5 
on extrayversion and 9.3, 8.7, and 10.8 on 
neuroticism. 


Discussion 


The present results replicate the finding of © 
a negative relationship between level of so- 
cialization and amplitude of the SCR to a 
noxious stimulus in a normal population. Pre- 
vious studies reporting this relationship con- 
trasted clinical sociopaths with nonsociopathie 
control subjects. 

The present results, however, fail to sup- 
port the hypothesis that the difference be- 
tween groups in the amplitude of the SCR 
to noxious stimulation is mediated by more 
efficient use of a warning by low-socialization 
subjects. Furthermore, these data preclude 
attributing the differences in the SCR to dif- 
ferences between groups in auditory sensitiv- 
ity. The low-socialization group displayed 
SCRs of significantly lesser amplitude than 
those of the high-socialization group regard- 
less of warning interval duration and despite 
loudness estimations virtually identical to 
those of the high-socialization group. If 
warning-signal utilization mediated this dif- 
ference between the two groups, there should 
have been no difference between high- and 
low-socialization groups at the 0-sec warning 
interval (nor perhaps at the .5-sec warning 
interval), but significant differences should 
have been observed at the remaining warning 
intervals. If different levels of sensory sensi- 
tivity were a mediator, then the low-socializa- 
tion groups should have shown significantly 
lower intensity estimations in addition to the 
significantly smaller SCRs. 
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The absence of a difference between highly 
and poorly socialized groups in the SCR to 
neutral stimuli weighs against the interpreta- 
tion that the differences between these groups 
in the amplitude of the SCR to noxious 
stimuli are due to peripheral physiological 
mechanisms of the SCR, to a generalized low 
arousal of the poorly socialized subjects, or 
to lower motivation of these subjects to stay 
alert during the experiment. If any of these 
factors were operative, then the differences 
between groups also should have appeared in 
the predictable warning-interval condition 
with mild stimulation. The differences be- 
tween groups, however, appeared only under 
conditions of either noxious stimulation or 
low predictability of stimulation. This out- 
come is consistent with previous research 
(Goldstein, 1965; Hare & Quinn, 1971; 
Lindner, 1942) reporting no differences be- 
tween sociopaths and nonsociopaths in the 
amplitude of the SCR to simple stimuli. 

Since the present results are based upon a 
large number of SCRs for each subject, the 
differences between groups could be due to a 
faster rate of habituation among low- than 
among high-socialization subjects (i.e., both 
groups might begin with SCRs of the same 
amplitude but appear significantly different 
when responses are pooled across many trials, 
due to a more rapid decrement in magnitude 
of the SCR among low-socialization subjects). 
The results of the blocks analysis, however, 
offer no support for habituation as a mediator 
of the group differences. High-socialization 
subjects displayed significantly larger SCRs 
to noise than did low-socialization subjects in 
the first block of trials at both levels of 
predictability of warning interval, and the 
decrement in amplitude of the SCR across 
blocks was virtually identical in the two 
“roups. 

The preceding discussion focused on the dif- 

ices between the high- and low-socializa- 
ups. The medium-socialization group, 
differed from the high-socialization 

ms of the SCR, in much the same 

the low-socialization group, 

\ ould be expected to display 

am, itude between those of the 
-sociclization groups. Two prin- 
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cipal hypotheses might account for this un- 
expected result. The first is that the apparent 
relationship between level of socialization and 
amplitude of the SCR observed in this sample 
of subjects is secondary to a stronger relation- 
ship between amplitude of the SCR and an- 
other dimension, anxiety. High-socialization 
subjects may produce larger SCRs than sub- 
jects in the other two groups because they 
are significantly more anxious than the other 
subjects. The three socialization groups, how- 
ever, did not differ on the Neuroticism scale 
of the Eysenck Personality Inventory, an 
index of trait anxiety, nor was there a signifi- 
cant correlation between neuroticism and so- 
cialization. Since the medium-socialization 
subjects rated the noise as significantly less 
loud than did the other two groups, a second 
and more likely explanation is that the un- 
expectedly reduced SCRs of the medium- 
socialization group result from these fortui- 
tously higher tolerance levels for intense 
auditory stimuli. In light of the loudness 
estimation data, the present results appear 
consistent with a roughly linear relationship 
between reactivity to noxious stimuli, as 
indexed by the SCR, and level of socialization. 

In conclusion, given the role of punishment 
in human socialization (Aronfreed, 1968) 
and the apparent role of physiological arousal 
as a mediator of the effectiveness of punish- 
ment, it is likely that diminished arousal 
to punishing stimuli partly mediates the rela- 
tive failure of the socialization process, which 
results in an undersocialized individual. Iden- 
tifying the origins of diminished arousal to 
noxious stimuli is, therefore, an important 
problem. The present paper has presented 
evidence against several possible cognitive- 
perceptual sources of this diminished arousal: 
warning-signal utilization, differential sensory 
sensitivity, and differential rate of habitua- 
tion. The sources of reduced arousal may 
ultimately be traced to early experience 
(Thompson & Grusec, 1970) or to genetic 
mechanisms. Block (1967) and Vandenberg 
(1966), for example, have presented evidence 
of substantial inheritance of autonomic re- 
sponse patterns. The possible role of addi- 
tional cognitive-perceptual factors must be 
considered, however. Lazarus (1967), for ex- 
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ample, has found that a stressful film will 
produce less skin conductance activity if the 
appraisal of the harmful events portrayed is 
altered appropriately. The influence on the 
autonomic correlates of socialization of indi- 
vidual differences in such cognitive defense 
mechanisms remains to be explored. 


REFERENCE NOTE 


1, Waid, W. M. Autonomic and psychophysical 
evidence that brief forewarning produces transient 
reduction in sensitivity to noxious stimulation. 
Paper presented at the meeting of the Eastern 
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Order of Consensus, Distinctiveness, and Consistency Information 
and Causal Attributions 


Diane N. Ruble and Nina S. Feldman 
Princeton University 


In two studies, we examined the hypothesis that the order of presenting con- 
sensus, distinctiveness, and consistency information concerning an event may 
affect attributions of causality for the event. In particular, it seemed possible 
that the relative weakness of consensus information in previous studies may 
have been due, in part, to a recency effect because consensus was always pre- 
sented first. Undergraduate subjects, assigned to one of several order conditions, 
were given questionnaires containing descriptions of eight events, according to 
procedures described by McArthur. Each event was followed by consensus, 
distinctiveness, and consistency information arranged in a predetermined order. 
The results showed the predicted recency effect in the use of consensus for 
both person and stimulus attributions, The discussion focuses on methodology 


and interpretational issues. 


In 1967 Kelley first proposed his concep- 
tualization on how the naive person answers 
the question “why,” based on an analysis of 
variance model and earlier theorizing by 
Heider (1958). Kelley (1967) suggested that 
individuals use three types of information 
(consensus, distinctiveness, and consistency) 
when attempting to make an attribution con- 
cerning the cause of an event. Consensus in- 
formation indicates whether or not a particu- 
lar stimulus evokes a similar response in 
different people. Distinctiveness information 
refers to whether a given response is made 
uniquely to a particular stimulus or to other 
stimulus entities as well. Consistency informa- 
tion concerns whether or not the same re- 
sponse occurs whenever the stimulus entity is 
present (i.e., over successive exposures and 
varied sensory or perceptual modalities). Ac- 
cording to Kelley’s (1967) model, attribu- 
tions to the stimulus, the person, or to cir- 
cumstances are related to the configuration 
of the available information, which has typi- 
cally been varied experimentally over two 
(high/low) conditions. 


The authors would like to express appreciation to 
Leslie McArthur for helpful suggestions on an earlier 
draft. 

Requests for reprints should be sent to Diane N. 
Ruble, Department of Psychology, Princeton Uni- 
versity, Princeton, New Jersey 08540. 


Tests of this model have provided reason- 
able support for predictions concerning the 
kinds of information patterns that lead to a 
particular attribution (McArthur, 1972; Or- 
vis, Cunningham, & Kelley, 1975). However, 
one unexpected result in both of these studies 
was the differential impact of the three types 
of information on attribution processes. Over- 
all, consistency information accounted for the 
most variance, whereas consensus information 
accounted for the least. In addition, the rela- 
tive importance of distinctiveness and consis- 
tency shifted, depending on the type of at- 
tribution being analyzed, but consensus infor- 
mation always emerged a distant third (Mc- 
Arthur, 1972). These findings have led some 
investigators to search for the underlying 
psychological reasons for people’s apparent 
disinterest in consensus information (Feld- 
man, Higgins, Karlovac, & Ruble, 1976; Mc- 
Arthur, 1976; Nisbett & Borgida, 1975). 

The contention of the present paper is that 
the relative importance of the three types of 
information in the McArthur (1972) and 
Orvis et al. (1975) attribution paradigms may 
be due, in part, to a very simple information- 
processing variable—an order effect. The 
information has always been presented in the 
same order in previous studies (i.e., consen- 
sus, distinctiveness, consistency), and thus 
memory or attentional variables may have 
affected the subject’s use of the information. 
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“t Thus, an order effect in the direction of 


rey 


o 


\ 


recency may be partly responsible for the 
relative weakness of consensus information 
because consensus was always presented first. 

A number of previous studies support the 
n that order of information presen- 
have a major impact on subjects’ 
responses in situations similar to the ones 
used by McArthur (1972) and Orvis et al. 
. For example, studies on impression 
sation frequently report order effects of 
both primacy and recency (Anderson, 1965; 
Jones & Goethals, 1971). The direction of 
such effects, however, depends on a number 
of factors (Jones & Goethals, 1971). In addi- 
tion, order effects in the direction of recency 
have been observed in judgment studies with 
children, even when only two information 
ples were involved (Feldman, Klosson, 
_Parsons, Rholes, & Ruble, 1976; Kun, Par- 
“sons, & Ruble, 1974). 

The present study was designed to examine 
the possible effects of order on people’s use of 
consensus, distinctiveness, and consistency in- 
formation in making attributions. Subjects 
were presented with eight story themes in one 
of three orders of information presentation. 

“They were then asked to make causal attri- 
1 butions concerning the event according to 
procedures described by McArthur (1972). A 
recency effect was generally predicted based 
on the following line of reasoning. Studies 
inding primacy effects usually involve a long 
list of information variables that are often 
presented verbally. In order to make use of as 
‘vuch of the information as possible, the sub- 
s ject must actively process or rehearse the 
information. If there is too much to remem- 
ber, the earlier information, undergoing re- 
hearsal, probably has an advantage (Jones & 
Goethals, 1971). In support of this suggestion, 
rehearsal processes have been implicated in an 
increase in primacy effects with age in studies 
of children (Cole, Frankel, & Sharp, 1971; 
Ornstein, Naus, & Liberty, 1975). In Mc- 
Arthur’s (1972) attribution study, subjects 
. were presented in written form with only 
three pieces of information. Thus, there was 
no impetus for the subjects to rehearse the 
material, and no primacy effect would be ex- 
pected. Instead, a recency effect might result 
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from the “recall readiness” (Jones & Goethals, 
1971) of the last cue read. 

In sum, in terms of previous findings re- 
garding the use of consensus information, the 
major prediction of the present study was 
that consensus would have the least impact on 
attributions when presented in the first posi- 
tion (i.e., the order used by earlier studies), 
However, the use of consensus was expected 
to increase substantially in different orders, 
especially when presented last. 


Stupy 1 
Method 


Subjects. The subjects were 90 undergraduate stu- 
dent volunteers, recruited from discussion sections 
of the introductory social psychology class at Prince- 
ton University. The subjects were randomly but 
equally divided among the three order conditions. 

Procedure. The subjects completed an eight-item 
questionnaire that consisted of a subset of the same 
items used by McArthur (1972) and Orvis et al. 
(1975) and was distributed across the four verb 
categories described in the McArthur study. Each 
item described a response by a person (e.g, Jim 
translates the sentence incorrectly) and was fol- 
lowed by three statements representing one of eight 
possible combinations of consistency, distinctiveness, 
and consensus information. All questionnaires pre- 
sented the items in a single random order, but the 
information statements were presented in one of 
three orders on each questionnaire. In Order 1, the 
information was presented in the same order as in 
the previous studies (i.e., consensus, distinctiveness, 
consistency). The sequence in Order 2 was con- 
sistency, consensus, and distinctiveness, while Order 
3 was distinctiveness, consistency, and consensus. 
Sample items representing each order are presented 
below: 


Order 1: 
Sue is afraid of the dog. 

a. [Consensus-Low] Hardly anyone else who 
sees the dog is afraid of it. 

b. [Distinctiveness-Low] Sue is also afraid of 
almost every other dog. 

c. [Consistency-Low] In the past, Sue was al- 
most never afraid of the same dog. 


Order 2: 
Jim thinks the building is very tall. 
a. [Consistency-High] In the past, Jim has 
almost always thought this same building was tall. 
b. [Consensus-Low] Hardly anyone else who 
sees the building thinks it is tall, 
c. [Distinctiveness—Low] Jim also thinks that al- 
most every other building is tall, 


Order 3: 
John laughs at the comedian. 
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a. [Distinctiveness-High] John does not laugh 
at almost every other comedian. 

b. [Consistency-High] In the past, John has 
almost always laughed at the same comedian. 

c. [Consensus-High] Almost everyone who hears 
the comedian laughs at him. 


The procedure described by McArthur (1972) was 
followed as closely as possible. Written instructions 
asked the subject to read each statement and decide, 
on the basis of the information given, what prob- 
ably caused the event to occur. The subjects were 
then directed to choose between four alternative 
causes, which took the following form: 


a. Something about the person (e.g., Sue) prob- 
ably caused her to make Response X (e.g., be 
afraid) to Stimulus X (e.g-, the dog). 

b. Something about Stimulus X probably caused 
the person to make Response X to it. 

c. Something about the particular circumstances 
probably caused the person to make Response 
X to Stimulus X. 

d. Some combination of a, b, and c above prob- 
ably caused the person to make Response X to 
Stimulus X. 


For choice d, subjects were asked to specify the 
particular combination of factors that probably 
caused the event. (See McArthur, 1972, for addi- 
tional information concerning the items and the 
dependent variables.) 


Results 


Following McArthur, analyses of variance 
were performed on the frequency data be- 
cause they are less cumbersome and provide 
a better description of the data than chi- 
square analyses. Separate 3X2xX2x2 
(Order X Consensus X Distinctiveness X 
Consistency) analyses of variance were per- 
formed on the dependent variables: attribu- 
tions to person, stimulus, and circumstances.* 
The presentation of the results is organized 
around each dependent variable. First, the 
main effects of each information variable are 
presented and compared with McArthur’s 
(1972) results, Next, effects of order of 
presentation are described and further exam- 
ined by means of within-order analyses. In- 
teractions among the information variables 
are not presented since they are generally 
consistent with McArthur’s (1972) findings 
ad ave not directly relevant to order effects. 

‘tion compares the relative impor- 
ta information variables across the 
three f presentation. 


DIANE N. RUBLE AND NINA S. FELDMAN 


The basic prediction was that the informa- 4 ~> 


tion variables, especially consensus, would 
have the least impact in the primacy position. 
Thus, for example, the means for high versus 
low consensus should be least differentiated 
in Order 1. This prediction would be sup- 
ported by interactions of the information 
variable with order. 

Person attribution. The results of the over- 
all analysis revealed significant main effects _ 
(p < .001) of all three information variables.” 
The frequency of person attribution was 
higher (a) with low than with high consensus 
information, (b) with low than with high dis- 
tinctiveness information, and (c) with high 
than with low consistency information. These 
results are consistent with Kelley’s (1967) 
model and with McArthur’s (1972) findings. 

There was one significant interaction with» 
order, an Order X Consensus interaction, 
F(2,87) = 4.23, p <.05. As predicted, the! 
pattern of the means (see Figure 1) indi- 
cates that the effects of consensus on person 
attributions were much weaker in Order 1 
(when consensus was presented first) than in 
Orders 2 or 3. This conclusion is supported 
by the within-order analyses. Although the , 
consensus main effect was highly significant” 
in Orders 2 and 3 (p < .001), it did not reacht 
significance in Order 1 (p > .10). 

Stimulus attribution. The results of the 
overall analyses of variance on attributions 
to the stimulus entity revealed significant 
main effects of consensus (p < .05) and dis- 
tinctiveness (p < .001) information. Attribu- 
tions to the stimulus were more frequent 
given high than low consensus and distincs 
tiveness information, consistent with Mc-, 
Arthur’s (1972) data. However, in contrast 
to her findings, the main effect of consistency 
did not reach significance (p > .10). 

There were several significant effects of 
order for stimulus attributions. The main 
effect, F(2,87) = 4.33, p< .05, indicates 
that the stimulus attributions were most fre- 
quent in Order 3. This effect seems to be due, 


X 

1The combination attributions are not presented 
because no order effects were found for these varia- 
bles. The proportion of person-stimulus attributions 
R .15 and of other combination attributions was 
«26. 
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to the consensus information that was 
strongly affected by the order of information 
presentation. Consistent with the recency 
hypothesis, an Order X Consensus interaction, 
* (2,87) = 10.97, p<.001, indicates that 
the predicted effects of consensus (high > 
low) occurred only when the consensus infor- 
2 was presented second or third (see 
ure 2). There was also an Order X Con- 
< Distinctiveness interaction, F(2, 87) 
59, ~ <.01, which shows that the pre- 
d effect of consensus information was 
found only with high distinctiveness informa- 
‘ion (consistent with McArthur, 1972), but 
only for Orders 2 and 3. These conclusions 
are supported by the within-order analyses. 
Within both Order 2 and Order 3, the main 
effect of consensus was significant (p < .05) 
» in the predicted direction, as was the Con- 
nsus X Distinctiveness interaction (p < 
001), However, within Order 1, the effect of 
consensus was significant in the direction 
opposite, to that predicted (i.e., low con- 
sensus led to more frequent stimulus attribu- 
tions, $ < .01), and the Consensus X Dis- 
‘inctiveness interaction failed to reach signifi- 
cance (p> .10). 


Circumstance attribution. The results of 


« the analyses of variance on the frequency of 
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Ficure 1. Mean proportion of person attributions as 
a function of consensus and order in Study 1. 
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Ficure 2. Mean proportion of stimulus attributions 
as a function of consensus and order in Study 1, 


circumstance attribution revealed significant 
effects of all three information variables; At- 
tributions to circumstance were more frequent 
with low than with high consensus (p < .05) 
and consistency (p < .001) ‘information and 
with high than with low distinctiveness infor- 
mation ($ <.01). The consistency and dis- 
tinctiveness effects replicate McArthur’s data, 
but the main effect of consensus did not reach 
significance in her study. 

One interaction with order was significant, 
the Order x Consensus X Consistency inter- 
action, F (2,87) = 3.69, p < .05, indicating 
that the effect of the consistency informa- 
tion was relatively weak when it was pre- 
sented last (Order 1), especially with high 
consensus information. Thus, in this case, the 
effect of order did not support the predicted 
recency effect. 

Relative importance of the information 
variables. Table 1 presents the proportion of 
total variance accounted for by each informa- 
tion variable as found in this study and in 
McArthur (1972), An examination of this 
table reveals that the relative importance of 
the three information variables in this study 
was affected substantially by the order of in- 
formation presentation for both person and 
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TABLE 1 


PERCENTAGE OF THE TOTAL VARIANCE IN CAUSAL 
ATTRIBUTIONS ACCOUNTED FOR BY THE THREE 
INFORMATION VARIABLES 


Causal attribution 


Position and name Circum- 
of variable Person Stimulus stance 
Overall 
Consensus 4.42 56 37 
Distinctiveness 5.07 3.24 87 
Consistency 5.07 .02 13.27 
Order 1 
Consensus 96 1.77 46 
Distinctiveness 6.01 4,23 46 
Consistency 3.85 OL 6.83 
Order 2 
Consistency 3.86 06 13.68 
Consensus 7.79 2.16 31 
Distinctiveness 6.68 6.01 2.19 
Order 3 
Distinctiveness 3.08 1.22 KY) 
Consistency 8.13 .09 21.64 
Consensus 5.82 3.64 37 
McArthur (1972) data 
Consensus 6.25 5.17 30 
Distinctiveness 21.72 12.12 7.58 
Consistency 15.76 5.88 41.36 


stimulus attributions. Within Order 1 (the 
order used by McArthur) the relative impor- 
tance of the information variables for person 
attributions is the same as that reported by 
McArthur (distinctiveness > consistency > 
consensus).2 Both Orders 2 and 3 differed 
from the pattern, primarily by increasing the 
importance of consensus information. For 
stimulus attributions, the pattern of results 
within Orders 1 and 2 is similar to Mc- 
Arthur's, except that consistency had virtually 
no impact in the present study. However, for 
Order 3, when distinctiveness information was 
presented first and consensus information was 
presented last, the importance of distinctive- 
ness decreased and that of consensus in- 
creased. Finally, order had little effect on the 
relative position of the information variables 
for circumstance attributions, though the per- 
centage of variance accounted for by con- 
sistency information was substantially larger 
in Order 3 than in Order 1, 


Srupy 2 
The results of Study 1 suggest that the 
order of presenting information does signifi- 


cantly affect the attribution process, as pre- 
dicted. However, because of a failure to repli- 
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cate some aspects of McArthur’s results and 
because the direction of the effects of order 
varied across information variables, an at- 
tempt to replicate the results of Study 1 
seemed warranted and a second study was 
conducted. The basic design remained the 
same, except that all six possible combina- 
tions of the information were examined in 
order to elucidate possible order-effect dif- 
ferences (recency vs. primacy) among the 
three information variables. Also, subjects ” 
were recruited for pay, instead of as volun- 
teers, and additional counterbalancing of item 
presentation was included as part of the pro- 
cedure. 


Method 


Subjects. The subjects were 96 Princeton Univer- 
sity undergraduates recruited for pay. There were 8 
males and 8 females randomly assigned to each of ~ 
the six order conditions. 5 

Procedure. The procedure was identical to Study 
1, with the following exceptions. First, all six possi- 
ble orders of information were examined instead of 
three. Second, all of the eight items included in the 
present study were taken from the McArthur (1972) 
questionnaire. Two items were randomly selected 
from each of the four verb categories included by 
McArthur. Third, two sets of item-information pair- 
ings were used instead of one, as in the first study.. 
These sets were constructed such that each level 
(high/low) of the three information statements 
would be associated with each questionnaire item. 
Fourth, the eight items were presented in several 
random orders to the subjects. 


Results 


The data were analyzed by means of sepa- 
rate 6X2X2X2 (Order X Consensus X 
Distinctiveness X Consistency) analyses of 
variance on person, stimulus, and circum- ” 
stance attributions, as in Study 1. This overall 
analysis revealed significant (p < .001) main 
effects of each information variable in the 
predicted direction for all three attributions. 
To simplify presentation of the effects of 
order, the means are presented in Table 2 as 


‘ 


his 


yá 


Ti 


2The magnitude difference between McArthur’s pæ 


and the present study is probably due to calculation 
procedures specific to each study. McArthur used } 
item-within verb category effects as error terms, 
whereas the present study used subjects-within 
groups effects. Also, differences in experimental pro- 
cedures (e.g., number of stories, volunteer vs. paid 
subjects) may have been contributing factors. 
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resultant scores. These resultant scores indi- 
cate the relative impact of the information on 
attributions as a function of order. In addi- 
tion, since consensus was consistently affected 
by order in the predicted direction in Study 
1, the specific effect of order on consensus in 
this study was examined by means of a priori 
linear trend contrasts, Within-order analyses 
are not presented, because the direction of 
effects for all orders replicates McArthur and 
ches significance for most variables. 

Person attribution. The overall analysis 
revealed no effects for order. To test the spe- 
cific prediction regarding the use of consensus, 
a linear contrast was computed on the re- 
sultant scores as a function of the presenta- 
tion position of consensus. These means were 
as follows: consensus presented first (Orders 
i and 2) =.179; consensus presented second 
(Orders 3 and 4) =.196; consensus pre- 
sented third (Orders 5 and 6) = .258. Al- 
though the means were in the predicted di- 
rection (consensus was used most when pre- 
sented last and least when presented first), 
the linear contrast failed to reach signifi- 
cance, F(1,93) = 1.63, p = .20. 

Stimulus attribution. The linear contrast 
revealed a significant effect of order on con- 
sensus, F(1,93) = 9.35, p < .01, similar in 
direction to that reported for Study 1. That 
is, the lowest proportion of all stimulus at- 


TABLE 2 


MEAN RESULTANT Scores or CONSENSUS (A), 
DISTINCTIVENESS (B), AND CONSISTENCY (C) ror 
THE THREE ATTRIBUTION VARIABLES AS A FUNCTION 


OF ORDER 
Order 
1 2 3 4 5 6 
Variable ABC ACB CAB BAC CBA BCA 

Consensus 

Person „171 187 .281 110 .266 .250 

Stimulus "156 (125.219 .266 .219 .343 

Circumstance .047 .094 .093 .203 .031 .172 
Distinctiveness 

Person .203 .219 .125 .110 .110 «156 

Stimulus 188 .281 .156 .204 4157 .219 

Circumstance 109 1156 .157 .172 .157 .172 
Consistency 

Person ATL 156 157 -172 110,031 

Stimulus i88 (187.156 .078 .187 .125 

Circumstance 422 .375 .407 .266 .281 .266 


Note, The resultant score is the difference between the propor- 
tions for the high versus the low level of each information vari- 
able. In order to have positive numbers in the table, the sub- 
tractions were based on the predicted direction. Thus, for 
example, for person attributions, the consensus means represent 
Jow consensus minus high consensus. 
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m—a CONSENSUS FIRST 
e—e CONSENSUS SECOND 


AL a—a CONSENSUS THIRD 


PROPORTION OF STIMULUS ATTRIBUTION 


at 1 
HIGH LOW 
CONSENSUS 

Ficure 3. Mean proportion of stimulus attribu- 
tions as a function of consensus and the position of 


consensus information (first, second, or third) in 
Study 2. 


tributions was when consensus information 
was presented first, and the highest propor- 
tion was when consensus information was 
presented third. Figure 3 presents these data 
as a function of the position (first, second, or 
third) of the consensus information. As seen 
by the slope of the lines, high versus low con- 
sensus information had a much greater im- 
pact on stimulus attributions when it was 
presented third than when presented first, 
thus supporting a recency prediction. The 
overall analysis revealed no effects of order 
for either distinctiveness or consistency (F < 
1); 

Circumstance attribution, The results re- 
vealed no significant order effects. The three- 
way interaction (Order X Consensus X Con- 
sistency) reported in Study 1 did not ap- 
proach significance, F (5,90) = 1.50, $ > ple), 
nor were the means in the same direction. 

Relative importance of the information 
variables. Table 3 presents the proportion of 
total variance accounted for by each informa- 
tion variable as a function of order. As in 
Study 1, it is clear that the relative impor- 
tance of the three information variables was 
affected substantially by the order of informa- 
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TABLE 3 


PERCENTAGE OF THE TOTAL VARIANCE IN CAUSAL 
ATTRIBUTIONS ACCOUNTED FOR BY THE THREE 
INFORMATION VARIABLES 


Causal attribution 


Position and name Circum- 
of variable Person Stimulus stance 
Overall 
Consensus 7.68 7.68 1.80 
Distinctiveness 4.07 6.30 3.74 
Consistency 3.04 3.70 17.86 
Order 1 
Consensus 5.84 4.29 31 
Distinctiveness 8.16 6.17 1.71 
Consistency 5.84 6.17 25.39 
Order 2 
Consensus 6.67 2.56 1.36 
Consistency 4.63 5.77 21.72 
Distinctiveness 9.07 12.98 3.77 
Order 3 
Consistency 4.01 3.77 22.99 
Consensus 12.98 7.39 1.22 
Distinctiveness 2.56 3.77 3.40 
Order 4 
Distinctiveness 2.60 6.20 5.38 
Consensus 2.60 10.60 7.52 
Consistency 6.41 92 12.86 
Order 5 
Consistency 1.80 5.77 12.98 
Distinctiveness 1.80 4.01 4.01 
Consensus 10.60 7.85 -16 
Order 6 
Distinctiveness 3.57 6.67 5.38 
Consistency (14 2.18 12.86 
Consensus 9.14 16.46 5.38 


tion presentation, primarily for stimulus at- 
tributions. Specifically, for stimulus attribu- 
tions consensus information accounts for the 
highest percentage of variance of the three 
information variables when it is presented 
either second or third. In contrast, when con- 
sensus is presented first (Orders 1 and 2), it 
accounts for the lowest percentage of the vari- 
ance. In addition, overall, the percentage of 
variance accounted for by the three informa- 
tion variables was very similar for both per- 
son and stimulus attributions, in contrast to 
McArthur’s results (see Table 1). 


Discussion 
The results of both studies indicate that, as 


: predicted, the order of presenting consensus, 


distinctiveness, and consistency information 
concerning an event has a significant effect on 


DIANE N. RUBLE AND NINA S. FELDMAN 


people’s judgments of causality for the event. 
As predicted, a recency effect was demon- 
strated in the use of consensus information 
such that high versus low consensus had the 
least impact (in the predicted direction) 
when it was presented as the first information 
statement. This effect reached statistical sig- 
nificance for stimulus attributions in both 
studies and for person attributions in Study 1 
only, though the means were in the same 
direction in Study 2. The impact of order 
was further demonstrated by the comparisons 
of the three information variables in terms of 
percentage of variance accounted for by each~ 
in Study 2. As Table 3 indicates, when order 
was fully counterbalanced, these three varia- 
bles accounted for approximately the same 
percentage of variance. In fact, in absolute 
terms, consensus accounted for the highest 
percentage of variance for person and stimu- 
lus attributions. There was a weak order in- 
teraction with consistency for circumstance 
attributions in Study 1, but this effect was 
not replicated in Study 2, nor were any other 
order effects found for either distinctiveness 
or consistency information, Thus, it seems 
fair to conclude that order of presentation 
affects only consensus information. It is as if 
a person initially learning about how others 
have responded to the event waits until con- 
firmatory or disconfirmatory information is 
received before making an attribution, there- 
by giving greater weight to the later infor- 
mation. 

The results of the present study indicate 
that basic information processing variables, 


such as order of information presentation, g 


must be considered in social-attributional 
paradigms, even when only a few information 
items are involved. Failure to account for 
such variables may lead to erroneous conclu- 
sions concerning the relative impact of such 
items in situations where an individual is 
asked to assess a set of information.’ Assum- 


3 It should be noted that McArthur’s (1976) find- 
ing that consensus information had less impact on 
attributions than distinctiveness information would 
probably hold true even if the order of presenting 
the information were counterbalanced, When high 
consistency information was provided as the third 
item of information, as it had been in McArthur’s 


j 


5 


e 


t bis 
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i ying a “logical” set on the part of the subject 
_ and presuming the careful evaluation of each 
` item presented does not seem to be a viable 
substitute for careful counterbalancing of in- 
formation. In particular, the present study 
suggests that the relative weakness of con- 
sensus information previously found in this 
attribution assessment pardigm may not be 
due totally to the nature of the information 
. As predicted, interactions with order 
nsistently indicated that consensus was 
cast used when it was the first item pre- 
‘ited, In sum, the consistent placement of 
nsensus information in a disadvantaged po- 
‘ition in the past has potentially led to an 
| inaccurate assessment of its importance in 
some contexts. Thus, its role in the attribu- 
, ton process should be reevaluated in future 
research, 


6 


‘study, consensus information had less influence on 
Stimulus attributions than distinctiveness informa- 
tion whether consensus was presented first or second. 
ìn the other hand, only when consensus informa- 
‘on was presented first did it have less influence 
» OP. person attributions than distinctiveness informa- 
k tion. When consensus information was in the second 
>, position, its influence was equal to that of distinc- 
igtiveness. These data suggest that if McArthur 
k (1976) had counterbalanced the order of presenting 
| consensus and distinctiveness information, the ten- 
a for consensus to have less influence than dis- 
inctiveness information on stimulus attributions 
would have remained the same, whereas the ten- 
b dency for consensus to have less influence on person 
uttributions would have been weakened somewhat 
‘but not eliminated. 
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Aggression as a Function of the Interaction of 
Alcohol and Threat 


Stuart P. Taylor, Charles B. Gammon, and Deborah R. Capasso 
Kent State University 


Intoxicated and nonintoxicated subjects competed in a reaction time situation 
against either a potentially threatening opponent or a nonthreatening oppo- 
nent, The results indicated that the intoxicated subjects initiated higher levels 
of attack than the nonintoxicated subjects only in the threatening situation. 
It was concluded that aggression is not just a consequence of the pharma- 
cological action of alcohol. Instead, alcohol-induced aggression appears to 
be a function of the interaction of alcohol consumption and the degree of 
threat or provocation inherent in a particular situation. 


Bennett, Buss, and Carpenter (1969) failed 
to find a relationship between alcohol con- 
sumption and physical aggression. They con- 
cluded that “alcohol as a pharmacological 
agent . . . does not lead to aggression” (p. 
875). In contrast, Shuntich and Taylor (1972) 
reported that subjects who consumed a mod- 
erate dose of alcohol behaved more aggres- 
sively than subjects who ingested a placebo 
cocktail, 

These contradictory findings could be due 
to methodological factors. The procedures 
used in the above studies differed in two im- 
portant respects. First, the subjects in the 
Bennett et al. (1969) study consumed vodka, 
an intoxicant of relatively low congener con- 
tent, while the subjects in the Shuntich and 
Taylor (1972) study ingested bourbon, an 
intoxicant with a relatively high congener 
content. Given the possibility that the dis- 
crepant results could be attributed to the dif- 
ferent types of alcohol used in these studies, 
Taylor and Gammon (1975) conducted a 
study in which subjects received high or low 
doses of either bourbon or vodka, Aggressive 
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Charles Gammon is now at State University of 
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behavior was found to be related mainly to 
the quantity of the intoxicant ingested: High 
doses of vodka and bourbon instigated more 
aggressive responding than did low doses of 
these intoxicants. 

The second methodological difference be- 
tween the Bennett et al. (1969) and Shuntich 
& Taylor (1972) studies concerns the para- 
digms used to assess aggressive behavior, In 
the Bennett et al. study each subject was 
instructed to administer electric shocks to a 
bogus “learner” whenever the learner made 
an incorrect response on a “discrimination- 
learning task.” In this situation, the learner 
(victim) was a helpless recipient of the nox- 
ious stimulation selected by the teacher. He 
could not retaliate against the teacher or 
initiate an aggressive response. Thus, from 
the teacher’s perspective, the situation was 
relatively nonthreatening. In the Shuntich 
and Taylor and Taylor and Gammon (1975) \; 
studies, each subject was informed that he 
would be competing with an opponent in a 
simple perceptual-motor task. He was further 
told that either he or his competitor would 
receive an electric shock, depending upon the 
outcome of the competition, and that they 
could both select the intensity of shock the 
other would receive. Besides enabling the sub- 
ject to shock his opponent, this paradigm 
allowed the opponent to both retaliate against 
the subject and initiate aggressive reponses. 
The opponent in this situation could not 
be considered helpless or passive. Therefore, 
from the subject’s perspective, the situation | 
was relatively threatening. 
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Thus, the discrepancy between the results 
of the Bennett et al. and Taylor and Gammon 
studies could possibly be attributed to the 
greater threat inherent in the competitive, 
reaction time situation. To put it another 
: way, it is conceivable that alcohol enhances 
aggressive behavior only under conditions of 
threat or potential harm. In an attempt to 
investigate this possibility, the subjects in this 

_ experiment were allowed to compete with an 
opponent in the reaction time situation under 
conditions in which there was either some 
potential threat or a minimal probability of 
being harmed. In the nonthreatening condi- 
tion, the subject competed with an opponent 
who indicated that he intended to set mini- 
mal shock intensities, In the threatening 
condition, the subject competed with a silent 
opponent. Thus, the procedure in the threat- 
„ening condition was identical to the procedure 
used by Taylor and Gammon (1975). 

If, on the one hand, the influence of alcohol 
on the expression of aggressive behavior 
toward a target depends upon the presence 
of threatening cues, one would predict that the 
intoxicated subjects would be less aggressive 
in the nonthreatening than threatening condi- 
tion, If, on the other hand, the effects of 

“al Shol on aggressive responding are indepen- 

nt of the degree of threat inherent in the 

“situation, one would expect the intoxicated 


TA subjects to respond in a highly aggressive 


manner in both the threatening and non- 
threatening conditions. 


METHOD 
Subjects 


The subjects were 40 male undergraduates over 
18 years of age who attended Kent State University. 
Subjects were recruited at various buildings on the 
campus by means of folders advertising the need for 
volunteers who would be paid for participating in 
a psychological experiment. The possible use of 
alcohol was not mentioned in the folder. The poten- 
tial subjects were contacted by phone and informed 
that the experiment concerned the influence of alco- 
hol on perceptual-motor skills, that they would 
receive $4 for their participation, and that the 
experiment would be conducted at the university 
health center. 

.* Upon arriving at the health center, the subjects 
were asked to sign a statement acknowledging that 
they had had some previous experience with alcohol, 
that they were aware of the maximum amount of 
alcohol they could consume (8 ounces, or .24 1), and 


939 


that they knew they could terminate their services 
at any time without penalty, The subjects also au- 
thorized the director of the health center to examine 
their health records in order to rule out any medical 
reasons for not participating in the experiment. 


Procedure 


Each subject was randomly assigned to one of four 
conditions: alcohol-threat, alcohol-no threat, no 
alcoho]-threat, and no alcohol-no threat. 

After placing a small amount of crushed ice in his 
mouth, the subject in the alcohol group consumed 
1.5 ounces (.045 1) of 100-proof vodka per 40 
pounds (18 kg) of body weight. The beverage was 
administered in two equal doses, 15 minutes apart. 
Each drink contained 40% vodka, 60% ginger ale, 
and two drops of peppermint oil (United States 
Pharmacopoeia). 

The subject in the no-alcohol group received a 
placebo consisting of a mixture of ginger ale (2 0z./ 
40 lbs., or .06 1/18 kg) and two drops of pepper- 
mint oil in two equal doses. One twentieth of an 
ounce of 100-proof vodka was poured upon the 
surface of the mixture. In both conditions the sub- 
jects were not informed of the amount or type of 
alcoholic beverage they consumed, and it was ob- 
served that the peppermint oil obscured any taste 
differences between the mixtures of the two groups. 

Thirty minutes after the second drink had been 
consumed, each subject was led to a small cubicle 
that contained the task board (a modification of the 
task board described by Taylor, 1967). The subject 
was seated by the experimenter, and a concentric 
shock electrode was attached to his left wrist. Fol- 
lowing a 5-minute delay, during which the experi- 
menter was presumably communicating to the oppo- 
nent, the subject’s and confederate’s unpleasantness 
threshold for shock was determined. During this 
phase, all subjects overheard, through an intercom, 
the opponent inform the experimenter (a) when he 
felt the first shock and (b) when the shock was 
“definitely unpleasant.” 

Each subject was then informed by means of a 
tape-recorded message that he was competing in a 
task involving reaction time with the other “subject” 
in an adjoining room. At the beginning of each trial 
he was instructed to select (by pressing 1 of 10 
buttons) any of the 10 intensities of shock he wished 
his opponent to receive. He was informed that the 
shock would be administered to his opponent at the 
end of a trial if he were faster than his opponent 
and that he would receive the shock his opponent 
set for him if his opponent were faster. Thus, he 
realized that either he or his opponent would receive 
a shock, depending upon the outcome of the com- 
petition, and that both could select the intensity of 
the shock the other would receive. 

The experimenter then asked the subject and the 
opponent if they had any questions concerning the 
task. The subject overheard, through the intercom, 
one of two tape-recorded messages communicated to 
the experimenter by the opponent. In the threat con- 
dition, the opponent simply replied, “No questions.” 
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In the no-threat condition, the interchange proceeded 
as follows: 


Yes, I have a question. I have 
some strong convictions about 
hurting people and I'd feel more 
comfortable about this thing if I 
just set the one button all the 
time. Would that bother the 
experiment? 

[Experimenter] What you do with the shock but- 
tons is up to you, as long as you 
set something and do the reaction 
time part as fast as you can. 
Okay, thank you. 


[Opponent] 


[Opponent] 


The task was then begun. All subjects received 19 
trials. These consisted of three six-trial blocks in 
which the opponent behaved in a passive manner 
and a trial to measure the subject’s reaction to the 
feedback of the 18th trial. Each of the three blocks 
consisted of six trials in which the opponent always 
used the one shock. The frequency of wins and 
losses (50%) and the amount of shock received were 
programmed by the expeximenter. 

In establishing the intensities of shock to be 
administered to each subject, the intensity judged 
as definitely unpleasant was designated as Number 
10; 9 was set at 95% of maximum; 8 at 90% of 
maximum; 7 at 85% of maximum, and so forth. 


RESULTS 


An analysis of variance was first per- 
formed on the subjects’ initial shock settings. 
According to this analysis, the intoxicated 
subjects initiated higher levels of attack than 
the nonintoxicated subjects, F(1,36) = 9.4, 
p < 01. The mean shock intensities set by 
the subjects in the alcohol and no-alcohol 
conditions were 3.45 and 2.20, respectively. 
The main effect of threat also proved to be 
highly significant, F(1,36) = 33.3, with the 
subjects in the threat condition setting more 
intense initial shocks (M = 4.0) than the sub- 
jects in the no-threat condition (M = 1.65). 

The Alcohol x Threat interaction was sig- 
nificant at the .01 level, F(1,36) = 8.0. As 
can be seen in Figure 1, under threatening 
conditions, the intoxicated subjects set more 
intense initial shocks than the nonintoxicated 
subjects. Under nonthreatening conditions, 
however, the intoxicated and nonintoxicated 
subjects set similar shock levels. Newman- 
Keuls analyses indicated that the difference 
in mean shock settings between the alcohol 
and no-alcohol groups in the threatening 
condition was significant at the .05 level. The 
mean shock intensities set by the intoxicated 
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and nonintoxicated subjects in the non- 
threatening condition were not significantly 
different. Finally, the post hoc analysis dem- 
onstrated that both the alcohol and no-alcohol 
groups set significantly higher shock inten- 
sities in the threat condition than in the 
no-threat condition (p < .05). 

The long-term effects of the opponents’ 
nonaggressive behavior were assessed by per- 
forming a 2 X 2 X 3 analysis of variance on 
the mean shocks set by the subjects on the 
three blocks of trials. According to this analy- 
sis, the main effects of alcohol and threat 
were significant at the .01 level, F(1,36) = 
10.6 and 14.1, respectively. The mean shock 
intensities set by the subjects in the alcohol 
and no-alcohol conditions were 2.67 and 1.52, 
respectively, The mean shocks set by the sub- 
jects in the threat and no-threat conditions 
were 2.75 and 1.44, respectively, The Threat ., 
Xx Alcohol interaction also proved to be 
highly significant in the blocks analysis, F(1, 
36) = 8.4. As in the analysis of the initial 
settings, in the threat condition, the intoxi- 
cated subjects set significantly more intense 
shocks (M = 3.83) than the nonintoxicated 
subjects (M = 1.67). In the no-threat condi- 
tion, there was no significant difference in the 
average shock intensities set by the intoxi- 
cated (1.50) and nonintoxicated (1.37) sub- 
jects. Neither the block effect nor the inter- 
actions of blocks with the other variables were 
significant at the .05 level. It would appear 
that within each condition, mean shock-set- 
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Ficure 1. Subjects’ initial shock settings as a function 
of alcohol and threat. 


ting behavior remained rebatively stable over 
the blocks of trials, 


Discussion 
n > 

There are two current theoretical perspec- 
~ tives concerning the relationship between al- 
cohol and aggression. Some researchers (Tay- 
lor & Gammon, 1975) have theorized that 
` the acting-out behavior evidenced by intoxi- 
cated persons may be due to hyperactivity of 
various subcortical areas of the brain, Pre- 
sumably, this “pseudostirgulation” occurs 
when relatively primitive areas of the brain 
are freed from the inhibitory control of the 
cortex. Other investigators have proposed 
that alcohol only modifies the expression of 
aggressive behavior that has been instigated 
by appropriate stimulating conditions (Car- 

' penter & Armenti, 1972). 

The behavior of the subjects in the threat- 
ening condition could be interpreted as sup- 
porting the pharmacological disinhibition 
hypothesis. Instead of setting low-shock in- 
tensities, in accorda™ ~ with the traditional 
norm of reciprocity, ` 2 intoxicated subjects 
instigated relatively intense attacks against 

» the opponent. Such behavior is, of course, so- 
cially inappropriate, since the competitor did 
not give the subject any reason to believe 
that he was going to be harmed. Furthermore, 
the alcohol subjects continued to attack the 
opponent who adopted a passive strategy. 
They did not evidenée an appreciable reduc- 
tion in their shock settings for approximately 
17 trials. 5 ah, 

The ‘physiological disinhibition hypothesis 
canrfot easily account for the behavior of the 
intoxicated subjects in the nonthreatening 
condition, where the subject overheard a 
complex message suggesting that the oppo- 
nent had pacifistic intentions. Their response 
to this communication was neither malevolent, 
irrational, nor insensitive. Instead, they re- 
sponded in a highly appropriate manner, set- 

_ ting low-intensity shocks initially as well as 
during the competitive session. Furthermore, 
they evaluated the pacifistic opponent in a 
favorable manner. One intoxicated subject, 
for example, stated that he thought the op- 
ponent was “straight and very compassion- 
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ate.” Another noted, “My competitor was an 
extremely fair person who treated me better 
than I treated him.” 

In this experiment, alcohol led to increased 
levels of aggression only under conditions in 
which there was some degree of threat. Thus, 
the results suggest that physical aggression is 
a function of the interaction of alcohol con- 
sumption and the degree of threat inherent in 
a particular situation. Obviously, these find- 
ings do not support the “pseudostimulation” 
or simple physiological disinhibition hypothe- 
sis. However, why the intoxicated subjects in 
the threat condition evidenced such a strong 
tendency to initiate attacks against a peer 
who had not communicated a specific intent 
to harm remains to be answered. One possi- 
bility is that by disrupting various percep- 
tual and cognitive processes, the alcohol dis- 
turbed the subject’s ability to judge the de- 
gree to which he was being threatened. An 
intoxicated person may attribute more ag- 
gressive intent to another person’s actions 
than a nonintoxicated person. Currently, re- 
search is being conducted in our laboratory 
to determine the influence of alcohol on the 
attribution process. Another possibility, which 
should be investigated, is that alcohol may 
influence a person’s ability to regulate his own 
retaliatory response tendencies. 
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Substantive and Structural Elements of Effective Plans 


for Self-Control 
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This study compared the effects of different kinds of self-instructional plans 
on preschool children’s resistance to temptation, In the experimental situation, 
subjects worked on a lengthy task in the face of tempting distractions in order 
to win attractive rewards. One plan directed the inhibition of attention to the 
temptation (temptation-inhibiting plan). Another focused on the rewarding 
consequences of continued work (reward-oriented plan). A third plan empha- 
sized continued attention to the task itself (task-facilitating plan). Each plan 
was presented by the experimenter in either elaborated or unelaborated form. 
In the elaborated form, both the nature and actual wording of the plan were 
specified by the experimenter. In the unelaborated form, the experimenter speci- 
fied the attentional focus but not the wording of the plan so that subjects 
could generate their own versions, Compared to the no-plan and irrelevant 
plan control conditions, resistance to temptation was facilitated only in the 
temptation-inhibiting and reward-oriented plan conditions, and only when 
these plans were presented in the elaborated form. These findings suggest the 
importance of both substantive and structural factors in determining the efficacy 


of plans for self-control. 


The idea that in order to understand the 
patterning of a person’s behavior one must 
study the nature of his plans has been one 
which has received considerable theoretical 
emphasis in recent years (e.g., Miller, Gal- 
anter, & Pribram, 1960). Although the im- 
plications of this idea have been pursued ex- 
tensively in cognitive psychology and psycho- 
linguistics (Mandler, 1968; Neisser, 1967), 
its implications for personality and social be- 
havior are only beginning to be explored in 
studies of self-instructions (e.g., Hartig & 
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Kanfer, 1973; Kanfer, Cox, Greiner, & Ka- 
roly, 1974; Kanfer & Zich, 1974) and of 
plans (Patterson & Mischel, 1975, 1976). 

In a systematic analysis of the influence of 
planning on social behavior, a distinction may 
be made between the organization or struc- 
ture of plans on the one hand and their con- 
tent or substance on the other. From a struc- 
tural point of view, an important issue con- 
cerns how the level of organization or degree 
of structure of plans affects their ability to 
direct or influence behavior in desired direc- 
tions. In studying the substance of plans, a 
question of central interest concerns ways in 


which the contents of plans (i.e., their sub- 


stantive nature) affect their ability to provide 
effective guides for behavior. The present 
study was conducted in an effort to begin an 
analysis of plans for children’s self-control 
from these twin perspectives, 

One potentially important structural char- 
acteristic of plans is the level of detail with 
which they are stated. Plans may range from 


unelaborated statements of intention (eg? 


“TIL study later”) to highly articulated sets 

of specific steps and contingencies for reach- 

ing a goal (e.g., “At eight o’clock, TIL open 

my French notes, then . . ”), Our general 
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question here was the following: To what 
extent does the success of plans in facilitating 
self-control depend upon their being elab- 
orated in great detail before the occasion for 
heir execution? Is it essential that many de- 
tails of a plan for self-control be specified in 
advance if it is to exert beneficial effects on 


. subsequent performance, or will it suffice to 


provide a relatively unelaborated plan for 
vhich children may generate their own ver- 


< sions 


Although work with adults has indicated 
that they are often able to employ relatively 
sketchy plans to good advantage in a wide 
variety of otherwise problematic situations 
(Meichenbaum & Cameron, 1974; Paivio, 
1971), research with children has yielded less 
consistent results. Meichenbaum and Good- 
) 71) gave impulsive children exposure 
: a verbalizing model and practice in gen- 


+ evating their own verbalizations but did not 


fy what the exact substance of the chil- 
dren’s verbalizations should be in performing 
on a variety of cognitive tasks. The children 
who had received this treatment exhibited 
onsiderable improvement of performance on 
a later test when compared with untreated 
ontrol subjects. On the other hand, Flavell 


, and his colleagues reported that although 


young children will use verbal plans to good 
advantage in memory tasks if directly in- 
structed to do so, they do not employ such 
strategies spontaneously or in response to less 
‘han completely formulated suggestions to do 
so (Keeney, Cannizzo, & Flavell, 1967; 
Moely, Olson, Halwes, & Flavell, 1969). 
Thus, whether young children are able to 


_ employ less than completely specified plans 


usefully seems to depend in part on the par- 
ticular nature of their training and the spe- 
cifics of the testing situation. One aim of the 
present study was to compare children’s abil- 
ity to use plans of varying degrees of elab- 
oration in a resistance-to-temptation situation. 
Accordingly, some children were offered plans 
which specified exactly what verbalizations 


, they should employ to facilitate their efforts 


at self-control in a subsequent test situation, 
while others were offered plans which speci- 
fied the nature but not the specific contents of 
such verbalizations. 
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Tn an analysis of the effects of the substan- 
tive nature of plans on their ability to facili- 
tate self-control, an important issue concerns 
the relative efficacy of plans that direct atten- 
tion to different aspects of the self-control 
situation. In earlier work, we found that chil- 
dren resisted temptation effectively when 
they used a verbal plan which directed them 
to inhibit attention to the temptation (temp- 
tation-inhibiting plan), but not when they 
used a plan which directed them to attend to 
the task which they were attempting to com- 
plete (task-facilitating plan; Patterson & 
Mischel, 1976). In that study, the plans con- 
tained both verbal and motor components; 
that is, they told the subject both what to do 
and what to say out loud to himself while 
involved in the self-control situation. In the 
present study, we sought to provide an ex- 
tended replication of these findings by com- 
paring the effects of temptation-inhibiting and 
task-facilitating plans that include only a 
verbal component—that is, that tell the sub- 
ject what to say to himself, but not what to 
do—on children’s self-control. 

Another main issue relating to the role of 
the substantive nature of plans concerns the 
efficacy of plans which focus on the reward- 
ing consequences of successful self-control. In 
previous work, the effects of reward-oriented 
cognitive strategies on children’s self-control 
have been mixed. For example, Mischel, Eb- 
besen, and Zeiss (1972) found that subjects * “ 
ability to sustain waiting in a voluntary delay 
of reward situation was impaired when the 
subjects were instructed to think about the 
reward objects. On the other hand, when 
ideation about the rewards was structured as 
an instrumental activity that would shorten 
the delay interval, instructions to think abou! 
the rewards facilitated subjects’ ability to sus- 
tain waiting behavior (Mischel & Underwood, 
1974), These results suggest that when the 


subject is helpless to affect the lene). sf the 
delay period, ideation about the r: ds im- 
pairs his ability to delay gratifi u If, on 
the other hand, the subject be! that his 
ideation can have an instru; ‘| function 


(ie., shortening the delay pe 
ing about the rewards { 
control efforts, 


, then 
ates hi 


N 
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In the resistance-to-temptation situation 
studied here (Patterson & Mischel, 1976), the 
subject is motivated to work on a long task 
in the face of tempting distractions. Whether 
or not he will receive the preferred reward 
depends upon how much work he completes— 
in other words, upon his own instrumental 
activities. Based on the results from earlier 
work, then, it was expected that use of a 
plan which would remind the subject of the 
rewards for successful self-control would fa- 
cilitate resistance to temptation relative to a 
no-plan control. No prediction was made 
about the relative efficacy of the reward- 

| oriented and temptation-inhibiting plans. 

In sum, the effects of three kinds of verbal 
plans—temptation-inhibiting plans, reward- 
oriented plans, and task-facilitating plans— 
on children’s resistance to temptation were 
studied. Each plan was studied in two forms. 
In the elaborated form, both the nature and 
actual contents of the plan were specified by 
the experimenter; in the unelaborated form, 
the experimenter specified the nature of the 
plan, but it was left to the subject to generate 
the particular words with which to execute the 
plan. Two additional groups were run as con- 
trols. In one of these groups subjects were 
given an irrelevant verbal plan (i.e., a nursery 

hyme to recite during the resistance-to-temp- 

ee test), and in the other they were not 
‘given plans of any kind. Dependent measures 
‘assessed the degree to which these plans fa- 
cilitated subjects’ continued work on a task in 
the face of attractive temptations to stop 
working. 


METHOD 
Design 


The experiment employed a 3X2 design that 
yaried the attentional focus (temptation-inhibiting, 
reward-oriented, and task-facilitating) and degree 
of elaboration (elaborated vs. unelaborated) of plans, 
and assessed the effects of these variations on sub- 
jects’ self-control in a subsequent resistance to temp- 
tation test situation. Two additional groups, one 
that was instructed to verbalize irrelevant material 
and one with no instructions about verbalization, 
were also run as controls. 


Subjects 


The subjects were 70 children, 35 girls and 35 
boys, enrolled in the Bing Nursery School on the 
Stanford University campus. They ranged in age 
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from 3 years 6 months to 5 years 3 months, with a 
mean age of 4 years 6 months. Subjects were as- 
signed on a random basis, 10 to each cell of the 
3 X 2 design, and 5 to each of the control conditions. 


Procedure 


Since the procedure was very similar to that em- 
ployed by Patterson and Mischel (1976), it will be 
described only briefly here. 

Upon entering the experimental room, a subject 
was first shown two sets of toys, the “fun toys” and 
the “broken toys,” and asked which set he would 
prefer to play with. After he had chosen (as all * 
subjects did) the “fun toys,” he was led to the other 
side of the room and seated at a table. 

The experimenter then introduced a pegboard task 
that required the subject to put a peg into every 
hole on the pegboard. The experimenter declared 
that she would have to leave the room, but that 
the child could work on the pegboard task while 
she was gone. 

The child was then introduced to Mr. Clown Box, 

a large wooden box painted to resemble a clown (see 
Mischel, 1976, p. 439). In a standard sequence, Mr. 
Clown Box (via a tape recording played through a 
hidden speaker) engaged the experimenter and child 
in a brief, lively conversation and demonstrated that 
attractive toys could sometimes be seen in the 
Clown Box’s windows. 

At this point the experimenter explained the con- 
tingency to the child. The subject was told that if 
he completed the pegboard task in the allotted time, 
then he would be allowed to play with the “fun 
toys” and with Mr. Clown Box when the experi- 
menter returned. If the subject did not finish the 
task, he was told, then he would be allowed to play 
with only the “broken toys” upon the experimenter’s 
return. Subjects’ understanding of this contingency 
was carefully assessed; all subjects for whom data 
are presented were able to indicate their understand- 
ing. 

To ensure that all children would be maximally 
motivated to work on the pegboard task and to 
ignore Mr. Clown Box, the experimenter. asked the 
child to promise both to work hard and to resist 
Mr. Clown Box, should he tempt the chi 
working. All subjects agreed to both 

If the subject had been assigneg/ to one of the 
plans conditions, the experimente gave a standard 
introduction to the plan, which was always identical, 
If the subject had been assigned to the elaborated 
temptation-inhibiting plan condition, she continued: 


When Mr. Clown Box says to look at him and 
play with him, then you can just say, “No, I’m 
not going to look at Mr. Clown Box.” So that’s 
what you can do. He says, “Look,” and you say, 
“No, I'm not going to look at Mr. Clown Box.” 


The plan was introduced in the same way for 
subjects in the elaborated reward-oriented grou] 
except that they were instructed to say, “I want a 
play with the fun toys and Mr. Clown Box later.” 
Again, the introduction was the same for those ie 
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the elaborated task-facilitating plan condition, except 
that they were told to say, “I’m going to look at 
my work.” 

If the subject had been assigned to the unelab- 
orated temptation-inhibiting plan condition, he was 
told, 


When Mr. Clown Box says to look at him and 
play with him, then you can just think of some- 
thing to say out loud that will help you not to 
look at Mr. Clown Box. You can say anything 
you want that will help you not to look at Mr. 
Clown Box. So that’s what you can do. He says, 
“Look,” and you say something to keep yourself 
from looking at Mr. Clown Box. 


In the unelaborated reward-oriented plan condi- 
tion, the plans were introduced in the same way, 
except that the subject was told to “think of some- 
thing to say out loud that will remind you of what 
you get to play with if you, finish your work.” 
Again, the introduction of plans was the same for 
those in the unelaborated task-facilitating plan con- 
dition, except that they were instructed to “think of 
‘something to say out loud that will help you to keep 
looking at your work.” 

For subjects in the irrelevant plan condition, the 
experimenter said, 


While I’m gone out of the room, you can say to 
yourself, “Hickory Dickory Dock, the mouse ran 
up the clock.” So that’s what you can do. While 
I’m gone out of the room, you can say { Your- 
self, “Hickory Dickory Dock, the mouse| An up 
the clock.” 


Finally, in the no-plan condition, the experimenter 
gave no instructions about what to say, but instead 
went directly on- with the concluding instructions. 

The concluding instructions were the same for all 
subjects. The experimenter indicated that she would 
be leaving the room, reiterated the contingency, and 
told the child that she hoped he would work hard. 

The test phase began as the experimenter left the 
room (allowing the child to begin work on the 
pegboard task) and it lasted for 6 minutes. During 


this time the Clown Box performed a standardized . 


routine designed to tempt the child to stop working 
(see Patterson & Mischel, 1975, 1976, for more de- 
tails). 


Dependent Measures 


The dependent measures were recorded by two 
observers, using a multichannel Rustrak event re- 
corder. The data are based on proportion of the 
test phase spent working on the task, amount of 
work completed (number of pegs inserted into the 
pegboard), the number of separate departures from 
working on the task, and the mean length of each 
such departure. 

Tape recordings of the experimental sessions were 
transcribed to yield records of the children’s verbali- 
zations during the.test phase. Two judges, blind to 

subjects’ conditions, scored all of the protocols for 


¢ 
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the occurrence of verbalized plans similar to those 
suggested by the experimenter. The judges em- 
ployed five task-relevant categories (temptation- 
inhibiting, task-facilitating, reward-oriented, irrele- 
vant, and self-generated plans) and achieved nearly 
perfect agreement in their scoring. The single in- 
stance of disagreement between the scorers was re- 
solved by discussion. The verbalization data for five 
children—two in the elaborated task-facilitating, one 
in the unelaborated ,temptation- ~inhibiting, one in the 
unelaborated task- -facilitating, and one in the no- 
plan control condition—were lost as.a result of 
equipment failure, 


RESULTS 
Use of Self-Instructional Plans 


The number of children who employed any 
task-relevant verbalizations in each condition 
was calculated. As expected, more subjects in 
the elaborated plan conditions than in the 
control conditions used task-relevant verbali- 
zations (19 of 28 versus 2 of 9; Fisher Exact 
p = .02). The number of children in the un- 
elaborated plan conditions who used task- 
relevant verbalizations did not differ signifi- 
cantly from that in the control conditions (11 
of 28 versus 2 of 9; p = .22). A further conl- 
parison indicated that more children in the 
elaborated than in the unelaborated plan con- 
ditions employed task-relevant verbalizations, 
x2(1) = 4.60, p <.05. 

The number of children in each condition 
who employed a verbal plan like that sug- 
gested by the experimenter was also calcu- 
lated. Of the 28 children in elaborated plan 
conditions 17 employed a plan like that sug- 
gested to them by the experimenter, whereas 
only 5 of the 28 in the unelaborated plan con- 
ditions did so,  *(1)'= 10.76, p< .005. 
Within the elaborated plan conditions, 7 of 
10 in the temptation-inhibiting, 9 of 10 in the 
reward-oriented, and 1 of 8 in the task-facili- 
tating plan conditions employed verbaliza- 
tions like those suggested by the experimenter. 
In the unelaborated plan conditions, 3 of 9 in 
the temptation-inhibiting, O of 10 in the re- 
ward-oriented, and 2 of 9 in the task-facili- 
tating plan conditions used the verbal plans 
which the experimenter had suggested to 
them. In the irrelevant plan condition, 1 of 
the 5 subjects employed the suggested verbal- 
ization. 
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Ficure 1, Time spent working as a function of elaboration and attentional focus of plans, 


Also of interest is the number of subjects 
in each condition who generated verbal plans 
of their own, apart from any suggested by the 
experimenter. Of the 28 children in the elab- 
orated plan conditions, 13 generated their 
own plans, whereas only 6 of 28 in the un- 
elaborated plan conditions and 2 of 9 in the 
control conditions did so. The proportion of 
children who verbalized a self-generated plan 
in the elaborated plan conditions did not dif- 
fer from that in the control conditions (Fisher 
Exact test p= .14), but was greater than 
the proportion of subjects in the unelaborated 
plan conditions who did so, x*(1) = 3.90, 
p <.05. 

In summary, a greater proportion of chil- 
dren in the elaborated plan conditions em- 
ployed task-relevant verbalizations than did 
those in the unelaborated plan conditions, 
This was reflected in the elaborated plan sub- 
jects’ greater tendency to employ the plans 
that had been suggested to them and also in 
their greater tendency to generate additional 
task-relevant verbalizations of their own. 


Time Spent Working 


The mean proportions of time spent work- 
ing by subjects in each condition are shown 
in Figure 1, These data did not meet the 
homogeneity of variances assumption of the 


analysis of variance, Fmax = 11.62; p< .05. 
Since the means were proportional to the 
variances, a square root transformation was 
applied, and the resulting scores were sub- 
mitted to a 2 X 3 analysis of variance. This 
analysis demonstrated a main effect for elab- 
oration of plans, F(1,54) = 4.15, p <.05, 
and a main effect for the attentional focus of 
plans, F(2, 54) = 6.44, p < .01. The interac- 
tion did not approach significance (F = 1.18, 
ns). 

Within the 2 X 3 design, subjects in the 
elaborated temptation-inhibiting and elab- 
orated reward-oriented plans conditions worked 
for a greater proportion of the test phase 
than those in other plans conditions, F(1, 54) 
= 13.18, p < .01. Within both the tempta- 
tion-inhibiting and reward-oriented plans con- 
ditions, those in the elaborated plans condi- 
tions spent more time working than those in 
the unelaborated plans conditions, #(18) = 
2.81 and 1.94, respectively, both ps < .05. In 
the task-facilitating condition, however, 
amount of time spent working did not differ 
between elaborated and unelaborated plans 
conditions (£< 1). As Figure 1 shows, re- 
gardless of elaboration, subjects in the task- 
facilitating plan conditions spent relatively 
little time working. There were no significant 


< 
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differences in time spent working between 
the three unelaborated plans conditions. 
Performance in the two control groups did 
not differ (ż < 1), so they were combined to 
form a single control group for all further 
comparisons. A £ test confirmed that children 


in the elaborated temptation-inhibiting and 


elaborated reward-oriented plan conditions 
spent more time working than control sub- 
jects, #(18) = 3.84 and 3.71, respectively, 


* both ps < .01. No other comparisons with the 


control group approached significance (all ts 
Ta 

Scores for amount of work completed 
(number of pegs inserted into the pegboard) 
were strongly correlated with the scores for 
proportion of time spent working (r= .87; 
n= 70; p< .0001). Since the two scores 
were essentially redundant with each other, 
the analyses of scores for amount of work 


~ done are not considered in detail. It should 


be noted, however, that paralleling the find- 
ing for time spent working, children in the 
elaborated temptation-inhibiting and elab- 
orated reward-oriented plan conditions com- 
pleted significantly more work than those in 
other cells of the 2 X 3 design, F(1,54) = 
13.50, p < .01, and than those in the control 
condition, #(28) = 1.77, p < .05. 

A 2 X 3 analysis of variance on the scores 
for the speed with which subjects put in pegs 
during the time that they were working re- 
vealed no significant differences among con- 
ditions. 


Number and Length of Departures From 
Work 


Although analyses of the data for total 
amount of time spent working demonstrated 
that, overall, subjects in the elaborated temp- 
tation-inhibiting and elaborated reward-ori- 
ented plans conditions spent more time work- 
ing (or, conversely, less time attending to the 
Clown Box) than other subjects did, these 
analyses did not reveal the source of this ef- 
fect. The large proportions of time spent 
working in these conditions might reflect 
fewer separate departures from work (i.e., 
smaller frequencies of departures), shorter 
departures (i.e., departures of shorter dura- 
tion), or some combination of these factors. 
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TABLE 1 
MEAN NUMBER OF SEPARATE DEPARTURES FROM 


Work As A FUNCTION OF ELABORATION AND 
ATTENTIONAL Focus OF PLANS 


Attentional focus of plan 


Tempta- Task- 
tion- Reward- facili- 
Plan inhibiting oriented tating Control 
Elaborated 
M 10.1 9.8 4,7 
SD 5.9 5.6 3.6 
Unelaborated 
M 9.6 5.0 6.3 
SD 5.6 3.9 5.6 
Control 
M 9.5 
SD 5.0 


To examine these possibilities, both the num- 
ber and average duration of separate depar- 
tures from work were calculated for each 
subject. 

Each occasion on which the subject looked 
at the Clown Box for at least 1 sec during the 
test phase was scored as a separate “depar- 
ture” from work; two or more such looks 
separated in time by fewer than 5 sec were 
scored as a single departure. The event re- 
corder tape for one subject in the elaborated 
reward-oriented condition was inadvertently 
destroyed prior to the performance of this 
analysis; hence his data are not included. 

The mean numbers of separate departures 
from work across conditions are shown in 
Table 1. A 2 X3 analysis of variance on 
these data demonstrated a main effect of the 
attentional focus of plans, F (2,53) = 3.67, 
p < 05; neither the main effect for elabora- 
tion of plans nor the interaction approached 
significance. Subsequent comparisons indi- 
cated that subjects in the task-facilitating 
plan conditions left their work less frequently 
than those in the temptation-inhibiting plan 
condition, F(1,53) = 7.24, p< .01. The 
fewer departures from work in the task-facili- 
tating plan conditions reflect the fact that 
these subjects, rather than moving often be- 
tween the work and temptation situations, 
simply spent most of their time attending to 
the temptation. Those in the reward-oriented 
plan conditions did not differ significantly 
from subjects in either of the other two plans 
conditions. The fact that the number of de- 


H SEPARATE 
oTION OF 
ONAL 


DEPARTURE FROM WORK AS A 
ELABORATION AND ATT) 
Focus or Pi 


Attentional focus of plan 


Tempta- 
tion- 


Reward- facili- 
tating 


inhibiting oriented Control 
44 5.7 46.9 
3.4 3.8 40.2 
Unelaborated 
M 20.8 54.0 614 
SD 27.9 88.6 96.1 
Control 
M 53.1 


SD 95.8 


partures from work exhibited in the tempta- 
tion-inhibiting and reward-oriented plans con- 
ditions was not significantly different than in 
the control groups suggests that the facilitat- 
ing effects of plans on time spent working 
must be reflected in briefer departures from 
work rather than in less frequent ones. 

The mean lengths (in seconds) of each 
separate departure from work across condi- 
tions are shown in Table 2. As is evident from 
the table, the homogeneity of variances as- 
sumption of the analysis of variance was not 
met by these data (Fmax = 973.3, p < .001). 
Since the n s were proportional to the vari- 
ances, 4 are root transformation was ap- 
plied, anc resulting scores were submitted 
to a 2 X 3 analysis of variance. Paralleling the 
results for the time spent working data, the 
analysis revealed significant main effects for 
both elaboration of plans, F(1, 53) = 4.10, 
p < .05, and attentional focus of plans, F (2, 
53) = 4.12, p < .05; the interaction did not 
approach significance. The mean lengths of 
each separate departure from work were sig- 
nificantly shorter for subjects in the elab- 
orated temptation-inhibiting and elaborated 
reward-oriented plan conditions than for those 
in any of the other plan conditions (fs rang- 
ing in size from 1.97 to 3.30; df= 17 or 18; 
all ps < .05) and for subjects in the control 
condition (¢s = 2.32 and 2.04, respectively; 
dj = 18 and 17, respectively; both ps < .05). 
No other comparisons between groups were 


significant. 
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The significant differences between vati- 
ances noted above are also of interest in their 
own right. The variances for the mean lengths 
of each departure from work in the elaborated 
temptation-inhibiting and elaborated reward- 
oriented plan conditions were significantly 
smaller than those for all of the other plans 
groups and for the control group (Fs ranging 
from 54.1 to 973.3; all ps < .01), indicating 
that these plans not only affected the mean 
length of each departure from work but also 
served to limit individual variability around 
the means. 

In sum, the facilitating effect of plans was 
due to the fact that the mean length of each 
departure from work was significantly shorter 
for those in elaborated temptation-inhibiting 
and elaborated reward-oriented plan condi- 
tions than for all other conditions. Thus, the 
factor responsible for the success of the elab- 
orated temptation-inhibiting and elaborated 
reward-oriented plans in facilitating children’s 
ability to work in the face of temptation was 
the ability of these plans to help the subject 
limit to a few seconds the length of each sepa- 
rate departure from the main task. 


Effects of Age and Sex 


The correlation of age with proportion of 
time spent working during the test phase was 
a modest .206 for the whole sample (n = 70; 
ns). Consistent with the findings of earlier 
research (Patterson & Mischel, 1975, 1976), 
this correlation was higher in the control con- 
dition (r= 518; m= 10; ns) than in the 
plans conditions (r = .182; n = 60; ns). The 
differences for sex of subject were more pro- 
nounced, with girls working for a greater pro- 
portion of the test phase, on the average, than 
boys, £(68) = 3.65, p< .01; no interactions 
between sex of subject and efficacy of plans 
were observed, 


Discussion 


The results of this study provide clear eyi- 
dence that the efficacy of plans for self-con- 
trol may depend upon both the substance and 
the structure of those plans. Children’s ability 
to sustain continued work in the face of 
temptation was significantly facilitated b 
elaborated plans which focused on the is 
warding consequences of Successful self-con- 
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trol or on the inhibition of attention to the 
temptation, but not by elaborated plans which 
focused attention on the task to be accomp- 
lished. Unelaborated plans, which required 
the subject to generate the specific wording 
of verbalizations, did not facilitate self-control 
performance over that observed in irrelevant 
verbalization or no-plan control conditions. 

An important finding of this study was 
that the efficacy of elaborated plans depended 
on their substantive nature or content. The 
success of the elaborated temptation-inhibit- 
ing plan and the failure of the elaborated 
task-facilitating plan in facilitating subjects’ 
self-control performances provides an extended 
replication of the findings reported by Pat- 
terson and Mischel (1976). Whereas the ver- 
sion of these plans employed in the earlier 
study had specified both the subjects’ verbal 
and motor behavior, those used in the prezent 
study contained only instructions about what 
the subject should say to himself. Thus, the 
present findings provide additional support 
for the notion that one kind of cognitive 
activity underlying an ability to sustain goal- 
directed work in the face of tempting distrac- 
tions is the active suppression of attention to 
the tempting stimuli. One cannot be certain 
why the plans that focus attention on the 
work itself (task-facilitating plans) proved 
consistently ineffective, but it seems reason- 
able to speculate that their failure occurred 
because the incentive or reward value of the 
work activity was weak when compared to 
the motivating power of the temptation. 

We also found that an elaborated plan 
which focused the subject’s attention on re- 
warding consequences for continued work was 
successful in facilitating children’s resistance 
to temptation. In the present study, as in the 
Mischel and Underwood (1974) one, subjects 
believed that they were in an instrumental 
situation, that is, that their success in the 
situation depended on their performance of 
certain active, instrumental responses. In this 


` kind of situation, ideation about the rewards 


for successful self-control facilitated children’s 
performances. In the Mischel, Ebbesen, and 


a . Zeiss (1972) situation, in contrast, the sub- 
“jects were essentially helpless to affect the 


length of the delay required to win the pre- 


ferred reward; in this passive situation, at- 
tention to the reward objects seemed to im- 
pair the children’s self-control performances. 
These results would seem to suggest that 
whether a cognitive focus on rewards will 
help or hinder children’s efforts to achieve 
self-control depends upon whether successful 
performance requires active, instrumental ac- 
tivity or passive waiting on the part of the 
subject. In any event, the use of a plan which 
focused attention on rewards for working cer- 
tainly facilitated self-control in the present 
situation. Further research is needed to clar- 
ify the exact role of ideation about rewards 
in this and other instrumental self-control 
situations. 

The results of this study also point to the 
importance of structural factors in determin- 
ing whether or not a given plan will facili- 
tate children’s self-control. Although the chil- 
dren in this study made good use of fully 
elaborated plans, they seemed quite unable to 
employ the unelaborated variants of these 
plans to any advantage. Our findings suggest 
that the details of a plan must be specified if 
the plan is to be successful in facilitating 
young children’s self-control in this situation. 

These findings raise more general questions 
with respect to the efficacy of plans formu- 
lated at varying levels of abstraction. On the 
one hand, it seems reasonable to assume that 
cognitive economy would be served by stating 
one’s plans at relatively high levels of abstrac- 
tion, since a generalized plan will certainly 
be applicable to a wider variety of situations 
than any of its fully detailed variants. On the 
other hand, successful use of a generalized 
plan requires that the subject be capable of 
filling in appropriate details at the moment 
when implementation of the plan becomes 
necessary. Our findings suggest that when a 
plan is less fully elaborated than a particular 
set of cognitive and situational conditions 
requires, the plan loses whatever value it 
might have possessed to facilitate perform- 
ance. Hence, it would seem that efficient use 
of plans requires that a balance be main- 
tained between the level of generality and of 
specificity. The nature of this balance will, 
of course, vary with the age and cognitive 
sophistication of the subject as well as with 
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the pressures and demands of the situations 
in which performance is studied. Our current 
findings demonstrate how surprisingly de- 
tailed and specific plans must sometimes be- 
come in order to achieve their desired ends. 

Finally, it is of interest to note that the 
manner in which successful plans facilitated 
self-control performance in the present study 
was similar to that observed in earlier work 
(Patterson & Mischel, 1976). Successful plans 
facilitated continued work in the face of 
temptation not by limiting the number of 
departures from work but rather by decreas- 
ing the average length of each such departure. 
In other words, the plans that succeeded in 
facilitating resistance did so by ensuring the 
subjects’ rapid return to goal-directed work 
following brief interruptions. 

In summary, self-control plans were effec- 
tive when they either directed the subject to 
inhibit attention to the temptation or to focus 
on the rewards for continued work but were 
e when they focused attention on the 
li. The success of a plan also de- 
its having been elaborated fully 

e. Thus, whether or not a 

tructional plan will facilitate 
young children’s self-control evidently de- 
pend not only on the substantive nature or 
the plan but also on its level of 
on or structure. 
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choice in determining reward or loss in 
situational dimensions that significantly i 


and different situational dimensions. The 
They strongly suggest the value of com 


or risk taking. 


Studies of chance-determined activities have 
traditionally emphasized either the influence 
of situational or individual-difference varia- 
bles. Very few studies have provided a simul- 
taneous treatment of both types of variables 
and an examination of their interactions. The 
present study considers two situational varia- 
bles, (a) probabilities of gambling outcomes 
and (b) the way in which the situation was 
presented; and two individual-difference vari- 
ables, (a) achievement orientation and (b) 
locus of control, These permit us to examine 
a number of explanations of risk-taking be- 
havior. 


Studies of Situational Variables 


Studies emphasizing situational factors can 
be found within both a decision theory frame- 
‘ work and a social psychological framework. 

Unfortunately, in some areas of decision the- 
ory (e.g., algebraic decision theories) experi- 
mentation has not caught up with theorizing 

(Rapoport & Wallsten, 1972). In other areas, 

decision theories have been evaluated by test- 
ing conditions which are only necessary rather 
than necessary and sufficient for its existence, 


. The’ authors wish to thank Herbert Lefcourt for 
~ his helpful suggestions on an earlier draft of this 


article. ` 
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Situational and Individual Difference Variables 
in Chance-Determined Activities 


M. E. El-Gazzar, S. D. Saleh, and D. W. Conrath 
Department of Management Sciences, University of Waterloo 
Ontario, Canada 


The study investigated behavioral responses to chance-determined activities in 
terms of an approach-avoidance tendency, using both situational and individual 
difference variables. The study showed that the probability of winning and the 


a gambling situation were important 
nfluenced the outcome. The study also 


showed an interaction between achievement motivation and locus of control, 


results were mainly explained in terms 


of attribution of responsibility and that risk taking is intrinsically rewarding. 


bining both situational and individual 


difference variables in attempting to understand the determinants of gambling 


as has been the case in most studies of sub- 
jectively expected utility (SEU). This may 
lead to the theory’s rejection but not to its 
acceptance (Tuersky, 1967). 

In general, decision theory models, for dif- 
ferent reasons, have not received conclusive 
experimental support for predicting risk-tak- 
ing or betting behavior. Moreover, very little 
has been said about the influence of the situa- 
tion of the decision maker, In one exception, 
Conrath (1973), using clinical experimental 
evidence, found that the format in which 
probabilistic data was presented to a group 
of managers affected their choice behavior, 

A number of general behavioral tendencies 
related to some situational factors have also 
been revealed through experimentation on de- 
cision making. For instance, it is often found 
that when a range of objective probabilities 
are inferred from decisions, the average sub- 
jective probability of lower probability everts 
are larger (overestimation), whereas the av- 
erage subjective probabilities of higher proba- 
bility events are smaller than objective proba- 
bilities (underestimation) (Lee, 1971, p. 61). 
There is also evidence that the probability 
of losing is the most important single deter- 
miner of perceived riskiness of a gamble 
(Conrath, 1973; Slovic, 1967). 

Decision-theoretic models typically fail to 
explain the above and similar findings. One 
way of examining such behavioral risk-taking 
tendencies is to use multiple-subject designs 
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and psychologically relevant individual-differ- 
ence variables, Such designs should further 
the understanding of the situational and per- 
sonality variables under which such risk-tak- 
ing tendencies can be found, and the psycho- 
logical processes that underlie them. 

From the social psychological perspective, 
many recent studies have investigated the ef- 
fects of different dimensions of betting and 
chance situations, Langer (1975) for instance, 
found subjects more confident of success when 
competing against a nervous, awkward con- 
federate than against a confident confederate. 
She also found that providing the opportunity 
of choice created a greater illusion of suc- 
cess potential. Wortman (1975) showed that 
subjects who caused their own outcome per- 
ceived themselves to have more control over 
the outcome, more choice about which out- 
come they received, and more responsibility 
for their outcome than subjects who did not. 
It was also shown that there was a tendency 
of greater restraint in chance-taking behavior 
when subjects were forced to wager after the 
outcome-determining physical event as com- 
pared with subjects who wagered before the 
event (Strickland, Lewicki, & Katz, 1966). 
These and other social psychological studies 
have shown the importance of studying the 
situational dimensions of a betting or gam- 
bling situation and their effects on behavior. 


Studies of Individual Difference Variables 


With regard to individual-difference varia- 
bles, there has been a growing interest in 
studying the concepts of locus of control and 
ichievement motivation in relation to both 

ill-determined activities (Atkinson & 

ither, 1966; Brody, 1963; Feather, 1967; 

-khausen, 1968; Lefcourt, 1966; Litwin, 
1. 5; McClelland & Watson, 1973; Rotter, 
19. © Rotter & Mulry, 1965; Lichtman & 
Julio . Note 1) and chance-determined activ- 
ities  ittig, 1966; Liverant & Scodel, 1960; 
McClei: ind & Watson, 1973; Ryckman & 
Rodda, 1971). 

Studies of  skill-determined activities 
showed that internally controlled and high- 
need-achievement subjects prefer intermediate 
degrees of tark difficulty, overestimate their 


chances of suc learn faster, perform more 


uickly, recall their errors better, and per- 
a jes under conditions of difficulty. The 
above findings were mainly explained on the 
basis of attribution of responsibility (Weiner, 
Freize, Kukla, Reed, Rest, & Rosenbaum, 
1971). 

There are a few studies which examined 
chance-determined activities in terms of locus 
of control and achievement motivation, In a 
dice-throwing situation Liverant and Scodel 
(1960), for instance, found that internals 
chose significantly more bets of intermediate 
probability and wagered more money on cau- 
tious than risky bets, compared to externals. 

In other words, internals revealed a greater 
tendency toward self-regulation in their pref- 
erence for bets, Liverant and Scodel suggested 
that internals attempt to exert control in a 
chance (externally controlled) situation by 
being cautious, while externals seem to make 
selections on the basis of hunches. 

In a recent study McClelland and Watson 
(1973) contrasted in a roulette game the in- 
fluence of n Achievement with that of n 
Power and n Affiliation. The results showed 
that those strongly motivated for Achieve- 
ment selected significantly more moderate 
risks (p= 1/11) compared to those moti- 
vated for Power or Affiliation, 

Another group of studies investigated the 
interaction between locus of control and 
achievement motivation (Fanelli, 1973; Wolk 
& DuCette, 1973). Wolk and DuCette, for 
instance, using measurements of (a) perform- 
ance in course exams, (b) estimation of 
chances of success, and (c) preference for 
level of difficulty in a perceptual-motor skill 
task, found that the data supported the pre- 
dictions of the achievement motivation theory 
only for internals, Fanelli ( 1973) found 
similar results on two simple clerical skill 
tasks. He found the performance of internals 
with high n Ach was better than that of ex- 
ternals with lower n Ach scores, Correlations 
between performance and each of locus of 
control and of n Ach were not significant, 

It should be noted that studies investigating 
the interaction between the above mentioned 
variables dealt mainly with skilled tasks. One 
of the main objectives of the present study 
is to investigate this interaction in a chance 
situation—a gambling task, 


p" 


CHANCE-DETERMINED ACTIVITIES 


Studies of individual differences are gen- 
erally criticized for their lack of a clear iden- 
tification of situational variables and usually 
fail to include them in their predictive models. 
Moreover, the behavioral measures frequently 
used in such studies (e.g., number of high-, 
medium-, or low-probability bets in a gam- 
bling task) are highly dependent on the situa- 
tion. This, of course, is adequate for predict- 
ing preferences in specific situations, but 
would not be useful for comparison purposes 
among different situations. The present study 
was designed to overcome these problems. 

Five hypotheses are presented. The first two 
test single-sided situational explanations. The 
third hypothesis tests two individual-differ- 
ence explanations. The last two hypotheses 
test predictions based on combinations of 
situational and individual-difference explana- 
tions. In these hypotheses the dependent 
variable is a general behavioral measure of 
risk taking, namely “approach—avoidance ten- 
dency”—the extent to which a given risk 
situation is perceived as attractive (approach) 
or unattractive (avoidance) compared to a 
“fair gamble.” 


Hypotheses 


Hypothesis 1, probability of winning: Sub- 
jects will exhibit a greater approach tendency 
toward situations posing a low probability of 
winning compared to situations posing a high 
probability of winning. 

This hypothesis is based on what is known 
in the utility theory literature as “utility of 
gambling.” It has been long recognized in 
this literature that many people enjoy par- 
ticipating in gambling (Davidson, Suppes, & 
Siegel, 1957; Lee, 1971; Royden, Suppes, & 
Walsh, 1959). Lee explained that this vague 
entity “utility of gambling” would seem to 
relate to the ideas of “variance preference” 
and “risk.” Based on Liverant and Scodel 
(1960), we would expect this hypothesis to 
be particularly relevant for externals. More- 
over, it is expected that a strong risk-taking 
tendency would characterize young under- 
graduate university students like those who 
participated in the present study. 

Hypothesis 2 situation proposition: Sub- 
jects will exhibit a greater approach ten- 
dency toward situations where they determine 
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their own reward, if they win, compared to 
situations where they determine their own 
loss, if they lose. 

This hypothesis is based on several assump- 
tions. People are motivated to see themselves 
as causal agents responsible for their suc- 
cesses (DeCharms, 1968, p. 269; Lefcourt, 
1973), but blame chance or other external 
factors for their failures. Moreover, when 
people see themselves as causal agents for 
their outcomes, they perceive themselves to 
have more control over the situation (Wort- 
man, 1975). Furthermore, it is expected that 
individuals will tend to have a higher ap- 
proach tendency toward a situation which 
they feel they can control than toward one 
which they cannot control. Since the choice 
of reward reflects a success situation in which 
the individual will feel responsible (even in 
a chance situation) and therefore in control, 
the approach tendency should be greater than 
in the loss situation in which the individual 
does not feel he is responsible or has control, 
This hypothesis would also appear to be more 
relevant for internals than for externals, 

The hypothesis may also be explained on 
the basis that anticipation of postdecision dis- 
sonance influences predecision behavior 
(Braden & Walster, 1964). 

Hypothesis 3, interaction between achieve- 
ment motivation and locus of control: High- 
achievement external subjects will exhibit the 
highest approach tendency, whereas the low- 
achievement internal subjects will exhibit the 
lowest approach tendency. 

This hypothesis is based on attribution of 
responsibility, even in a chance determined 
situation; and on the assumption that risk 
taking in a gambling situation can be intrin- 
sically rewarding. It may be noted that attri- 
bution of responsibility is directly related to 
locus of control and intrinsic reward is di- 
rectly related to achievement motivation. 

Since different lines of research have pro- 
vided support for the position that people take 
responsibility for chance outcomes (Jenkins 
& Ward, 1965; Langer, 1975; Lerner, 1965; 
Wortman, 1975), it is expected that locus of 
control would be operative in a gambling 
situation as it is in a skill situation, Internally 
controlled subjects would feel more respon- 
sible for the outcomes than externally con- 
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trolled subjects and attempt to control the 
situation. Internals attempt to exert control 
over the gambling situation by being more 
cautious (Liverant & Scodel, 1960). As a 
result, the high-achievement externals for 
whom gambling, as a risk-taking behavior, is 
intrinsically stimulating, and for whom the 
approach tendency is not inhibited by being 
cautious, would show the highest approach 
tendency. The low-achievement internals, who 
would be more cautious and at the same time 
not stimulated by gambling or risk taking, 
are expected to show the lowest approach 
tendency. 

Hypothesis 4, interaction between probabil- 
ity of winning and achievement, motivation, 
and locus of control: High-achievement exter- 
nals will indicate a greater difference in ap- 
proach tendency between a low probability 
of winning and high probability of winning 
situations than will low-achievement internals. 

This hypothesis tests the assumption that 
the differential approach tendency exhibited 
in situations posing different probabilities of 
winning is at least partially motivational. In 
other words, the hypothesis examines the as- 
sumption that risk taking is intrinsically re- 
warding, 

According to Hypothesis 3, high-achieve- 
ment externals would be more motivationally 
disposed toward risk taking than low-achieve- 
ment internals. Now if the cause of the dif- 
ferential strength of approach tendencies to- 
ward situations posing varying degrees of risk 
is primarily motivational, then two trends 
should be observed in the data. First, the dif- 
ference betwen high-achievement externals 
and low-achievement internals across situa- 
tions should be wider in high-risk than low- 
risk situations. Secondly, the differences be- 
tween high- and low-risk situations should 
be wider for high-achievement externals than 
for low-achievement internals. If, however, 
the differential strength of approach tenden- 

toward different situations is primarily 
perceptual, no significant variance would be 
‘d by the motivational variables (the 


nu ^esis), 

O ht still argue that motivational 
orien contribute to perceptual over- or 
underes: ion of probabilities of winning. 


However, the basic value of testing the moti- 
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vational assertion is not primarily to rule out ` 


the perceptual assertion, but rather to bring 
attention to the relevance of a motivational 
orientation. r 

Hypothesis 5, interaction between situation 
proposition and achievement motivation and 
locus of control: High-achievement externals 
will exhibit a stronger approach tendency to- 
ward situations where they determine their 
own loss compared to low-achievement in- 
ternals. 

The basis of the above contention is that 
externals as compared to internals would tend 
more to attribute their loss (failure) to ex- 
ternal sources. Externals „would thereby an- 


ticipate less postdecision dissonance in gam- ° 


bling situations (i.e., less inhibition and less 
avoidance). Moreover, the difference between 
externals and internals would be further 
widened by the strength of achievement moti- 
vation—the urge to win. The difference should 
also be more observable in high-dissonance- 
evoking situations. In sum, this hypothesis 
tests the contention that the differential re- 
action to different situation propositions can 
be explained by anticipations of postdecision 
dissonance, in addition to the assertions of at- 
tribution of responsibility and risk taking as 
intrinsically rewarding. 


METHOD 
Subjects > 
Sixty-five male undergraduate students participated 
in the experiment, Their average age was 20.7 years 
(SD = 2.42). Only 61 complete responses were ob- 
tained. 


Experimental Task 


Each subject responded to 35 two-outcome gam- 
bling situations. Each situation involved a blind- 
folded subject drawing a chip from a box containing 
100 well-mixed green (winning) and red (losing) 
chips. Each situation is completely defined in terms 
of three independent dimensions: probability of 
winning ~(W), amount of winning prize r(W), and 
amount of losing payment r(L). 

The following example illustrates the way in which 
one of the situations is presented: 


The box contains: 75 GREEN and 25 RED chips; 
and you will lose 100¢ if a RED is drawn. What 
is the minimum winning amount (if a CREEN is 
drawn) that makes you gamble in this situation 


Ce ene 


In each situation, two of the dimensions were given 
and the subject was asked to indicate the minimum 
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condition of the third dimension, which would in- 
duce him to gamble. 

The subjects responded to the 35 questions know- 
ing the following: (a) They would be competing in 
small randomly selected groups (four members) to 
get the right to gamble; (b) only the subjects with 
the highest two bids in any situation would be given 
that right; (c) the subjects selected would stand the 
chance of gaining or losing actual money according 
to their own specifications of the gambling situations 
—but an amount not to exceed the sum of their 
participation remuneration and accumulated gains; 
(d) only four gambles, randomly selected out of the 
total number of situations, were allowed for each 
group; (e) the selection of groups, questions and 
the actual gambling would start after all subjects 
had responded to all questions. In this way the par- 
ticipants were induced to indicate their indifference 
conditions since (a) if they overbid there would 
be a good chance of being introduced to a gamble 
they did not like; (b) if they underbid, there would 
be a good chance of losing the right to gamble. 


Situational Variables 


Three types of situation propositions were used: 
(a) reward: the subject indicates minimum accepta- 
ble reward, (b) probability: the subject indicates 
minimum acceptable probability of winning, and 
(c) loss: the subject indicates maximum acceptable 
wager, 

Within each type of situation proposition those 
levels of proposed or implied (fair gambling equiva- 
lent) probabilities of winning were used: (a) low: 
03 < p(W) < .33. At this level there were 5 rewards, 
5 probabilities, and 3 loss situations. (b) intermedi- 
ate: p(W) =.50 where there were three situations 
for each of reward, probability and loss. (c) high: 
-67 < p(W) < .97. At this level there were 3 rewards, 
5 probabilities and 5 loss situations. The level of re- 
wards and losses ranged from 10¢ to 300¢. 


Approach-Avoidance Tendency 


The dependent variable in the present study is 
designed to be a general behavioral measure of risk 
taking which can be used across a variety of situa- 
tions. What we have sought is a single measure which 
incorporated the utility for various outcomes, sub- 
jective/objective probability distortion, and the util- 


, .ity for gambling, One behavioral index which appears 


to satisfy this requirement is the tendency to ap- 
-proacl or avoid a risk-taking situation. 
Operationally, this tendency is assessed using stated 
probability or reward preferences for gambling out- 
comes, essentially in comparison with probubilities 
or rewards entailed by “fair gambling,” in which 
the expected value of the gamble is zero. For in- 
stance, if the gambling situation requires the re- 
spondent to specify his maximum wager for a given 
reward obtainable at a given probability of win- 
ning, then the greater the indicated wager the greater 
the inferred approach tendency. The wager entailed 
by fair gambling should constitute a yardstick for 
assessing the strength of approach tendency in such 


a case. A similar procedure can be devised for 
assessing approach tendency in situations where the 
individual has to indicate his minimum probability 
of winning or his minimum reward. 

Conceptually, approach tendency is defined as the 
individual’s perceptual distortion of winning d(W) 
and losing d(Z) under conditions of indifference be- 
tween approaching and avoiding a gamble. The 
tendency is termed approach-avoidance tendency 
(T), and is explicitly defined in terms of the four 
dimensions of a two-outcome gamble: $(W), P(L), 
and r(L), 


T= d(W)/d(L) = p(L) r(L)/p(W) 1(W). 


According to the above definition, T is both a 
general (i.e., applied to a wide variety of situational 
specifications) and a comprehensive (i.e, incorpo- 
rates a transformation of all given or stated outcome 
probabilities and rewards) behavioral measure of 
risk-taking tendency. A value of T which is equal to 
unity corresponds to fair gambling, whereas lower 
values indicate low riskiness (avoidance tendency), 
and higher values indicate high riskiness (approach 
tendency). 


Measurement of Individual-Difference 
Variables 


Locus of control was assessed by the I-E scale 
(Rotter, 1966), The instrument is a 29-item test, 
including 13 forced-choice items and six filler items. 
The I-E scale has been used in over 200 studies and 
possesses acceptable construct and discriminant 
validity (Rotter, 1971; Reid & Ware, Note 2). 
Achievement orientation was measured by a subscale 
of Jackson’s (1967) Personality Research Form. The 
subscale consists of 20 true-or-false questions. Jack- 
son (1967) reported test-retest and parallel-form 
reliability ranging from .72 to .86, convergent valid- 
ity ranging from .42 to .56, and a factor-analytic 
study supporting the discriminant validity of the 
subscale. On the author’s recommendation, the 20 
achievement items were imbedded in 30 more filler 
items, extracted from the same inventory. 


Experimental Procedure 


The subjects were given three sets of question- 
naires in two separate sessions. In the first session, 
the decision-making questionnaire was administered. 
In the second session, the individual-difference ques- 
tionnaires and a questionnaire measuring a number 
of control variables (age, annual income, marital 
status, attitude toward gambling experience) were 
administered. 

The administration took place in a group setting 
of at least 16 participants. After filling out the ques- 
tionnaires, the subjects were divided into randomly 
selected four-member groups, and four gambling 
situations were randomly selected. Gambling took 
place after bids were compared, and the two highest 
bids were identified. It is to be noted, however, 
that the experimental measurements ended with 
the completion of the questionnaires, 
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TABLE 1 
MEANS OF APPROACH-AVOIDANCE DISPOSITION 
Situational variables r; 
Probability of winning Situation proposition 
Individual- Prob- 
are, Was A prey Overall 
HAE n 2,05%** 110 AA ES E Ior 
HAL n 1,14 90 91 1,23* 99 89 1.03 
LAE n 3 1.25% 1.02 92 1:29. 1.06 81 1.03, 
LAI n= 12 1,05 81 .66* 1.02 .16* .17* 85 
Overall 1,31** 96 .92 1.22* 1.03 97 
Note, HAE = high-a chievement internals, LAE = low-achievement eens ARINE 


Analysis 


The subjects were divided at the mean into four 
groups according to their scores on the two indi- 
vidual-difference scales. A four-way analysis of vari- 
ance was calculated. Trends within significant ef- 
fects were further examined using appropriate ¢ tests. 


RESULTS 


None of the control variables (cf. Experi- 
mental Procedure, above) provided a signifi- 
cant explanation of the variance in the data. 

The results showed that the main effects of 
both situational variables were significant. In 
particular, the level of probability of winning 
accounted for a significant portion of the 
between-situations variance, F(2, 26) = 6.44, 
p < .01. Approach toward situations posing a 
low probability of winning was significantly 
stronger than those posing a high probability 
of winning (p < .001) (see Table 1). Thus, 

pothesis 1 was supported. Compared to 

ambling, low p(W) situations resulted in 
tly stronger approach tendencies for 
sample (p < 01) and for the ex- 
chievement (p < .001) and low- 
< .05) groups (Table 1). 

4 xe approach tendency was 
aced by the way in which 
posed: F(2, 26) = 3.18, 
2). Reward situations 
cantly stronger than 
The interaction be- 
variables was also 

7 3, p < 001. 


wert 

loss sit 
tween the 
significant, / 


p <.001; \ 
HAL 2 < .05; LAE, p < .01; overall, p < .05, Sig- 
i high probability of winning, # < :001; reward, 


The analysis of variance showed that the 
achievement main effect was significant, E 
2,073) = 10.31, p<.0l. High achievers 
showed more approach tendency than low 
achievers. Also, the main effect due to locus 
of control was significant, F (1, 2,073) = 7.87, 
p < .01. This result supports earlier research 
findings (cf. Liverant & Scodel, 1960), which 
showed that internals were more cautious in 
gambling than externals. 

The interaction between the two person- 
ality variables was significant (Hypothesis 
3), F(1, 2,073) = 31.80, p<.001. Table 1 
provides a detailed examination of the nature 
of this interaction, The difference between 
high-achievement externals and low-achieve- 
ment internals were significant in five out of 
six comparisons. The high-achievement ex- 
ternals consistently showed the highest ap- 
proach tendency, whereas the low-achieve- 
ment internals consistently showed the lowest 
approach tendency. 

In further analysis of responses relative to 
objectively fair gambling, high-achievement 
externals were generally found to be risk seek- 
ers (p < .001), while low-achievement inter- 
nals tended toward risk aversion (avoidance) 
(p < .05). The other two groups tended to 
take “fair” chances. 


The interaction between probability of- 


winning and the two individual-difference 
variables was significant, F(2, 2,073) = 3.00, 
p < .05. In line with Hypothesis 4, the dif- 
ference in approach tendency between low 
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and high #(W) situations was twice as large 
for high-achieyement externals than low- 
achievement internals. Both differences, how- 
ever, were significant ( < .001 and p< .01, 
respectively). The high-achievement externals 
also showed a significantly higher approach 
tendency toward low probability of winning 


“# situations compared to low-achievement in- 


ternals (p < .001). High-achievement exter- 
nals also significantly exceeded the remaining 
two groups (low-achievement externals and 
high-achievement internals) in their approach 
tendency at low (W) situations, 

The interaction between the situation 
proposition and the two individual-difference 
variables was significant, F(2, 2,073) = SES, 
$ < .05. The largest difference is found in 
the loss situations. High-achievement exter- 
nals significantly exceeded low-achievement 
internals at loss (b < .001) situations (sup- 
porting Hypothesis 5). A similar difference 
was also observed in probability (p < .01) 
and reward (p< .01) situations. Unexpect- 
edly, high-achievement externals showed a 
different pattern of relationship with situa- 
tion proposition compared to the other three 
groups, where approach tendency at loss situ- 
ations was numerically (though not signifi- 
cantly) higher than at reward situations. 


Discussion 


Consistent with earlier studies, the ap- 
proach tendency toward a gambling situation 
is found to be universally influenced by the 
probability of winning where higher risks are 
approached with stronger vigor. This was 
based on the assumption of the “utility of a 
gamble” or that risk taking is intrinsically 
rewarding, 

Tf risk taking is an adequate explanation, 
motivational orientation as measured in the 
present study should make a difference. In 
the low-probability of winning situations (high 


‘risk), the results showed that high achieve- 


ment externals had a stronger approach ten- 
dency than low-achievement internals, The 


‘former are expected to be motivationally dis- 
` posed toward risk taking while the latter are 


expected, at least in a comparative sense, to be 
disposed toward risk aversion, 


as vi 
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It should be noted that the results, in a 
general sense, support DeCharms’s (1968) 
assertion that risk taking as intrinsically re- 
warding explains the relationship between 
achievement motivation and gambling behay- 
ior, The relationship, however, may be mod- 
erated by other personality variables such as 
internal-external locus of control, 

The fact that locus of control influences 
the result in a chance-taking situation sup- 
ports the recent findings based on studies con- 
cerned with the judgment of contingency, the 
“just world hypothesis,” or the attribution of 
responsibility for an outcome, It would seem 
that at least some people deny the operation 
of chance even in a gambling situation. Many 
studies, however, which concluded that there 
are feelings of control over chance determined 
situations, have presented the conclusions in 
general terms. It would be more fruitful to 
consider relevant individual-difference varia- 
bles such as the locus of control and the male- 
female variable. For instance, there is an indi- 
cation of attribution differences between the 
sexes which shows women to invoke luck more 
often, and suggests that good and bad luck 
are more potent experiences for women than 
for men (Deaux, White, & Farris, 1975). 

Both attribution of responsibility and the 
intrinsic value of risk taking explain the inter- 
action between locus of control and achieve- 
ment motivation. Internals attempt to exert 
control in a gambling situation by being more 
cautious and calculative than externals, while 
high achievers value risk taking, even in a 
gambling situation, more than low achievers, 
Accordingly, the high-achievement externals 
would place the highest value on winning and 
therefore exhibit the highest approach ten- 
dency. This is also evidenced by their seeking 
risks significantly higher than what is expected 
if they have acted in accordance with fair 
gambling. Furthermore, the low-achievement 
internals not only showed the lowest ap- 
proach tendency of the four groups, but also 
this tendency was significantly lower than fair 
gambling. 

The present study emphasizes what has 
been indicated in several recent studies lie; 
Langer, 1975; Langer & Roth, 1975, Wort- 
man, 1975) that the dimensions of the chance 
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determined situation influence the outcome. 
Factors like competition, choice, involvement, 
stimulus familiarity, and response familiarity 
influence choice behavior when introduced into 
chance settings where they did not objec- 
tively influence the outcome (Langer, 1975). 
The present study identified the importance of 
other situational dimensions, namely the level 
of probability of winning, and determining 
one’s own reward if one wins as compared to 
determining one’s own loss if he loses. These 
findings call for the reexamination of earlier 
studies in terms of the types of situation di- 
mensions used. Notably, experimental deci- 
sion theory studies have asked subjects to 
stipulate their own wagers more often than 
stipulating their own rewards or probabilities 
of winning. The finding may also have some 
important implications for the design of re- 
ward systems in real-life risk-taking situa- 
tions. 

The major contribution of the present study 
has been to develop a measure of risk-taking 
behavior that is sensitive to differences be- 
tween individuals and situations. The measure 
assesses overall decision-making bias and is 
termed approach-avoidance tendency. Used as 
a dependent variable in the present study, 
approach-ayoidance tendency exhibited sig- 
nificant relationships with relevant individual- 
difference variables, namely locus of control 
and achievement orientation. The measure was 
also significantly related to the tested inter- 
actions between the two variables. It is not 
unreasonable, therefore, to suggest that ap- 
proach-avoidance tendency is a psychologi- 
cally meaningful variable. The measure was 
also useful in depicting the impact of situa- 
tional variables. 

Finally, the present study attempted to 
nine how subjects with different person- 

characteristics interact with different di- 

of a chance situation. While studies 
; same line have been frequently 
ci ery few attempts have been re- 
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Misattribution and Humor Appreciation: A Comment 
on “Enhancement of Humor Appreciation by 
Transferred Excitation” 


Jerry M. Suls 
State University of New York at Albany 


A study by Cantor, 
transfer account of 
results can also be 
that Cantor et al.’ 
explanations raise 


of humor appreciation is an appealing notion, 


Provide unambiguous support. 


The most widely tested hypothesis in the 
field of humor research is Freud’s thesis that 
humor is appreciated because it permits the 
momentary gratification of forbidden or hid- 
den wishes. According to this theory, the ten- 
dentious nature of humorous material is dis- 
guised by the use of “joke-work” (puns, in- 
congruities, exaggerations) and thereby facili- 
tates catharsis and the release of tension, The 
research testing this hypothesis typically in- 
volves the experimental arousal of drive (sex- 
ual or aggressive) and the measurement of the 
subject's subsequent appreciation of motive- 
relevant humor, Studies by Davis and Farina 
(1970), Dworkin and Efran (1967), Lamb 
(1968), and Strickland (1959), among oth- 
ers, have reported support for the psychody- 
namic hypothesis, 

Recently, Cantor, Bryant, and Zillmann 
(1974) advanced an alternative explanation 
for these results and presented an experiment 
in support of their thesis. These authors ar- 
gued that if a subject has been exposed to 
experiences associated with a high level of 
excitation, this sympathetic arousal will dissi- 
pate relatively slowly, and residues of the 
excitation should transfer to the period of 
exposure to the humor stimulus. Cantor et al. 


contended that 


I would like to thank Jeffrey Goldstein, James 
Tedeschi, and especially Fred Tesch for their com- 
ments on early drafts of this paper. 

Requests for reprints should be sent to Jerry M. 
Suls, Department of Psychology, State University of 
New York at Albany, 1400 Washington Avenue, 
Albany, New York 12222 


Bryant, and Zillmann testing a misattribution — excitation 
humor appreciation is reviewed. It is concluded that the 
explained by Berlyne’s theory of collative motivation and 
s interpretation of their data and discounting of alternative 
some critical problems. Although a misattribution account 


Cantor et al.’s study does not 


if it is assumed that the residual excitation is poten- 
tially specific to a particular inducing stimulus but 
that the individual is provided with inadequate, non- 
specific interoceptive feedback of such excitation, 
which prevents him from linking it with its actual 
source, it can be expected that he will (mis)attribute 
its induction to the stimuli to which he is being 
exposed. (1974, pp. 812-813) 

Accordingly, the subject, unaware of the re- 
sidual nature of his excitation, misattributes 
its source to the humor stimulus and so ex- 
periences an enhancement of the humor Te- 
sponse (i.e., facilitates mirth). 

To test their hypothesis, Cantor et al. in- 
duced high or low levels of emotional arousal 
in subjects before exposing them to humorous 
communications. The hedonic tone (positive, 
negative) of the excitatory stimuli was also 
varied, to avoid alternative explanations of 
the data based upon assimilation or contrast 
effects. The subjects were randomly assigned 
to one of four conditions resulting from the 
2 (Levels of Excitatory Potential) x 2 (Ley- 
els of Hedonic Tone) factorial design. Written 
excerpts were employed to induce these ma- 
nipulations. The particular manipulations are 
listed here because they are important for 
later discussion: 

(a) positive hedonic, low excitation—pas- 
sage describing the good works of the beaver; 

(b) positive hedonic, high excitation—pas- 
sage describing an erotic encounter; 

(c) negative hedonic, low excitation—pas- 
sage describing the adverse effects of snow- 
mobiles; 

(d) negative hedonic, high excitation—pas- 
sage describing the brutal torture and muti- 
lation of a young boy. 
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tə After reading these excerpts, the subjects read 


and rated for funniness a series of cartoons 
and jokes. These humor stimuli were devoid 
of sexual or aggressive content, which is an 
important aspect of the experiment because 
the psychodynamic thesis predicts no facili- 
tation of humor appreciation after motive 
arousal if the humor material is motive irrele- 
vant. 

Consistent with Cantor et al.’s hypothesis 
and contrary to the psychodynamic thesis, 
prior exposure to the highly arousing passages 
resulted in greater appreciation of the hu- 
morous communications than did prior ex- 
posure to the less arousing passages. These 
results suggest that psychodynamically in- 
spired research may be reinterpreted by 
the transferred excitation — misattribution ap- 
proach. In fact, these data challenge any in- 
vestigator who examines the Freudian thesis 
because it is difficult to conceive of an experi- 
ment yielding data that could not be rein- 
terpreted, at least in part, along the lines 
suggested by Cantor et al. Because the psy- 
chodynamic approach has been dominant in 
humor research, the Cantor et al, study has 
considerable theoretical import. 


THEORY OF COLLATIVE MOTIVATION 


We begin by considering whether any other 
theories of humor or emotional response can 
account for the pattern of results reported by 
Cantor et al. One possibility is Berlyne’s 
theory of collative motivation (1960, 1967), 
in which Berlyne argued that changes in the 
momentary level of arousal determine the 
quality and quantity of affective responding. 
Applying the theory to humor appreciation, 
Berlyne proposed two processes that may 
mediate the pleasure generated by humorous 
material. The first account stipulates that hu- 
mor contains collative properties (structural 
variables such as novelty, surprise, complex- 
ity, and ambiguity) that increase and then 
suddenly reduce arousal. Berlyne called this 
sequence of events the “arousal jag” and 
argued that a rapid increase-decrease is plea- 
surable. Although there are many properties 
of the humor stimulus that may contribute to 
the arousal jag, two are most relevant to our 
thesis. The punch line of a joke is viewed by 
most students of humor as the prime source of 
surprise, inducing a sudden increase in the 


respondent’s level of arousal or activation (see 
Godkewitsch, 1976, for supportive evidence). 
Subsequently, the humor respondent seeks 
further information to resolve or make sense 
of the surprise (Jones, 1970, Shultz, 1972, 
and Suls, 1972, outlined this two-stage pro- 
cess and reported supportive evidence). The 
resolution may initiate arousal reduction, 
thus producing an arousal jag. 

Berlyne’s second account of humor focused 
only on the arousal-increasing properties of 
humor and suggested that moderate increases 
in arousal are pleasurable because they ac- 
tivate the “primary reward system.” Accord- 
ing to this view, humor provides moderate 
arousal increases and hence provides pleasure 
to the respondent, but reduction of the 
arousal, as long as it is in the moderate 
range, is unnecessary. In support of this con- 
tention, Godkewitsch (1976) examined the 
relationship between perceived funniness and 
changes in physiological arousal and found 
that funniness is associated with moderate 
arousal increments. 

It is proposed that either one of Berlyne’s 
accounts of humor appreciation offers an al- 
ternative interpretation to Cantor et al.’s 
results. Either account requires only one as- 
sumption, and it is the same assumption made 
by Cantor et al., that is, that the excitation 
produced by the stimuli preceding the hu- 
morous communications carries over and adds. 
to the arousal induced by the excitatory prop- 
erties of the humorous communications them- 
selves. For the arousal jag hypothesis, a hu- 
mor respondent exposed to highly excitatory 
communications prior to receiving the humor 
stimuli should experience two sharp increases 
in his arousal because of the joint and addi- 
tive impact of the two sources of arousal. 
Thus, when this respondent comprehends 
(“gets”) the joke, the arousal jag should be 
steeper, sharper, and according to Berlyne’s 
theory, more rewarding and pleasurable than 
if only one of the two sources of arousal had 
operated on the respondent or if the first 
source had very low excitatory value. 

The “moderate increases” version of Ber- 
lyne’s theory provides a simpler interpreta- 
tion. The residual excitation stemming from 
the prior communication combines with the 
moderate arousal produced by the humor 
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stimuli, and this moderate arousal increase 
produces pleasure and enhances humor ap- 
preciation. It is, of course, assumed that the 
respondent’s arousal increase does not exceed 
the safe level and thus activates only the 
primary reward system without triggering the 
more extreme “aversion system” (Berlyne, 
1960, 1967). If the optimal arousal is ex- 
ceeded, of course, an unpleasant reaction is 
expected, Because Cantor et al.’s excitatory 
manipulations were presented as written pas- 
sages and because humor is a nonserious stim- 
ulus, it is reasonable to make the assumption 
that the optimal level was not exceeded. 
(Note, however, that up to a point, the 
arousal-reducing elements of the humor stim- 
ulus might bring the organism back to the 
safe level.) 

According to the Berlynian account, an 
individual may derive more pleasure from the 
humor stimulus after an initial arousal manip- 
ulation because of the actual effect of the 
prior arousal on the individual’s arousal and 
primary reward system, not because of a 
misattribution of arousal. It is assumed, of 
course, that arousal increases or arousal jags 
are pleasurable as long as arousal stays within 
an optimal level. Hence, this analysis posits 
that the pleasure prompted by humor follows 
an inverted-U function. 

The Berlynian theory was considered by 
Cantor et al. in the introduction to their 
paper, but they soon disposed of it. They 
argued that “without a precise empirical 
specification of optimal arousal, any obtained 
linear trend can be interpreted post hoc as 
consistent with the inverted-U notion by plac- 
ing the optimal level at an arbitrary point 
along the arousal continuum” (pp. 813-814). 
Also, “since it cannot be known whether the 
degree of arousal produced in the present 
study would be below, at, or above Berlyne’s 
postulated optimal level, Berlyne’s notion 
cannot be used to predict particular results” 
(p. 814). 

Cantor et al. raised an important issue 
here, and it is well taken. A precise a priori 
specification of arousal level is required, and 
researchers have not adequately dealt with 
this problem. But even conceding this point, 
the Berlynian theory still remains one plausi- 
ble alternative to their findings. One would 
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have reason to dispose of the Berlynian ac- 
count only if the misattribution approach 
possessed no similar problems of imprecision 
or ambiguity. The next section examines this 
problem. 


PROBLEMS WITH THE MISATTRIBUTION 
APPROACH 


Emotional states such as pain, attraction, 
hostility, and love have been examined in mis- 
attribution terms, and it is an appealing no- 
tion that humor may also be explicable along 
these lines. There have, however, been ques- 
tions raised about the misattribution approach 
in general. Calvert-Boyanowsky and Leven- 
thal (1975) noted that empirical studies of 
attribution either have lacked manipulation 
checks to ascertain if in fact the subjects mis- 
perceived the cause of their arousal, as in- 
tended, or have reported equivocal results, In 
this regard, although Cantor et al. checked on 
the effects of the kind and degree of manipu- 
lated arousal, there was no manipulation 
check to ascertain if residual arousal was in 
fact misattributed (to the humor stimuli) in 
the manner they suggested. 

In a second argument, Calvert-Boyanowsky 
and Leventhal (1975) discussed what appears 
to be a curious assumption of much misattri- 
bution research, namely, that subjects are 
“careful assessors of their internal arousal 
states yet are willing to regard a complex 
external environment in a unidimensional 
fashion” (p. 215). These authors were refer- 
ring to the fact that in many studies, it is 
unclear why a variety of causes or loci were 
not considered by the subject to attribute to 
his arousal. A case in point is a study by 
Ross, Rodin, and Zimbardo (1969), which 
showed fear reduction in a puzzle-solving sit- 
uation through the misattribution of negative 
arousal to either shock or noise. But as Cal- 
vert-Boyanowsky and Leventhal noted, it is 
unclear why additional sources—such as worry 
about performance, uncertainty, frustration 
over the insolvable puzzle task, or time pres- 
sure—were not also considered by the sub- 
jects as possible causes for their arousal. A 
similar imprecision arises in the Cantor et al. 
study. Assuming that the subjects were 
aroused and unaware of the actual arousal 
source, why did they misattribute their 
arousal to the humorous material and not to 
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ther situational elements (e.g., participating 


to be explained why some entities should be 
perceived as potential arousal contributors and 
others not. It would appear then that if Ber- 
lyne’s theory is somewhat ambiguous in de- 
termining the optimal level of arousal, Can- 
tor et al.’s approach also suffers from an 
imprecision: It does not spell out what the 
salient cues for misattribution are and how 
they are selected. This argument leads us to 
another consideration. 

If it is not totally clear in these studies 
why arousal was misattributed specifically to 
the humorous material, it is even less obvious 
why misattribution should have occurred at 
all. Why should the subjects have been un- 
aware of the residual excitation stemming 
from their exposure to the highly arousing 
passages? For one thing, the subjects rated 
how arousing they found the passages. Cantor 
et al, argued that “the importance of the ex- 
citatory potential of the passages was not 
made apparent to the subjects, because the 
arousingness scale [a manipulation check] 
was just one of the seven scales on which they 
rated all three passages they read” (p. 820). 
Interestingly, one of the other six scales as- 
sessed “how disturbing” the subjects found 
the passages (p. 819), But the key question to 
be asked is: Since misattribution is a cogni- 
tive process, how could the subjects have mis- 
attributed their arousal if they were not 
aware of it? 

Cantor et al. also offered two other argu- 
ments against subject awareness as a crucial 
event, They argued that “even if the rating 
procedure made subjects momentarily aware 
of their arousal state, it seems doubtful that 
subjects would know that a residual portion 
of their excitatory response was still present 
after they had switched cognitively to the 
humorous stimuli” (p. 820), and that even if 
they were aware of the residual arousal as 
they were exposed to the humor stimuli, “such 
.an awareness should have inhibited transfer 
rather than cause it” (p. 820) and the humor 
enhancement should not have been observed. 
The first of these arguments seems to beg the 
point. We fail to understand why it is “doubt- 
ful” that the subjects would know that the 
excitatory response was still present. The 
highly arousing passages were intended to 
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stimulate and must have been rather novel to 
the subjects in the experimental context. It is 
unlikely that the subjects would forget them 
in the space of a few minutes, It is possible, 
of course, that the subjects were unaware of 
their residual arousal, but the experiment did 
not preclude awareness, as the authors con- 
tended.* This leads directly to their point that 
awareness should have reduced humor ap- 
preciation. Because the results were as pre- 
dicted, the authors discounted the awareness 
argument. This argument suggests that mis- 
attribution must have occurred because humor 
appreciation was enhanced by prior arousal. 
This argument would be more cogent if their 
results were not also explicable in terms of a 
theory that does not require a misattribution 
mechanism, namely, Berlyne’s theory of colla- 
tive motivation. 


DISCOMFORT RELIEF THEORY 


Cantor et al. did consider an alternative in- 
terpretation of their results, which was similar 
to the arousal jag notion. The authors rea- 
soned that if the subjects exposed to a highly 
arousing passage experienced discomfort, then 
they may have shown enhanced humor ap- 
preciation because the humor turned their at- 
tention away from the source of their discom- 
fort and thereby produced relief, The authors 
discounted this possibility because manipula- 
tion checks indicated that the subjects did 
not find the sexual passage disturbing or 
noxious. Because there was no discomfort re- 
ported in this condition, the humor stimulus 
could not have served as a source of relief. 
(Of course, the discomfort relief hypothesis 
could apply to the negative hedonic, high-ex- 
citation passage, which was rated as somewhat 
disturbing. In light of the apparent lack of 
success of this explanation for the positive 


iJn a more recent study, Cantor, Zillmann, and 
Bryant (1975) reported results indicating that physio- 
logical excitation (in this case produced by physical 
exercise) lingered for a substantial period after the 
subjects felt they had returned to baseline. However, 
it is somewhat surprising, given the extensive lit- 
erature on emotional misattribution, that few inves- 
tigators have tested the basic contention that arousal 
continues even though the subject is unaware of it 
at a conscious level. This excellent study by Cantor 
et al. is the only one in the literature we have been 
able to find that has examined this crucial proposi- 
tion. 
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hedonic condition, the discomfort relief hy- 
pothesis was put aside.) 

A difficulty with Cantor et al.’s argument 
against the discomfort relief idea is that it 
was based on the subjects’ ratings of how 
pleasant they found the excitatory passages. 
We might question whether college students 
would report finding a description of an erotic 
encounter unpleasant, although some discom- 
fort might be produced while reading such 
material in the context of a psychological ex- 
periment during a class period. Even outside 
the experimental context, individuals would 
probably rarely report erotic material to be 
unpleasant or disturbing although the desire 
for relief may be present (of course in the 
case of viewing or reading about perverse acts, 
this could be the case). 


LrvENTHAL’s THEORY OF EMOTIONAL 
RESPONSE 


Thus far we have discussed drive, or 
arousal-related, approaches, but a recent the- 
oretical suggestion by Leventhal may also 
offer an alternative interpretation to Cantor 
et als results. In brief, Leventhal (1974; 
Johnson & Leventhal, 1974) hypothesized that 
emotion may result when an individual’s ex- 
pectations are disconfirmed by subsequent 
events. It is probable that the subjects read- 
ing the high-excitation passages graphically 
describing erotic or violent episodes found it 
surprising that jokes and cartoons followed 
these materials, In accord with Leventhal, we 
might suggest that this surprising sequence of 
events prompted the reported increase in af- 
fect. Cantor et al. might reply that the sub- 
jects were told before the study began that 
they would be reading and rating both serious 
and humorous material, so that no surprise 
was involved. The intensity of the high-ex- 
citation passages, however, probably rendered 
the humorous stimuli surprising and perhaps 
even inappropriate, especially since the hu- 
morous material followed the description of a 
brutal torture and mutilation or a description 
of a couple having sexual intercourse. 


RELEVANT RESEARCH 


Although Cantor et al. have proposed a 
novel thesis, we might consider whether any 
other relevant data are available. Two studies 
do appear to have relevance. 
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Strickland (1959), in an attempt to test the 
psychodynamic hypothesis, randomly assigned 
his subjects to a hostility-arousing, sexually 
arousing, or nonarousal condition and then 
had them rate aggressive, sexual, and non- 
sense cartoons. Although Strickland performed 
his experiment to test the Freudian theory of 
humor, the design allows for a test of the 
transferred excitation—misattribution model. 
The latter theory, of course, predicts that 
aroused subjects show enhancement of both 
motive-relevant and motive-irrelevant humor. 
Strickland found that motive arousal en- 
hanced appreciation of only motive-relevant 
humor: Within the hostile group, aggressive 
cartoons were preferred to both sexual and 
nonsense cartoons, and within the sexual 
group, the sexual cartoons were preferred to 
the aggressive and nonsense cartoons, This 
pattern of data appears not to confirm the 
misattribution hypothesis. There was, how- 
ever, an indication that sexual arousal pro- 
duced enhanced appreciation of the nonsense 
humor, but the misattribution approach can- 
not explain either why the appreciation of 
hostile humor was not similarly enhanced or 
why hostility arousal did not produce an 
overall enhancement. Strickland’s results sug- 
gest a complex interaction between arousal 
and humor type that is not explicable from 
any current theoretical formulation, 

Another relevant study was reported by 
Dworkin and Efran (1967), who aroused 
anger in one half of their subjects and ex- 
amined responses to hostile and nonhostile 
humor. These investigators found that anger 
enhanced the appreciation of hostile humor 
but did not make the subjects more apprecia- 
tive of humor in general. Although Cantor et 
al. did cite these two humor studies, they 
failed to mention that these findings are dis- 
crepant with the misattribution approach. 


A FINAL CONSIDERATION 


In this paper we have suggested and dis- 
cussed some alternative interpretations to 
Cantor et al.’s results and have cited data un- 
supportive of the misattribution approach. In 
this final section we wish to present a more 
general criticism of their procedure. In recent 
years psychologists have become increasingly 
aware of the possibilities of subject suspicious- 
ness and of demand characteristics (Rosenthal 
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& Rosnow, 1969). Studies testing the psycho- 
dynamic hypothesis or the misattribution hy- 
pothesis entail a novel sequence of events, and 
it would indeed be surprising if the subjects 
did not ask themselves what the experimenter 
was testing. Although a cover story may be 
accepted by most subjects, it is necessary to 
assess suspiciousness to eliminate those who 
do not accept the cover story or who respond 
in a contrived fashion. In this perspective, it 
is disconcerting to note that Cantor et al., as 
well as most of the psychodynamic research- 
ers, did not report whether assessments of 
subject suspiciousness were made, Because the 
theoretical implications of Cantor et al.’s 
study are very important for the psychological 
analysis of humor, the question of demand 
characteristics and subject suspiciousness is 
critical and deserves more attention. 


CONCLUSION 


This paper has suggested some problems for 
and alternative explanations to a study testing 
the possibility of misattribution effects in 
humor appreciation. It may have occurred to 
the reader that several of the problems raised 
here may also be applicable to other research 
on emotional misattribution. Some of these is- 
sues have been raised earlier by Leventhal 
(1974). In addition, recent disconfirmations 
of the Schachterian theory of emotion, upon 
which the misattribution-of-arousal phenome- 
non is based, have been reported (Calvert- 
Boyanowsky & Leventhal, 1975; Kellogg & 
Baron, 1975; Rogers & Deckner, 1975). These 
developments suggest that there is a need to 
examine the basic tenets of misattribution 
theory more carefully than has thus far 
been done. 
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Two pleasant and two unpleasant videotaped scenes were presented to 60 sub- 
jects (“senders”). Unknown to the senders, their facial expressions were video- 
taped when they watched the scenes (spontaneous encoding) and when they 
talked about their reactions to the scenes (talking encoding). Later, senders 
were again videotaped while posing the appropriate expressions for each of the 
four scenes (posed encoding). The videotaped facial expressions were presented 
for decoding to the senders. Results were as follows: (a) accuracy of commu- 
nication varied according to mode of encoding, the scene, and the mode by 
scene interaction; (b) females were significantly more accurate decoders than 
were the males; (c) there were large positive and significant correlations be- 
tween the ability to communicate (that is, to decode or to encode) via spon- 
taneous and posed cues and positive but lower correlations between these two 
modes of communication and the talking mode; (d) the more extreme the 
emotional experience of the sender the more accurate his encoding score, espe- 
cially in the spontaneous mode; and (e) there were low positive correlations 
between total encoding and total decoding scores and low negative correlations 
between encoding and decoding of the same scene. 


& Rogers, Note 1). The relationship between 
the ability to encode or decode posed cues and 
a number of personal characteristics, including 


The large majority of studies on nonverbal 
communication of facial expressions have used 
posing as the means of generating nonverbal 


cues. In posing, subjects (senders) are di- 
rected to encode or transmit nonverbal cues, 
which are recorded and then decoded or 
interpreted by judges. A review of posing 
behavior (Ekman, Friesen, & Ellsworth, 1972) 
showed that the accuracy of communicating 
facial cues of emotion from senders to judges 
is greater than chance. It has also been found 
that individuals vary in their ability to encode 
recognizable facial cues (Thompson & 
Meltzer, 1964) and in their ability to judge 
correctly the cues transmitted by others 
(Rosenthal, Archer, Koivumaki, DiMatteo, 
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sex of the subjects, has also been examined 
(Zaidel & Mehrabian, 1969), 

It is questionable, however, whether results 
of research on posing can be generalized to 
spontaneous nonverbal behavior, For exam- 
ple, Hunt (1941) argued that posed facial 
cues constitute a socially learned code that is 
unrelated to spontaneously occurring cues. In 
contrast, Ekman et al. (1972) suggested that, 
although the eliciting circumstances for posed 
and spontaneous cues are different, the cues 
themselves are similar if not identical. Ekman 
et al. based their argument on the finding > 
that posed facial cues were judged similarly 
across cultures (Ekman, 1972). The cross- 
cultural similarity conflicts with the view 
that posed cues are an arbitrarily learned code 
because a learned code implies cultural speci- 
ficity. However, the evidence from cross-cul- 
tural studies is only indirectly related to the 
problem of the relationship between posed and 
spontaneous cues. 
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ENCODING AND DECODING 


To our knowledge, only three studies 
(Buck, 1975; Coleman, 1949; Landis, 1924, 
1929) examined posed and spontaneous facial 
expressions as they occur within the same 
person. Landis (1924, 1929) photographed 
his subjects in a variety of emotion-arousing 
situations and again when subjects were asked 
to remember each situation and pose the ap- 
propriate expression. Only three subjects par- 
ticipated in both the spontaneous and the 
posing parts of the study. Landis reported 
that observers were unable to correctly iden- 
tify either the spontaneous or the posed ex- 
pressions. However, in addition to the small 
number of subjects, Landis’ study has been 
repeatedly criticized on various methodologi- 
cal grounds (for a review, see Ekman et al., 
1972). 

Coleman (1949) did find that judges were 
able to identify both acted and spontaneous 
facial expressions. He also observed that the 
posed expressions did not appear qualitatively 
different from the spontaneous ones. However, 
Coleman used only two subjects and obvi- 
ously could not say much about the general 
relationship between posed and spontaneous 
behavior. 

Buck (1975) had 5- to 6-year-old children 
(N = 14) send both spontaneous and posed 
facial expressions. Results showed that posed 
sending was correlated with various measures 
of spontaneous sending (median 7 = .35). 
Furthermore, the two types of sending were 
similarly related to teachers’ ratings of the 
children’s personality and behavior. However, 
since judges had known in advance the 
particular emotions that each child was 
instructed to send, they rated the appropri- 
ateness of the child’s response rather than 
identifying the emotion enacted. Thus, it is 
impossible to tell whether the accuracy of 
posed sending was above chance. Moreover, 
since the judges rated each child’s posed 
expressions after they had already rated his 
spontaneous expressions, it is possible that the 
first impressions influenced the later judg- 
ments, which, in turn, might have inflated the 
posed-spontaneous correlation, Consequently, 
Buck’s evidence, in addition to being based on 
a small number of subjects from a particular 
age level, seems inconclusive. 
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The purpose of the present study was to 
explore the relationship between posed and 
spontaneous nonverbal cues as they occurred 
within the same individual. Encoding of posed 
cues was compared with the encoding of spon- 
taneous cues; decoding of posed cues was 
compared with the decoding of spontaneous 
cues; the relationship of encoding and decod- 
ing abilities with several personal character- 
istics including sex of the subject was also 
examined, 

As was previously mentioned, the circum- 
stances leading to the encoding of posed and 
spontaneous cues are quite different. When a 
subject is placed in a situation designed to 
arouse a specific emotion, his facial expression 
is determined by several factors including (a) 
the emotion actually aroused, (b) the display 
rules (Ekman et al., 1972) managing expres- 
sion of the emotion in that particular situa- 
tion, and (c) the ability or the knowledge to 
communicate the emotion aroused. It is im- 
portant to realize that spontaneous cues may 
be encoded when the person is alone and may 
convey information although they were not 
intended to do so. Thus, spontaneous encoding 
may be defined as nonverbal behavior rather 
than as nonverbal communication (Weiner, 
Devoe, Rubinow, & Geller, 1972) and as in- 
formative rather than as communicative (Ek- 
man & Friesen, 1969). On the other hand, 
when a subject is asked to pose an emotion, 
his response is largely determined by his 
knowledge of the appropriate cues and how 
to encode them. Posing does not require the 
experience of the actual emotion, is less con- 
trolled by display rules, and, by definition, is 
intentional. In addition, a subject who poses 
transmits the cues to a present or future audi- 
ence. Since both spontaneous and posed en- 
coding depend on the ability or the knowledge 
to encode, they should be related. However, 
such a relationship may not be very strong 
because spontaneous encoding depends on 
other factors that seem irrelevant to posing. 
On the other hand, if posed and spontaneous 
cues were similar, the abilities to decode posed 
and spontaneous cues might be strongly re- 
lated, 

The present study also examined the rela- 
tionship between encoding and decoding abil- 
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ity. Although several studies have examined 
this relationship, their results are not con- 
sistent. On one hand, Knower (1945), Levy 
(1964), and Zuckerman, Lipets, Koivumaki, 
and Rosenthal (1975) found positive and 
significant correlations between the ability to 
encode and decode vocal and facial expres- 
sions. While Zaidel and Mehrabian (1969) 
found nonsignificant encoding-decoding cor- 
relations, their findings may have been the 
result of a small sample size (V = 12). The 
encoding-decoding relationship gave rise to 
the idea of a general communication factor 
(Levy, 1964) which may account for the abil- 
ity to send and receive vocal and possibly 
other modes of expressions. 

Other studies reported no correlations of 
encoding and decoding (Osgood, 1966; Wolf, 
Gorski, & Peters, 1972) or even negative 
correlations (Lanzetta & Kleck, 1970; Miller, 
Levine, & Mirsky, 1973). In the study by 
Lanzetta and Kleck (1970), subjects were 
unknowingly videotaped after they had been 
informed by a signal light that they were or 
were not going to be shocked. Later, the 
videotapes were shown to the same subjects, 
who were asked to discriminate between the 
shock and the nonshock trials solely on the 
basis of the cotaped facial expressions. It 
was found tl. -ending ability was negatively 
correlated (7 .80) with receiving ability. 


When monkeys were placed in a situation 
similar to the one used by Lanzetta and Kleck, 
the negative encoding-decoding relationship 


was again ob (Miller et al., 1973). It 
should be emphasized that Lanzetta and 
Kleck examined the spontaneous encoding of 
a single emotion, while all studies reporting a 
positive encoding-decoding relationship exam- 
ined the posing of several emotions. Interest- 
ingly, in another study of posing (Zuckerman 
et al., 1975) the correlations between encod- 
ing and decoding of the same emotion were 
low or negative. Zuckerman et al. hypothe- 
sized that when one emotion is spontaneously 
sent the encoding—decoding relationship is 
negative, and when one emotion is sent via 
posing the negative relationship is weaker or 
zero. When several emotions are involved, the 
encoding—decoding relationship was hypothe- 


sized to be positive. This hypothesis was 
tested in the present study. 

To examine spontaneous encoding, the pres- 
ent experiment utilized a paradigm that was 
first developed for studying nonverbal com- 
munication in monkeys (Miller, 1967; Miller, 
Caul, & Mirsky, 1967) and subsequently 
adapted to the study of nonverbal communi- 
cation in humans (Buck, 1975; Buck, Miller, 
& Caul, 1974; Buck, Savin, Miller, & Caul, 
1972; Gubar, 1966; Lanzetta & Kleck, 1970). 
This paradigm involves the presentation of 
affect-producing stimuli to a subject who 
serves in the role of a sender. Unknown to the 
sender, his face is televised to an observer who 
is required to identify the affect-producing 
stimuli (e.g., Lanzetta & Kleck, 1970) and/or 
the sender’s reported reaction to the stimuli 
(e.g., Buck et al., 1972). Miller et al. (1967), 
Gubar (1966), and Lanzetta and Kleck 
(1970) used electric shocks to produce the 
necessary affect; Buck and his colleagues 
(Buck, 1975; Buck et al., 1972; Buck et al., 
1974) used color slides; the present study 
used various videotaped scenes. 

Buck (Note 2) found that there was no 
significant communication of affect when 
senders were looking at the slides. In other 
words, judges watching the sender’s face could 
not identify above chance the slide that the 
sender was watching. It was only when the 
sender was talking about his emotional re- 
sponse to the slide that his face was decoded 
above chance. Conceptually, it is hard to 
classify this talking mode as either spontane- 
ous or as posed, although it is probably closer 
to the former than to the latter. In view of 
the results reported by Buck, it was decided 
to examine the amount of communication 
occurring (a) when the sender was watching a 
videotaped scene (spontaneous sending), (b) 
when the sender was describing his reaction to 
the scene (talking sending), and (c) when 
the sender was posing (posed sending). 


METHOD 


The subjects were 30 male and 30 female under- 
graduates enrolled in an introductory psychology 
course. They were recruited to particinate in a four- 
session experiment in fulfillment of the course re- 
quirement. The first of the four sessions was an 
encoding session and the remaining three were decod- 


k 


| 
l 
! 


ing sessions. Two male experimenters, Experimenter 
” A and Experimenter B, conducted the entire study. 


. Encoding Session: Apparatus 


Two rooms separated by a one-way mirror were 
used in this session. In one, the subject sat near a 
2.5%5 ft. (.76 X 1.52 m) table, facing a videotape 
monitor and the one-way mirror. On the table were 

æa videotape deck, an audio tape recorder, and a 
dummy galvanic skin response (GSR) recorder. Ex- 
perimenter A sat on the other side of the table. Be- 
cause of the equipment, it was impossible for the 
experimenter and the subject to see each other. 

‘xperimenter B was in the adjacent room, which 

contained a signal lamp and the camera, recorder, 
and monitor of a videotape system. The signal lamp 
was connected to the first room and controlled by 

Experimenter A. 

Four 30-second videotaped scenes were used as 
stimuli. They were chosen by the experimenters and 
other faculty and graduate students on the basis of 
their ability to arouse different degrees of pleasant- 
ness. The scenes, the sources from which they were 
taken, and the degree of pleasantness that they were 
supposed to arouse, were as follows: (a) the comedy 
scene was taken from the Carol Burnett Show and 
designed to be extremely pleasant; (b) the child 
scene was taken from a short videotaped film of an 


~“ interaction between an adult and a child and designed 


to be moderately pleasant; (c) the murder scene 
was taken from the film Psycho by Alfred Hitchcock 
and designed to be moderately unpleasant; and (d) 
the accident scene was taken from an educational 
film on traffic accidents and designed to be extremely 
unpleasant, In addition to the above four scenes, a 
fifth neutral scene depicting wild animals in Aus- 
tralia was used for practice purposes. 


Encoding Session: Procedure 


All subjects were run individually by Experimenter 
A. Experimenter B videotaped the subjects’ faces 
and did not have any direct contact with the sub- 
jects. When a subject arrived at the laboratory, he 
or she was asked to be seated and dummy GSR 
clectrodes were attached to his or her fingers. Ex- 
perimenter A then read the following instructions, 
«part of which were taken from Buck et al. (1964): 


This is a study of physiological correlates of emo- 
tions. This can be done in a variety of ways and 
that’s why the experiment consists of four dif- 
«< ferent sessions. In the first session, we will com- 
pare your physiological reactions with your sub- 
jective reactions as you look at a series of five 
videotaped scenes. We will record your skin con- 
ductance while you are watching the scenes. This 
recording will start as soon as I finish reading the 
instructions. In addition to the physiological re- 
cording, we would like to have your description of 
your feelings about the scenes. After each scene, 
I will tell you the number of the scene you have 
watched, switch on the tape recorder, say the 
number again (this is for identification) and then 
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you will describe the emotional reaction you had 
when you watched the scene. You will have 15 
seconds to describe your reaction and then I will 
say stop. Try to describe your subjective emo- 
tional reaction as accurately as you can. Feel free 
to say anything you like about your feelings. Try 
to talk during the entire 15 seconds. When I say 
stop, you should stop talking because at this 
time I will switch off the tape recorder. I will then 
ask you to rate your reactions on two scales: 
pleasantness and intensity. After that we will 
proceed to the next scene. In between scenes there 
will be a rest period of 3 to 4 minutes. During 
this time, I will prepare the next scene to be 
shown and you should try to relax as much as 
possible. 

Except for describing your reactions and making 
the ratings you have nothing to do but to con- 
centrate on the videotaped segments. We would 
like you to be as relaxed as possible. It is very 
important to us that you react to the segments as 
naturally as you can. The only limitation is that 
you should not move your hands and remain 
seated facing the monitor. In particular, do not 
pay attention to my activity here and do not 
turn to me because that may distract you. Do 
you have any questions? 


The neutral scene was shown first with the re- 
maining four scenes following in a random order 
determined separately for each subject. For each of 
the four showings the procedure was as follows: 
Experimenter A located the appropriate scene on the 
videotape recorder and started it. After 15 seconds, 
he activated the signal lamp. This signaled Experi- 
menter B to begin videotaping the subject's face. 
Fifteen seconds later Experimenter A stopped the 
videotape recorder and again activated the signal 
light. This signaled Experimenter B to stop video- 
taping the subject's face. Experimenter A then read 
the scene’s number to the subject, started the audio- 
tape recorder, again read the scene's number, and 
activated the signal lamp. This signaled the subject 
to verbally describe his or her reaction to the scene 
and Experimenter B to start videotaping the sub- 
ject’s face. After 15 seconds, Experimenter A stopped 
the audiotape recorder and again activated the sig- 
nal light. This signaled the subject to stop talking 
and Experimenter B to stop videotaping the sub- 
ject’s face, Experimenter A then asked the subject 
to rate the pleasantness and intensity of his or her 
reaction to the scene on two 9-point scales. After 
the subject had completed these ratings, Experi- 
menter A proceeded to locate the next scene. Thus, 
when all five scenes had been shown, each subject 
had been videotaped eight times, four times in the 
spontaneous mode and four times in the talking 
mode, 

After the five segments had been presented, Ex- 
perimenter A removed the electrodes from the sub- 
ject and asked for comments. The experimenter then 
proceeded to explain the true design of the study 
and the necessity and nature of the deception in- 
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volved. For 1 or 2 seconds, Experimenter B turned 
on the lights in the adjacent room so that the 
subject could see the camera through the one-way 
mirror. Experimenter A made an effort not to arouse 
antagonism in the subject and to maintain a cordial 
atmosphere, After the experiment had been ex- 
plained, the subject was asked whether he or she 
had held any corresponding suspicions. None of the 
subjects suspected that videotaping had gone on 
through the one-way mirror. 

Experimenter A then explained that the second part 
of the experimental session concerned intentional 
or posed sending, and he read the following instruc- 
tions: 


I will go over the four scenes with you. The 
scenes will not be shown again, but for each scene 
I will tell you in which part you were videotaped. 
Then I will ask you if you are ready and when 
you say you are, I will say start, and signal Ex- 
perimenter B to videotape your face. You will be 
posing for 15 seconds and then I will say stop 
and signal Experimenter B to stop videotaping 
your face. This procedure will be repeated for 
each of the four scenes. 


The second part of the posing instruction varied 
according to the type of acting required from each 
subject. Half of the subjects were instructed to pose 
in a way that would inform future observers as to 
which scene they pretended to be watching; the 
remaining subjects were instructed to pose in a way 
that would inform future observers as to what their 
reactions were to the scene. Since the data showed 
that the type of posing instructions did not affect 
the accuracy of the posed encoding, they will not 
be discussed further in this paper. 

After the posed encoding had been completed, the 
subject filled out three personality scales: the Self- 
Monitoring Scale (Snyder, 1974), the Facilitation- 
Inhibition Scale (Ullman, 1962), and a shortened 
version of Lykken’s Anxiety-Reactivity Scale (Lyk- 
ken, 1957). 


Decoding Sessions 


With 60 senders, each sending responses to four 
videotaped scenes in three modes of sending, there 
were 720 (60X4X3) 15-second videotaped seg- 
ments of the subjects’ facial expressions. (In the 
talking mode, there was no sound, only the face 
with lips moving.) Because of the large number of 
stimuli, the decoding part of the experiment was 
divided into two parallel parts. All 60 senders were 
divided into Groups A and B, each of which had 15 
males and 15 females. Subjects in Group A de- 
coded the 360 facial segments encoded by Group A 
and subjects in Group B decoded the 360 facial seg- 
ments encoded by Group B. Furthermore, the 360 
segments were presented to each subject in two 
separate decoding sessions, each comprised of the 
180 facial segments encoded by 15 senders (15 X 4 
X 3). These 180 segments were edited to randomize 
the order of presentation but blocking for senders 
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and mode of encoding so that a block of 15 seg- 
ments in the new order included all the 15 senders 
and an equal number of facial segments for each 
mode of sending. The editing cut down the length 
of each segment from 15 seconds to approximately 
10 seconds. Also, because of technical problems, 7 
facial segments of a male sender in Group A were 
completely eliminated. Thus, the analysis of the 
encoding data was based on 59 subjects, while the 
analysis of the decoding data was based on 60 sub- ` 
jects. 

Each facial segment was judged first on a cate- 
gorization measure and second on a pleasantness 
measure. The categorization measure required sub- 
jects to choose from among the names of the four 
videotaped scenes (comedy, child, murder, and acci- 
dent) the one which they thought the sender was 
watching, talking about, or pretending to be watch- 
ing. The pleasantness rating required subjects to 
judge how pleasant they thought a sender’s reaction 
was along a 9-point pleasant-unpleasant scale. For 
each facial segment the experimenter waited until 
all subjects completed the two measures before 
proceeding to the next facial expression. The time 
required for rating a single segment was about 10 
seconds at the beginning of a decoding session and 
less than 5 seconds at the end of the session, as sub- 
jects became more familiar with the procedure. 
Group A’s decoding sessions were conducted by Ex- 
perimenter A and Group B’s decoding sessions were 
conducted by Experimenter B. 

In addition to the two decoding sessions, there 
was a third session in which a group-administered 
test of decoding abilities was given to all subjects. 
This test, the Profile of Nonverbal Sensitivity 
(PONS; Rosenthal et al., Note 1) was administered 
to Group A after the completion of the two decod- 
ing sessions and to Group B before the two decod- 
ing sessions. The administration of the PONS was 
conducted by Experimenter A. 


Dependent Variables and Data Analysis 


As was previously mentioned, both categorization 
and pleasantness measures were used in the decod- 
ing sessions. To categorize the scene that was pre- 
sented to a sender, subjects had to view the expres- 
sion on the sender's face and then identify the 
scene that would arouse such an expression. It fol- 
lows that accurate categorization depends on the 
assumption that the four scenes aroused different 
types or different levels of emotional experience in 
the senders. Table 1 presents mean scores of the 
senders’ ratings of the pleasantness and intensity of 
their reactions to the scenes. Potential ratings 
ranged from 1 to 9 with high scores indicating a 
high level of pleasantness and/or intensity. Gen- 
erally, it seems that the four scenes formed a di- 
chotomy in which the child and comedy scenes 
aroused more pleasant and less intense experiences 
than did the murder and accident scenes. Since the 
senders’ self-ratings did not differentiate well between 
the child and comedy scenes and between the murder 
and accident scenes, perhaps accuracy of communica- 


ENCODING AND DECODING 971 


tion should also be based only on the ability to dif- 
ferentiate between the two pleasant and the two 
unpleasant scenes, Thus, it was decided to develop 
two scoring systems. According to the first, categori- 
zation was accurate if the judge succeeded in identi- 
fying the exact scene presented to the sender (exact 
categorization). According to the second, the cate- 
gorization was accurate if the judge succeeded in 
identifying whether the sender was responding to one 
of two pleasant scenes or to one of the two 
t scenes (pleasantness categorization). For 
, if a sender who was responding to the 

ene was decoded as responding to the comedy 
decoding was defined as incorrect according to 

exact categorization system, but as correct 
ording to the pleasantness categorization system. 

The exact categorization scores and the pleasant- 
ness categorization scores were computed for both 
encoding and decoding. Encoding scores were the 
proportion of judges who correctly categorized the 
expressions encoded by each of the senders. Decoding 
scores were the proportion of correct categorizations 
made by each of the judges. Both encoding and 
decoding scores were computed for each of the 12 
Mode X Scene combinations (3 Modes X 4 Scenes) 
and summed across modes and scenes to create en- 
coding and decoding scores for scenes, modes, and 
total. 

Unlike the two categorization measures, the 
pleasantness measure required the judge to predict 
the sender’s reaction from his or her facial expres- 
sion but not to match this reaction with a par- 
ticular scene, To compute how well judges could 
predict the senders’ reactions, the absolute dif- 
ferences between judges’ ratings of the senders’ 
pleasantness and senders’ self-ratings of pleasantness 
were computed. To facilitate presentation of the 
analysis, the absolute difference scores were trans- 
formed so that high scores indicate better com- 
munication, As before, absolute difference scores 
were computed for encoding and decoding each of 
the 12 Mode X Scene combinations and summed 
across modes and scenes to create encoding and 
decoding scores for scenes, modes, and total. 

An additional measure of nonverbal communica- 
tion, similar to the criterion used by Buck et al. 
(1972), was the correlation between a sender’s self- 
ratings of pleasantness and the mean scores of judges’ 
ratings of the senders’ pleasantness across the four 


TABLE 1 


MEAN SENDERS’ SELF-RATINGS OF PLEASANTNESS 
AND INTENSITY FOR FOUR SCENES 


Scene Pleasantness Intensity 
Comedy 6,83 5.03 
Child 6.70 3.98 
Murder 3.48 6.95 
Accident 2,20 6.97 


Nole. Differences between the first two and last two scenes 
account for 95% of the variance among the four means in the 
case of pleasantness and 92% in the case of intensity. 


TABLE 2 


MEAN INTERCORRELATIONS AMONG 
DEPENDENT VARIABLES 


Encoding Decoding 


Dependent variables (n = 59) (n = 60) M 
Pleasantness categorization St 72 62 
Exact categorization AB 64 56 
Absolute difference 12 65 38, 
Agreement index 39 = ~~ 


scenes. This measure, the agreement index, was com- 
puted only for sending in the talking mode. 

All the decoding scores excluded the responses that 
subjects made to their own facial expressions. 
Product-moment correlations were used to examine 
the relationships among the four encoding and 
among the three decoding measures. Table 2 pre- 
sents the means of the correlations between each 
measure and the remaining measures. Results showed 
that the pleasantness categorization was more highly 
correlated with the other dependent variables than 
any other measure. Unless stated differently, there- 
fore, the analysis of the data in the Results section is 
based upon the pleasantness categorization scores. 

The senders’ descriptions of their reactions to the 
four scenes were tape-recorded during talking send- 
ing. Two judges, one male and one female, rated the 
recorded statements on the extent of the sender’s 
“involvement” with the scene. Following Buck et 
al. (1974), a high level of involvement implied that 
the sender described a personal emotional experience, 
while a low level of involvement implied that the 
sender described the content of the scene rather than 
his or her reaction to it. The involvement ratings 
were made on 9-point scales with high scores indi- 
cating a high level of involvement. Interjudge reli- 
abilities were .64 for child, 46 for comedy, .65 for 
murder, and .60 for accident (all ps<.001). On the 
basis of the high interjudge agreement, the ratings 
of the two judges were combined. The combined 
ratings for the four scenes were intercorrelated, with 
the median r=.38 (p < .01). On this basis, it was 
decided to combine the ratings of the two judges 
across all four scenes into a single index of sender's 
degree of verbal involvement. 

To test for the effects of sex of subject, mode of 
encoding, and type of scene on the accuracy of non- 
verbal communication, the pleasantness categoriza- 
tion scores were treated in two unweighted-means 
analyses of variance, one for the encoding scores 
(n= 59) and one for the decoding scores (n = 60). 
In these analyses, sex of subject (male/female), group 
of subjects (A/B), and level of anxiety (above/ 
below the median of the anxiety-reactivity scores) 
were between-units factors; mode of sending (spon- 
taneous/talking/posed) and scene (comedy/child/ 
murder/accident) were the within-units (repeated 
measures) factors. Level of anxiety was included in 
the analysis of variance to test for possible interac- 
tions between anxiety and other independent varia- 
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TABLE 3 


Mean Communication Accuracy (PROPORTIONS) 
or Four SCENES UNDER THREE 
Moves or SENDING 


Scenes 
Modes Comedy Child Murder Ai M 
Spontaneous ANT 504 657 -795 668, 
Talking 657 582 442 571 563 
Posed 795 «716 -741 «818 «167 
M 6123 «600 613 +728 666 
Note. n = 60. 


bles. Since no interactions were found, the relation- 
ship between anxiety and encoding and decoding 
scores will be reported in a correlational format. 

Generally, it should be realized that if not for 
the fact that the analysis of variance of the decod- 
ing scores had one more subject than the analysis 
of variance of the encoding scores, mode of com- 
munication and type of scene would have shown 
identical effects on both the encoding and the de- 
coding scores. Since all the senders in the study also 
served as judges, to say that a certain scene is more 
accurately encoded also means that this scene is 
more accurately decoded. However, sex of subject 
and level of anxiety did not have to have identical 
effects on the encoding and decoding scores. For 
example, if females are more accurate decoders, they 
are not necessarily more accurate encoders. 


RESULTS 


Table 3 presents the mean communication 
scores (in proportions) for all four scenes in 
each of the three modes of encoding. Table 4 
presents the analysis of variance of these 12 
means broken down into the 2 degrees of 
freedom effect of mode and the remaining 
orthogonal contrasts involving scene effects 
and Scene X Mode interaction effects. All the 
effects were significant at the .001 level. Con- 
trasts not shown in Table 4 were pooled into 
a residual which accounted for less than 1% 
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of the variance altogether. The error term 
mean square of .012 was based on the meth- 
ods of aggregation and pooling described by 
Green and Tukey (1960). 

Table 5 presents the mean communication 
scores for the effects presented in Table 4. 
The mode effect shows that posing produced 
a higher level of accuracy than spontaneous 
sending and that spontaneous sending pro- 
duced a higher level of accuracy than the 
talking mode. The extremity effect shows that 
the scenes chosen to be extremely pleasant and 
unpleasant (comedy and accident) produced 
a higher level of accuracy than the scenes 
chosen to be more moderate (child and 
murder). The quadratic component of the 
Mode X Pleasantness interaction (as pre- 
sented after the significant mode effects were 
removed) shows that the talking mode pro- 
duced the highest level of accuracy among the 
pleasant scenes (comedy and child) and the 
lowest level of accuracy among the unpleasant 
scenes (accident and murder). The linear 
trend component of the Mode X Extremity 
interaction (as presented after the significant 
mode effects were removed) shows that for 
the extremely pleasant and unpleasant scenes, 
accuracy increased as spontaneity increased, 
while for the moderate scenes, accuracy in- 
creased as “posedness” increased. 

The analysis of variance of the decoding 
scores showed that females were significantly 
more accurate decoders than males, M females 
=.677, M males = .655; F(1,52) = 5.54, 
p = .023, effect size = .65o. When females 
and males were compared on their decoding 
ability as defined by PONS total score (pro- 
portion of items correctly judged), females 
were again found to be significantly better 


TABLE 4 
SUMMARY OF ANALysts OF VARIANCE OF DECODING ACCURACY 


urce df MS F 7? $ 

Mode 2 2.309 192.42 38 62 

Extremity* 1 2.335 194,58 24 49 

Mode (quadratic) X Pleasantness 1 1.234 102.84 14 38 

Mode (linear) X Extremity 1 0.265 22.06 03 18 

Residual between 6 0.064 5.35 <01 09 
Pooled error 624 0.012 


Note, SS total = 16,326; SS error = 7. 
^ Extremity contrasts the extreme p 
and child). 


; SS conditions = 8.838, 
ant and unpleasant scenes (accident and comedy) with the moderate scenes (murder 
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TABLE 5 


MEAN Accuracy SCORES FOR EFFECTS PRESENTED IN TABLE 4 


Source 


Mode X Pleasantness* Mode X Extremityb 


Mode Extremity Pleasant Unpleasant Pleasant Unpleasant 
"Spontaneous 668 Extreme scenes 726 Spontaneous .610 (608) „726 (.724) +580 (578) «756 (.754) 
Talking 563 Moderate scenes .606 Talking -620 (.723) .506 (.609) +512 (615) 614 (.717) 
Posed «167 Posed -756 (.655) 780 (.679) +728 (627) „806 (705) 


* Proportions in parentheses present the Mode X Pleasantness accuracy scores after subtracting the significant mode effects, 
Proportions in parentheses present the Mode X Extremity accuracy scores after subtracting the significant mode effects, 


decoders than were the males, M females = 
-820, M males = .800; F(1, 58) = 4.65, p< 
.05, effect size = .570, However, the analysis 
of variance of the encoding scores showed 
that females were not significantly more 
accurate encoders than were males, M females 
= .677, M males = .658; F(1,5) =.70, p< 
-41, effect size = .23c. There were no signifi- 
cant interactions between sex of either de- 
coder or encoder and any other independent 


“ variable. 


Table 6 presents the intercorrelations 
among the three modes of communication for 
decoding and encoding scores. The results 
showed significant correlations between the 
encoding of posed and spontaneous cues and 
between the decoding of posed and spontane- 
ous cues. Additional correlational analysis 
showed that total decoding was significantly 
correlated (r = .31, p < .05) with decoding as 
defined by PONS total score. Correlations be- 
tween the three modes of decoding (spon- 
taneous, talking, and posed) and PONS total 
Score were not appreciably different (.15, 34, 
and .24, respectively). 

As shown in Table 6, correlations between 
the talking mode and the two other modes 
were positive but low. The reason for such low 


, correlations may have been that the variances 
“of the talking mode were smaller than the 


variances of the spontaneous and the posed 
modes for both encoding (.127 vs. .318 and 
-288) and decoding (,035 vs. .055 and .044), 

Table 7 presents the correlations of encod- 
ing accuracy with senders’ self-ratings of 
pleasantness, and Table 8 presents the corre- 
lations of encoding accuracy with senders? 
self-ratings of intensity and judges’ ratings of 
yerbal involvement, Table 7 shows that the 


relationship between encoding accuracy and 
self-ratings of pleasantness was moderated 
by the level of pleasantness of the encoded 
scene. Thus, for the two pleasant scenes, the 
more pleasant a scene was judged by the send- 
ers, the higher their encoding scores; for the 
two unpleasant scenes, the more unpleasant 
4 scene was judged by the senders, the higher 
their encoding scores. As presented in Table 
8, self-rated intensity and judged verbal in- 
volvement were positively related to level of 
encoding accuracy. Interestingly, the senders’ 
judged verbal involvement was also related to 
the sender’s decoding accuracy as defined by 
pleasantness categorization (r = 27, p < .05) 
and by the PONS (r = .28, p < LOD) 

Of the three personality measures used in 
the present study, only the Anxiety-Reactiv- 
ity Scale was correlated (r= .27, p < .05) 
with total encoding scores, showing that peo- 
ple with higher levels of anxiety were better 
encoders. None of: the three scales was corre- 
lated with decoding scores, The lack of sig- 
nificant relationship between the Self-Moni- 


TABLE 6 


INTERCORRELATIONS AMONG MODES FOR 
DECODING AND ENCODING 


Mode Spontaneous Talking Posed 
Spontaneous — .24 Bald 
Talking 04 — Bir 
Posed .46**a AL oa 


Note, Correlations above the diagonal are of decoding; those 
below the diagonal are of encoding. 

* The analogous correlation for the absolute difference and 
Gast jgateworization measures were .68, p <.001, and .33, 
b <.05, 

b The analogous correlation for the absolute difference and 
exact categorization measures were .77, p <,001, and .45, 
$ <.001. 

* p <05, 

p <.001, 
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TABLE 7 


CORRELATIONS OF ENCODING ACCURACY WITH 
SELF-RATINGS OF PLEASANTNESS 


Mode 


— 


Scene Spontaneous Talking Posed Total 
Comedy +4.49%  +.20 Ast 4.484 
Child +.04 —.09 ++.30%# +418 
Murder —.25 =.15 —.33* —.32* 
Accident —.26* —.17 AL —.26* 

*p <.05. 
tp <01, 

w p < 001, 


toring Scale and posed sending is in contrast 
to the findings reported by Snyder (1974). 

The correlation of total encoding with total 
decoding was .20 when decoding was defined 
by pleasantness categorization and .20 when 
decoding was defined by PONS total score. 
PONS total score was correlated .24 with en- 
coding as defined by exact categorization, .16 
with encoding as defined by the absolute dif- 
ference, and .41 with encoding as defined by 
the agreement index, 

The relationship between encoding and de- 
coding scores of the same scene was com- 
pared to the relationship between encoding 
and decoding scores of different scenes. Table 
9 presents the relevant correlations. The re- 
sults showed that for each of the four scenes 
the correlation between the ability to encode 
and decode the same scene was lower than the 
median of the three correlations between the 
ability to encode this same scene and abilities 
to decode the three other scenes. Furthermore, 
all four -orrelations between encoding and 
decod f the same scene were negative 
whi! - correlations between encoding 

lifferent scenes were positive. 
rəplicate the findings reported 
by . et al. (1975). Furthermore, 
of differences between the 


the 
encoj ag correlations of same scene 
and ing-decoding correlations of 
diffe: is almost identical to the cor- 
respondi ifferences reported by Zuckerman 
et at 


of examining the relationship 
y to send and receive the 
emotions. is to correlate 
of encoding pleasant 


the relative 
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scenes with the relative accuracy of decoding 
pleasant scenes. Relative accuracy of encod- 
ing pleasant scenes was computed by sub- 
tracting the encoding scores of the two un- 
pleasant scenes from the encoding scores of 
the two pleasant scenes. Relative accuracy of 
decoding pleasant scenes was computed by _ 
subtracting the decoding scores of the two | 
unpleasant scenes from the decoding scores 
of the two pleasant scenes. The relative en- 
coding-relative decoding correlation was 
—.275 (df =59, p < .05, two-tailed), indi- 
cating that people who send positive affect 
relatively well (relative to negative affect) 
judge positive affect relatively poorly and vice . 
versa. 


DISCUSSION 


The high level of communication accuracy 
in the posed mode (see Table 5) was not sur- 
prising. First, posing is intended to transmit 
information, while the spontaneous and talk- 


ing modes, at least as defined in the present “» 


study, are not. Second, according to Ekman 
et al. (1972), posing is not controlled by dis- 
play rules whose function is to deintensify, 
mask, or neutralize the emotion. On the other 
hand, the low level of communication accu- 
racy in the talking mode as compared to the 
relatively high level of accuracy in the spon- 
taneous mode was not expected. As was pre- 
viously mentioned, Buck and his colleagues +3 
(Note 2) found the opposite: Senders’ facial 
expressions were decoded above chance when 
senders were talking about their reactions to 
the slides (talking mode) but not when send- 
ers were watching the slides (spontaneous 
mode). It is possible, however, that video- 


TABLE 8 


CorRELATIONS OF ENCODING ACCURACY WITH dy 
SELF-RATINGS OF INTENSITY AND JUDGES’ 
RATINGS OF INVOLVEMENT 


Mode 


Intensity Involvement 

Spontaneous 413 38** 

Talking xe 16 

Posed .26* -28* 

Total 29% .39%* 

Note. Intensity and involvement are correlated .39, p < .01. 
*p <05. 
Fp <0: 


\ 
į 
i 


@ 


Ae 


ENCODING AND DECODING 


taped scenes produce more intense emotions 
than do slides which, in turn, may account 
for the greater accuracy of spontaneous com- 
munication in the present study. As will be 
- discussed later, it-is also possible that the 
difference in content between the stimuli used 


i the” Buck et al. studies and the stimuli 


used in the present study may account for 
the difference in communication accuracy of 
the talking mode. In any case, in the present 
study, the talking mode was not a good indi- 
` cator of either encoding or decoding ability: 
(a) level of accuracy of communication in the 
talking mode was .56—only 6% above chance; 
(b) variances of the talking mode were rela- 
tively small for both encoding and decoding; 
and (c) correlations between the talking mode 
and the two other modes of communication 
were positive but low. E 

The extremity contrast in Table 5 and the 
correlations presented in Tables 7 and 8 sug- 
gest that more intense emotional experiences 
produced more accuraté encoding in all three 
modes of sending. However, superimposed on 
the effects of the sender’s emotional experi- 
ence, there were interaction effects of mode 
by pleasantness (quadratic) and mode by 
extremity (linear) (see Tables 4 and 5). The 
mode by pleasantness interaction showed that 
in talking about pleasant scenes the sender’s 
face encoded better than would be expected, 
whereas in talking about unpleasant scenes 
the sender’s face encoded worse than would 
be expected. It is possible that in talking (but 
not in spontaneous or posed sending) display 
rules deintensify the expression of unpleasant 
emotions and intensify the expression of 
pleasant emotions. Because a larger propor- 
tion of unpleasant stimuli was used in the 
present study than in the Buck et al. studies 
(50% versus 20%), the operation of display 

z rules may also explain why the talking mode 
was a better means of communication in the 
Buck et al. studies. : 

The linear component of the mode by ex- 
tremity interaction indicates that when the 
emotional stimulus is moderate, relative to the 
situation when the stimulus is extreme, send- 
ers are able to encode better if allowed to 
participate actively in the encoding process 
(such as in talking and especially in posing). 
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TABLE 9 


CORRELATIONS BETWEEN (A) ENCODING AND DECODING 
SAME SCENES AND (B) ENCODING AND 
DECODING DIFFERENT SCENES 


Encoding Encoding 
and and 
decoding decoding 
r same different 
Scene scene sceness Difference 
Comedy —.09 +.12 
Child —.24 +.26 
Murder —.09 +.12 
Accident —.00 +17 +17 
Median = ,09 


+.14 +.21 


a Correlations in this column represent the median of three 
correlations between encoding scores of a specific scene and 
decoding scores of three other scenes, 


It is as if by acting the senders compensated 
for the lack of spontaneously occurring emo- 
tion. 

Females were found to be significantly 
better decoders but not significantly better 
encoders than were the males. Other studies 
reported similar sex differences. Zaidel and 
Mehrabian (1969), Zuckerman et al. (1975), 
and Rosenthal et al. (Note 1) have all found 
that females are better decoders than are 
males. On the other hand, the encoding abil- 
ity of females was sometimes significantly 
better, sometimes significantly worse, and 
sometimes no different from the encoding 
ability of males (Thompson & Meltzer, 1964; 
Zaidel & Mehrabian, 1969; Zuckerman et al., 
1975). 

The ability to decode posed cues was sig- 
nificantly correlated with the ability to decode 
spontaneous cues, Furthermore, the total de- 
coding score, which was based on the ability 
to decode spontaneous, talking, and posed 
cues, was correlated with PONS total score 
which was based on the ability to decode 
filmed posed cues consisting of face, body, and 
voice tone cues separately and in combina- 
tion. Interestingly, the correlation between 
total decoding score and PONS total score in 
the present study (r = .31) is identical to a 
reported correlation (Zuckerman et al., 1975) 
between PONS total score and the ability to 
decode posed facial cues from slides. These 
correlations are consistent with the suggestion 
(Ekman et al., 1972) that posed and spon- 
taneous cues resemble each other substan- 
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tially. These correlations also encourage us to 
think that posed and spentaneous cues can be 
used interchangeably in decoding tasks. 

The significant relationship between encod- 
ing posed and spontaneous cues was surpris- 
ing because the two types of cues were en- 
coded under extremely different conditions. In 
the spontaneous condition, senders were not 
aware that their faces were being videotaped, 
and there was no one to whom they were 
communicating. In the posed situation, how- 
ever, senders were intentionally communicat- 
ing a message to a future audience. It seems, 
though, that the spontaneous and posed modes 
of sending involved similar skills, though we 
do not know to what extent the very same 
cues were utilized in the two modes. In both, 
common factors, such as a high level of anxi- 
ety on the part of the sender or the intensity 
of the emotion that was elicited from the en- 
coder by the scene, may have contributed to 
accuracy of encoding, Posed behavior, it 
seems, is not merely an arbitrary convention- 
alized set of cues, and, contrary to Hunt 
(1941), it is related to spontaneous behavior. 

The correlations between total encoding 
and total decoding scores were positive and 
low. The low magnitude of these correlations 
may be explained by the fact that (a) the 
encoding—decoding correlations for the same 
scene were all negative, and (b) total encod- 
ing scores (from which the total encoding- 
decoding correlations were derived) were 
based on only four scenes. Perhaps when en- 
coding and decoding are based on a larger 
number of scenes or emotions, as they were in 
the Zuckerman et al. (1975) study, the 
{otal encoding-decoding relationship appears 
onger. The encoding—decoding correlation 

cial expressions in the Zuckerman et al. 
stul 55 (p < .01). 

Th esis that spontaneous behavior 
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n does posing was not sup- 
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attempt to explain the reason for such a rela- ~ 
tionship, Zuckerman et al. (1975) suggested 
two possible explanations: (a) the skill to 
encode a particular emotion requires skills to 
decode complementary emotions rather than 
the skill to decode the same emotion, and (b) 
if the environment (such as the family) does _ 
not allow the expression of a specific emo- $ 
tion, the individuał may develop the skill to 
detect this emotion in other people but not 
the skill to encode it himself. At present, none 

of these explanations has been put to empiri- 
cal test. 

The negative expression-judgment correla- 
tion for same emotions in the present study 
was low and needs to be replicated. Further- i 
more, the actual patterns of socialization that 
may encourage or discourage the expression 
of certain emotions ought to be investigated. 
Future research may focus on the antecedents 
of encoding and decoding of specific emo- 
tions rather than on encoding and decoding as 
general skills. ‘ 
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Outcome Dependency: Attention, Attribution, and Attraction 
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Theoretical and empirical work on the processes by which we attribute dis- 
positional characteristics to others has focused almost exclusively on how such 
processes proceed once the perceiver has been motivated to initiate them. The 
problem of identifying the factors which prompt the perceiver to engage in an 
attributional analysis in the first place has been relatively ignored, even though 
the influence of such factors may extend beyond the initiation of the causal 
analysis to affect the manner in which it unfolds and, ultimately, the form 
and substance of its conclusion. From the assumption that the function of an 
attributional analysis is effective control of the social environment, it was 
hypothesized that high outcome dependency upon another, under conditions 
of high unfamiliarity, is associated with the initiation of an attributional anal- 
ysis as evidenced by increased attention to the other, better memory of the 
other’s characteristics and behavior, more extreme and confidently given evalua- 
tions of the other on a variety of dispositional trait dimensions, and increased 
attraction to the other, These hypotheses were tested within the context of a 
study of heterosexual dating relationships in which men and women volunteers 
anticipated varying degrees of dependence upon another for their dating out- 
comes. The findings support the view that the data processing operations of 
the social perceiver—from attention to memory to attribution—are part of a 
unified whole and may be viewed as manifestations of an underlying motivation 


to predict and control the social environment. 


Virtually all of the empirical examination of 
the processes by which we attribute disposi- 
tional characteristics to others, as well as most 
of the theoretical work, has been directed to- 
ward an understanding of the “machinery” of 
the attribution process, how such processes 
proceed once the perceiver has been motivated 
to initiate them; the question of when he 
fires up his attribution apparatus has been 
relatively ignored. 

That the study of attribution processes has 
not begun at the beginning is not without 
consequence. We are, of course, unable to 
predict when a perceiver will try to make at- 
tributions about a specific other; thus, at best, 

the emerging blueprint of the attribution 
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machinery is incomplete. In addition, and at 
worst, it may be misleading in relation to its 
usual operation in daily life; the very factors 
which influence whether attribution processes 
are initiated may also steer the manner in 
which these processes unfold and determine, 
as well, the point at which they cease. 

Where do attribution processes begin? Per- 
ceivers, according to attribution theorists, are 
rational constructors of causal schemata who 
analyze two types of data, the behavior of 
another and the context in which his behavior 
occurred, in order to extract such invariances 
from these data as the other’s dispositional 
characteristics (e.g., Jones & Davis, 19.5; 
Kelley, 1972b). It follows, then, that just as 
in any other analysis of data, the attribution 
process must necessarily start with the col- 
lection of data about the object of analysis. 
The beginning of the attribution process 
may be presumed to be, therefore, some- 
where around the point at which the per- 
ceiver attends to a particular other, observes 
his behavior and observes its context (or at- 
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“tends to such artifacts of the other as the 
other’s responses to a questionnaire, an ac- 


counting of his behavior and its context by a 
third person, etc.). 

When do attribution processes begin? Why 
is the perceiver sometimes willing to spend 
his time and energy in an effort to arrive at-a 
causal understanding of another’s behavior? 
On the perceiver’s motivation for performing 
an attributional analysis, theorists agree; 
they endorse the Brunswickian-Heiderian view 
that perceptual acts function to stabilize and 
make predictable the phenomenal world. Kel- 
ley (1972a), for example, observes, “The at- 
tributor is not simply an attributor, a seeker 
after knowledge. His latent goal in gaining 
knowledge is that of effective management of 
himself and his environment” (p. 22). He 
comments further, in fact, that “The attribu- 
tional process can be well understood only in 
the context of a comprehensive analysis of 
the exercise of control” (p. 22). It follows, 
then, that the attribution machinery is most 
likely to be activated for a person in the per- 
ceiver’s social environment who represents 
a potential control problem to him; a person 
who does not present the perceiver with prob- 
lems of control should be unlikely to be the 
subject of an attributional analysis and 
should, therefore, be unlikely to invite atten- 
tion in the service of such an analysis. 

Confidence in the designation of attention 


~ to another as the beginning point of the at- 


tribution process is increased by the apparent 
similarity between the factors which reflect 
the degree to which a person in the perceiver’s 
social environment may represent a potential 
control problem to him and those which are 
known to influence attentional processes. The 
factors perception theorists and researchers 
have identified as governing attentional pro- 
cesses generally may be subsumed under the 
two general principles of novelty and impor- 
tance (cf. Berlyne, 1960, 1974; Lanzetta, 
1971). 

In terms of the perceiver’s exercise of con- 
trol over his social environment, it seems rea- 
sonable to assume that, other things being 
equal, novel or unfamiliar people are more 
likely to represent potential problems of con- 
trol than are familiar (and predictable) others. 
While the determinants of importance have 


not been strictly specified, the degree of impor- 
tance of social stimuli should increase with 
another’s ability to control the perceiver’s re- 
wards and punishments. Thus, the importance 
of a social stimulus to the perceiver should be 
a function of the perceiver’s outcome de- 
pendency upon that other (cf. Thibaut & 
Kelley, 1959). 

Thus, we can predict that, other things 
being equal (particularly the degree of “nov- 
elty” of persons in the perceiver’s social en- 
vironment), persons on whom the perceiver’s 
outcomes are dependent should be especially 
likely to trigger the attribution machinery; as 
the perceiver’s outcome dependency upon an- 
other increases, there should be an increased 
tendency for the perceiver to be attentive to 
the other, to collect data about the other’s 
behavior and the context in which the be- 
havior occurred, and to perform an attribu- 
tional analysis upon these data. 

We speculated earlier that the same factors 
which influence whether attribution processes 
are initiated in the first place may also in- 
fluence the manner in which the attributional 
analysis subsequently unfolds, If it is true 
that outcome dependency is one determinant 
of whether an attributional analysis is begun, 
then we may make at least two predictions 
about how this particular factor might reign 
over the whole causal analysis, 

First, we may predict that as his outcome 
dependency on the person who is the object of 
his analysis increases, so does the perceiver’s 
preference that the analysis conclude in a dis- 
positional rather than a situational attribu- 
tion. Knowledge of another’s personal and 
internal predispositions to behave in certain 
ways would seem to be a more useful bit of 
information to the perceiver who hopes to ex- 
ercise control over that part of his social en- 
vironment than an attribution of the other’s 
behavior to nonpersonal situational factors 
would be (cf. Shaver, 1975). Thus, the factor 
triggering the analysis may influence the form 
of the attribution arrived at. 

Second, when the triggering factor is out- 
come dependency, the social perception lit- 
erature allows us to predict that it will in- 
fluence the substance of the attribution as 
well. The results of several studies investigat- 
ing the influence of perceiver need states upon 
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social perception may be interpreted to sup- 
port the thesis that persons who are outcome 
dependent upon another tend to arrive at 
wish-fulfilling conclusions about the other’s 
dispositions (e.g., Berscheid, Boye, & Darley, 
1968; Darley & Berscheid, 1967; Levine, 
Chein, & Murphy, 1942; Pepitone, 1950; 
Stephan, Berscheid, & Walster, 1971). Thus, 
as the perceiver’s outcome dependency upon 
another increases, the dispositional attribu- 
tions made about the other should be in the 
direction of wish fulfillment (even despite the 
fact that dependency should lead the perceiver 
to pay close attention to the other’s behavior 
and the context in which it occurred), 

We can make, therefore, the following pre- 
dictions: As the perceiver’s outcome depen- 
dency on another in his social environment 
increases, the perceiver should (a) attend 
more closely to the other; (b) remember and 
recognize more details about the other’s be- 
havior; (c) be more likely to make dis posi- 
tional attributions about the other (as evi- 
denced by more extreme and more confident 
responses on trait scales, Jones & Davis, 
1965); and (d) ascribe more positive dis- 
positional characteristics to the other and, 
thus, (e) like the other more, 


METHOD 
Overview 


To investigate the hypotheses outlined above, it 
was necessary to devise a situation in which (a) the 
degree to which persons in the perceiver’s social en- 
vironment represented potential problems of control 
to the perceiver could be manipulated; specifically, 
the perceiver’s outcome dependency on each could be 
varied, and novelty could be held constant; (b) 
the degree of attention the perceiver subsequently 
awarded to each of these others; as well as (c) the 
extent to which the perceiver arrived at dispositional 
inferences about each could be ascertained; and fi- 
nally, (d) the degree to which these dispositional at- 
tribu s reflected wish-fulfilling distortion could be 


specif priori. 
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to characterize perceivers scoring high on this in: i 
vidual difference measure resemble the behaviors 
hypothesize will characterize perceivers who 
themselves in a situation in which their outcomes 
highly dependent on another. 

All of these conditions were satisfied, for our c 
lege age population, by a situation in which the p 
ceiver agreed to turn over his (or her) heterosex:. 
dating life to us, the experimenters, for a period o! 
weeks, promising that he would date whenever, an 
whomever, we instructed, fully aware that it coul 
be his lot to date no one for the period, the san 
person many times over the period, or many differe) 
people over the period. We were thus able to situ: 
tionally manipulate the perceiver’s outcome depe: 
dency upon a person by manipulating his expectati« 
that he or she would be unlikely to date that pa 
ticular person but rather would be dating others í: 
the 5-week period (zero exclusiveness); would d 
that person once and also others during the peri 
(low exclusiveness) ; would date that person exch 
sively over the entire 5-week period (high exclusive 
ness), 


Participants 


Twenty-nine males and 28 females who were 
between 18 and 22 years of age volunteered for an 
advertised experimental study of “heterosexual dating 
relationships.” These individuals also had indicated 
that they were not currently romantically involved 
with any one person (i.e., married, engaged, going 
steady, etc.) and were seriously interested in meeting 
persons of the other sex. All but three were willing to 
commit themselves to the dating terms described 
below, leaving 27 males and 27 females who sub- 
sequently participated in the study. 


Procedure 


Each participant was interviewed individually and 
asked if he or she would agree to three stipulations: 
(a) to participate in a dating study to last 5 weeks 
and to date only those individuals assigned to them 
on the basis of a random lottery and to date ab- 
solately no one else; (b) to report their impressions 
of their assigned date following each contact; and 
(c) to engage in a short videotaped discussion of 
“dating problems on campus” with two ether par- 
ticipants of the same sex and to view a videotape of 
a similar discussion made by three discussants of the 
other sex. 

To assure commitment to these procedures and 
conditions, each Participant was asked to sign a 
formal guarantee to the above Stipulations, 

Exclusiveness manipulation. Participants were ran- 
domly assigned to either the high-, the low-, or the 
zero-exclusiveness condition, In the high-exclusive- 
ness condition, the participant was given the prospec- 
tive date’s name and phone number, and was in- 
formed that he or she would be dating this person 
exclusively—once a week—for the entire 5-week pe- 
riod. In the low-exclusiveness condition the pro- 
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cedure was the same, except that the participant was 
informed that he or she would be dating this person 
only once, and that a different person would be dated 
for each of the remaining 4 weeks. The procedure for 
the zero-exclusiveness condition was identical to the 
low-exclusiveness condition, except that the name 
given as the prospective date was not among the 
names of the discussants whom the participant would 
shortly view on the videotape. Since none of the 
videotaped discussants in the zero-exclusiveness condi- 
tion would be a date, this condition was not included 
as part of the factorial design, but was included to 
provide base-rate levels of response on the principle 
dependent measures (cf. Himmelfarb, 1975). 

Stimulus materials and apparatus. After the par- 
ticipant received the date’s name and number, the 
experimenter asked whether he or she presently had 
enough time to view a videotaped discussion of some 
participants of the other sex for the purpose of gain- 
ing perspective on dating problems, 

Participants viewed the discussion over a specially 
designed system which consisted, in part, of a tele- 
ision monitor and three switches, The experimenter 
explained that due to the limitations of the recording 
equipment, it would be possible to view only one of 
the three discussants on the videotape at a time. He 
indicated that when any one of the switches was de- 
pressed, only the corresponding discussant would 
appear on the monitor; for example, if the left switch 
were depressed, only the participant who had been 
sitting in that location of the discussion room would 
appear. The audio track was common to all three 
switches, If no switch was depressed, there was 
neither video nor audio. If more than one switch was 
depressed, the video was scrambled. For a switch to 
be operative, it had to be actively held down; re- 
lease resulted in loss of both video and audio. Con- 
nected to each switch was a remote cumulative timer 
which measured the time a participant attended to 
each of the discussants. The experimenter was not 
present while the participant viewed the videotape. 

Male participants viewed a videotape of three fe- 
male confederates who engaged in an apparently 
spontaneous 15-minute discussion of dating problems; 
female participants viewed a tape of three male con- 
federates. The confederates were selected, and the 
tapes were edited, on the basis of extensive pretesting, 
such that (a) the stimulus persons within each tape 
all appeared to be slightly above average in physical 
attractiveness and equivalent to each other on this 
dimension; (b) male confederates were as attractive 
to females as female confederates were to males; (c) 
each person spoke for about the same length of time; 
and (d) in general, they appeared to be of about 
equivalent dating desirability. 

Each stimulus person discussant was dressed dis- 
tinctively yet stylishly, and each mentioned his or 
her interests, hobby, academic major, etc., in the 
course of the dialogue. For the first 7.5 minutes of 
conversation, however, the discussants did not men- 
tion their names, Thereafter, the discussants intro- 
duced themselves to each other using their “names.” 
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Participants in the high- and the low-exclusiveness 
conditions anticipated dating one of the three stimu- 
lus persons on the videotape. The prospective date’s 
name had been randomly selected for each participant. 
Each of the stimulus persons on each tape was desig- 
nated a prospective date for an equal number of 
participants in each of the high- and low-exclusive- 
ness conditions. In the zero-exclusiveness condition, 
participants expected to date a person whose name 
was given to them, but that person was not among 
the discussants on the videotape viewed. 

Questionnaires. After the participant had viewed 
the videotape, the experimenter returned and asked 
the participant to complete an “impression formation 
questionnaire.” This measure was ostensibly intro- 
duced as a means of determining if men and women 
differed in the way they responded to videotaped dis- 
cussions and the impressions they derived from them. 

The first items on the questionnaire were memory 
items, both recall and recognition, for each of the 
three stimulus persons. They required the participant 
to answer questions pertaining to specific things said 
or articles of clothing worn by each of the stimulus 
persons. 

Next, participants were asked to complete 15 bi- 
polar trait scales (e.g., warm-cold, strong-weak) for 
each of the three stimulus persons, Immediately be- 
neath each trait scale was a bipolar “confidence” 
scale, Thus, after each trait rating was made, par- 
ticipants rated their confidence in the rating they had 
made. According to Jones and Davis (1965), attribu- 
tions of dispositions to a stimulus person are evi- 
denced by perceiver ratings of the stimulus person 
which are extreme on trait dimensions, and ratings in 
which the perceiver indicates confidence. 

In addition, participants were asked to rate their 
liking for each stimulus person on a 41-point bipolar 
liking scale, And finally, participants were asked to 
complete the self-monitoring scale (Snyder, 1974), 

After participants had completed all questionnaires, 
they were interviewed extensively. No participant re- 
ported suspicion, nor were any able to guess the true 
nature of the experiment. Participants were debriefed 
and thanked for participating in the experiment, 
which was terminated at this point. To fulfill the 
promise of a date, however, each male was randomly 
assigned to a female participant. 


RESULTS 
Independent Variables and Design 
The independent variables were: 


1, Date versus nondate. Since there were 
three stimulus persons, and only one of these 
was the anticipated date, the remaining two 
were designated as “nondates.” For each de- 
pendent measure, scores for the two nondates 
were summed and divided by two, yielding an 
average nondate score. Thus, each participant 
produced two scores for each dependent mea- 
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sure: a score for the anticipated date and a 
Score representing the average of the two 
nondates, For the analyses, this within-sub- 
jects variable was nested within the other two 
independent variables for two reasons: (a) It 
was predicted that the other two independent 
variables would affect responses to the an- 
ticipated date, but not responses to the non- 
date; and (b) on some dependent variables, 
there was a direct dependency of certain date 
Scores on nondate scores; for example, if a 
participant attended to his anticipated date, 
he could not simultaneously attend to his 
nondate, 

2, Exclusiveness of relationship, Although 
there were three levels of exclusiveness in the 
study (zero exclusiveness, 0-dates; low exclu- 
~ siveness, 1-date; and high exclusiveness, 5- 
= dates), only low and high exclusiveness were 
included in the major analyses, Since there 
could be no date-versus-nondate factor in the 
___zero-exclusiveness condition (all three stimu- 

lus persons were nondates), data obtained 
from the zero-exclusiveness condition were 
analyzed in separate auxiliary comparisons, 
3. Self-monitoring. Each of the participants 
in the low-exclusiveness and high-exclusive- 
ness conditions were classified as either high 
in self-monitoring or low in self-monitoring on 
the basis of a median split of scores (Mdn = 
12) on the Snyder Self-Monitoring Scale.t 
4. Sex of participant. Due to the multiple 
dependent measures and the likelihood of 
substantial intercorrelations between these 
measures, the data were analyzed in a single 
multivariate analysis of variance (MAN OVA). 
Since there were no systematic differences 
(main effects or interactions) between male 
and female participants on any of the de- 
pendent variables (all MANOVA Fs < 1.70), 
data for male and female participants were 
combined for all subsequent analyses. Thus, 
subsequent MANOVAs were based on a 2X 
(2 X 2) design and were followed by separate 
univariate ANOVAs for each dependent mea- 


sure (Hummel & Sligo, 1971). 
Dependent Vi S 
Each partici; roduced seven dependent 


icinated date and seven 
average nondate. 


measures for tl 
dependent measu 
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These were: 


1. Attention. The attention dependent mea 
sure was the percentage of time the antici 
pated date and the average nondate wer 
watched after the stimulus persons had men 
tioned their names. 

2. Recall. The recall dependent measure 
consisted of “fill-in-the-blank” items, such as 
“What was the academic major of the person 
on the left (on the videotape)?” 

3. Recognition. The recognition dependent 
measure consisted of multiple choice items re- 
ferring to details of the videotaped discussion 
of the stimulus persons. For example, par- 
ticipants were asked, “Who said he liked to g 
to sports events on a first date?” Choices wer 
the person on the right (on the videotape) 
person on the left, person in the middle, and 
no one. Since males and females saw different 
videotapes, items drawn from the one tape 
may have been more difficult than items 
drawn from the other tape. For this reason, 
recognition items were converted to stan- 
dardized z scores before the recognition scores 
were analyzed, By converting to z scores, 
mean difficulty across items and across tapes 
was equated. 

4. Extremity of trait ratings. This depen- 
dent measure was the sum of the absolute de- 
viation from the midpoint of 21 on each of the 
15 bipolar 41-point trait scales, 

5. Confidence of trait ratings. After par- 
ticipants had completed each rating of a stim- 
ulus person on a bipolar trait scale, they were 
asked to indicate their confidence in that rat- 
ing on a 41-point scale which was placed im- 
mediately beneath the trait scale, Scores on 
the 15 confidence scales were summed. 

6. Positivity of trait ratings. Each of the 15 
trait scales had a positive pole (e.g., “warm,” 
“sensitive”) and a negative pole (e.g, “cold,” 


1 The range of self-monitoring scores in the low- 
exclusiveness condition was 5 to 18, with 11 partici- 
pants at or above the median of 12 and 7 below the 
median of 12. The range of self-monitoring scores in 
the high-exclusiveness condition was 5 to 22, with 11 
at or above the median of 12 and 7 below the me- 
dian. Thus, cell frequencies are proportional. The 
correlation between the total range of self-monitoring 
scores and each of the dependent variables is reported 
in Table 3, 


Ca 


| 
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TABLE 1 
MANOVA anv ANOVA Summary TABLE ror THREE OUTCOME DEPENDENCY VARIABLES 
Exclusiveness condition Self-monitoring 
valent EAE, pee Nendita Pee ACA 
MANOVA Agee 2.20* 2.50* 2.49* 1.48 
Univariate ANOVAs 
Attention 58.66**** 8,96"** 8.43" 2.02 1.97 
Memory 
Recall 259,5405** -00 08 6.77" 1.03 
Recognition 31,8384 05 2,23 62 01 
Attribution 
Extremity 4.32* 2.46 9.137 10,31*** 8.22" 
Confidence 5353* 27 1.86 4,38" 1.11 
Attraction 
Liking 6,43* 3,50* 1.07 4.51" 28 
Positivity 12.63°*** 4,51* 4.11" 5.67* 81 


base : 01, 
wet b < .005, 

wk p < 001, 

“insensitive”). Each trait scale was scored 
such that a response nearest the negative pole 
received a score of 1, and a response nearest 
the positive pole received a score of 41. Scores 
on all 15 scales were summed, 

7. Attraction. The liking dependent measure 
consisted of the response to the question 
“How much did you like this person?” on a 
41-point bipolar scale. 


MANOVA and Univariate Analyses 


No significant interactions were found be- 
tween the exclusiveness variable and the self- 
monitoring variable on any of the dependent 
measures for either the anticipated date or 
the nondate (all Fs < 1.14). Thus, the analy- 
ses reported below describe the main effects 
which were evident for the three outcome de- 
pendency factors: date/nondate, high/low ex- 
clusiveness, high/low self-monitoring. These 
analyses are presented in Table 1. 

First, there was a_ highly significant 
MANOVA date-versus-nondate effect, F(7, 
26) = 44.71, p< .00001. This result sup- 
ported our prediction that an anticipated date 
would elicit different responses on the de- 
pendent measures than would a nondate. 

Second, there was a significant MANOVA 
exclusiveness effect for the responses to the 
anticipated date, F(7,26) = 2.20, p < .06. 
This finding supported our prediction that a 
person who was expected to be a date five 


{ANOVA effects, df = 7, 26; for univariate ANOVAs, df = 1, 32. 


times was responded to differently than was a 
stimulus person who was expected to be a 
date only one time. Contrary to our predic- 
tions, however, there was also a significant 
MANOVA exclusiveness effect for the non- 
date, F (7,26) = 2.60, p < .04. Exclusiveness 
of the relationship with the date affected reac- 
tions to the nondate as well. 

There was a significant MANOVA self- 
monitoring effect for the anticipated date, 
F(7,26)= 2.49, p< .04. This finding sup- 
ported our prediction that participants high 
in self-monitoring would respond to an an- 
ticipated date differently than low self-moni- 
tors would. As predicted, there was no signifi- 
cant MANOVA self-monitoring effect for the 
nondate, F(7, 26) = 1.48, ns. 

The univariate ANOVAs provide more spe- 


cific tests of our hypotheses of participants? — 


responses on each of the dependent measures, 


Altention 


A significant univariate date-nondate ef- 
fect for attention, F(1, 32) = 58.66, p< 
.000001, indicated that the anticipated date 
was awarded more of the participant’s atten- 
tion (M = 40.53%) than was the nondate 
(M = 29.69%). 

As predicted, there also was a significant 
univariate exclusiveness effect for the antici- 
pated date on the attention dependent mea- 
sure, F(1,32) = 8.96, p < .005; an antici- 
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pated date was given more attention in the 
high-exclusiveness condition (M = 43.33%) 
than in the low-exclusiveness condition (M = 
37.72%). 

The univariate self-monitoring effect for the 
anticipated date was not significant on this 
dependent measure, F (1,32) = 2.02, p < 16; 
men and women high in self-monitoring did 
not spend more of their time watching their 
anticipated date than those low in self-moni- 
toring did. 


Memory 


A highly significant univariate date-versus- 
nondate effect for recall, F(1,32) = 259.54, 
P < .000001, was found; this finding indicated 
that, as predicted, participants recalled more 
details about their anticipated date (M = 
2.83) than about the nondate (M = 1.33). 

There was a highly significant date-non- 
date effect for recognition as well, F(1,32) = 
31.83, p < .00003, indicating that participants 
recognized more details pertaining to their 
anticipated date (M = +.66) than to the 

nondate (M = —,17). To investigate whether 
accuracy in recognition was due to a response 
bias toward attributing all statements to the 
date, analyses of “false positive” scores were 
conducted. The results of these analyses did 
not support this interpretation of the recogni- 
tion finding; false positive scores did not vary 
with condition (all Fs < 1.26), 

The univariate exclusiveness effect for re- 
call for the anticipated date was not signifi- 
cant, F(1,32) = .00, nor was the univariate 
exclusiveness effect for recognition for the 
anticipated date, F(1,32) = .05. 

The univariate self-monitoring effect for re- 
call for the anticipated date was significant, 
F (1,32) = 6.77, p < .01; those high in self- 
monitoring recalled more about their antici- 
pated date (M = 3.00) than did those low in 
self-monitoring (M = 2.57). 


Trait Ratings 


Extremity. There was a significant uni- 
variate date-nondate effect for extremity of 
trait rating, F(1,32) = 4.32, p < .05; par- 


ticipants rated their anticipated date more ex- 
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tremely (M = 132.06) than they did their 
nondates (M = 120.72). 

Although the means were ordered as pre- 
dicted, the univariate exclusiveness effect for 
the anticipated date was not significant on 
the extremity of trait rating measure, F (1, 32) 
= 2.46, p < .13. The univariate exclusiveness 
effect for extremity for the nondate was sig- 
nificant, however, F(1,32) = 9.13, p < .005; 
this finding indicated that a nondate was rated 
more extremely in the high-exclusiveness con- 
dition (M = 135.83) than in the low-exclu- 
siveness condition (M = 105.61). 

There was a significant univariate self- 
monitoring effect for extremity for the an- 
ticipated date, F(1, 32) = 10.31, p < .003, 
indicating that those high in self-monitoring 
rated their anticipated date more extremely 
(M = 150.36) than did persons low in self- 
monitoring (M = 103.29), High self-monitor- 
ing individuals were also more extreme in 
rating their nondates (M = 132.16) than were 
the low self-monitoring participants (M = 
102.75); F(1, 32) = 8.22, p < .007, 

Confidence, For the confidence in trait rat- 
ing dependent measure, there was a significant 
univariate date-nondate effect, F(1,32) = 
5.53, p < .03; this finding indicated that the 
anticipated date was rated more confidently 
(M = 393.67) than were the nondates (M = 
372,86). 

The univariate exclusiveness effect for con- 
fidence in rating the anticipated date was not 
significant, F(1,32) = .27. The univariate 
self-monitoring effect for confidence in trait 
rating of an anticipated date was significant, 
F(1, 32) = 4.38, p < .04, indicating that per- 
sons high in self-monitoring were more con- 
fident in rating an anticipated date (M = 
418.19) than were persons low in self-monitor- 
ing (M = 354,00). 

Dispositional attributions. Following Jones 
and Davis’ operational definition of disposi- 
tional attributions, when a perceiver rated a 
stimulus person both extremely and with con- 
fidence on trait rating scales, we assumed he 
had made dispositional attributions to the per- 
son. It will be recalled that the extremity and 
confidence data analyzed in the two separate 
ANOVAs indicated that perceivers were both 
more extreme and more confident in rating 
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rsons they anticipated dating than they 

e in rating persons they did not expect to 
‘ate. These analyses do not tell us, however, 
whether extremity and confidence covaried in 
ratings of the same stimulus person. To in- 
vestigate whether this was indeed the case, 
within each exclusiveness condition, we cor- 
related the extremity score with the con- 
‘idence score for the ratings of the date, and 
ompared this correlation with the correlation 

tween average nondate extremity scores and 
confidence scores. 

These analyses revealed that within the 
low-exclusiveness (1-date) condition, extrem- 
ity and confidence were correlated +.77 (df = 
16) for the date, and +.41 (df = 16) for the 
average nondate. Within the high-exclusive- 
ness (5-date) condition, extremity and con- 
fidence were correlated +.85 (df = 16) for 
the date, and +.63 (df = 16) for the average 
nondate. Thus, extremity and confidence were 
more highly correlated for dates than they 
were for the average nondates. In both ex- 
clusiveness conditions, the correlation between 
extremity and confidence of trait rating was 
significantly higher for the date than for the 
nondate.2 (A similar pattern was found for 
both those high and those low in self-monitor- 
ing in both conditions.) 

Extremity and confidence ratings from the 
zero-exclusiveness (0-date) condition provided 
especially valuable information relevant to 
this issue. Since all three stimulus persons in 
this condition were nondates, dispositional at- 
tributions should have had relatively less 
utility for the perceivers in this condition. As 
a consequence, we expected the correlation be- 
tween extremity and confidence scores to be 
lower for all stimulus persons in the zero ex- 
clusiveness condition than for the anticipated 
date in the other conditions, As predicted, 
there was no significant correlation (r= 
+.07, df = 16) between extremity and con- 
fidence of trait ratings in the zero exclusive- 
ness condition. 

It might be argued that the lack of correla- 
tion between extremity and confidence in the 
zero-exclusiveness (base-rate) condition merely 
reflects a restriction in range since we had ex- 
pected ratings in this condition to be less 
extreme and less confidently given than in a 
condition in which a date was anticipated. To 
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investigate this possibility, the variances as- 
sociated with the extremity and confidence of 
trait ratings in the zero exclusiveness condi- 
tion were compared with the variances associ- 
ated with the extremity and confidence rat- 
ings of an anticipated date, collapsing across 
the high- and low-exclusiveness conditions. 
There was no reliable evidence of differential 
variability on the extremity ratings between 
the zero-exclusiveness condition (SD = 45.65) 
and the conditions in which a date was an- 
ticipated (SD = 48.96), F(35,17) = 1.15, 
ns; nor was there differential variability on the 
confidence ratings between the zero exclusive- 
ness condition (SD = 75.68) and the an- 
ticipated date conditions (SD = 93.62), F(35, 
17) = Sms: 

It should be noted that, contrary to our 
expectation, the stimulus persons in the zero- 
exclusiveness condition were not rated less 
extremely than were anticipated dates. In ad- 
dition, the stimulus persons were not rated 
any less confidently in the zero-exclusiveness 
condition than in the anticipated date con- 
ditions, The ANOVA summary and marginal 
means for extremity, confidence, and the other 
dependent variables are presented in Table 2. 


2Due to the dependency between ratings for the 
date and ratings for the nondate, these correlations 
could not be directly contrasted. We would like to 
thank Auke Tellegen for suggesting a test based on 
the following reasoning: When two variables x and y 
are perfectly correlated (rzy = +1.00) and when each 
pair of x and y scores are converted to z scores, 
within each pair, Zs =zy; the smaller the relationship 
between x and y, the greater discrepancy within pairs 
of z scores. Therefore, within both high- and low- 
exclusiveness conditions, extremity scores and con- 
fidence scores were converted to z scores for the date 
and for the nondate, The s scores for extremity were 
subtracted from the z scores for confidence for the 
date, producing a date discrepancy score. A similar 
discrepancy score was produced for the nondates, It 
was hypothesized that if there were a higher correla- 
tion between extremity and confidence for the date 
than for the nondate, the discrepancy score for the 
date should be significantly smaller than for the non- 
date. This hypothesis was tested using £ tests for cor- 
related measures. There was a significantly smaller 
discrepancy score for the date than for the nondate 
in both the low-exclusiveness condition, ¢(17) = 2.36, 
p <.05, and in the high-exclusiveness condition, ¢(17) 
=2.96, p < .01. (Both ¢ tests were two-tailed.) 
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TABLE 2 


Summary TABLE AND MEANS FOR AN ANTICIPATED 
Date (CoLLAPSING Across HIGH- AND 
Low-Exctusivengss CONDITIONS) VERSUS 
NONDATE IN ZERO-EXCLUSIVENESS CONDITION 


Anticipated 


date Nondate 
ANOVA effect M M F 
Attention 40.53% 33.33% 24.,30*** 
Memory 
Recall 2.83 1.61 60,70*** 
Recognition +66 — 21 4,89* 
Attribution 
Extremity 132,06 132.46 .00 
A Confidence 393.67 400.43 .07 
É: Attraction 
s Liking 29.22 25.54 5.63** 
Positivity 401,33 366.98 4.82* 


y In summary, there is no evidence that par- 
-ticipants responded less extremely or less con- 
fidently in the zero-exclusiveness condition., 
This suggests that the lack of correlation be- 
tween extremity and confidence in the zero-ex- 
clusiveness condition might better be attrib- 
uted to less coherence and correspondence in 
the inference process in this condition than to 
a restriction in range artifact, 


Attraction 


To investigate our hypotheses on the at- 
traction dependent measure, we return now to 
the ANOVA analyses. There was a significant 
univariate date-nondate effect for liking, 
F(1,32) = 6.43, p < .02. As predicted, par- 
ticipants liked their anticipated date signifi- 
cantly more (M = 26.22) than they liked 
nondates (M = 25.44). 

The univariate exclusiveness effect for lik- 


ing was i uly significant, F(1, 32) = 
3.50, p uggested that an anticipated 
date wa e under high-exclusiveness 
(M under low-exclusiveness 


(M 1S. 
T] ‘monitoring effect for 
i ignificent, F(1,32) = 4.51, 
hat ‘hose high in self- 
vated dates more 
w in self-moni- 


WH 30.68) tha ose 
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When we examine the positivity of trait 
ratings, we find a significant univariate date— 
nondate effect, F(1, 32) = 12.63, p < .001, 
indicating that the anticipated date was per- 
ceived more positively (M = 401.33) than 
was the nondate (M = 363.11). 

The univariate exclusiveness effect for posi- 
tivity of trait ratings for the anticipated date 
was also significant, F(1, 32) =451, p< 
-04, indicating that an anticipated date was 
rated more positively under high-exclusive- 
ness (M = 421.50) than under low-exclu- 
siveness (M = 381.77) conditions. Unexpect- 
edly, but in accord with some results on other 
dependent measures, the univariate exclusive- 
ness effect for the nondate was also significant, 
F(1,32) = 4.11, p < 05, indicating nondates 
in the high-exclusiveness condition were rated 
more positively (M = 380.06) than were non- 
dates in the low-exclusiveness condition (M 
= 346.17). 

The univariate self-monitoring effect for the 
anticipated date was also significant, F(1, 32) 
= 5.67, p < .02; those high in self-monitoring 
rated their anticipated date more positively 
(M = 419.36) than did those low in self- 
monitoring (M = 373.00). 

It will be recalled that we hypothesized 
that in general, under conditions of high out- 
come dependency, a ‘person should be highly 
motivated to predict the behavioral predisposi- 
tions of the person on whom he or she is de- 
pendent. We expected this motivation to be 
reflected in high intercorrelations among all 
the dependent variables under conditions of 
high outcome dependency, but low intercor- 
relations under low outcome dependency, 
Since each of the independent variables in the 
MANOVA analyses we conducted may be 
viewed as convergent conceptualizations of 
outcome dependency (cf. Messick, 1975), we 
may interpret our finding that each of these 
MANOVA effects were significant as suggest- 
ing that a single factor, outcome dependency, 
was underlying participants’ responses on the 
dependent measures. In addition, Table 3 


presents the intercorrelations among the seven - 


dependent measures, as well as their correla- 
tions with self-monitoring scale scores, for the 
anticipated date and for the nondate (col- 
lapsing across the 1-date and 5-date condi- 
tions) and the zero-date stimulus persons as 
well. It can be seen that responses on the de- 


Y 


ss. 
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R TABLE 3 
INTERCORRELATION MATRIX FOR DEPENDENT VARIABLES FOR THE ANTICIPATED DATE, NONDATE, 
AND NONDATE FROM ZERO-EXCLUSIVENESS CONDITION (“Zero DATE”) 
Recall Recognition Extremity Confidence Liking POA E eae a 
Attention 
Date A7 —.01 35* 34 22 wt 3 
mma Nondate 10 =.23 =.19 =17 20 .23 — 18 
O-date — 06 .08 — 18 — 01 21 AL .06 
4 mory 
ste —.06 .38* 28 at) 28 38* 
Nondate —.10 02 06 A2 03 At 
O-date .06 —.18 —.05 —.14 — 22 27 
ecognition 
Date — 24 —.19 .02 —.12 —.01 
Nondate —.20 —.10 AL —.09 04 
f 0-date 27 21 =AZ 02 13 
2 
F r 
Date ` .80*** Grete 84h AS** 
Nondate Thad AT Ase 3t 
0-date 07 —.04 — 32 05 
Confidence 
Date 50** A) han 36* 
Nondate 20 32 26 
s 0-date 02 OL 08 
me $ 
Attraction 
o) 
> Liking 
Date 590" 35 
Nondate 70S 05 
0-date 69** =.11 
Positivity 
Date .36* 
Nondate ll 
0-date 08 


Note, Correlations for Date are based on 36 observations (df 
the correlations of the two nondate stimulus persons and are base 


= 34). Correlations for Nondate are derived from the average of 
d on 36 observations (df = 34). Correlations for 0-date are derived 


from the average of the correlations of the three O-date stimulus persons and thus are based on 18 observations (d/ = 16). 


“Dp < 205. 
** p < 0l 
** p <.001, 


pendent measures tended to be more highly 
intercorrelated for the anticipated date than 
for the nondate and zero-date stimulus per- 
sons, 


x Discussion AND CONCLUSIONS 


i We hypothesized, it will be recalled, that 
the same factors which prompt the initiation 
of the attribution process may guide, in pre- 
dictable ways, how the process unfolds, in- 
fluencing both the form and the substance of 
its conclusion. Following the view that all 
perceptual acts are performed in the service 
of rendering the environment stable and pre- 

X dictable, and particularly Kelley’s (1972a) 

a view that “The purpose of causal analysis . . . 


£ 


o” 


is effective control” (p. 22), we reasoned that 
outcome dependency upon another, under con- 
ditions of high novelty, may be one such initi- 
ating factor. 

With divergences only in some particulars, 
our results generally support the specific hy- 
potheses deduced from the above reasoning. 
With increases in a person’s dependency upon 
another (for his or her dating outcomes, in 
this case), he or she tended to award more 
attention to the other, remember more about 
the other, evaluate the other more extremely 
and more confidently on a variety of disposi- 
tional traits, and be more attracted to the 
other as indicated by the positivity of trait 
ratings of the other and by self-reported liking 
of the other. 
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These last findings do not appear to be due 
solely to the fact that perceivers paid more 
attention to, and thus should have had more 
information about, the other on whom they 
anticipated dependence. Several lines of evi- 
dence argue against a purely informational in- 
terpretation of the memory and attraction re- 
sults, First, there were few significant cor- 
relations between attention time and the other 
dependent variables (see Table 3). Second, 
while scores on the self-monitoring scale 
proved to be associated with attribution, they 
were not related to attention. Third, nondates 
in the high-exclusiveness condition were rated 
more extremely and more confidently than 
were nondates in the low-exclusiveness condi- 
tion even though they were awarded less at- 
tention. 

Consideration of the putative function of 
attribution processes to make the social en- 
vironment more predictable, and hence more 
susceptible to control, led us to hypothesize 
that there may be stable individual differences 
in the desire for such predictability. We hoped 
that scores on Snyder’s Self-Monitoring Scale 
might be indicants of such differences; we rea- 
soned that the evidence which demonstrates 
that high self-monitoring perceivers depend 
more upon cues provided by others to guide 

their own social behavior than do low self- 
monitoring perceivers suggests that high 
scorers on this scale may be habitually more 
concerned with making accurate predictions 
about their social environment. 

The positive and significant relationships 
which were found between an individual’s self- 
monitoring score and the attention he or she 
awarded the prospective date, the extremity 
and the confidence of ratings of that other, as 
well as memory of and attraction to the other, 
are consistent with our reasoning. It might be 
noted, too, that these results suggest that high 
self-monitoring individuals do not straight- 
forwardly and simple-mindedly copy the cues 

provided by another’s behavior to guide their 
own; rather, our findings indicate that these 
verceptual/cognitive time 


individuals expend 
contingencies which 


and effort to discer: 
underlie the other's D: or. 
While the attention 
generally straightforw 
attribution results invit 


emory results are 
s expected, the 
- comment, It 
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will be recalled that while neither the | - 
tremity ratings nor the confidence rating 
the zero-exclusiveness condition differed fr 
those of the other conditions, within the ze 
exclusiveness condition extremity and cı 
fidence of trait ratings were not systematica 
related to each other. By comparison, extrei 
ity and confidence of trait ratings for ti 
prospective date were highly correlated in tl 
other conditions. 

These findings suggest that extremity an: 
confidence of trait ratings served differen 
functions in the two situations in which rat 
ings were made. In particular, it suggests tha 
those perceivers who were dependent upon th: 
other for their dating outcomes tried to con 
struct a coherent perception of that other (cf 
Neisser, 1966) as evidenced by the kind o 
cognitive constructions (i.e., dispositional at 
tributions) they produced, but that those per 
ceivers who did not anticipate dependence on 
the other did not attempt to construct such 
a systematic picture. 

Perhaps more importantly, the results of 
these internal analyses investigating the rela- 
tionship between extremity and confidence 
support the primary findings of this study and 
the conclusion that schemata of the attribu- 
tion process which are developed from in vitro 
observations of the laboratory, and which 
neglect the motivational bases of these pro- 
cesses, may be at variance with how these 
processes operate in vivo. It seems fair to say» 
that our zero-exclusiveness condition closely 
approximates the typical person perception 
laboratory situation. In such situations, the 
perceiver typically observes, or is given in- 
formation about, stimulus persons with whom 
he anticipates little or no future interaction 
(and, in some cases, he is even aware that the 
stimulus person does not exist). In addition, it 
is typical of these situations that the per- 
ceiver might not even notice the stimulus per-» 
son but for the experimenter’s request that he 
read about or watch the stimulus person to 
evaluate him on a questionnaire. If the typical 
person perception study participant performs 
similarly to our zero-exclusiveness condition 
perceivers, their ratings of the stimulus per- 
son may be completed unsystematically with 
respect to confidence and extremity, Thus, 
whether their evaluations are representative 
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4 of the type of evaluations they would spon- 
taneously make under more naturalistic con- 
ditions is in doubt. At the least, the present 
findings suggest that the terms “dispositional 
attribution” and “rating” should not be used 
interchangeably as they often are in the per- 
son perception literature. 

Finally, we might comment upon the gen- 
eral role of motivational factors in the at- 
tribution process. In his discussion of attribu- 
tion processes in the context of control, Kel- 
ley (1972a) states, “We might simply leave 
our analysis at the point to which we have 
brought it—with the recognition that at- 
_ tributions are often reasonable but that they 

are also often influenced by personal interest, 
w emotional considerations, and such” (p. 22). 
The results of the present study allow us to 
entertain the thesis that the conditions which 
provide the motivation for the perceiver to 
initiate the attribution process in first place 
may be the very conditions under which he is 
likely to invest “personal interest, emotional 
===>" considerations, and such” in the outcome of 
dey that val analysis (cf. Berscheid & Gra- 
f ziano, i press, for further discussion of, and 
evidence relating to, this proposition). 

In addition, and in accord with contempo- 
rary recognition that perception, memory, and 
cognition are so inextricably interwoven that 
it is difficult to study each separately (e.g., 
Flavell, 1970; Neisser, 1966), the present 
study demonstrates that all of the data-pro- 
cessing operations of the social perceiver—the 
collection of data (attention), the selective 
storage and retrieval of data (memory), as 
well as ` “erential extrapolation on the basis 
of the dota (attribution)—are part of a 
unified whole and may be viewed as mani- 
festations of an underlying motivation to 
predict and control the social environment. 
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From newspaper accounts it appears that when women were attacked and F 
bystanders did not intervene, frequently the bystanders justified their inactivity 
by stating that they thought it was a “lovers? quarrel.” In the first study sub- l 
jects witnessed a violent fight between a man and a woman. Subjects inter- l 
vened more frequently in the fight between strangers than between a married 
couple. In additional experiments it was found that the stranger condition was l 
perceived as being more damaging to the woman although the fights them- 
’ selves were identical. The woman in the stranger condition was seen as want- 
; ing help more frequently than the married woman. In addition the attacker 


was perceived as likely to run in the stranger condition and to stay and fight 


in the married condition, If subjects know 
likely to perceive the protagonists as dates, 


little about the conflict, they are 
lovers, or married couples rather 


than as strangers, acquaintances, or friends, The results were discussed in 
terms of their implication for social control. 


From time to time newspapers report that 
bystanders have witnessed an attack by a man 
on a woman without adequately helping the 
victim. For example, in one such case, 25 em- 
ployees of a company within a block of the 
police station observed the rape of a woman 
and did nothing but watch (Associated Press, 
1973). 

Borofsky, Stollak, and Messé (1971) in- 
vestigated bystanders’ responses to attacks by 
a man on a woman, a man on a man, a woman 
on a man, and a woman on a woman by stag- 
ing fights in the context of a psychodrama that 
“got out of hand.” Subjects observed all of 
these fights with confederates of both sexes. 
The authors found that men helped more 
than women in all but one condition. When a 

man attacked a woman no male subject out of 
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six intervened, while they intervened in the 
other fights approximately two thirds of the 
time. The authors tentatively suggested that 
male bystanders derived “vicarious sexual 
and/or hostile gratification” from viewing the 
incident. The authors also suggested that the 
four fights were seen as very different phe- 
nomena, although they reported no evidence 
that this was the case. At any rate, the in- 
terpretation of these results is at best ex- 
tremely uncertain, 

Research on bystander behavior suggests 


that part of the explanation for some of the ~~ 


accounts of why no help was given to the 
assaulted women is bystanders are less likely 
to help when others are present than when 
they are alone, This occurs because (a) by- 
standers take cues from others concerning the 
appropriate response, and (b) bystanders feel 
a diffusion of responsibility (Latané & Darley, 
1969). However, it appears that other factors 
may also be involved in inhibiting the re- 
sponse of bystanders when a man assaults a 
woman. 

Bystanders stated that they did not help 
Kitty Genovese because they thought it was 
“a lovers’ quarrel” (Rosenthal, 1964), or, as in 
the previous example, “it might turn out to be 
her boy friend . . .” (Associated Press). At 


Shotland, Department hology, The Pennsyl- ; 3 i 
vania State University, | ty Park, Pennsylvania Other times, even when witnesses did not men- 
16802, tion the possibility of a relationship between 


990 


BYSTANDER RESPONSE TO ASSAULT 


he victim and attacker, the descriptions they 
provided suggest that they might have be- 
eved there was a relationship (McFadden, 


The set of experiments we describe explores 
the implications and effects of a perceived re- 
lationship between a victim and her attacker 
ping behavior. Specifically, the hypoth- 
sis for Experiment 1 was: If bystanders per- 
i relationship between a woman and her 
sr, they will be less likely to intervene 
van if the victim and attacker are perceived 
strangers, 


EXPERIMENT 1 
Method 


he first procedure attempted was similar to the 

» used by Borofsky et al. (1971) in which a male 

ttacked a female during a psychodrama. The fight 
was actually videotaped and played for each subject, 
who was told he or she had been arbitrarily selected 
fo act in the next psychodrama pairing of the three 
varticipants and was asked to leave the other two 
participants (the confederates) so as not to inhibit 
them and to watch their psychodrama on television 
in the next room. This procedure proved unworkable 
as approximately 40% of the subjects thought the 
fight was staged. 

A second procedure was needed that was more be- 
lievable, The local police department was made aware 
of the impending experiment and their advice was 
sought. Three of the first author’s colleagues in the 
Department of Psychology were asked to review the 
procedure for ethical considerations. 


Subjects 


The subjects were 51 male and female students who 
participated in order to receive course credit for 
either an introductory psychology, or Man-Environ- 
ment Relations course during the summer of 1974 
at The Pennsylvania State University. 


Procedure 


Two couples with theatrical training were re- 
cruited through the cooperation of a faculty member 
in the Drama Department. One couple (Team 1) con- 
sisted of a man who was 1.70 m, 63.50 kg (5 feet 7 
inches, 140 pounds) with brown hair conventionally 
cut, paired with a woman who was 1.57 m, 49.90 kg 
(5 feet 2 inches, 110 pounds) with shoulder-length 
black hair. The second couple (Team 2) consisted of 
a man who was 1.85 m, 90.72 kg (6 feet 1 inch, 200 
pounds) with light brown hair conventionally cut, 
paired with a 1.70 m, 54.43 kg (5 feet 7 inches, 120 
pounds) woman with long blonde hair. All of the 
college-age actors wore the usual jeans and shirts, 
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Each couple worked on alternate nights with sub- 
jects being randomly assigned to conditions, the only 
provision being that the conditions contained an 
equal number of male and female subjects. The sub- 
ject came to take part in a study of attitudes in 
which he or she expected to answer a questionnaire 
and be interviewed. The subject arrived at the ap- 
pointed room and found a note explaining that the 
experimenter would arrive later to interview him or 
her and asking that the subject begin filling out the 
questionnaire, On the table with the questionnaires 
were what appeared to be a university phone with 
a standard sticker giving emergency telephone num- 
bers, including the police, The telephone was not 
functioning and was rigged to provide a dial tone 
when it was appropriate, and to record any telephone 
number that the subject might try to call. 

Other rooms on the floor had apparent activity in 
them. From the room across the hall emanated the 
sound of a computer printer, and in the room next to 
the experimental room a radio played. 

After the subject entered the room, 5 minutes 
passed, during which time the subject proceeded to 
fill out the questionnaire. The confederates, on a dif- 
ferent floor of the building, received a signal by 
walkie-talkie and took the elevator to the proper 
floor. Before the doors opened, a loud verbal argu- 
ment could be heard. The woman was accused by the 
man of picking up a dollar he claimed to have 
dropped, After approximately 15 sec of heated dis- 
cussion, the man physically attacked the woman, 
violently shaking her while she struggled, resisting 
and screaming, The screams were loud piercing 
shrieks, interspersed with pleas to “get away from 
me.” Along with the shrieks one of two conditions 
was introduced and then repeated several times. In 
the stranger condition the woman screamed, “I don’t 


know you,” and in the married condition, “I don't 


know why I ever married you.” From the first desig- 
nation of the condition the physical fight lasted a 
maximum of 45 sec, with the entire incident not last- 
ing more than a minute. 

The fight was immediately terminated with the 
attacker fleeing at any attempt by the subject to 
intervene. If no one intervened, the fight was termi- 
nated by a third confederate (male), who came out 
of a nearby room and demanded in a loud voice that 
the attacker stop beating up the woman. The attacker 
then ran away while the false intervener consoled the 
woman. Every effort was made to keep a direct in- 
tervener away from the actors to insure their safety, 
Two confederates were placed in concealed locations 
between the actors and the subject, one to falsely 
intervene, the other to inform the subject it was an 
experiment, if necessary. 


1 Approximately 6 to 9 months prior to the ex- 
periment there were a number of assaults, on campus 
and in town, on women of both a sexual and non- 
sexual nature. The attacks had caused much concern 
and received a great deal of publicity. 
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TABLE 1 


PERCENTAGE OF INTERVENERS BY CONDITION, SEX, 
AND ACTING TEAM; EXPERIMENT | 


Married Strangers 
Acting team Males Females Males Females 

1 

% 20 20 80 80 

n 5 5 5 5 
2 

% 17 20 60 40 

5 5 


A subject was considered an intervener if he or she 
called the police on our phone, asked a person in 
the computer room to help stop the fight, shouted at 
the attacker to stop, or acted to intervene directly. 
The fight was staged 15.85-16.15 m (52-53 feet) 
away from the room containing the subject, If the 
subject came down the hall toward the fight some 
9.14 m (30 feet) and did not turn into an intersecting 
hallway, he or she was counted as intervening, 

The subjects were interviewed immediately follow- 
ing the experiment and fully debriefed by the first 
author, who spent up to 45 minutes with each sub- 
ject, The interviews were unstructured except for sev- 
eral standard questions. First, the subjects were asked 
to describe what they had seen, If they did not ade- 
quately describe the experimental condition to which 
they were assigned, they were asked: (a) What was 
the relationship between the two people fighting? 
(b) How do you know? Subjects were asked (c) 
What caused the fight? and (d) Did you think it was 
a rape? They were then asked to describe their 
thoughts and behavior in the sequence in which they 
had occurred while witnessing the fight. After the 
debriefing each subject was specifically asked: (e) Do 
you think we have done any harm to you by having 
you go through that experience? and (f) Do you 
think that the potential results and implications of 
the research are worth the experience you went 
through? Each subject was urged to call the first 
author if he or she wished to discuss any aspect of 


the experiment in the future. 
Results 
Subjects indicated during the interview that 
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man’s technique for the assessment of ini 
action effects in multivariate continge: 
tables (Bishop, Fienberg, & Holland, 197: 
As one can see from Table 1, neither i 
three-way y*(1) = .12 nor any of the two-w: 
interactions y*(3) = .95 are significant, Ho 
ever, there is significant variation in the mai 
effects x?(3) = 10.13, p < .025. There is 1 
difference in the rate of intervention for men 
(42.9%) or women (40%) y?(1) = .09. Noi 
was any difference found between the teams 
of actors (33% vs, 50%) y*(1) = 1.26. There 
is, however, a sizable main effect: Interven- 
tion occurred much more frequently when 
subjects perceived the attacker and victim as 
Strangers (65%) rather than married (19% ) 
x*(1) = 8.45, p < .005). Surprisingly, all ex- 
cept four interveners tried to stop the fight 
directly, with only one person using the phone 
to call the police (a female in the married 
condition) and three knocking on the door of 
the computer room and asking the people in- 
side to help stop the fight (two males and one 
female in the stranger condition), In addition 
to intervention behavior, 30% of women who 
had witnessed the stranger condition had 
taken protective precautions by shutting the 
door to the room, turning the lights out, and 
in one case locking the door. During the sub- 
sequent interview these women stated that 
they had taken these actions for self-protec- 
tion. This type of behavior never occurred in 
the married condition, 

Ten subjects were eliminated from the anal- 
ysis because they failed to perceive the as- 
signed condition or had preknowledge of some 
aspect of the experiment (e.g., one subject 
was an acquaintance of the actress), These 
subjects would not have changed the results 
of the above analysis as five out of eight who 
witnessed the stranger condition intervened, 
while neither of the two subjects assigned to 
the married condition did. 

Most subjects, during the interviews, indi- 
cated they had listened to the fight from the 
beginning and did not think it was very seri- 
ous until they heard the actress scream. The 
Screams were arousing. (Subjects reported 
sensations such as their hearts racing and 
blood rushing to their heads.) Most subjects 
then left the room to observe the fight. They 
described the fight as severe and any suspicion 
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hat it was part of an experiment was dis- 
ssed upon seeing that two live people were 
olved in an intense fight. 

The subjects who witnessed the married 
ondition said they hesitated to take any ac- 
tion because they weren’t sure their help was 
wanted, Virtually every subject who did not 
intervene said he felt the fight was “none of 

business.” Those subjects who witnessed 
iranger condition felt they should take 
active role but were not sure how to in- 
ervene, They decided that if the attacker 
iew of their presence it might stop the fight 
ind so they slowly approached the fighters. 

Only two subjects took direct action ex- 
tremely quickly. Both had made plans con- 
cerning exactly what to do if an occasion such 

; arose, The man (stranger condition) 
ıs a weight lifter and studied a martial art 
ind wanted to try it. The woman (stranger 
condition) had been told at a woman’s meet- 
ing that she should spread keys between her 
fingers and punch an assailant. She had no 
keys so she planned to stab the attacker with 
a pencil. 

Subjects indicated that they did not use the 
telephone because (a) they did not notice it, 
or (b) during the excitement they forgot 
about it, A few subjects thought (c) the 
police would never get there in time. Only one 
woman subject (stranger condition) admitted 
to thinking of the incident as a rape. 

Subjects indicated that they did not feel 
that we did them any harm by having them 
go through the experiment. No subject 
thought that we should halt the experiment 
and in fact every subject thought it was im- 
portant and interesting. They were willing to 
discuss their experience including what they 
perceived as causative factors of their behav- 
ior and were interested in the possible results 
and implications of the experiment. 


EXPERIMENT 2 


Although the fights in the stranger and mar- 
ried conditions appeared to us to be identical, 
the subjects exposed to the married condition 
thought the fight did not seriously threaten 
the victim. Several subjects in explaining why 
they didn’t intervene in the married condition 
said: “He wasn’t really hurting her anyway.” 
Perhaps they were just trying to justify their 
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nonintervention by modifying their percep- 
tions of the incident. If so, no differences 
should appear in ratings of the two fight con- 
ditions in situations where intervention is not 
possible, However, if a subject’s perception of 
the potential injury to the woman victim de- 
pends upon the relationship between the vic- 
tim and attacker rather than upon whether 
or not he or she intervened, then one should 
be able to find a significant difference in his 
or her perception of the two situations. In the 
event that differences were found in a sub- 
ject’s perception of the two different fights, 
then we would have to show that the fights 
themselves were equal, 


Method 


Eight fights, equally divided between the stranger 
and married condition, were recorded on black and — 
white 16-mm sound film. The recorded incidents 
were not edited and all of them were used. They 
mirrored the variance in the fights in Experiment 1 
even to the extent that during one of the struggles 
the actress fell to the ground, One film sequence con- 
tains an identical incident. i 

Unfortunately, one of the actors (Team 2) dropped 
out of school as Experiment 1 was finishing so only — 
Team 1 fights were filmed. While this was unfortu- 
nate, it is not fatal as the main effect found in the — 
first study was significant for Team 1 by itself, z= 
3.35, p < .001. 


Subjects 


The subjects were 46 males and 42 females taking 
part in the experiment to receive extra credit for an — 
introductory psychology course during the winter 
term of 1975 at The Pennsylvania State University. 
Subjects were randomly assigned to the conditions. 


Procedure 


Subjects were asked to watch a very short film of — 
a fight and pay very careful attention to it. After the — 
film had ended they were asked to complete a short 
questionnaire. 

The questionnaire contained five questions to be 4 
rated on 4-point scales measuring (a) the intensity 
of the fight; (b) the degree of physical injury likely 
to result from the fight to the woman, and (c) to 
the man; the degree of emotional injury to (d) the 
woman, and to (e) the man. For example, subjects 
were asked to rate the fight on a scale from 1 (not 
likely to result in any physical injury to the woman) 
to 4 (likely to lead to the woman’s death). In addi- 
tion, the questionnaire contained two binary items 
which measured the subjects’ perceptions of who was 
aggressing and their perceptions of the relationship 
between the man and woman. 


Results 


Subjects were clearly able to indicate 
whether a sequence belonged to the stranger 
condition or married condition y?(1) = 63.25, 
P<.001. There was virtual unanimity of 
agreement that the male was the attacker, 
with 100% of males and 95% of females indi- 
cating so across both conditions. However, 
there was no difference in the subjects’ rating 
of the intensity of the fight F(1, 72) = 41, 
nor the amount of emotional trauma likely to 
= result to the woman, F(1, 72) = .04, or man, 

CY, 72) = 03. 

As we expected from the interview, how- 
ever, subjects did perceive the woman in the 
stranger condition (M = 2.29) to suffer more 
__ injuries than the woman in the married condi- 
tion (M = 2.01), F(1,72) = 5.31, p < .032 
In addition, women perceived a greater de- 
gree of injury to both the woman (M = 
2.31), F(1, 72) = 7.54, p < .05, and the man 
(M = 1.50), F(1,72) = 7.00, p <.01, than 
did men (M = 1.98 and M = 1.21, respec- 
tively). 

‘There was no effect due to the film episode 
_ nor were there any significant interactions. 


EXPERIMENT 3 


The next question that needs to be an- 
swered is whether the difference found in 
Experiment 2 is strictly a matter of the sub- 
ject’s perception or if the two conditions 
really differ as to the amount of damage 
apparently being done to the woman. If the 
two conditions are really dissimilar, no con- 
clusions could be drawn from Experiment 1. 


Method 
Another print was made of the eight fights, only 
this time the sound tracks were altered. The tracks 


were edited sc 
(stranger conditio: 
married you” (ma 
their place additiona 
other respects the 
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ıt the lines: “I don’t know you” 
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Results 


Subjects could not discriminate between t} 
two sets of films as to the relationship be 
tween the victim and attacker. They re 
sponded in one of six categories (strangers, 
acquaintances, friends, dates, lovers, mar- 
ried), and the distribution of responses within 
the two conditions were extremely similar 
x°(5) = 4.41, ns with no difference between 
men and women y*(5) = 2.83. The modal re- 
sponse for both the stranger and married con- 
ditions was “acquainted with each other” 
(62.5%), with 87.5% of the population indi- 
cating some form of familiarity between the 
attacker and the victim. 

Without the verbal cues the two condi- 
tions were not discriminable in terms of the 
physical injury to the woman as found in 
Experiment 2, F(1, 64) = 2.66, ms. The mean 
for the stranger condition was 2.00, with 2.25 
in the married condition, 

Surprisingly, the effects found in Experi- 
ment 2 indicating that women judged the 
injury likely to result to the woman and the 
man as more severe than did men were not 
replicated in the current study, F(1, 64) = 
-75 and F(1, 64) = .00, respectively. 


Discussion of Experiments 1, 2, and 3 


The perception of a greater likelihood of 
injury to the woman in the stranger condition 
in Experiment 2, in part, may explain the 
higher rate of intervention in this condition 
in Experiment 1. Shotland and Johnson (Note 
1), West and Brown (1975), and Staub and 
Baer (1974) concluded that victims are more 
likely to obtain help in a severe emergency 
than in a milder one, hence the woman at- 
tacked by a stranger receives more help. Why 
is an attack by a stranger seen as more se- 
vere? 

An attack by a total stranger is a frighten- 
ing event to witness, as the three women who 
took self-protective precautions indicated. 
This type of attack is one in which the victim 
appears to be chosen at random, without 


?Similar results were attained by Rossi, Waite, 
Bose, & Berk (1974) who used survey data and found 
that crimes between strangers were regarded as more 
severe than crimes between acquaintances, 
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© )ome or reason. Under this condition it is 


diL'cult to place the blame for the attack on 
tim, but this is not to say that it is not 
d e.g., rape victims). On the other hand, 
ed fight is one in which a bystander 
that the wife’s past behavior caused 
nd’s attack, and if he beats her why 
stay with him anyway? Thus, in 
with the just-world hypothesis the 
woman is seen as partially causing her 
ortune, The married situation is seen 
rational and thus less fearful. On the 
hand, the stranger condition is not con- 
ved and any woman could have been the 
ctim, as the three women who took self- 

£ protective precautions indicated. 

In addition to the subject’s perception of 
the stranger condition as more severe than 
he married condition, are there other causes 
w the higher intervention rate in the 
tranger condition? 


EXPERIMENT 4 


in Experiment 1 nonintervening subjects 
who viewed the married condition suggested 
that the quarrel was a private matter between 

he husband and wife and should be settled 
without external interference. This suggests 
that bystanders may be afraid of embarrassing 
a husband and wife but not strangers who are 
struggling. Likewise, bystanders may perceive 
that the woman being beaten by a stranger 
wants help, while they are uncertain whether 
the married woman wishes aid. Thus, the 
responsibility norm may not be as salient in 
ihe married condition as in the stranger con- 
dition, 

When bystanders consider “butting into” a 
conflict, one might expect them to weigh the 
possibility of physical retaliation by the pro- 
tagonists. Piliavin, Rodin, and Piliavin (1969) 


A suggested that bystanders balance the benefits 
“and costs of a situation before they decide to 


intervene. 

In Experiment 4 five hypotheses were 
tested concerning bystanders’ differential per- 
ceptions of fights between strangers and mar- 
ried couples: 


1. A married couple who are fighting will 
be expected to be more embarrassed by inter- 
vention than strangers fighting. 
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2. A woman who is attacked by a strange 
man will be expected to want help more than 
a woman attacked by her husband. 

3. Subjects will feel a greater sense of re- 
sponsibility to help the woman attacked by a 
stranger than the one attacked by her hus- 
band. 

4. The strange man will be seen as more 
likely to run away if someone intervenes, 
while the married man will be expected to 
stay and fight. 

5. The woman attacked by a stranger will 
be expected to stay and help fight her at- 
tacker more often than the woman who is 
attacked by her husband. 


Method 
Subjects 


The subjects were 40 men and 16 women enrolled 
in a social psychology course during winter term 
1976 at The Pennsylvania State University. 


Procedure 


The subjects were asked to watch an extremely 
short film of a fight between strangers (or a married 
couple) and to answer a questionnaire after the 
viewing. The two films were the first take of each 
condition and were part of the film used in Experi- 
ment 2. 

The questionnaire asked: “If someone tried to stop 
the fight I had just seen, the woman would (be)”.... 
This question was followed by five 5-point scales 
asking how embarrassed, surprised, angry, and frus- 
trated she should feel, as well as whether she would 
want help, The surprised, angry, and frustrated scales 
were filler items. The same sequence also was pre- 
sented for the male attacker. Subjects were also 
asked: “If I tried to stop the fight, chances are 
(check one) (1) the man would (a) run away, (b) 
stay and fight me, (c) I don’t know, and (2) the 
woman would (a) run away, (b) stay and fight the 
attacker, (c) I don’t know.” Subjects were then 
asked to indicate on a 5-point scale: “If I was 
present and had witnessed the incident,” what degree 
of responsibility to intervene would be felt and 
whether or not they would try to help the woman. 


Results 


Subjects did not perceive any difference in 
the levels of embarrassment that the male 
attackers would experience if they were inter- 
rupted, F(1,41) = .42, ns. There was, how- 
ever, a tendency for subjects to see the mar- 
ried woman (M = 2.76) as being more embar- 
rassed than the woman attacked by a stranger 
(M = 3.57), FU, 41) = 3.10, p < .09. 


996 


TABLE 


PERCENTAGE OF SUBJECTS BY CONDITION AND RES 
ATTACKER AND Victim WouLD Tak 


R. LANCE SHOTLAND AND 


MARGRET K. STRAW 


Course of action expected by subjects in 


Stranger condition (u = 26) Married condition (» = 30) 

E Fight me Fight me 

(attacker) (attacker) 

or fight or fight 

Expected course Don't attacker Don’t attacker 

of action of Run know (victim) Run know (victim) 
Attacker 58 4 38 10 17 73 
Victim 27 19 54 37 33 30 


A woman attacked by a stranger is more 
likely to be seen as wanting help (M = 1.11) 
than a woman who is attacked by her hus- 
band (M = 1.72), F(1,41) = 5.90, p < .02. 
Likewise, subjects indicate they would feel 
more responsibility to intervene in the stranger 
condition (M = 1.32) than in the married 
condition (M = 1.99), F(1,41) = 7.24, p 
< .01, and also indicate a greater willingness 
to help the woman in the stranger condition 
(M = 1.39) than the married condition (M 
= 2.27), F(1,41) = 9.42, p < .005. There is 
a tendency for women to feel more responsi- 
bility (M = 1.43) than do men (M = 1.89), 
F(1,41) =3.35, p<.08. No interaction 
reached statistical significance. 

As Table 2 illustrates (dropping the “I 
don’t know” responses and_reconditionaliz- 
ing), subjects who see fights among strangers 
expect the attacker to run away if an inter- 
vener arrives to a much greater extent than 
do subjects who see married couples fighting, 
when the husband attacker may stay and 
fight the intervener, y?(1) = 12.77, p < .001, 
As Table 3 indicates, women are more likely 
to expect the attacker to run when an inter- 
vener appears, while males are more likely to 
anticipate a fight with the attacker if they 
intervene, y°(1) = 7.66, p < .01. There was 
no interaction between sex and condition, 
x1) = .82, ns. 

Table 2 als 
ceive that the 
fight the attacke 
tacked by a stra 

is her husband , 
dition, Table 3 sh: 
from the victim 1 
xX) = 5.20, p< 


licates that subjects per- 
ı will help the intervener 
‘re often when being at- 
‘han when the accoster 

- 3.32, p < 10. In ad- 

ut women expect help 
often than do men 
"here was no inter- 


action between sex and condition y*(1) = ,1 
ns. 


| 


Discussion 


Tt appears that there is a slightly greate 
focus on the woman as a cue in making ¢ 
decision to intervene or not. From Experi 
ments 2 and 3 it appears that subjects expect 
greater injury to the woman in the stranger 
condition. From Experiment 4 it appears that 
subjects expect the woman in the stranger 
condition to be more appreciative of help and 
thus subjects are more likely to feel responsi- 
ble to help. They even expect the woman to 
contribute to her defense. Making it still 
easier to intervene in the stranger condition, 
there is a tendency to believe that the 
stranger, upon seeing a bystander, will run 
away. Hence it seems perfectly reasonable for 
a bystander to make the attacker aware of 
his presence. This was the course most inter- 
veners took in Experiment 1. 

Women more often than men think that 
an attacker will run away and that the vic- 
tim will assist them if they intervene. Women 
thus believe the task of intervention will be 
safer than do men and possibly as a result 
intervene as often as men in contradiction to 
some previous literature (e.g., Borofsky et 
al., 1971; Piliavin et al., 1969) which indi- 
cates that men intervene more frequently 
than women.? 


8 Women may also empathize more often with a 
female victim. Borofsky et al. had all their female in- 
terventions when a female was being attacked. In 
addition, the present study ran subjects alone rather 
than in mixed-sex groups, in which an unequal dif- 
fusion of responsibility might have taken place. 


BYSTANDER RESPONSE TO ASSAULT 997 
TABLE 3 


PERCENTAGE OF SUBJECTS BY SEX AND RESPONSE AS TO THE POSSIBLE COURSE OF ACTION THE 
ATTACKER AND VICTIM WOULD TAKE UPON INTERVENTION: EXPERIMENT 4 


Course of action expected by 
Male subjects (mn = 40) Female subjects (n = 16) 


Fight me Fight me 
(attacker) (attacker) 
; : or fight or fight 
Expected course Don't attacker Don't attacker 
of action of Run know (victim) Run know (victim) 
Attacker 23 13 65 56 6 38 
Victim 38 33 30 19 13 69 


EXPERIMENT 5 


From newspaper accounts of assaults pre- 
viously cited, it would appear that bystanders 
rarely have a total picture of the incident and 
hence do not know what precipitated the as- 
sault or the relationship between the victim 
and attacker. Under this condition what do 
they assume? From newspaper interviews of 
witnesses they appear to believe there is a 
Jose relationship between the victim and 
attacker. However, data from Experiment 3 
indicate that approximately 60% of our sub- 
jects assumed the victim and attacker were 
acquaintances and approximately 10% as- 
sumed no relationship. It seems possible that 
subjects’ perceptions of the relationship be- 
tween the protagonists may have been influ- 
enced by an assumption that people involved 
in a close relationship won’t fight over a 
“dropped” dollar. Thus in this experiment 
information concerning the relationship be- 
tween the man and the woman will be kept 
at a minimum by eliminating auditory cues. 
Additionally, one would expect attributions 
about the relationship between the man and 
the woman to be related to other aspects of 
the fight such as the fight’s intensity. 


Method 
Subjects 


Subjects were 40 male and 41 female introductory 
psychology students earning supplementary credit 
during spring term 1975 at The Pennsylvania State 
University. 


Procedure 
Four new film episodes were produced using the 


same actor and actress as in the previous films. The 
initial episode started with a mild discussion with 


each additional episode becoming more aggressive 
than the last until the final reached the level of the 
fights used in previous studics. The filmed argu- 


ment was, as in the past, concerned with found | 


money. 

Subjects were randomly assigned to a single film 
sequence with the only provision being that an 
approximately equal number of male and female sub- 
jects were assigned to each condition. They were 


shown the sequence without the sound track, given — 


the same instructions as in Experiments 2 and 3 and 
were asked to fill out the same questionnaire used in 
Experiment 3 with one exception. On the first page 
(blank), subjects were asked to describe what had 
occurred and why. 


Results 


It appears that the four filmed sequences 
can be differentiated along what may be re- 
garded as a severity dimension. The likeli- 
hood of physical injury to the woman, F(3, 
73) = 17.13, p < .001, discriminated between 
the film sequences as Table 4 indicates. As 
the level of severity increased with each ad- 
ditional sequence, the Newman-Keuls pro- 
cedure indicated that subjects rated a greater 
likelihood of injury to the woman (p < .05) 


TABLE 4 


Supyect’s MEAN RATINGS OF THE PROBABLE PHYSICAL 
DAMAGE TO THE WOMAN BY SEX: EXPERIMENT 5 


Film sequence 


Subjects 1 2 3 4 

Males 1.00 1,00 1.20 1.80 

Females 1.00 1.10 1,50 1,90 
Total 1,00, 1,05, 1.35a,b,0 1.855,b,0 


2 : eet 

_ Ratings were made on a 4-point scale ranging from 

(aot heels to result in any physical injury to the woman) to 4 

(likely to lead to the woman's death). Each shared subscript 

indicates a p < .05 difference between the means as indicated 
by the Newman-Keuls post hoc comparisons. 


nas, * 
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with the only exception being a lack of sig- 

nificance between the first and second film 

sequences. Subjects’ ratings of the intensity of 

each sequence also differentiated between the 

films, F(3,73) = 7.52, p < .001. The New- 

_ man-Keuls procedure indicated that the first 
Sequence was different from the second, third, 
and fourth. 

Similar to the results in Experiment 3, but 
unlike the results in Experiment 2, (a) women 
were not found to perceive a significantly 
greater likelihood of damage of the woman 
(M = 1.38) than were men (M = 1.25), F(1, 

- 73 = 1.77, ns, and (b) women did not rate a 
greater likelihood of damage to the man (M 
= 1.12) than did men (M = 1.12), F(1, 73) 
= 0, ns. 

The aggressor role was not held constant 
across the four film sequences, x?(3) = 41.9, 
p < .001. In the first two sequences 69% of 

~ the subjects believed the woman was the 
aggressor, while in the last two sequences 
97.5% thought the man was the aggressor, 
Thus it appears that severity is confounded 
with the role of the aggressor. Hence, an anal- 
ysis of the attributions of the relationship 
between the victim and assailant as a func- 
tion of severity would be meaningless. 
However, the data do provide an oppor- 
tunity to study what bystanders perceive is 
happening when they are provided with little 
information concerning a fight between a man 
and a woman across a variety of severity and 


TABLE 5 


FREQUENCIES OF THE PERCEPTIONS OF THE RELATIONSHIP BETWEEN A VICTIM AND AGGRESSOR 
ACROSS SEVERITY AND AGGRESSOR CONDITIONS: EXPERIMENT 5 


R. LANCE SHOTLAND AND MARGRET K. STRAW 


two aggressor conditions. Table 5 illustre es 
these results. 

The modal relationship attributed to ` e 
couple was that the people dated. The le 
chosen category was that the protagoni: 
were strangers. When the three categories « 
the closest relationships (dates, lovers, ma 
ried) are combined it appears that a clos: 
relationship is attributed to the couple more 
highly (69.1%) than by chance, Z = 3.80, 
p< 014 


GENERAL DISCUSSION 


The assumption that a man and a womar 
who are fighting have a close relationship is 
logical. People who know each other hav 
more common ground than strangers from 
which to quarrel. Hence, unless a fight 
clearly concerns some event where knowledge 
of the other is not necessary, for example, 
theft and rape, the bystander assumes a rela- 
tionship. 

Is it true then that most assaults occur 
between people that know each other? It ap- 
pears not. Criminal Victimization in the 
United States (1974) indicates that 60% of 


4 Analyzing the subject’s unstructured descriptions 
of what they believed to be happening indicates much 
the same thing. Only 1 subject out of 81 suggested 
that the man and the woman were strangers, Ap- 
proximately 67% suggested some type of relationship, 
while the remainder did not specifically mention any- 
thing about the relationship between the couple. 


Film sequences Strangers Acquaintances Friends Dates Lovers Married 
1 
Male 2 4 3 1 
Female 2 2 5 1 
2 : 
Male 1 1 2 4 2 ae 
Female 2 3 3 2 7 
3 j 
Male 1 5 4 
Female 5 4 1 f 
4 
Male 2 5 2 1 
Female 2 2 2 2 1 
Total 10 12 32 19 5 | 
% 12.4 148 39.5 23.5 61 | 


ule is the aggressor, in the latter two the male is the aggressor. 


i 


BYSTANDER RESPONSE TO ASSAULT 


its occur among strangers. In urban 
a: ıs bystanders rarely are in a position to 
k- wv who is involved in an attack and have 
s. In the face of ignorance about the 
\uonship, the assumption is likely to be 
ong and thus the bystander will most 
x: intervene when he or she might 
: if a correct assumption had been made. 
I ss reasonably broad implications for 
control, 
eople recognized the individuals within 
community and knew basic points of in- 
nation, such as who their wives, husbands, 
id children are, then no assumption concern- 
ing the relationship between a victim and her 
attacker would be necessary. By knowing the 
people involved in an assault or other crime, a 
bystander has the necessary information to 
w something is amiss. Rothman (1971) 
ribed colonial America as a group of 
ages, each composed of a small population 
‘hat had intimate knowledge of each other 
nd were distrustful of outsiders. This knowl- 
dge of one’s neighbors is the basis of social 
control. Thus, if we could obtain this control 
in terms of a man beating up a woman, we 
night be able to restrict the victimization of 
women to their husbands or close associates.” 


5 If bystanders and, one would guess, society do not 
regard wife beating seriously, this act cannot be 
ontrolled, 
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Reactions to Male and Female Disclosure at Two Levels 


Gordon J. Chelune 
University of Nevada, Reno 


Reactions of male and female observers to high and low behavioral samples 
of male and female disclosure to a stranger were examined for two factors: 
perceived emotional health and perceived likability, Male speakers were rated 
as significantly more healthy than female speakers irrespective of disclosure 
level. There was also a tendency to see high disclosure to a stranger as less 
emotionally healthy than low disclosure, Significant differences in likability 
between male and female speakers were observed as a function of their dis- 


closure level. The male speaker was most liked when he was a low discloser, 


whereas the female speaker was least liked when 
Female observers were found to be more discriminating between these con- 


ditions than were the male observers. 


social expectations and norms, 


Self-disclosing behavior has generally been 
considered .a positively valued activity (Cul- 
bert, 1970). Jourard (1964) has even argued 
that an optimal amount of disclosure under 
“specific conditions is synonymous with men- 
tal health” (p. 15). Research, however, aimed 
at demonstrating this relationship between 
self-disclosure and personality adjustment has 
_ produced inconclusive results (Cozby, 1973). 
Altman and Taylor (1973) and Cozby (1973) 
have noted that a major difficulty with this 
research has been the tendency to view self- 
disclosure as a trait or global personality pat- 
tern rather than acknowledging the importance 
of situational factors in determining the level 
of behavioral disclosure. Chelune (1975a) has 
suggested that the degree to which an indi- 
vidual can adequately differentiate interper- 
sonal variables (i.e., social situation, target 
person, etc.) and adapt his or her disclosures 
accordingly is a basic parameter of self-dis- 
closing behavior. 

Goffman (1967) has written extensively 
about the social rules and norms governing 
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The results are discussed in terms of 


she was a low discloser, 


social interactions. Several studies have am- 
ined the social-situational factors which ect 
the perceived appropriateness of Self-disclo- 
sure. Nonreciprocal disclosure was rated as 
more inappropriate than the same disclosure 
when it was reciprocal (Chaikin & Derlega, 
1974a) and intimate disclosure to a strancer bo 
was seen as less appropriate by observers tha 
nondisclosure (Chaikin & Derlega, 197 
The sex of the discloser has also been 

to affect observers’ ratings of appropriati 
Kleinke, Lenga, and Beach (Note 1) d 
that females were rated about the san m 
consideration, politeness, and attentiveness 
whether they monopolized 20%, 50%, or 80% 
of the time in a conversation. Males, how- 
ever, were rated less favorably the more time 
they spent talking. 

The present study was designed to exami 
the reactions of male and female observer 
high and low behavioral samples of male 
female disclosure to a stranger. Thus, 
present study represents both a replicatio 
past research using behavioral samples of 
self-disclosure as stimuli and also an ex4- 
sion in that it examines the effects of 
differences for both the observer and disci ser 
variables, 


METHOD 
Observers 


The observers (V=40) were 20 male (median 
age = 20.7 years) and 20 female (median age = 19.9 
years) students enrolled in undergraduate psychology 
classes at the University of Nevada, Reno, 


REACTIONS TO DISCLOSURE AT TWO LEVELS 


»ulus Material 


-minute tape-recorded interviews from two 

es and two females were used as disclosure stim- 

material. These four recordings were selected 

m a pool of 74 interviews obtained in a previous 

dy helune, 1975b) in which the recordings were 

ured ng objective criteria for the following 

riabli total number of statements; number of 

lf-descriptive statements; intimacy of content; and 

ailective manner of presentation. In order to create 

r: male-female and high-low disclosure manipula- 

the present study, the tape recordings were 

d on all variables except the number of self- 

iptive statements; that is, rate of verbalization, 

and affective manner were held constant for 

| four tapes. On the tapes, 92.5% of the high- 

lisclosing male and female stimulus persons’ state- 

ments were escriptive, while 72.75% of the low- 

disc] and female stimulus persons’ state- 
ments were self-descriptive. 


u 


Person Perception Scales 


The person perception scales were specifically 
ned for use in this study. The scales asked ob- 
vers to rate tape-recorded speakers on an 8-point 
scale for the following four bipolar qualities: likable- 
ot likable; emotionally unstable-emotionally stable; 
xciting-dull; and weak personality-strong person- 
ality. 


Procedure 


The observers were informed that the tape record- 
ings of the four speakers were obtained by an inter- 
viewer who was a stranger to the speaker. The ob- 
servers Were then instructed to evaluate each speaker 
using the person perception scales following the end 
of each recording. The four stimulus tapes were 
played in the following order: low female; high male; 
high female; and low male. 


RESULTS 
Manipulation Check 


In order to assess the effectiveness of the 
high-low disclosure manipulation of the stimu- 
lus tapes, the observers were asked to rate 
each speaker on an 8-point scale for two 
aspects of self-disclosure: amount and speci- 
ficity. Amount refers to the number of items 
of  self-descriptive information revealed, 
whereas specificity refers to how specific the 
information disclosed is. The scales were an- 
chored at each end. The observers rated the 
high-disclosing speakers higher on both 
amount, F(1,38) = 5.02, p < .05, and speci- 
ficity of disclosure, F(1,38) = 15.13, p< 
001, than the low-disclosing speakers, thus 
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verifying the effectiveness of the high-low 
disclosure manipulation. 


Dependent Measures 


Correlations were computed among the four 
dimensions of the person perception scales, 
Scores for the emotionally unstable-emotion- 
ally stable and weak personality-strong per- 
sonality dimensions were highly correlated (r 
= .63) and therefore combined to form a 
single emotional health factor, with high scores 
indicating high perceived emotional health, 
Likewise, the likable-not likable and exciting- 
dull dimensions correlated highly (r= .61) 
and were also combined to form a single lik- 
ing factor, with igh scores indicating low lik- 
ing. Separate 2 X 2 X 2 (Sex X Level of Dis- 
closure X Speaker Sex) analyses of variance 
with repeated measures on the last two factors 
were computed for emotional health and lik- 
ing. 

The results of the analysis for the perceived 
emotional health of the speakers revealed only 
one significant main effect. The observers 
rated the male speakers as more emotionally 


healthy than the female speakers irrespective — 


of disclosure level, F(1,38) = 11.94, p< 
.005. There was also a nonsignificant trend for 
the observers to rate the high-disclosing speak- 
ers as less healthy than the low-disclosing 
speakers, F(1,38) = 3.45, p<.10; that is, 
the observers rated high disclosure to a 
stranger as less emotionally healthy than low 
disclosure. 

The results of the analysis for perceived 
liking revealed no significant main effects. The 
Disclosure Level X Speaker Sex interaction, 
however, was highly significant, F(1, 38) = 
44.91, p < .001. The likability of the speakers 
in this interaction is depicted in Figure 1. 
Significant differences between the male and 
female speakers were observed as a function of 
their disclosure level. The male speaker was 
most liked when he was a low discloser, 
whereas the female speaker was least liked 
when she was a low discloser. 

In addition to the Disclosure Level x 
Speaker Sex interaction on the measure of 
liking, a higher order interaction involving 
observer sex, disclosure level, and speaker sex 
was also found, F(1, 38) = 9.72, p < .005, 


Male Speaker #-——# 
Ss Female Speaker 0----9 
z 4.0 
~} ae 
Bas. paste y 


High 
DISCLOSURE LEVEL 
Ficure 1, Mean ratings of perceived liking for 
male and female speakers at two disclosure levels. 


(Liking scores run from high to low along an 8-point 
scale.) 


The means associated with this interaction 
are presented in Table 1 and show that while 
both male and female observers displayed the 
pattern of results shown in Figure 1, female 
| observers were more discriminating between 
conditions than males. Females saw the high- 
disclosing male speaker as much less attrac- 
tive than did males, and they saw the high- 
disclosing female speaker more likable than 
did male observers. 


Discussion 


The results of the present study have dem- 
onstrated some interesting differences in the 
reactions to male and female disclosure. There 
was a tendency to see speakers who made a 
high percentage of self-descriptive statements 
to a stranger as less emotionally stable and 
strong than those speakers who revealed less 


information about themselves. This finding is 
in general agreement with past research which 
found that disclosure of highly intimate in- 
formation to a nger was seen as more 
maladjusted t ndisclosure (Chaikin & 
Derlega, > relationship between 


self-disclosu: 

be complex, d 
ferentiate con 
when, and wi 
one’s disclosure 


‘tal health appears to 
on the ability to dif- 
‘iables, that is, who, 
he ability to adapt 
iccordingly (Chelune, 
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1975a). High disclosure to a stranger vio! es 
the norms of social penetration which sus; st 
that relationships proceed from nonintin 

to intimate areas of exchange via verbal « 
closure, activities engaged in, and nonver 
communication (Cozby, 1973). Chaikin a: 
Derlega (1974b) have suggested that viol 
tion of such social norms “may reflect inad 
quate socialization or maladjustment” (p 
588). 

Striking differences between male and fe- 
male speakers on measures of liking were ob 
served as a function of the speaker’s disclosuré 
level. The major difference in ratings of likin 
between the male and female speakers was i 
relation to low disclosure. Low male disclosur 
was seen as significantly more appropriate oi 
likable to a stranger than was low femal 
disclosure. As in Chaikin and Derlega’s study 
(1974b), female observers were more dis- 
criminating between conditions than were 
male observers. 

The sex differences observed in liking are 
consistent with social learning theory. Nu- 
merous studies have shown that females are 
generally higher disclosers than males (Cozby, 
1973). Jourard (1964) has suggested that our 
culture sanctions the disclosure of personal 
information in women, but censors the same 
in males. The results of the present study sup- 
port the notion that our culture differentially 
reinforces the expression of personal informa- 
tion in males and females. Males are liked 
more when they adhere to the “strong silent” 
role than when they are personally open, 
whereas females are more favorably evaluated 
when they meet the definition of what Jourard 
(1964) has called the “expressive” role. 

The results of the present study are in 
contrast with Jourard’s (1964) position that 


TABLE 1 


Means ASSOCIATED WITH THE OBSERVER 
Sex X SPEAKER SEX X DISCLOSURE 
LEVEL INTERACTION FOR 


LIKABILITY 
Male speaker Female speaker 
Observer — 
sex High Low High Low 
Male 4.05 3.57 4.32 442 
Female 482 3.57 3.85 


Note. Scores run from high to low along an 8-point scale, 


j 
? 


REACTIONS TO DISCLOSURE AT TWO LEVELS 


bo rental health and liking are positively 

rel to disclosure. The relationships appear 

t much more complex, Self-disclosure, be- 

` basic medium for social interaction, is 

ct to social norms and rules which gov- 

interactional behavior. To break from the 

ial expectations and norms governing male 

' emale interactions in a given context, 

ne ruu: the risk of being seen as inappropri- 

d negatively evaluated (Goffman, 

further research is needed to deter- 

what conditions govern the perceived 
opriateness of self-disclosure. 
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Dramatic Events and Attitude Change 


Robert T. Riley and Thomas F. Pettigrew 
Harvard University 


Two quasi-experimental field studies tested the potentiality of dramatic events 
to change the racial attitudes of white Texas adults. The first investigated 
the effects of the 1957 desegregation crisis in Little Rock, Arkansas, Some 
overall change toward greater compliance with desegregation occurred. But 
there was also a sharp counterceiling effect resulting in greater opinion 
polarization, as those types of respondents initially most compliant provided 
the largest shifts. The second study used a multiple time-series design before 
and after the 1968 assassination of Martin Luther King, Jr. The counter- 
ceiling effect operated again both for types of respondents and for types 
of racial attitudes. Formal contact attitudes shifted the most favorably, 
informal contact attitudes moderately, and intimate contact attitudes the 
least. Three months later, there was no change among the formal and in- 
formal contact attitudes and a ripple effect of delayed change among intimate 
contact attitudes. These results relate closely to attitude change theory and 
laboratory research, The counterceiling effect is explained in terms of social 


judgment, 
explained in consistency terms. 


Social psychology has evinced surpris- 
ingly little research interest in the attitude 
change effects of dramatic events. Yet such 
events are commonly thought to shape, even 
shift, public opinion. Is this the case? And 
if so, are these effects systematic and predict- 
able, and do they have implications for 
attitude change theory and research? 

Relevant studies to date have reached 
conflicting conclusions largely because their 
definitions of “effect” have varied as to time 
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project on public school desegregation funded under 
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norm mediation, and relabeling processes; 


the ripple effect is 


period and stability of change. Social psychol- 
ogists and sociologists tend to emphasize slow 
long-term shifts, with sudden shifts generated 
by dramatic events seen as minor deviations 
that soon revert back to the dominant 
trend (e.g., Bellisfield, 1972; Dillehay, 1964; 
Schwartz, 1967; Sheatsley, 1966; Thistle- 
thwaite, 1974). Political scientists, by con- 
trast, often focus on sharp swings in opinion 
over short time periods related to salient 


international events (e.g., Deutsch & Merritt, 


1965). 

We propose a middle-ground position. If 
dramatic events are defined roughly as highly 
publicized occurrences of world significance, 
we believe that they have considerable poten- 
tial for lasting attitude change. Shifts in atti- 
tudes generated by such events occur over 
short periods of time and are superimposed 
upon long-term trends. There are wide indi- 
vidual differences, to be sure, though many 
studies have concentrated exclusively on total 
aggregate effects. Two longitudinal and de- 
tailed studies of physicians’ responses to 
Medicare and educators’ responses to the 
Schempp ruling against religious exercises in 
the public schools, for example, found lasting 
attitude changes even without prior behav- 
ioral change (Colombotos, 1969; Muir, 1967). 
Mann and Janis (1968) provide a pointed 
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ay 


ç laboratory demonstration of this effect. They 
note that the widely discussed Surgeon Gen- 
eral’s Report linking smoking to cancer in 
1964 had a pronounced influence on their 
subjects that rivaled in magnitude and was 
= superimposed upon the experimental effects 
rendered by their anti-smoking, role-playing 
treatments. 

This paper reports two quasi-experimental 
field studies on differential changes in the 
racial attitudes of white Texans over time 
| periods marked by dramatic racial events. 

The pilot study shaped the hypotheses and 
method of a second and more comprehensive 
study. Both studies used survey data from the 
Texas polls of Belden Associates of Dallas, 
Texas, and both exploited fortuitous events 
as the experimental stimuli. But the two 
studies employed different quasi-experimental 
designs: The first study relied on a simple, 
separate sample, pre-post design, while the 
second relied on a multiple time-series design 
' with separate samples. Repeated measures on 
the same samples were not possible; rather, 
independent samples selected by the same 
procedures from the same sampling points 
were used for each survey. The pilot study 
design included only one preevent and one 
postevent sample and thus permitted just one 
| comparison without a control group. Campbell 
and Stanley (1963) rightly criticize this de- 
» Sign extensively. But our second and basic 
study employed four separate samples, two 
prior to and two following the event. This 
second design allowed us to simulate a con- 
“trol and to test for regression-to-the-mean 
_ effects. 


Quast-ExPERIMENT 1 


In September of 1957, then-President 
Dwight D. Eisenhower ordered federal troops 
to Little Rock, Arkansas, to implement a 
federal court order to desegregate that city’s 
ij Central High School (Campbell & Pettigrew, 
1959). This rare action was in response to 
the calling out of the State National Guard 
. by Arkansas Governor Orval Faubus to pre- 
| vent black students from entering the school. 
, ,Eisenhower’s decision occasioned a historic 
onfrontation of federal and state power, re- 
ceived enormous world-wide attention through- 
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out the fall of 1957, and constitutes the 
dramatic event for our pilot study. 


Method 
Survey Samples and Questions 


In August, 1957, 686 white Texans were inter- 
viewed individually to form the preevent sample. 
And in December 1957, a different group of 682 
white Texans were interviewed individually to form 
the postevent sample.1 Each respondent was asked 
a question that assessed resistance to the racial 
desegregation of the public schools: 


Suppose you were on the school board here, and 
they asked you to give your frank opinion. Which 
one of the four statements on this card comes 
closest to the way you feel about it? 

1, Keep the races separate even if I have to 
disobey the law. 

2. Find a way to keep the races separate by 
getting around the law. 

3. Begin mixing the races gradually, starting where 
there is least opposition. 

4. Obey the Jaw even if I have to let all races 
go to the same school immediately. 


With a score of 1 representing extreme resistance 
and 4 representing full compliance, this resistance- 
compliance question directly measured respondents’ 
attitudes toward school desegregation in their own 
communities and constituted the dependent variable, 


Demographic Types 


A dramatic event may not exert the same impact 
on all respondents. Individual characteristics, such 
as age and social class, and residential factors, such 
as city size and region of Texas, may mediate the 
event’s effect. Differences in age and social class 
relate strongly to racial attitudes (Campbell, 1971), 
and differences in the city size and region of resi- 
dence often point to contrasting racial traditions 
within Texas (Pettigrew & Riley, 1972). For this 
initial analysis of variance, age, social class, city size, 
and region of Texas constituted four separate factors 
categorized as follows: age—younger (20-34 years), 
middle-aged (35-50), and older (51+); social class— 
low, working, and middle (based equally on the 
occupation of the respondent and on the inter- 
viewer's rating of the family socioeconomic status) ; 
city sise—rural (under 10,000 total population in 
1960 United States Census) and urban (10,000+) ; 
and region of Texas—East (Lamar, Kaufman, 
Grimes, Chambers, and all counties to the east), 


1A three-stage sample design was employed on 
all of the Belden surveys reported in this paper: 
(a) a stratified probability sample of sampling 
points; (b) a probability sample of areas within 
sampling points; and (c) quota sampling of respon- 
dents by age, sex, and race within areas, 
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TABLE 1 


ANALYSIS OF VARIANCE FoR ATTITUDES ON 
DESEGREGATION, PRE- AND Post- 
Litrre Rock Crisis, 
EXPERIMENT 1 


Source af MS F 
Time (A) 1 10.69  13.04* 
AX Age (B) 2 1.39 1.69 
A X City Size (C) 1 1.03 1.26 
A X Social Class (D) 2 7.81 9,52* 
A X Region (E) 2 13.16  16.03* 
AXBXC 2 -80 97 
AXBXD 4 1.09 1.32 
AXBXE 4 1.72 2.09 
AXCXD 2 1.41 1.72 
AXCXE 2 49 59 
AXDXE 4 55 67 
AXBXCXD 4 54 .65 
AXBXCXE 4 1.29 1.57 
AXBXDXE 8 67 83 
AXCXDXE 4 33 40 
AXBXCXDXE 8 J 93 
Error 1, 314 82 
*p <.001, 


Central, and West (Hardeman, McCulloch, Zavala, 
and all counties to the west). 

Crossing these four factors yielded 54 cells in our 
analysis of variance design. We call each of these 
unique combinations of background and residential 
characteristics a demographic type, similar in mean- 
ing and purpose to Pool, Abelson, and Popkin’s 
(1965) use of the term voter type. To assess the 
overall impact of the dramatic event, preevent and 
postevent means on the attitude item are compared; 
and to assess the differential impact of the event, pre- 
event and postevent means within each demographic 
type or combination of types are compared. The 
interactions of the demographic variables with time 
point to the existence of a patterned differential ef- 
fect of the dramatic event. For conservative tests, 
we poo rror all effects not involving time; 
that is, all main effects of age, social class, 
city size n, as well as their interactions 
without ooled with error, 


Results 


T} can on the attitude item 
I nee from 2.01 to 2.14 
a ad December 1957 
.001; see Table 1). 
isks the differential 
episode apparently 

ion attitudes of 


1 wit? 


samy ( 

But this c 
impact that 
exerted on ihe 


white Texans. The ntial effects are 
signaled in Table two significant 
interactions, Time X Soc ass (F = 9.52, 

and Time x R 1 of the state 


p <-001) 


(F = 16.03. p < 001). Table 2 provides the 
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pre- and post-Little Rock attitude means for 
these two demographic variables. Note that 
the positive shifts toward greater compliance 
are concentrated among respondent types| 
from the middle class and West Texas. But 
substantial negative shifts toward greater re- 
sistance to school desegregation took place ` 
among respondent types from the lower class — 
and East Texas. Table 2 also reveals a phe- 
nomenon that we shall focus upon in two 
ways: opinion polarization and a counter- 
ceiling effect. / 

Opinion polarization. One apparent result 
of the Little Rock conflict upon white Texans ` 
was to heighten their group differences in» 
opinions toward school desegregation. Prior to . 
the event in August, lower-class and middle- 
class respondents had a mean difference of. 
.66; but by December their differente had | 
widened to 1.26. Similarly, differences be-, 
tween West and East Texans rose from .78,~ 
to 1.44. 

A counterceiling effect. Opinion polariza- 
tion in Table 2 resulted from the operation, 
of a counterceiling effect. This effect occurs ` 
when the greatest change, either positive or — 
negative, occurs for demographic types with 
the least range to exhibit change. Thus, the 
initially most compliant demographic types’ — 
provide the greatest amount of positive 
change, and the initially most resistant types. 
the greatest amount of negative change; 
Table 2 reveals the effect. The two groups” 
with the most compliant attitudes toward 
school desegregation initially, the middle class 
and West Texans, are the groups that evinced 
the largest shifts toward becoming still m4 


TABLE 2 


ATTITUDE MEAN ON DESEGREGATION COMPLIANCE | 
BY SOCIAL CLASS AND REGION, PRE- AND 
Post-LittLe RocK Crisis, 
EXPERIMENT 1 


Pre- Post- 
Factor Little Rock Little Rock 

Social class 

Lower class (219) 1.70 1.49 

Working class (260) 1.99 217 

Middle class (206) 2.36 2.75 
Region 

East Texas (130) 1,64 

Central Texas (288) 1.84 1.92 

West Texas (267) 2.42 2.79 


Nole. Number in parentheses is number of respondents. 


g Xe compliant. Likewise, the two groups with the 
“most resistant attitudes initially, the lower 
class and East Texans, are the groups that 
evinced the largest shifts toward becoming 
still more resistant, 

A more refined correlational test for the 
counterceiling effect can be made. Using the 
54 demographic types as the units of analysis, 
|) a positive correlation between preevent means 
“4 and the mean changes over time reveals the 

existence and extent of the effect. For these 

pilot data, there is a moderately positive rela- 
tionship (r = 42, p < .01). This pattern is 
further delineated by dividing the 54 demo- 
graphic types into three groups on the basis 
of their preevent attitude means: the 18 most 
compliant types, the 18 middle types, and the 

18 most resistant types. Change toward com- 

splianZe was greatest among the initially most 
, compliant third, with an average mean change 

of +.50. Virtually no change occurred among 
the midway types (—.01). And change toward 
resistance was greatest among the initially 
most resistant third, with an average mean 
change of —.35. Likewise, the correlations 
between the preevent means and pre-post 
change are significant only among the two 
extreme thirds of the demographic types 

(+.50, p < .05, for the initially most com- 

pliant third; +-.17, ns, for the middle third; 

and +.67, p < .01, for the initially most re- 

ų sistant third). All of these results are consist- 
ent with the operation of a definite counter- 

) ceiling effect among those demographic types 
^ that prior to the Little Rock episode harbored 
e more extreme views for or against the 
racial desegregation of the public schools, 

Were the basic change scores corrected for 

ceiling and floor effects, the apparent effect 

of the dramatic Little Rock events would be 
considerably more striking. 


Discussion 


No firm conclusion can be drawn from 
these pilot results. Data from only two points 
in time do not allow a determination of the 
already operating attitude trend. Without an 
uneventful period for comparison, we do not 
know if the changes uncovered would have 
occurred without the events surrounding 
| Little Rock’s school desegregation. Moreover, 
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only one survey question was employed as the 
dependent variable; this prevents any at- 
tempt to gauge differential effects across 
various attitudes. Though inadequate to es- 
tablish the effects of a dramatic effect, these 
Suggestive results of a pattern of differential 
responses did serve to pique our interest in 
the problem and to lead us to propose for 
our second study the following hypotheses: 
Dramatic racial events in American race rela- 
tions produce differential effects upon the 
racial attitudes of whites as a direct function 
of preevent attitudes. (a) Thus, those types 
of respondents initially most favorable to 
racial change will change the most toward 
becoming still more favorable. (b) And those 
forms of racial change initially most accepted 
by whites will yield the greatest change 
toward becoming still more accepted. 


QUASI-EXPERIMENT 2 


The opportunity to test these ideas pre- 
sented itself tragically on April 4, 1968, when 
the nation’s foremost civil rights leader, Dr, 
Martin Luther King, Jr., was assassinated in 
Memphis. Violent race riots and lesser dis- 
turbances erupted in many American cities 
following the murder. And just prior to the 
murder, the National Advisory Commission 
on Civil Disorders (1968)—popularly known 
as the Kerner Commission—released its con- 
troversial report in which it condemned Amer- 
ican society as “racist.” This tumultuous 
period constitutes the intervening dramatic 
event. Though the effects of these three 
highly publicized happenings cannot be sepa- 
rated from each other, the survey data indi- 
cate that the assassination of Dr, King was 
by far the most salient and influential of 
these events. 


2 Our interest was furthered by a crude confirma- 
tion of the counterceiling effect in Belden’s Texas 
polls of August 1963 and August 1964. This period, 
marked by the assassination of President John Ken- 
nedy in Dallas as well as the sweeping Civil Rights 
Act of 1964, also witnessed much-larger-than-usual 
increases in the acceptance by white Texans of racial 
desegregation. And these increases were greatest both 
among those types of respondents initially more 
favorable as well as for those areas of desegregation 
initially most accepted. 
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Method 
Survey Samples 


Four Texas polls were used in the second study, 
each with a fresh probability sample drawn from 
the same sampling points. Two were conducted 
during December of 1967 and March of 1968, prior 
to the events, and two were conducted during May 
and August of 1968, soon after and four months 
after the events. Approximately 750 white Texans 
are included in each of these samples. The use of 
four surveys, instead of two, strengthens the design 
considerably. By comparing the December and 
March sample differences with those of March and 
May, the effects of the dramatic event can be largely 
separated from the effects of history, maturation, 
or continuing trends. By comparing the March and 
May sample differences with those of May and 
August, the persistence of short-term changes and 
the delayed appearance of additional changes can be 
determined. 


Demographic Types 


The three-way categorizations of age, social class, 
and region of Texas listed above for the first study 
were retained for this analysis. City size, however, 
was expanded to three categories: under 10,000; 
10,000 to 99,999; and 100,000+ population in the 
1960 census. This would have led to 81 distinctive 
demographic types (34), but the absence of any 
communities over 100,000 in population in East 
Texas reduced this number to 72. These missing nine 
cells also necessitated an incomplete factorial analy- 
sis design, and made it impossible to obtain uncon- 
founded estimates for the interaction terms involving 
city size and region. Once again, all effects not in- 
volving time were pooled with error for conservative 
tests: 


Altitude Questions and Scales 


Belden Associates asked on each of the four surveys 

a series of questions concerning attitudes toward ra- 
cial desegregation in a dozen different realms ranging 
from hotels to college roommates. Responses to these 
questions were recorded on an 1l-category scale 
varying from 1 (most favorable to racial desegrega- 
tion) to 6 (neutral or no opinion) to 11 (most 
unfavorable). Less than 3% of the respondents re- 
fused to answer any one item; nonresponses were 
assigned the middle value (6). To sharpen the analy- 
sis, these 12 items were combined into three scales 
differentiated by the intimacy of the interracial con- 
tact—formal, informal, and intimate. These scales 
were formed empirically by considering the item 
intercorrelations, and conceptually by considering the 
work of Triandis and Davis (1965). Pooling all of 
the respondents from the four samples, the three 
scales had Kuder-Richardson 20 reliabilities above .70. 
The formal contact scale. This scale was formed 
by taking the mean score of the four items tapping 
approval of both races using the “same (public) 
buses,” the “same restaurants,” the “same hotels,” 
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and the “same jobs.” These contact situations in- » 
volve the most defined roles and norms, and their 
desegregation has long been the focus of legislative 
and judicial action. These interracial situations also 
tend to provoke the least threat for whites; indeed, 

a majority of white Texans already accepted inter- 
racial contact in these realms in an August 1964 
Texas poll and 60% did so by December of 1967. / 
Following the analysis of Triandis and Davis (1965), $ 
these four situations should invoke belief dissimi- 
larity more heavily than race dissimilarity as a 
source of prejudice. 

The informal contact scale, This scale employed 
three items that tapped approval of both races using 
the “same (public) schools,” the “same churches,” 
and both black and white public school teachers. 
Interracial contact in schools and churches was still 
rare in Texas in 1968, unlike widespread contact and 
desegregation in restaurants, hotels, and public buses. 
It involves somewhat more threat to whites and 
did not achieve a majority acceptance among white 
Texans until the December 1967 survey. Again, fol- 
lowing Triandis and Davis (1965), we would expect 
these situations to invoke both belief and race dis- 
similarity as sources of prejudice and would expect. 
to witness a greater degree of individual variance. 

The intimate contact scale. This scale used the 
remaining five items that measured approval of the 
races sharing the “same swimming pools,” the “same 
social gatherings,” and the “same house parties,” as 
well as the races living “next door” to one another 
and living together as college roommates. Unlike 
formal and informal contact, interracial contact in 
these situations requires high levels of social accept- 
ance and cross-racial interaction. And it is precisely 
this type of intimate interracial contact that tends 
to invoke the greatest threat among whites, Legis- 
lative and judicial attempts to bring about social 
change in these realms have faced the greatest 
resistance, and an average of only 29% of white 
Texans approved of this interaction in the December 
1967 survey. Race dissimilarity, according to Tri- 
andis and Davis (1965), should predominate in these 
situations as the primary source of prejudice. 


Results 
Short-Term Effects of King’s Assassination 


Table 3 summarizes for each of the three 
scales the short-term changes that occurred 
during the time of the King assassination. 
The survey data for December 1967 and 
March 1968 are combined in this table to 
obtain the preevent means, which are com- 
pared with the postevent means of May 1968; 
this procedure provides more stable preevent 
estimates and is justified by the fact that 
none of the December and March means are , 
significantly different, 
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TABLE 3 


ANALYSES OF VARIANCE FOR ATTITUDES TOWARD THREE FORMS or INTERRACIAL Contact, 
PRE- AND IMMEDIATE Post-Kinc ASSASSINATION, EXPERIMENT 2 


Formal contact 


Informal contact Intimate contact 


Source df MS F MS F MS F 
ime (A) 1 141.00 13.69" 48,73 3.95* 02 00 
< Age (B) 2 5.25 SL 15.36 1.24 12.28 1.05 
\ X Social Class (C) 2 47 05 12.03 ‘98 T aa} 
A 2 92.79 9.01"** 162.34  13.14*** 10139  8.65*%** 
\XBXC 4 3.42 33 441 1281 1.09 
A xD 4 9.84 96 12.77 5.59 48 
XD 4 1449 141 18.16 7.09 61 
iXBXCXD 8 2291 2.228 26.56 24.32 2.01" 
\ X B X City Size (E) 4 8.59 84 6.54 16.69 1.43 
\ x I 4 1264 1.23 17.84 ENEE] 
\ 2 49.95 4.85% 52.08 27.33 233 
Error 2, 207 10.30 12.35 11.71 
f 
= hr <.05 


p< .01 
004 


\s predicted, the shifts vary sharply across 
the three types of interracial contact. Atti- 
toward formal contact, initially the 
nost accepted realm, had the largest overall 
vorability shift (from 4.39 to 3.86) ;* atti- 
udes toward informal contact had a signifi- 
cant but more moderate overall shift (from 
+34 to 4.56); and attitudes toward intimate 
contact, initially the least accepted realm, 
were unaffected overall (7.41 to 7.42). This 
form of counterceiling effect is also indicated 
by the near-perfect rank-order relationship 
across the 12 interracial situations between 
preevent means and the amount of change by 
May 1968 (p = +.965, p < .001). 
he significant interactions between time 
i region and between time and city size in 
iable 3 point to the differential shifts across 
demographic types. In general, these inter- 
actions signal mean changes that are also con- 
sistent with our counterceiling hypothesis. 
Thus, East Texans evinced the smallest favor- 
able shifts, and metropolitan dwellers evinced 
the largest favorable shifts. (No groups 
i. evinced negative shifts toward greater opposi- 
) tion to desegregation.) But there were some 
(Ñ discrepancies with the hypothesis; rural re- 
j spondents changed more favorably than pre- 
dicted, while West Texans changed less than 
predicted. A closer look reveals that the coun- 
‘terceiling effect across demographic types is 
_ mediated by the intimacy of the domain. 
« There are significant correlations across the 


tudes 


72 demographic types between initial attitude 
means and the degree of change by May 
1968 for attitudes concerning formal contact 
(r= +.30, ~<.01) and intimate contact 
(r= +.69, p < .001), but not for attitudes 
concerning informal contact (r= —.04, ns), 
A control comparison. Analyses of variance 
directly comparable to those shown in Table 3 
were also performed for the December 1967 
to March 1968 period prior to King’s assas- 
sination. This simulated control comparison 
tests the possibility that the substantial 
changes recorded by May 1968 were merely 
continuations of ongoing trends that predated 
the dramatic event. Neither the main effect 
of time nor any of the interactions involving 
time attained statistical significance for any 
of the three contact scales in the tests that 
compare the December—March differences. 


Delayed Effect of the King Assassination 


Did the changes in attitudes toward formal 
and informal contact that were recorded in 
May 1968 last until August 1968? And were 
there any delayed effects that appeared 4 
months after the dramatic event? Table 4 
summarizes these results by comparing the 
May 1968 with the August 1968 means. First, 


8It should be noted that a downward shift now 
reflects a greater acceptance of racial desegregation, 
unlike the results from the pilot study, 
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TABLE 4 


ANALYSES OF VARIANCE FOR ATTITUDES TOWARD THREE FORMS OF INTERRACIAL CONTACT, 
IMMEDIATE AND DELAYED Post-KING ASSASSINATION, EXPERIMENT 2 


Formal contact 


Informal contact Intimate t 


Source df MS F MS F MS 

Time (A) 1 2.50 25 4.00 32 53,80 

A X Age (B) 2 11.70 1.16 28.47 2.29 36.65 

A X Social Class (C) 2 8.11 80 13.41 1.08 22.87 

A X Region (D) 2 8.63 85 11.40 92 2.58 
AXBXC + 3.89 39 10.57 85 25.74 2 
AXBXD 4 14.24 1.41 5.64 AS 4.53 
AXCxD 4 2.26 22 4.93 40 12.32 1\ 
AXBXCXD 8 13.07 1.29 13.19 1.06 20.02 1.7 
A X B X City Size (E) 4 3.04 .30 3.99 -32 4.03 } 
AXCXE 4 13.08 1.29 18.24 1.47 10.37 
AXE 2 26.64 2.64 18.62 1.50 2953 2 

Error 1, 506 10.11 12.45 11.79 

*p <.05, 


we note that the only significant effects in- 
volve intimate contact. The substantial shifts 
in attitudes toward formal and informal con- 
tact that occurred at the time of the King 
assassination persisted at least until August, 
for no regression to the mean is evident either 
for formal contact (3.86 to 3.88) or informal 
contact (actually slightly more favorable, 
from 4.56 to 4.46). 

But, surprisingly, a significant favorability 
shift now occurred for attitudes toward inti- 
mate interracial contact (7.42 to 7.05), the 
realm that did not reveal an overall shift im- 
mediately after the event.t The significant 
interaction between time and age specifies 
this unexpected result further. Between May 
and August 1968, the attitudes of white 
Texans 50 years of age and younger became 
substantially more favorable toward intimate 
interracial contact (under 35, 6.86 to 6.20; 
and 35-50, 7.70 to 6.95). But respondents 
over 50 became less favorable to such con- 
tact (7.72 to 7.97). These findings are con- 
sistent with a counterceiling effect across 
demographic types; we checked on this fur- 
ther by dividing the 72 types into thirds 
according to the intimate contact means in 

May. Once again, the least favorable third 
changed the least by August (—.16), the 
middle third changed more (—.26), and the 
most favorable third the most (—.79; F = 
2.89, p < .05). Note, too, that polarization 
across the “generation gap” also resulted from 
these counterceiling trends. While those under 


35 differed from those over 50 by .86 in Ma , 
they differed by 1.77 in August. 

Exploring the possible operation of a strai» 
toward cognitive consistency underlying th 
delayed effect, correlations were obtaine: 
across the 72 demographic types between th: 
amount of May to August change in intimate 
contact attitudes and the March to May 


change in other attitude realms. The delayed | 


changes in intimate contact attitudes cor- 
relate positively and significantly with the 
initial change in both formal contact atti 
tudes (7 = +.413, p < .001) and informai 
contact attitudes (r = +.489, p < .001); and 
regressing the delayed changes in intimate 
contact attitudes upon the initial changes in 
both formal and informal contact attitudes 
yields a multiple correlation of +,542. This 
link between the initial and the delayed shifts 
is specified further by obtaining the correla 
tions separately for the 48 demographic type 
that include respondents 50 years of age o 
younger and for the 24 demographic type 
that include respondents over 50 years of age. 
As one might expect from the age interaction 


*When an analysis of covariance for attitudes 
toward intimate contact was carried out for the May 
to August shifts, with attitudes toward formal and 
informal contact as covariants, a sharp reduction in 
the error term resulted and the F test for the de- 
layed change in intimate contact attitudes became 
correspondingly more significant. Thus, the F ratios 
reported here are conservative. 


mt ee 
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discussed above, the correlations are signifi- 


* cant only for the younger demographic types 


(formal contact, r= +.521, p< .001; in- 
mal contact, r = +.591, p < .001) and not 
{or the older types (formal contact, r= 
090, ns; informal contact, r = .183, ms). ` 
le 5 explores further the possibility of 
sistency strain involved with the delayed 
in intimate contact attitudes by pro- 
the interscale correlations across indi- 
juals for each of the four samples. While 
coefficients are quite comparable across 
December, March, and August samples, 
« for May are uniformly and significantly 
wer. Observe, too, that this reduction in 
the May correlations is especially great for 
the iwo interscale combinations that involve 
att :oward intimate contact. 


Discussion 


vo interesting effects emerge from these 
field results, both of which relate to attitude 
chenge theory and laboratory research. 


The Counterceiling Effect 


The strength of this effect is striking in 
the results of both studies and across demo- 
graphic types as well as attitude realms. It 
operates strongest at the extremes of either 
the demographic types’ initial scores or the 
three attitude realms (toward formal and 
intimate contact but not the intermediate 
realm of informal contact). Were these data 
corrected for ceiling and floor constraints, 
they would constitute far more impressive 
opinion shifts than generally uncovered in 
field research. Hovland (1959) noted in a 
classic paper seven reasons that field studies 
rarely obtained as large attitude changes as 
those commonly found in the laboratory. 
Naturalistic situations, he noted, are more 
typified by (a) greater self-selection of those 
who expose themselves to the communication, 
(b) breadth and complexity of communi- 
cations, (c) longer time interval between the 
communication exposure and the change mea- 
sure, (d) more remote and potentially less 
credible communicators, (e) familiar com- 
munication settings without incentives for 
change as well as social support for not 
changing, (f) random samples instead of col- 
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TABLE 5 
INTERSCALE CORRELATIONS Across INDIVIDUAL 


RESPONDENTS IN Four SURVEYS, 
EXPERIMENT 2 

December March May August 

sample sample sample sample 

Scale (750) (756) (738) (782) 
Formal and informal 

contact -67 70 62 76 
Formal and intimate 

contact 62 66 50 104 
Informal and intimate 


contact -80 84 62 82 


lege sophomores, and (g) more socially sig- 
nificant attitudes rather than attitudes selected 
for their modifiability. 

Many of these characteristics describe the 
King assassination situation, but there are im- 
portant differences, involving the first five, 
that help explain why the present attitude 
changes were as large as they were and why 
these changes typically took a counterceiling 
direction. First, dramatic events are likely to 
be communicated to extremely wide audi- 
ences, thereby reducing the degree of self- 
selection involved with exposure. Second, the 
basic facts of this dramatic event are rela- 
tively simple and direct, even if their meaning 
may be complex. Third, relative to most field 
studies, the present research measured for 
change soon after the event. Fourth, the 
credibility of communicators reporting this 
dramatic event may have been enhanced by 
virtue of the widespread agreement among a 
variety of sources as to the facts involved. 
Finally, and perhaps most important, dra- 
matic events upset perceived reality and 
thereby may have invoked an incentive to 
modify social opinions in order to reestablish 
perceptual stability. 

Likewise, social support for maintaining or 
altering racial opinions as a function of the 
King assassination undoubtedly varied greatly 
among different segments of white Texans. 
We have noted strong interactions, for ex- 
ample, by region of the state and by rural- 
urban communities. Small urban areas in East 
Texas, such as Marshall, evinced the smallest 
increases in favor of interracial contact; 
Dallas, Houston, and other metropolitan cen- 
ters in Central Texas, by contrast, evinced 
the largest increases in favor of interracial 
contact. Considerable research has demon- 
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strated how the historical and cultural dif- 
ferences of these areas relate strongly to cur- 
rent racial and political phenomena (Key, 
1950; Matthews & Prothro, 1966; Pettigrew 
& Cramer, 1959; Pettigrew & Riley, 1972). 
Thus, regional and local norms reflect cultural 
differences and political climates; these norms 
mediate the effects of a dramatic event as a 
source of racial attitude change. 

We believe that this process of norm medi- 
ation partly fostered the counterceiling effect. 
If we assume that diverse racial norms across 
Texas in 1968 influenced the receipt and 
interpretation of the stimulus in roughly direct 
relation to the preevent racial attitudes held 
in various areas, counterceiling shifts would 
result. Such shifts, we have noted, tended to 
polarize white Texans along such dividing 
lines as region. Is opinion polarization, then, 
a necessary result of dramatic events? We 
suspect it often is a result, but not always. 
Negative cases come readily to mind: For 
instance, the December 7, 1941, attack on 
Pearl Harbor was certainly a dramatic event, 
but it served largely to unite rather than 
polarize the American public. There are, of 
course, obvious differences between the Pearl 
Harbor bombing and the King assassination, 
and that is the point. Polarization is a likely 
result of certain types of events, but we need 
an effective categorization of dramatic events 
to specify the relevant conditions—a task 
beyond the scope of this paper. 

At the individual cognitive level, the 
counterceiling effect is consistent with the 
nonmonotonic discrepancy predictions of so- 
cial judgment theory (Sherif & Hovland, 
1961). Of course, like other field studies, this 
research can shed no direct light on the dis- 
crepancy issue in attitude change, since the 
measure of discrepancy is across subjects and 
not across messages. This confounding of 
initial position with discrepancy, then, renders 
these data inconclusive. Yet it is still worth 
noting that this situation matches a number 
of the conditions that have been demonstrated 
in the laboratory to lead to a nonmonotonic 
relationship between discrepancy and attitude 

change: an issue of high involvement, high 
attitude confidence, and a wide range of pre- 
event attitudes (Freedman, 1964; Koslin, 
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Stoops, & Loh, 1967; Miller, 1965; herif, 
Sherif, & Nebergall, 1965).° Relabeli' » and 
the establishment of new anchors, as = pha- 
sized by Ostrom and Upshaw (1968) i: their 
perspective model of attitude change may 
also have operated for many responden > and 


contributed to the counterceiling fect. 
Hence, King may have been perceiv by 
many white Texans prior to his assa ina- 
tion as a trouble-making militant. Bui nce 


struck down by an assassin’s bullet, ing 
shifted for many to the status of a ma ‘yr. 
Indeed, national media coverage of the e ent 
actively aided such a reanchoring process 


The Ripple Effect * 


The delayed change in the attitudes of 
white Texans toward intimate interracial © n- 
tact can be variously explained. The nost 
parsimonious explanation involves additic al 
dramatic events that occurred between ne 
critical months of May and August of 19°38, 
such as the assassination of Robert Kennedy 
in June. This explanation requires that there 
was some perceived link for many white 
Texans between racial contact in intimate 
situations and Robert Kennedy. Or it could 
be argued that this second assassination of 
1968 affected white Texans’ opinions toward 
all interracial contact, and thus it led to a 
shift in the previously unchanged intimate 
contact realm while preventing a regression- 
to-the-mean shift in the formal and informal 
contact realms. We feel that these possi.\\'r 
are unlikely, since the perceived link between 
race relations and Kennedy was not nearly 
so direct as that with King. But we have no 
direct test to discount these possibilities. 

We prefer, however, a consistency explan- 
tion for this delayed shift in intimate contact 
attitudes. Thus, it is held that the abrupt 


5 However, one dimension of this situation— 
reasonably high credibility of the sources—would 
predict a monotonic function between discrepancy 
and attitude change (Aronson, Turner, & Carlsmith, 
1963; Bochner & Insko, 1966). 

6 We purposely avoid the more familiar term of 
sleeper effect because of its identification with a 
differential forgetting explanation, for rather dif- — 
ferent processes appear to operate in this case 
(Hovland & Weiss, 1951). 
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in attitudes toward contact with 
in both formal and informal contexts 
eated for many of our respondents sharp 
»nsistencies with their generally unchanged 
itudes toward intimate contact that in turn 
d to the delayed shift. We know that racial 
titudes in the United States are generally 
of early socialization (Porter, 1971), 
ially those racial attitudes involved with 
ate concerns. Intimate contact, as noted 
Triandis and Davis (1965), therefore in- 
s components of racial similarity far 
» than those of belief similarity. These 
rations help explain two of our find- 
There was no significant shift in inti- 
attitudes soon after King’s 
cose these attitudes were the most 
affectively rooted in early social- 
snd when these attitudes did register 

‘icant if belated shift by August, they 

> as a function of age. Older white 

s were socialized during the early dec- 

of this century when intimate interracial 
tact under southern norms was stigma- 
ed as sinful behavior (Myrdal, 1944; 
ith, 1949). It is not surprising, then, that 
nge toward acceptance of intimate inter- 
cial interaction was concentrated among the 
ounger respondents. Moreover, older respon- 
dents had changed less initially on their atti- 
tudes toward formal and informal contact 
consequently had less inconsistency to 


cor 


ind 


‘ounger respondents faced a greater prob- 
of racial attitude inconsistency fol- 
ag their significant moves toward ac- 
ceptance of both formal and informal inter- 
action. In the terms of our items, this could 
take such a concrete form as “If I now believe 
that it is all right for blacks and whites to 
school and church together, what is so 
wrong about their attending social gatherings 
together?” This inconsistency could have been 
resolved in other ways. Attitudes toward 
formal and informal contact could have re- 
gressed back to their preassassination levels. 
But in the context of ongoing structural 
change in Texas race relations in 1968, our 
respondents tended to resolve the inconsist- 
ency by a delayed modification of their most 
fundamental racial opinions. 
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Two lines of evidence are presented in sup- 
port of this consistency explanation. First, 
those kinds of respondents who changed the 
most toward acceptance of formal and in- 
formal contact soon after the event tended to 
be those that later changed the most toward 
acceptance of intimate contact. Second, low- 
ered interscale correlations were noted in May 
with a return to preevent levels by August as 
required by the consistency explanation. 

This delayed effect and consistency inter- 
pretation recall the work of McGuire (1960a, 
1960b) on syllogistic attitude change. Among 
other findings, McGuire noted changes on 
logically connected but unmentioned attitude 
“conclusions” following induced changes 1 
week earlier on attitude “premises.” This 
result did not quite attain statistical signifi- 
cance, but the delayed changes in the derived 
attitudes came considerably closer than the 
immediate changes to his consistency model 
predictions. McGuire (1960a) stated: 


People’s opinions seem to follow a “loose-link” 
model such that a considerable change can be in- 
duced on one cognition before any pull is exerted on 
related cognitions. This effect, however, is at least 
partially corrected by the continued seepage of 
change from the explicit issue to the derived one as 
time passes. (pp. 351-352) 


Dillehay, Insko, and Smith (1966) repeated 
McGuire’s studies with subjects of greater 
educational attainment. Though they found 
greater consistency 1 week later, they de- 
tected only a small and insignificant tendency 
for the delayed changes in the unmentioned 
issue to come closer to consistency predic- 
tions than the immediate changes. Two other 
studies, however, uncovered delayed effects 
similar to those noted here. Hovland, Lums- 
daine, and Sheffield (1949, pp. 70-71, 184— 
187) found that the Battle of Britain film in 
World War II had few 1-week effects but 
considerable 9-week effects on more general 
attitudes; they suggested that a “sinking-in’” 
period may be required for such derivative 
effects, Katz, Sarnoff, and McClintock (1956) 
noted a similar phenomenon when they found 
that a self-insight approach more effectively 
reduced antiblack attitudes of college sub- 
jects than did an informational one. Their 
self-insight technique, consisting of a case 
history that demonstrated the defensive na- 
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ture of prejudice, took longer for its effects to 
emerge. Later work by Stotland, Katz, and 
Patchen (1959) indicated that this delayed 
effect involved in part the subject’s realiza- 
tion that racial prejudice was inconsistent 
with her self-conception. 


Use oj Separate Samples 


The similarity of these quasi-experimental 
results from the field to laboratory results is 
especially interesting given the utilization of 
separate samples at each of the four points in 
time. There are obvious disadvantages to this 
design, and comparable sampling must be 
assumed not only for each entire sample but 
for each of the 72 demographic types em- 
ployed as analytical units. Yet there are also 
advantages to the use of separate samples. 
There are no testing, memory, or set biases 
carried over from sampling to sampling. 
There is also an enormous practical advan- 
tage. Over-time field studies with the same 
samples generally have large attrition rates 
that seriously bias their results, in part be- 
cause of urban America’s high rate of hori- 
zontal mobility. Yet many of the most in- 
teresting questions for field research require 
longitudinal designs. Following the work of 
Pool, Abelson, and Popkin (1965), this study 
demonstrates the utility of a quasi-experi- 
mental, longitudinal approach involving sepa- 
rate samples and the analytic use of demo- 
graphic types. Such a design is especially 
suited for further research on dramatic events, 
which from the present results appear capable 
of generating significant attitude change via 
processes already delineated in laboratory 
research. 
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Sex Typing and Androgyny: Further Explorations 
of the Expressive Domain 
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Previous research by Bem has indicated that androgynous individuals of both 
sexes display “masculine” independence when under pressure to conform as 
well as “feminine” nurturance when interacting with a kitten. In contrast, 
sex-typed individuals were low in one or both of these behaviors. The two 
studies reported here were designed to replicate the low nurturance of the 
masculine male and to clarify the unexpected finding that feminine females 
were low in both independence and nurturance, In the first study subjects 
interacted with a human infant, and in the second study they listened to a 
lonely student. Taken together, the results of these two studies conceptually 
replicated the low nurturance of the masculine male and demonstrated that 
the low nurturance of the feminine female does not extend to her interaction 


with humans. Finally, evidence was presented in support of Spence, Helmreich 


> 


and Stapp’s distinction between “androgynous” individuals, who are high in 


both masculinity and femininity, and “undifferentiated” 


low in both of these characteristics. 


Both historically and cross-culturally, mas- 
culinity and femininity have represented 
complementary domains of positive traits and 
behaviors. Different theorists have designated 
different labels for these domains, According 
to Parsons (Parsons & Bales, 1955), mascu- 
linity has been associated with an “instru- 
mental” orientation, a cognitive focus on 
getting the job done or the problem solved, 
whereas femininity has been associated with an 
“expressive” orientation, an affective concern 
for the welfare of others and the harmony of 
the group. Similarly, Bakan (1966) has sug- 
gested that masculinity is associated with an 
“agentic” orientation, a concern for oneself 
as an individual, whereas femininity is asso- 
ciated with a “communal” orientation, a con- 
cern for the relationship between oneself and 
others. 

More recently, scholars in a number of 
disciplines have begun to concern themselves 
with the concept of psychological androgyny, 

a term which denotes the integration of both 
masculinity and femininity within a single 


This research was supported by National Institute 
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individuals, who are 


individual (e.g., Bakan, 1966; Bazin & Fro- 
man, 1974; Bem, 1974, 1975, in press-i): 
Bem & Lenney, 1976; Block, 1973; Gelp:, 
1974; Harris, 1974; Hefner, Rebecca, & Ole- 
shansky, 1975; Heilbrun, 1973, 1976; Kap- 
lan & Bean, 1976; Pleck, 1975; Rebecca, 
Hefner, & Oleshansky, in press; Rossi, 1964; 
Secor, 1974; Spence, Helmreich, & Ste 
1975; Stimpson, 1974; Berzins & Wellin., 
Note 1). The concept of psychological ar 
drogyny implies that it is possible for an in 


p 


dividual to be both masculine and feminine; 


both instrumental and expressive, both agen- 
tic and communal, depending upon the situa- 
tional appropriateness of these various modal- 
ities; and it further implies that an individual 
may even blend these complementary modal 
ities into a single act, being able, for examp! 
to fire an employee if the circumstances wa 
rant it, but to do so with sensitivity for th: 
human emotion that such an act inevitab? 
produces. 

Empirical research on the concept of an- 
drogyny is now beginning to appear in the 
psychological literature, For example, a more 
androgynous view of oneself has recently 
been found to be accompanied by greater ma- 
turity in one’s moral judgments (Block, 1973) 
and by a higher level of self-esteem (Spence 
et al., 1975). In our own laboratory studies, 
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we e found that androgynous individuals 
n ‘ly perform cross-sex behavior with 
li ‘luctance or discomfort (Bem & Len- 
n 976) but display both “masculine” in- 
ence when under pressure to conform 
“feminine” nurturance when given 
tunity to interact with a baby kit- 
2m, 1975). 
for many individuals, traditional 
still produce an unnecessary and 
I even dysfunctional pattern of avoid- 
é which prevents the possibility of an- 
í from ever becoming a reality. For 
€ ) the study by Bem and Lenney 
ped individuals not only ac- 
wide variety of simple, every- 
(like nailing two boards to- 
inding a package of yarn into a 
because those activities happen to 
typed as more appropriate for the 
x, but they also reported discomfort 
en some temporary loss of self-esteem 
w actually required to perform such ac- 
ies. 
‘oreover, this pattern of avoidance is not 
limused to simple, everyday behaviors but 
rs to constrict the individual’s instru- 
tal and expressive functioning as well. 
aus, whereas androgynous individuals in the 
Bem (1975) study displayed high levels of 
both masculine independence and feminine 
nu 1onandrogynous individuals did 
masculine males were low in 
nce, and feminine males were low in 
idence. 
xe the nonandrogynous males, the non- 
androgynous females were also constricted, 
but their behavioral patterns were more com- 
plicated. As anticipated, masculine females 
were quite independent, but they were not 
significantly Jess nurturant toward the kit- 
ten than were androgynous females, Hence, 
it could not be concluded that the masculine 
woman was low in her expressive functioning. 
But it was the behavior of the feminine wo- 
man that was the most surprising and the 
most constricted. Not only was the feminine 
woman low in independence but she was also 
low in her nurturance toward the kitten, Of 
course, it is possible that feminine women 
might simply find animals unappealing for 


à Some reason and that they might therefore 
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display much greater nurturance if they were 
given the opportunity to interact with an- 
other human being rather than with a kitten, 
The current investigations represent a fur- 
ther exploration of the expressive domain. In 
an attempt to give the feminine women a 
fairer test of their expressive functioning, 
two additional studies were carried out, Be- 
cause we wished to clarify whether the femi- 
nine woman’s low level of nurturance was 
unique to her interaction with animals, both 
of these studies were designed to be genuinely 
interpersonal situations in which the subjects’ 
nurturant sympathies were more likely to be 
aroused, In addition, because it also seemed 
possible that feminine women might be in- 
sufficiently assertive to act out their nurturant 
feelings if the situation required that they 
take responsibility for initiating the interac- 
tion, the second study was designed not only 
to be genuinely interpersonal but also to place 
the subject into a more passive role that 
would require very little initiative or impro- 
visation and in which there would be virtually 
no ambiguity about what a subject ought to 
do if he or she wished to be nurturant. Ac- 
cordingly, the first study gave the subject 
the opportunity to interact with a human 
baby, and the second required the subject 
to listen to a fellow student who openly 
shared some of his or her unhappy emotions. 


EXPERIMENT | * 


Method 


Subjects. Subjects were 84 undergraduates (half 
males and half females) from Stanford University 
who participated in the experiment during the win- 
ter and spring quarters of 1974, All of the sub- 
jects had taken the Bem Sex Role Inventory at 
the beginning of the quarter, and all had experienced 
relatively little prior interaction with babies. One 
third of the subjects had been preselected as mascu- 
line, one third as feminine, and one third as an- 
drogynous on the basis of the androgyny t ratio, 
a difference score that measures the extent to which 


1 Carol Watson was primarily responsible for the 
design and execution of Experiment 1. She would 
like to give particular thanks to Fred Bart Astor for 
his invaluable assistance during all phases of the 
study. She would also like to express her gratitude 
to Carol Carpenter, Barbara Finn, Linda Krieger, 
Lyn Littlefield, Susie Lynn, and Dana McComber 
for serving long hours as experimenters and coders. 
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subjects distinguish between masculine and femi- 
nine personality characteristics in their self-descrip- 
tions, In general, the greater the absolute value of 
the androgyny ¢ ratio, the more a person is sex 
typed or sex reversed, with positive scores indicating 
femininity and negative scores indicating masculinity. 
The smaller the absolute value of the androgyny 
t ratio, the more a person is androgynous. An an- 
drogynous sex role thus represents the equal en- 
dorsement of both masculine and feminine personal- 
ity characteristics. 

Also participating in the study were 14 babies 
(10 males and 4 females), each of whom interacted 
with one representative of each of the six different 
sex roles. During the experimental sessions, how- 
ever, each baby was dressed in sex-neutral clothing 
and was randomly assigned to be introduced to all 
six subjects either as “David” or as “Lisa.” The babies 
ranged in age from 44 to 7 months. Naturally, great 
care was taken to protect the health and well-being 
of all of the babies who participated. 

Dependent variables, Subjects were solicited for 
a study of social responsiveness in infants and were 
then left alone with a baby for a period of 10 
minutes while we observed the interaction from be- 
hind a one-way mirror. Specifically, the subject’s 
behavior was time sampled every 10 sec by one 
of three female coders, all of whom were blind with 
respect to the subject’s sex role and all of whom 
observed an approximately equal number of mas- 
culine, androgynous, and feminine subjects of each 
sex. For each subject, the coder made 60 3-sec ob- 
servations. The following six behaviors were coded 
as present or absent: Was the subject smiling directly 
at the baby? Was the subject talking to the baby? 
Was the subject kissing or nuzzling the baby? Was 
the subject holding the baby chest to chest?2 Was 
the subject stimulating the baby in a way that in- 
volved touching (e.g, tickling, patting, stretching) ? 
Was the subject stimulating the baby in a way that 
did not involve touching (e.g, shaking a rattle, 
squeaking a toy)? These behavioral measures were 
all highly reliable, with two independent coders 
agreeing perfectly on over 90% of the observations 
for each measure. Moreover, because only nonphysi- 
cal stimulation was negatively correlated with the 
other behaviors (r= —.13), a summary measure of 
behavioral nurturance was constructed by averag- 
ing together the standard score equivalents of the 
remaining five behaviors. The mean intercorrelation 
of these behaviors was +.24, yielding an internal 
reliability (coefficient alpha) of +.62 (Nunnally, 
1967) for the composite. 

In order to derive a summary measure of subjects’ 
self-reported feelings of nurturance toward the baby, 
we also averaged together the subject’s ratings on 
seven written questions asked at the end of the 
study. These questions asked such things as how 
positive the subject’s feelings were toward the baby 
and how much the subject wanted to be affection- 
ately responsive to the baby. The mean intercorre- 
lation among these items was +.32, yielding an in- 
ternal reliability of +.82 for the composite. 
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Results 

Both the summary measure of be) -vioral 
nurturance and the summary measure ©’ sub 
jects’ self-reported feelings of nurturan: - were 
analyzed by means of a 2 (Sex of Subj. t) X 


3 (Sex Role of Subject) x 2 (Assigned 
Baby) analysis of variance, as well a: 
series of planned comparisons. The two 
gonal comparisons to be tested were tt 
lowing: (a) that feminine and androg 
subjects would both be significantly mor: 
turant to the baby than masculine su! 
and (b) that feminine and androgynous 
jects would not differ significantly fror 
another in this regard. 

The results of the overall analysis of vari- 
ance indicated a significant main effect only 
for the baby’s assigned sex, with David re- 
ceiving more behavioral nurturance than I. sa, 
F(1, 72) = 6.32, p< .02.* Thus, the pre- 
dicted effect of the subject’s sex role did ot 
reach significance in the overall analysis of 
variance for either of the dependent mea- 
sures, nor did it reach significance in the 
planned comparisons. Accordingly, our hy- 
potheses with respect to sex role failed to 
receive confirmation. 

As Spence et al. (1975) have pointed out, 
however, our use of a difference score to de- 
fine androgyny serves to obscure a potentially 
important distinction between those individ- 
uals who score high in both masculinity and 
femininity and those individuals who score- 
low in both. Moreover, they argued that those 
individuals who score low in both masculin- 
ity and femininity should not even be in 
cluded in the group defined as androgynous 
In accordance with this suggestion, the an- 
drogynous subjects in this study were subdi- ` 


2 The observers also attempted to code when the 
subject was holding the baby on his or her lap, 
but because the coding instructions confounded 
“holding the baby on the lap” with “providing 
physical support for the baby,” this behavior was 
not included in further analysis. 

3 An analysis of the individual behaviors further 
revealed that David was held chest to chest more 
than Lisa, F(1, 72) =5.81, p < .02, and that he was 
also physically stimulated more than Lisa, F(1, 72) 
= 4.86, p < .03. Perhaps college students think that 
baby girls are too fragile to be manipulated physi- 
cally, 
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on a post hoc basis into those who were 
; the Stanford median in both masculin- 
nd femininity, henceforth labeled “un- 
d ntiated,” and those who were not. As 
a lement to the analyses described earlier, 
t immary measure of behavioral nurtur- 
vas reanalyzed by means of a 2 (Sex of 

ct 4 (Sex Role of Subject) x 2 

vod Sex of Baby) analysis of variance 
jual cell frequencies as well as by the 
ct of planned comparisons described 


sults of this reanalysis revealed a 

ant main effect of the subject’s sex 

e F $) = 2.67, p= .05, with mean 

ct 9, —.15, +.18, and +.16 for 

tiated, masculine, androgynous, 

ine subjects, respectively. More- 

€ ined comparisons indicated that as 

i, masculine subjects were signifi- 

ss nurturant toward the baby than 

1¢ or androgynous subjects, ¢(68) = 

p < .05, and furthermore, that feminine 

i androgynous subjects did not differ sig- 

ni icantly from one another, ¢(68) < 1, ns. 

Firally, there was a trend for androgynous 

ts to be more nurturant toward the 

baby than undifferentiated subjects, (68) = 

Py ear a AO 

Although these supplementary results can 

only be s suggestive, they do serve to 

tually our earlier finding 

hat feminine and androgynous 

i e more nurturant toward a kitten 

than masculine males; they also suggest that 

the feminine woman’s low level of nurturance 

was situation specific and probably resulted 

from having her interact with an animal 
rather than with another human being. 


1.73 


EXPERIMENT 2 5 


The supplementary results of Experiment 
1 offer tentative support to the hypothesis 
that feminine and androgynous subjects of 
both Sexes are more nurturant than mascu- 
line subjects. Experiment 2 represents an at- 
tempt to test this hypothesis still further. 
Because it seemed possible that feminine wo- 
men might be most capable of acting out all 
of their nurturant feelings in a situation where 
they did not have to take responsibility for 
initiating and sustaining the interaction, as 
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they had to do with both the kitten and the 
baby, the situation in Experiment 2 was 
designed not only to be genuinely interper- 
sonal but also to place the subject into a 
more passive or responsive role. Accordingly, 
Experiment 2 was designed to evoke sym- 
pathetic and supportive listening on the part 
of the subject but, at the same time, did not 
require the subject to play an active or in- 
itiating role in the interaction. 


Method 


Subjects. Subjects were 84 undergraduates (half 
males and half females) from Stanford University 
who participated in the experiment during the spring 
quarter of 1974. All had taken the Bem Sex Role 
Inventory at the beginning of the quarter, with one 
third of each sex preselected as masculine, one third 
as feminine, and one third as androgynous on the 
basis of the androgyny t ratio, 

Three undergraduate experimenters and four un- 
dergraduate confederates also participated in the 
study. They interacted with subjects of their own 
sex only, were blind with respect to the subjects’ 
sex roles, and interacted with approximately equal 
numbers of masculine, androgynous, and feminine 
subjects. 

Procedure. Two males or two females, one of 
whom was a subject and one of whom was a con- 
federate, participated together in a study whose 
alleged purpose was to find out “whether you be- 
gin to feel close to another person primarily be- 
cause of the things you learn about the other per- 
son, or primarily because of the things you tell 
that other person about yourself.” The two “sub- 
jects” drew lots to allegedly determine who would 
take the role of “talker” and who, the role of “lis- 
tener,’ but in fact, the confederate always served 
as the talker and the subject always served as the 


4 All probability levels for the £ values in this 
article are based on two-tailed tests of significance. 
In addition, because of unequal cell frequencies in 
some comparisons, all comparisons for the sexes 
combined are based on individual cell means for 
males and females considered separately. 

5 Wendy Martyna was primarily responsible for 
the design and execution of Experiment 2. She would 
like to give particular thanks to Dorothy Ginsberg, 
a senior honors student in psychology who collab- 
orated in every phase of the study. She would also 
like to express her gratitude to the experimenters, 
Richard Rector, Janet Langston, and Maria Rod- 
riguez, who also coded the subjects’ nonverbal be- 
havior and helped in scheduling subjects for the ex- 
perimental sessions, and to the confederates, Sharon 
Biagi, Susan Mercure, Fred Behling, and John Men- 
singer, who also coded the subjects’ nonverbal be- 
havior and aided in the debriefing. 
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listener. The experimenter explained that the talker’s 
role was simply to talk about himself or herself 
for approximately 10 minutes. The talker was told 
to begin with some general background information 
and, at the experimenter’s signal, to move on to 
more personal material. In contrast, the listener’s 
role was primarily to listen, Thus, the listener was 
allowed to ask questions or make comments but 
never to shift the focus of the conversation to him- 
self or herself. 

After explaining the roles of talker and listener, 
and after pointing out that he or she would be able 
to hear the conversation from behind the one-way 
mirror, the experimenter left the room. At that 
point, the talker began, somewhat shyly and ner- 
vously, to deliver a memorized script. The first half 
of the script dealt with relatively casual background 
information (e.g, major, hometown, summer job, 
new apartment, etc.), whereas the second half was 
more personal. In general, the talker described him- 
self or herself as a recent and rather lonely trans- 
fer student to Stanford University. He or she talked 
about problems such as missing old friends, the 
difficulty of making new friends now that cliques 
had already been established, and having to spend 
more time alone than he or she would have liked. 
In short, the talker described feelings common to 
many new transfer students. He or she did not 
seem neurotic, just somewhat isolated, and rather 
pleased to have this opportunity to share some of 
his or her feelings with another person, 

After the talker had finished, the experimenter 
returned with questionnaires for the two “subjects” 
‘to fill out. He or she mentioned that the experi- 
ment was now over and that the subjects were no 
longer expected to play the roles of talker and lis- 
tener. The subjects were then leit alone in the room 
again while they filled out their questionnaires. 

After they had both finished filling out their 
questionnaires but before the experimenter had re- 
turned to the room, the confederate/talker turned 
to the subject and said, “I really feel better after 
talking to you, It’s too bad we didn’t get a chance 
to talk longer.” This statement was designed to pro- 
vide the subject with his or her only opportunity 
to be responsive outside of the listener role. Fol- 
lowing the subject’s response, the experimenter re- 
turned to the room and began the debriefing process. 

Dependent variables. As in Experiment 1, a sum- 
mary measure of behavioral nurturance was devel- 
oped by averaging together the standard score 
equivalents of five individual behaviors, The inter- 
correlations among these behaviors were all posi- 
tive, with a mean of +.19, thereby yielding an in- 
ternal reliability (coefficient alpha) of +.54 for the 
composite. The five behaviors were as follows: 


1 and 2. Head nods and facial reactions (non- 
verbal responsiveness): Listeners who nodded their 
heads or changed their facial expression in re- 
sponse to what the talker was saying seemed, in 
most instances, to be listening to the talker and to 
be involved. Accordingly, the experimenter time 
sampled the subject’s behavior every 15 sec and 
recorded whether or not each of these two behaviors 
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occurred. Both measures were highly re , Wil 
two independent coders agreeing perfect ov 
93% of the 40 5-sec observations for each wior, 
3. Verbal responsiveness: Although the li r Wi 
instructed to keep the focus of the conver m on 
the talker, he or she was allowed to ask tions 
and to make comments, Moreover, if the bject 
had said nothing during the entire 10 min: the 


interaction would have been very strained : 
talker would have been made to feel extrem 
comfortable. Accordingly, the interaction v 
corded on audiotape and was later scored | 
subject’s number of verbalizations. This meas 
highly reliable, with a correlation of +.92 b 
two independent coders 

4. Positive reaction to the talker’s implicit 
for further contact: The subject’s response 
statement, “I really feel better after talking t: 
it’s too bad we didn’t get a chance to talk lo: er, 
was coded into one of five categories: (1) n 
sponse, (2) acknowledgement that the talker 
spoken, (3) sympathetic response, (4) interes! 
pressed in getting together again, and (5) sp 
time offered for getting together again, Indepen: 
coding yielded 100% perfect agreement on 
measure, 

5. Degree of nurturance as perceived by others: 
Both the experimenter and the confederate/talk er 
independently rated the subject on a 5-point scale in 
terms of “how nurturant” he or she had been to 
the talker. Because they agreed within a single scale 
point on 96% of the subjects, their ratings were 
averaged together, 


the 


In addition, in order to derive a summary score 
of subjects’ self-reported feelings of nurturance to- 
ward the talker, we also averaged together the sub- 
ject’s ratings on four written questions that asked 
“how close” the subject felt to the talker, “how 
much of a real identification or empathy” he or 
she felt with the talker, “how concerned” he or shë 
felt about the talker, and “how eager” he or she 
would be to talk to the talker “in a real situation 
(not an experiment).” The scale ranged from 1 
(not very much) to 6 (extraordinarily). The mean 
intercorrelation among these items was +.63, yielding 
an internal reliability of +.87 for the composite, 


Results 


The composite measure of behavioral nur- 
turance, the individual nurturant behaviors, 
and the subjects’ self-reported feelings of in- 
volvement with the talker were all analyzed 
by means of a 2 (Sex of Subject) x 3 (Sex 
Role of Subject) analysis of variance as well 
as by a series of planned comparisons. The 


® We also attempted to code eye contact during 
the conversation, but because virtually every sub- 
ject made eye contact during every observation pe- 
riod, this variable was eliminated from further 
analysis. 
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two orthogonal comparisons to be tested were 
1 ih (a) that feminine and androgy- 
n ibjects would both be more nurturantly 


re ive to the talker than would mascu- 
li hjects and (b) that feminine and an- 
d wus subjects would not differ signifi- 
ca rom one another in this regard.” 


irst rows of Tables 1 and 2 present 

sores on the composite measure of 

| nurturance for the masculine, an- 

, and feminine males and females, 

‘ly, As the tables suggest, there was 

int main effect of the subject’s sex 

his measure, F(2, 78) = 6.55, p= 

nean nurturance scores of —.29, 

.25 for the masculine, androgy- 

feminine subjects, respectively. 

planned comparisons indicated 

predicted, masculine subjects were 

tly less nurturant than feminine or 

nous subjects, ¢(78) = 3.37, p < .002, 

thermore, that feminine and androgy- 

subjects did not differ significantly from 

on another, ¢(78) = 1.38, ns. This pattern 

of .esults replicates the supplementary find- 

ings of Experiment 1, in which subjects inter- 
acted with a baby. 

in analysis of the individual measures that 

were scored during the subject’s interaction 

with the talker provides a close-up of what 

behavior fferentiated the three groups 

lower portions of Tables 1 

y the means for each of these 

. The overall analyses of variance re- 

ver od ificant or near-significant main ef- 

fects of the subject’s sex role for head nods, 

F(2,78) = 3.08, p = .05, for verbalizations, 


TABLE 1 


MALE Nurturance TOWARD A LONELY 
FELLOW STUDENT 


Subject 
Mascu- Androgy- Femi- 

line nous nine 

Summary 

Observed nurturance —.43 —04 +05 
Self-reported nurturances 345 3.80 3.64 

26 3.0 3.1 

421. 56L -604 

offerb 3A 35 3.3 

Head nods® 17.1 18.3 19.9 

Facial reactionse 87 104 114 


ve For all subjects, n = 14, 
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TABLE 2 


FEMALE Nurturance TOWARD A LONELY 
FELLOW STUDENT 


Subject 
Mascu- Androgy- Femi- 
Measure line nous nine 
Summary 
Observed nurturance —.16 +11 +43 
Self-reported nurturance 3,59 3.45 4.13 
Individual 
Global rating» 2.8 3.1 3.3 
Verbalizations 51.6 618 64.1 
Behavioral offer 2.9 3.1 3.4 
Head nodse h 19.6 22,9 27.1 
Facial reactionse 12.5 12,2 15.0 


Note. For all subjects, n = 14, 

a Max = 6. 

b Max s 

e Max = 40. 

F(2, 78) = 3.05, p = .05, and for the global 
rating of nurturance given to the subject by 
the experimenter and the talker, F(2, 78) = 
2.76, p < .07. In addition, planned compari- 
sons for each of these measures indicated that 
masculine subjects were significantly less nur- 
turant than feminine or androgynous subjects, 
#(78) > 2.14, in each comparison, p < .05; 
and furthermore, that feminine and androgy- 
nous subjects did not differ significantly from 
one another, Finally, with respect to sub- 
jects’ self-reported feelings of nurturance to- 
ward the talker, feminine women described 
themselves as feeling significantly more nur- 
turant than androgynous women, ¢(78) = 
2.18, p < .05, but none of the other planned 
comparisons approached significance for either 
sex, nor was there a significant main effect of 
sex role for this variable. 

In addition to these findings with respect 
to sex role, the overall analyses of variance 
also revealed a number of significant main 
effects of the subject’s sex. Specifically, there 
was a significant main effect of sex for the 
composite measure of observed nurturance, 


7In parallel with the data analysis of Experiment 
1, the data from Experiment 2 were also reanalyzed 
with the androgynous subjects subdivided on a post 
hoc basis into those who were below the masculinity 
and femininity medians and those who were not. 
In contrast with Experiment 1, however, no con- 
sistent differences emerged between the androgynous 
and “undifferentiated” subjects in Experiment 2, 
Perhaps this is because the supportive listening situa- 
tion did not require the subject to play an active 
or initiating role in the interaction. In any case, 
these supplementary analyses are not discussed 


further. 
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F(1, 78) = 4.83, p < .03, for facial reactions, 
F(1, 78) = 6.90, = .01, and for head nods, 
F(1, 78) = 8.50, p = .005, with females be- 
ing significantly more responsive than males 
on each, It is interesting to note that the only 
two individual behaviors revealing a sex dif- 
ference (i.e., nodding one’s head and alter- 
ing one’s facial expression) both seem to rep- 
resent the individual’s style of moving his or 
her own body and are not particularly sub- 
stantive. No other main effects or interactions 
approached significance. 


GENERAL DISCUSSION 
Instrumental and Expressive Functioning 


As noted in the introduction to this article, 
the two studies reported here represent an 
attempt to broaden the empirical base upon 
which our earlier conclusions about sex typ- 
ing and androgyny were founded. Although 
the results that emerged from the reanalysis 
of the baby study can only be seen as sug- 
gestive by themselves, the two studies taken 
together conceptually replicate the low nur- 
turance of the masculine male, and even more 
importantly, they demonstrate that the low 
nurturance of the feminine female does not 
extend beyond her interaction with animals. 

In addition, we would now like to pull to- 
gether all of our findings on the instrumental 
and expressive domains in an attempt to reach 
some tentative conclusions about the effects 
of sex typing and androgyny. We will begin 
with the men because their data have been 
so consistent. Quite simply, only androgynous 
males were high in both the instrumental and 
the expressive domains; that is, only androgy- 
nous males were found to stand firm in their 
opinions as well as to cuddle kittens, bounce 
babies, and offer a sympathetic ear to some- 
one in distress. 

In contrast, the feminine male was low in 
independence, while the masculine male was 
low in nurturance. Because at least one third 

of college-age males are classified as mascu- 
line, it is noteworthy that masculine males 
were lower in nurturance than androgynous 
or feminine males whether they were inter- 
acting with a kitten, a baby, or a lonely fel- 
low student, In other words, they were rela- 
tively low in nurturance in all of the diverse 
situations that we designed evoke their 


S. L. BEM, W. MARTYNA, AND C. WATSON 


more tender emotions—to tug, if a li 
tle, on their heartstrings. 

The results for women are less isteni 
but the same general pattern emer Only 
androgynous women were high in bo > ind 


pendence and nurturance; feminine a! 
were low in independence; and mascu 
men were (in two situations) low in 
ance. Thus, for both men and won 
typing does appear to restrict one’s fi 
ing in either the instrumental or the 
sive domains. Masculine individuals © 
sexes are high in independence but 
nurturance, and feminine individuals « 
sexes are high in nurturance but low ii 
pendence. In contrast, androgynous ir 
uals of both sexes are capable of being oth 
independent and nurturant, both instrum ntal 
and expressive, both masculine and femi: ine. 


The Operational Definition of Androgyny 


As indicated earlier in this article, thers is 
currently a question in the literature con- 
cerning how best to operationalize the concept 
of androgyny. In most of our research, sub- 
jects have been defined as masculine, an- 
drogynous, or feminine on the basis of the 
difference between their masculinity and femi- 
ninity scores. In contrast, Spence et al. (1975) 
divided subjects at the medians on both the 
masculinity and femininity scales of the Per- 
sonal Attributes Questionnaire, an instru- 
ment very similar to the Bem Sex Role In- 
ventory, and thereby derived a fourfold 
classification of subjects as either masculine 
(high masculine-low feminine), feminin 
(high feminine-low masculine), androgynous 
(high masculine—high feminine), or “undiffer- 
entiated” (low masculine—low feminine). Fur- 
thermore, they presented evidence that the 
four groups differ significantly on a measure 
of self-esteem, with the androgynous group 
being highest and the undifferentiated group 
being lowest. 

In an attempt to clarify whether one or the 
other of these two definitions of psychologi- 
cal androgyny is likely to have greater utility 
for future research, we administered the Bem 
Sex Role Inventory along with a variety of 
other pencil-and-paper questionnaires, and in 
addition, we reanalyzed the results of our 
earlier laboratory studies with the low—-low 
scorers excluded (Bem, in press-a). The re- 
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sults in ‘icated that high-high and low-low 
scorers ‘id not differ significantly on the At- 
titude »ward Women Scale, the Internal- 
Exter» | Control Scale, the Machiavellianism 
Scale he Attitudes Toward Problem-Soly- 
ing § xor did the two groups differ sig- 
nific ı two of our three previous labora- 
tory . Moreover, as indicated earlier 


in c, there were no consistent differ- 
en the two groups in their re- 

to a lonely fellow student. 
D hese several negative findings, 
vers in our research were found 
ntly lower in self-esteem than 
high—h ers, and at least among men, 
t iso found to disclose less personal 
mation about themselves to others; more- 
scorers were found to be sig- 
responsive to a baby kitten 
scorers; furthermore, in both 
ce and the kitten studies, ex- 
ow-low scorers from considera- 
serve to strengthen the original find- 
ior both sexes. Finally, as indicated 
i yis article, there was a tendency 


over, low 


nif 


icant 


ea:lier in 


for 1>w~low scorers to be less nurturant to- 
ward a y than high-high scorers. 

On isis of the available evidence, 
then, we now believe that high-high and 


low-low scores 


with t 


do differ from one another, 
w scorers suffering low self- 
ome behavioral inhibition, Al- 
results do not allow us to predict 
vhen the two groups will differ, they 
do lea: us to concur with Spence et al. (1975) 
that ‘he term androgynous should henceforth 
be reserved only for those individuals who 
score high in both masculinity and femininity. 
At the same time, it is important to note that 
neither the androgynous nor the undifferen- 
tiated individual partitions his or her self- 
concept into the two categories of masculine 
and feminine. Accordingly, it remains un- 
clear whether the two groups differ funda- 
mentally in their basic assumptions about 
gender. In other words, androgynous and un- 
differentiated individuals are still alike in not 
being sex typed. 


exact? 


REFERENCE NOTE 


1. Berzins, J. I., & Welling, M. A. The PRF ANDRO 
Scale: A measure of psychological androgyny de- 
rived from the Personality Research Form. Unpub- 
lished manuscript, University of Kentucky, 1974. 
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Response-Contingent Payoffs and Cooperative Behavio 


Prisoners Dilemma Game 


Jerry I. Shaw 
California State University, Northridge 


This study compares the effect on cooperative behavior of two response 

contingent payoff structures with the invariant payoff structure of the 
standard Prisoner’s Dilemma (PD) game. One hundred male dyads played 
a 20-trial PD game that either rewarded joint cooperation by increased 
profits, punished joint competition by increased costs, both rewarded co- 
operation and punished competition, or left payoffs constant. Half of the 
pairs played for real money and half played for points. Although punishment 
led to higher cooperation than either reward or constant payoffs, players 
perceived themselves as more cooperative under the reward contingency. 
A pattern of findings supported the interpretation that a predisposition to 
compete leads to earlier exposure to and superior learning of the punishment 
contingency. It is suggested that while this facilitates cooperative behavior, 
it may undermine cooperative motivation, Finally, real-money payoffs sig- 
nificantly increased both interdyad variability and intradyad uniformity in 
cooperative behavior. This effect of reward size on response variation is 
viewed as a threat to the external validity of conflict studies that have used 


n the 


only trivial rewards. 


Much recent research on social interdepen- 
dence between parties with conflicting in- 
terests has employed two-person, non-zero-sum 
games such as the Prisoner’s Dilemma (PD). 
Although these studies have singled out a va- 
riety of factors as affecting cooperative and 
competitive behavior (see Apfelbaum, 1974; 
Becker & McClintock, 1967; Gallo & McClin- 
tock, 1965; Vinacke, 1969), critics have ar- 
gued that the conventional PD game has 
severe limitations as a simulation of inter- 
personal conflict (McClintock, 1972; Nemeth, 
1970, 1972). A fundamental problem is the 
relative inflexibility inherent in the usual 
application of the paradigm, Subjects typically 
are requested to respond repeatedly for many 
trials to a conflict situation that remains vir- 
tually unchanged over the entire set of trials. 
Unfortunately, the presentation of an invari- 
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ant conflict in which structural featur s are 
unaffected either by the players’ own «: tions 
or by the passage of time over trials may be 
too great an abstraction. To quote McClin- 
tock (1972): 


The reiteration of the same game lacks flexibility as 
a technique for assessing the relative strength of 
particular motives. Specifically, utilizing the same 
game with the same payoff structure for many trials 
provides little information as to what would happen 
if players were provided with more or less poin 
for choices that serve a given motivational dispos 
tion as related to other motivational dispositions, 
nor does it allow one to ascertain how the operation 
of several goals may be combined in a situation to 
affect the player’s choice behavior. (p. 294) 


Related to the inflexibility problem is the 
fact that a PD game posing a static conilict 
portrays conditions that seldom obtain in 
natural conflicts where the incentive structure 
may range considerably over successive en- 
counters between a pair of adversaries. A case 
in point is the arms race deadlock in which 
the potential for disastrous outcomes sky- 
rockets as a result of reciprocated hostilities. 

The invariant conflict situation used in 
typical PD studies was recently examined by 
Shaw and Thorslund (1975), who contrasted 
the effects of a variable- versus a constant- 
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i structure on cooperative game behav- 
vojects either played a standard PD 
ith constant rewards, or they played 
hree variable games in which rewards 
reased, decreased, or changed ran- 
er each trial. In each condition, the 
yoff per trial was identical, thereby 
‘erage reward size over the set of 
permitting the different reward 
p be varied independently. When 
sul syed for points it was found that 
the rewards (particularly the random 
nificantly enhanced cooperation 
re onstant rewards, although this 
attenuated when subjects 
! money. Moreover, the varia- 
es each produced significantly 
n cooperative behavior among 
( the constant-reward game. In- 
‘ional applications of the PD 
I instant reward structure with 
low-in 1i rewards (usually points), the 
h Chorstund (1975) results imply 
thi limiting to these specific condi- 
tic i y have precluded much 
use ration out the determinants of 
in addition, the increased 
vith the variable-reward 
s a caution in extrapolating 
o comparable natural con- 
ds are more meaningful as 
rable. 
her the effects of variable 
, the present investigation 
ilso to contrast variable with 


conti 

varian | 
structure 

from PD studi 
flicts where r 


re i 45 


Was í pened 


const: uds. In the Shaw and Thorslund 
(1975 Uy, reward changes were indepen- 
dent o layers’ behavior inasmuch as 
variation: reward structure occurred as a 


function of the passage of time over trials. 
This might correspond in a natural conflict to 
the case where goals are reevaluated owing to 
fatigue or in light of subsequent information. 
A second possibility would be to investigate 
the effects of reward variations that occur as 
a consequence of the specific actions taken by 
the players on preceding trials. Conflicts in 
which potential outcomes are contingent upon 
the adversaries? prior behavior are not un- 
usual and characterize, for example, the arms 
race spiral mentioned previously. 
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The primary purpose of the present experi- 
ment was to examine the effects of response- 
contingent rewards on cooperative game 
behavior. Two types of response-contingent 
reward structures were considered: (a) reward 
joint cooperation (RJC), in which joint coop- 
eration is followed by increasingly positive 
outcomes for continued cooperation, and (b) 
punish joint defection (PJD), in which joint 
competition is followed by increasingly nega- 
tive outcomes for continued competition, The 
two reward structures imply two distinct 
modes of conflict resolution. With RJC, the 
beneficial consequences of initial cooperation 
are expected to induce subsequent coopera- 
tion, whereas with PJD, it is the harmful 
consequences of initial competition that serve 
as the inducement for future cooperation. The 
reader has probably already recognized the 
structural similarity between these two modes 
of conflict resolution and the political strate- 
gies of détente and the “balance of terror,” 
respectively. A question of considerable in- 
terest, thus, is which of the two response- 
contingent reward structures will most suc- 
cessfully induce cooperative behavior? 

First, it seems likely that both structures 
can be effective. Studies that have used the 
“minimal social situation” paradigm (Sidow- 
ski, 1957; Sidowski, Wycoff, & Tabory, 1956) 
illustrate that people can learn to coordinate 
implicitly their behavior by adopting a simple 
“win-stay, lose-change” principle. This re- 
quires subjects to perseverate on a response 
that is followed by a positive outcome and to 
avoid repeating a response followed by a nega- 
tive outcome. It has been shown that as sub- 
jects behave according to this rule, they rap- 
idly establish a stable sequence of mutually 
rewarding (i.e., cooperative) responses (Kel- 
ley, Thibaut, Radloff, & Mundy, 1962). Al- 
though this type of simple accommodation in 
the typical invariant form of the PD game is 
often obscured by strategic considerations, 
such as to trust or not to trust, it may be 
that the reward contingencies of both RJC 
and PJD will act as a focal point for con- 
vergence of the players’ expectations. That is, 
the reward contingencies may be taken by the 
players to imply the “win-stay, lose-change” 
rule as a type of “prominent solution” (see 
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Schelling, 1960) to their conflict. If so, then 
both RJC and PJD should enhance the de- 
velopment of cooperative behavior. 

As to their relative efficacy, the speculation 
that RJC will be superior to PJD because of 
the well-established principle that reward is 
a more effective control technique than pun- 
ishment cannot be endorsed unequivocally. 
On the contrary, it seems plausible that PJD 
will be superior to RJC because cooperation 
to avoid increased costs may be more easily 
attained than cooperation to promote in- 
creased profits. This expectation is held for 
two reasons, one of which has to do with goals 
and the other with tactics. As to the former, 

the experimental gambling literature is rife 

with evidence that subjects would rather 
minimize their possible losses than maximize 
their possible gains of the same magnitude 

(Atthowe, 1960; Myers, Reilly, & Taub, 

1961; Myers & Suydam, 1964; Slovic & 

Lichtenstein, 1968). In fact, this negativity 

bias is so strongly held that Tversky (1967) 

has found that subjects will pay more than 

the expected value of a negative gamble so 
as to avoid the risk of loss connected with 
that gamble. With respect to tactical mat- 
ters, stable cooperation may be less attainable 
with RJC because it is never more costly to 
compete than it was at the outset (only re- 
wards are increased, not costs), and there is 
always the opportunity to exploit an unsus- 
pecting opponent. With PJD, however, the 
temptation to exploit the other should be mod- 
erated by knowledge that joint competition 
leads to increasingly higher costs. 

The aforementioned considerations pro- 
vided the basis for the present PD study 
wherein (a) the effects of a variable versus 

a constant reward structure on cooperative 
game behavior were further explored, and (b) 
a response-contingent reward structure in 
which joint cooperation led to higher possible 
profits was contrasted with one in which joint 
competition led to higher possible costs. Four 
variations were examined including RJC (in- 
creased outcomes for joint cooperation follow- 
ing prior joint cooperation), PJD (decreased 
outcomes for joint competition following prior 
joint competition), RJC + PJD (a combined 
structure including both the reward and pun- 
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ishment contingencies), and NC 
control containing neither conting 
much as previous research (Gall: 
1971; Shaw, 1972; Shaw & Thors 
has shown that the presence of re 
payoffs can have diverse effects 
playing behavior, each of the f 
structures was run under two co! 
absolute reward size—high (real nv 
low (points). These manipulations co 
the eight treatment combinations oi 
x2 factorial design. Included ar 
size (high vs. low), presence versu 
of reward for joint cooperation (RJ‘ 
PJD vs. PJD, NC), and presence vi 
sence of punishment for joint coi 
(PJD, RJC + PJD vs. RJC, NC). ! 
lyzing the data in this way it is poss: 1e not 
only to assess the effects of RJC an PJD 
separately but also to examine the inte» ction 
between the reward and punishment struc 
tures. As reasoned earlier, it was ex ected 
that both RJC and PJD would produce more 
cooperation than the NC control and that 
cooperation would be higher for PJD n for 
RJC. In terms of the above analysis 
would be reflected most clearly as a 1 
effect for the presence-of-punishment variable. 
No specific predictions were made as to an 
interaction between the two contingency 
structures or in relation to the reward-size 
variable. 


i 


METHOD 
Subjects 


A total of 160 male students enrolled in intro- 
ductory psychology classes at California State Uni- 
versity, Northridge, participated in this experiment 
in partial fulfillment of their course requirements: 
Ten dyads were randomly assigned to each of eight 
treatment combinations with the proviso that per- 
sons who were previously acquainted would not be 
paired in the same dyad. Immediately after the 
primary experiment was conducted, an additional 40 
subjects were assigned in pairs to two appended 
control groups, the purpose of which will be ex- 
plained later in the results section. 


Prisoner's Dilemma Matrices 


A number of different matrices were used in the 
present study. The precise number varied among the 
experimental conditions insofar as the values in the 
matrices were altered according to the prior joint 
behavior evidenced by the two players in each 
dyad. Table 1 displays the basic game matrix from 
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ubsequent matrices were derived. In all 
matrix was presented to each dyad on 
» the NC condition, this matrix was re- 


peated without alteration for 20 trials, thereby con- 
stitutit - usual invariant form of the PD game. 
In the condition, a rule of progression was 
impler uch that the values in the joint-coop- 
eratior ellow-yellow) were increased by one 
unit joint cooperation on the immediately 
preced Each of the other response combina- 


tic sir of players had no effect on any of 
the matr ues on the later trials. In brief, the 


RJC n sion was such that joint coopera- 
tion on vas rewarded by increased values for 
joint c ı on trial n +1, whereas any other 


ion on trial m had no effect on the 
trial a +1. 
rule of progression in the PJD 
that the values in the joint- 
ue-blue) were decreased by one 
7 competition on the immediately 
prec vefore, each of the other response 
o effect on any of the matrix 
trials. Stated briefly, the PJD 
as such that joint competition 
‘ished by decreased values for 
trial n+ 1, whereas any other 
on trial » had no effect on 
anti. 
wy rule of progression was 
PJD condition, which in- 


respo 
maty 


combi 
value 
rule of } 


on t 


joint com 
response 
the matrix 
Fi 
utilize 


cluded nd PJD contingencies, That 
is, eithe: ion or joint competition on 
trial n wi by increased values or pun- 
ished by decrca values, respectively, on trial 
n-+1, whereas all er response combinations had 
no effect on the ma!ix values on trial n+ 1. 
Procedure 


experim isisted of 20 consecutive trials 


OF the ects were scheduled in pairs 


and wert i on either side of an opaque parti- 
tion, whi vented them from seeing one another. 
No! communication was permitted. Instead, each 
player w with a panel switch containing 
two option: or blue. By moving his panel 
Switch towar color, each subject indicated his 
choice in the r ix. The experimenter stood visibly 
in front of the players behind a partitioned desk 


upon which the PD game equipment rested. This 
consisted of a display box divided into four equal 
quadrants within which a sequence of cardboard 
matrices could be presented. 2 
ee Seated, the pairs were given a stan- 
ign oD) lanation of the PD game. The instruc- 
ae Aires neutral regarding motivational orienta- 
s Sean was made of a specific goal. If 
the fala ald to elicit this information from 
ae menter, they were told to make their own 
FAE a a to such matters. Moreover, the subjects 
Stimulus eae in advance of possible changes in 
ie 5 values in their matrix. Instead, the opera- 
of the panel switches and the format of the 
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TABLE 1 
Basic Game MATRIX 
Piyerte ___ Player 2's choices 
choices Yellow Blue 
Yellow 10, 10 ~25, 35 
Blue 35, —25 00 


Note, The first value in every set is the payoff to Player 1, 
In the no-change control group the matrix values were pres 
sented unchanged over 20 trials; in the RJC (reward joint 
cooperation) group the matrix values for joint cooperation were 
increased only after prior joint cooperation; in the PJD (punish 
joint defection) group the matrix values for joint competition 
were decreased only after prior joint competition; in the 
RJC + PJD group the matrix progressions utilized in RJC and 
PJD were each implemented, 


matrix presentation was described to the subjects. 
Each of the four possible outcomes on a trial was 
demonstrated and questions were answered to assure 
comprehension of the game. In all conditions, the 
NC matrix was used for the purpose of this demon- 
stration, 

In the high-reward condition, all subjects weré 
initially paid $1 for their participation upon arrival 
at the laboratory. They were informed that although 
this was theirs to keep, it would be possible either 
to earn more money or to lose part or even all of 
their initial payment by the end of the experiment. 
Subjects were told that the values in the matrix 
represented points for which they would receive 
symbolic cardboard chips in both “profit” and “cost” 
denominations. Each subject was to subtract his 
costs from his profits at the end of the trials, and 
any profits remaining were to be remunerated in 
real money at an exchange rate of 2:1. This would 
be in addition to their $1 payment, Thus, perfect 
cooperation would bring the pair a total shared profit 
of between $6 (NC and PJD groups) and $9.80 
(RJC and RJC+PJD groups). If costs exceeded 
profits, however, the total loss was to be halved and 
charged against the subjects’ initial payment up to 
a maximum loss of $1. In the low-reward condi- 
tion, subjects were not given any cash stake but 
were merely told that the values in the matrix rép- 
resented points for which they would receive sym- 
bolic, nonredeemable cardboard chips. 

After the instructions were completed, the actual 
trials began. A signal light located on top of the 
PD display box indicated the onset of each of the 
20 trials. This signal light was terminated after each 
subject had made his own choice with his panel 
switch, Simultaneously, the players were informed 
of their joint behavior by a decision light, which 
illuminated that quadrant of the matrix correspond- 
ing to their mutual choices. Subjects were given 
immediate payoffs in symbolic chips, and the next 
trial was initiated. There was no cumulative feed- 
back of outcome information over the trials. The 
matrix was either changed or not changed prior to 
each trial according to the progression rule in effect 
at the time. At the end of the trials, all subjects 
completed a brief questionnaire designed to tap their 
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perceptions during the experiment. They were then 
debriefed and they pledged not to reveal the experi- 
mental procedures to other potential subjects. 


RESULTS 
Level of Cooperation 


The major dependent variable of coopera- 
tive behavior was analyzed by summing the 
number of individual cooperative choices elic- 
ited from each dyad and then converting the 
total to percentages. These data are shown in 
‘Table 2. Analysis of variance revealed a sig- 
nificant effect only for the punishment con- 
tingency, F(1,72) = 7.06, p< .01} with 
subjects behaving more cooperatively when 
punishment was present (40.6%) than when 
it was absent (29.2%). No other main effects 
or interactions approached significance. A 
parallel analysis computed with joint coopera- 
tive choices as the dependent variable re- 
vealed an identical pattern of findings except 
that the punishment contingency main effect 
was of borderline significance, F(1,72) = 
3.67, p< .07; 18.6% joint cooperation for 
punishment present versus 10.9% joint coop- 
eration for punishment absent, 

As can be seen in Table 2, the means for 
the RJC + PJD, PJD, RJC, and NC condi- 
tions considered separately are 40.1%, 41.0%, 
29.4%, and 28.9%, respectively. Planned 
comparisons using the Newman-Keuls pro- 
cedure were carried out to assess the experi- 
mental predictions that cooperation would be 
higher for RJC and PJD as contrasted to NC 
and, moreover, that PJD would be higher 
than RJC. The results of this analysis con- 
firmed that cooperative behavior was elevated 
in PJD but not in RJC as compared to NC. 


TABLE 2 


Com- 
Condition Money Points bined 
Punishment present Sth 
Reward present (RJC + PJD) 47.5 35.3 40.1 
Reward absent (PJD) 38.5 43.5 41.0 
Punishmentabsent f; 
Reward present (RJC) 29.5 29.3 29.4 
Reward absent (NC) 33.5 22,3 28.9 
Combined 37.1 32.6 34.9 
Note. RJ PJD = punish joint 


defection; NC = no change. 
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TABLE 3 
Two APPENDED CONTROL Grou! RICES 
Player 2's c! 
Player 1's —— = 
choices Yellow ue 
RJC + PJD control 
Yellow 12, 12 35 
Blue 35, —25 -3 
RJC control 
Yellow RJC, RJC 35 
Blue 35,25 
DEENA AA wuts iae n fi io Player 


In RJC + PJD ard joint cooperation n 
fection) control, the matrix values were pre 
over 20 trials; in RJC control, the matrix 
cooperation were increased only after prior joir 
(initial values: 10, 10), 


The expectation that PJD would yic 
cooperation than RJC was clearly 
Moreover, RJC + PJD produced signi: 
more cooperative behavior than either 
RJC but did not differ from PJD. The « egre 
of freedom for each Newman-Keuls co:mpath 
son were 72 and the obtained level of signifi 
cance always exceeded p< .05, excepi fo 
the RJC+ PJD versus RJC comparison, 
which exceeded p < .01. 
It is interesting to note that the combined: 
reward structure RJC + PJD did not enhanc 
cooperation more than PJD considered sepa 
rately. This result, together with the findin 
that RJC did not differ from NC, implie 
that the main effect for presence of punis": 
ment is facilitated specifically by the PJD 
response-contingent reward structure. In light 
of this, the question as to the precise mech: 
anism by which PJD increases cooperation 
merits reflection. It appears that the relative 
efficacy of PJD in contrast to both RJC and 
NC could be a consequence of either (a) the 
type of payoff contingency itself, namel: i 
“punishment,” or (b) the higher costs ovet 
trials for continued joint competition. In 
order to separate these two possible expla 
tions, two appended control groups wer" 
added immediately after completion of the 
original design. Both control groups wel 
carried out only for real money due to lack 0} 


i 


1 All p values are two-tailed. 
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available subjects. In the first group, 10 male 
dyads wore presented with the RJC + PJD 
contro! atrix displayed on the top half of 
Table ‘he values in this matrix approxi- 
mated average values obtained by all 
dyads original RJC + PJD real-money 
condi he RJC + PJD control matrix 
was ] d without alteration for a total 
of 2 \ comparison of the RJC + PJD 
cont its counterpart, the RJC + PJD 
real-r mdition, should uncover the ef- 
fects ‘iment per se because both con- 
ditior er costs, but it is only the 


latt that includes the punishment 
cont was found that the level of 
“ r significantly higher in the 
R] j l-money condition (47.5%) 
as | he RJC + PJD control con- 
ditior (18) = 2.83, p < .02, there- 


by supi ie assumption that the salu- 
tary effe veration observed when pun- 
ishment ent is mediated specifically 
by the t contingency itself. 

In itrol group, 10 male dyads 
were 1 the RJC control matrix 
display tom half of Table 3. For a 
total of ; layers were given the stan- 
dard RJC p ion of matrices, except that 
the (constant) values in the joint-competi- 
tion cell approx: nated the averages obtained 
by all dyads in ine original PJD real-money 
condition. A com .arison of the RJC control 


with the RJC -eal-money condition should 
reflect the ef Í higher costs for continued 
Joint compet i since both conditions in- 
clude the reward contingency, but it is only 
the RJ( »| that includes the element of 


higher costs. Our prior conclusion that coop- 
eration is tacilitated by the punishment con- 
tingency itself rather than by differences in 
payoff level is bolstered by the result that 
Cooperation was undifferentiated in the RJC 
Control (32.7%) and RJC real-money 
(29.5%) conditions, £(18) = 1.32, ns.? 

; What is it, then, about the punishment con- 
tingency (PJD) that makes it more effective 
than the reward contingency (RJC)? Earlier 
It was mentioned that both RJC and PJD 
might enhance cooperation because either can 
Serve as a convergent cue implying a coopera- 
tive settlement. The fact that RJC was wholly 
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TABLE 4 
MEAN Cuoice BEHAVIOR ON First Five TRIALS 


Response- Choice behavior 
Skee — — 
reward N e 
structure c&eperpe pp) ARES 
RJC + PJD 05 2.65 1.70 6.06" 
PJD 1,20 2.60 1.20 0,00 
RIC 80 1.90230 5.17 
NC 90 2.00 2.10 5.08" 
Combined -88 2:29 1.83 7.79" 


; df for combined 
n= 79. (Abbreviations: RJC ard joint co- 
PJD = punish joint defection; = no change; 
> = joint cooperation; DD = joint competition.) 

*p <.001, 


ineffective suggests that PJD worked because 
it alone served this cue function. This sup- 
position gains credence if we consider that in 
order to act as a cuing mechanism, the payoff 
contingency must be noticed early in the in- 
teraction before emerging strategies, lock-ins, 
prior commitments, and the like can render it 
ineffectual. To the extent that subjects are 
predisposed at the outset to compete rather 
than cooperate, exposure to the PJD contin- 
gency would precede that to the RJC con- 
tingency. Reasons for early dominance of 
competition include a lack of trust inherent 
in the PD game, miscomprehension of the 
dilemma choices, attempts to test the oppo- 
nent’s mettle, and so forth. Data showing that 
competitive choices do exceed cooperative 


2 Inasmuch as the idea for the two control groups 
was formulated near completion of the original 
design, two comments are in order regarding their 
proper interpretation. First, by virtue of their late 
formulation, the control groups were necessarily run 
subsequent to the experimental groups. Although this 
temporal difference renders intergroup comparisons 
more difficult, a measure of security lies in the fact 
that the control data were collected within 1 week 
of completing the original design and that the two 
control groups themselves were run in a balanced 
rather than sequential order. Second, the attempt to 
match the element of higher costs between the ex- 
perimental and control groups was not strictly possi- 
ble because the former groups were exposed to higher 
costs sequentially over trials, whereas the latter were 
exposed to higher costs at the outset. Since it is not 
known to what extent the control group findings 
reflect this temporal difference in exposure to higher 
costs rather than the operation of the punishment 
contingency itself (as has been assumed), it is per- 
haps advisable to interpret the control group fnd- 


ings with caution. 


1030 JERRY I. 
ones on early trials are provided in Table 4. 
On the average, joint competition (DD) 
occurred on 1.83 of the first five trials, where- 
as joint cooperation (CC) occurred on only 
88 of them, #(79) = 7.79, p < .001. More- 
over, the higher rate of DD choices was pre- 
sent in each experimental condition with the 
lone exception of PJD. Thus, there is strong 
evidence that subjects compete more than 
they cooperate in the preliminary stages of 
their interaction. 

Given this, it follows that they are exposed 
to the (negative) consequences of joint com- 
petition (i.e., the PJD contingency) earlier 
than they are exposed to the (positive) conse- 
quences of joint cooperation (i.e., the RJC 
contingency). Evidence that earlier exposure 
results in differential learning of the two 
payoff contingencies is already available on 
the first five trials. As can be seen in Table 4, 
DD occurs less frequently in those conditions 
that include the PJD contingency than in 
those which do not, F(1, 72) = 4.10, p < .05. 
However, the parallel observation that CC 
occurs more frequently in those conditions 
that include the RJC contingency is not sub- 
stantiated. In fact, the data are slightly, but 
not significantly, in the opposite direction 

(F < 1, ns). Differential learning of the two 
payoff contingencies remains stable over the 
entire set of 20 trials. Mean DD choices are 
less frequent when punishment is present 
rather than absent (3.71 vs. 5.30), F(1,72) 
= 4.13, p < .05. However, mean CC choices 
are not more frequent when reward is present 
rather than absent (2.83 vs. 3.06), F < 1, ns. 
Thus, while players learn to avoid joint com- 
petition where it is punished, they do not 
learn to repeat joint cooperation where it is 
rewarded, 

An analysis of the data in terms of transi- 
tion probabilities provides a trial-by-trial 
measure of differential learning of the two 
payoff contingencies. This measure indicates 
the probability of a given dyad going from 
one joint state on trial n to the same or an- 
other state on trial » + 1. If subjects are 

learning the PJD contingency, the probability 
of joint competition following joint competi- 
tion, that is, P(DD/DD), would be lower in 
the PJD and RJC + PJ}! conditions than in 
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the others. Likewise, if subjects were 
learning the RJC contingency, the probal 
ity of joint cooperation following joint 
eration, that is, P(CC/CC), would be no dif 
ferent in the RJC and RJC + PJD condition: 
than in the others. 

Evidence of learning ih 
comes from examinatio: 
state P(DD/DD), which 
value of .42 for the tw: 
conditions, and a value 
punishment-absent cond 
5.63, p < .03. Thus, th 
tendency for DD to be r 
when the PJD contingency 
lated question is whethe: 
gency induces subjects only 
DD choices or if it also l 
more frequently to CC < 
of the appropriate data ind 
cant tendency in this dir: 
P(CC/DD) values avera 
punishment-present conditions, whereas they 
average .24 for the punishment-absent condi- 
tions, F(1,72) = 5.30, p < .04. 

Evidence of not learning the RJC contin 
gency comes from examination of the transi 
tion state P(CC/CC), which reveals thal 
subjects do not repeat joint cooperation mort 
where it is rewarded in contrast to where i 
is not rewarded (.44 vs. .43; F < 1, ns). Fur- 
thermore, the P(DD/CC) transition value: 
are also unaffected by the RIC contingenc: 
(reward present = .34 vs i 
32; F <1, ns), thereby 
warding joint cooperation 
inhibiting the players’ DD 


PJD contingene 

of the transi! 

ields an aver: 

‘nishment-present 

59 for the tw 
F(1, 72) 

a signifi 

ited least ofte 

erative, A re 

PJD contin: 

oid repea' 

vem to § 


Perceived Cooperation 


The postquestionnaire 
rate their cooperativeness 
game on an 11-point bipola 
the extremes by “0” (extr 
tive) and “10” (extremely 
(b) in the future by proj 
trials out of another 20 they would play co 
operatively. These ratings revealed that sul 
jects with high rewards as contrasted to tho: 
with low rewards perceived themselves 
more cooperative (6.2 vs. 5.0), F(1, 72) 
5.25, $ < .03, and projected that they would 


uncooper 
rative), ani 
ng how man; 
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continue to be more cooperative in the future 
(8.5 vs. §.2),F(1, 72) = 6.47, p < .025. Thus, 
althoug ‘here was no difference in level of 
cooper: s a function of reward size, 
there v varked discrepancies in perception 
of coop n as related to this variable. 


In t Í the response-contingent reward 
structu ibjects perceived themselves as 
least cox tive in the NC condition (4.6) 
cooperative in the RJC + PJD 

), F(1,72) = 11.16, p < .001. 
tern occurred in their estimates 
eration with least in the NC 
conditior and most in the RJC + PJD 
condi , F(1,72) = 10.59, p < .001. 
More importsily, although subjects tended 
ect h r cooperation in the two pun- 
t-presss. conditions (8.8) in contrast 

nent was absent (5.9) F(1, 


and as 1 
conditio 
The s 
of futi 


to whe 


72) = ¢ 001, they reversed this tend- 
ency on { eived own cooperation mea- 
sure rati iselves as more cooperative 
in the ditions where reward was 
present contrast to when it was 
abse *(1, 72) = 12.39, p< .001. 
Hence of future cooperation are 
consister ictual cooperation, whereas 


perception í ual cooperation is not. 
Reward Size 


Ì Variability in Cooperative 
Behavior 


Although the reward-size variable exerted 


strong effect perceived cooperation, it did 
not influence average level of cooperative be- 
havior, F(1 ) = 1.12, p < .25. The latter 
result is not prising insofar as studies that 
have attempted to link reward size and co- 


Operative behavior have obtained negative 
support (Gumpert, Deutsch, & Epstein, 1969; 
Oskamp & Kleinke, 1970), positive support 
(Gallo, 1966: Oskamp & Perlman, 1965; 
Radlow, Weidner, & Hurst, 1968), as well as 
no relationship (Evans, 1964; Wrightsman, 
1966) between these two variables (for a 
review, see Shaw, 1972). : 
Even though reward size is not consistently 
related to average level of cooperative behav- 
lor, it may have a more reliable effect on 
variability of cooperative behavior. Two prior 
PD Studies that have examined the relation- 
ship between reward size and variance both 
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found significantly higher interdyad variance 
of cooperative responses when subjects played 
for real money as opposed to points (Knox & 
Douglas, 1971; Shaw & Thorslund, 1975). 
The results of the present study provide a 
third confirmation of this effect inasmuch as 
the overall interdyad variance was 69.2 in the 
money condition and 31.5 in the points 
condition, F(39, 39) = 2.19, p < 01. 

Another way to investigate the effects of 
reward size is to examine the tendency to 
reciprocate cooperation within dyads by cor- 
relating the number of cooperative choices for 
each member of a pair over all such pairs. 
The higher the correlation, the stronger is the 
reciprocation level. Data indicated that the 
correlation was .71 for money and .34 for 
points. Using the R to Z transformation as 
recommended in Hays (1970, p. 532), the dif- 
ference between the two correlations yielded 
a value of 2.32 (p < .02). Thus, even though 
large rewards had no effect on average level 
of cooperation, they tended to maximize dif- 
ferences among dyads and to minimize differ- 
ences within dyads. 


DISCUSSION 


The primary purpose of this investigation 
was to compare the effect on cooperation 
of two response-contingent payoff structures 
with the invariant payoffs used in the con- 
ventional PD game. Although it had been 
assumed that both RJC and PJD would pro- 
mote cooperative behavior, the results indi- 
cate that only the latter was effective. A 
pattern of evidence supports the interpreta- 
tion that the efficacy of PJD in contrast to 
RJC resides in earlier exposure to and differ- 
ential learning of the two payoff contingencies. 
Insofar as subjects are predisposed at the 
outset to compete rather than cooperate, it 
follows that they will encounter the negative 
consequences of joint competition prior to the 
positive consequences of joint cooperation. As 
a result, the two payoff contingencies are 
differentially learned such that DD is re- 
peated least often where PJD is operative, 
but CC is not repeated most often where RJC 
is operative. Moreover, the PJD contingency 
is most likely to induce a transition from DD 
to CC, whereas the RJC contingency is not 
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most likely to prevent a transition from CC 
to DD. If, as it seems, the crucial aspect is 
one of timing, then the relative superiority 
of PJD may be largely offset by procedures 
that ensure equivalent exposure to each pay- 
off contingency. This possibility is amenable 
_ to empirical verification and could be tested 
by comparing groups explicitly informed 
about the two payoff contingencies prior to 
their interaction with other groups which are 
not informed. 

A different point to be considered in this 
study relates to the measure of cooperative 
behavior. It will be recalled that the facilita- 
tive effects of the punishment contingency 
were observed most strongly with individ- 
ual cooperation as the dependent variable 
(p < .01), whereas in terms of joint coopera- 
tion, the effect of punishment attained only 
borderline significance (p < .07). From the 
viewpoint of conflict resolution, this implies 
that the punishment contingency has its pri- 
mary advantage in the prevention of mutual 
harm rather than in the promotion of mutual 
benefit. If this observation is valid, it helps 
us to understand the apparent paradox that 
although subjects cooperated more under the 
punishment contingency, they perceived them- 
selves to be more cooperative under the re- 
ward contingency. An interpretation consist- 
ent with the above observation is that the 
higher cooperative behavior in response to 
punishment is not “true” cooperation (i.e., an 
attempt to maximize joint gain) but is merely 
an instrumental response to a more salient 
payoff contingency. Recognizing this, players 
may have been reluctant to conclude that 
their cooperative behavior reflects a coopera- 
tive intention. Thus, while punishment may 
facilitate cooperative behavior, it may impair 
the development of cooperative motivation. 


This possibility deserves further consideration, 
especially in applied settings wherever at- 
tempts are niade to induce cooperation through 


the use of 1 ds and/or punishments. 


A final f that merits discussion is the 
relationshi een reward size and vari- 
ability in cx tive behavior, Although a 
spate of has failed to demonstrate 
that reward size is consistently related either 


to an increase or a decrease in average level 
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of cooperation, there is growing ev- nce that 
reward size is more reliably relate. to vari- 
ability in cooperative behavior. usistent 
with previous findings (e.g., Knox &  ‘ouglas, 
1971; Shaw & Thorslund, 1975), results 
of the present study reveal that si cantly 
higher interdyad variability in coope: ive re- 
sponding is displayed by subjects pl: ag for 
real money in contrast to those pla: ig for 
points. Three independent demonstrat ns of 
increased variability with high reward must 
be considered in the debate over th: ful- 
ness of trivial rewards, which chara: vizes 
much PD research, The present autho: akes 
issue with those investigators who have ter- 
preted their findings of a negative re tion 


between reward size and level of coope- tion 
(e.g., Gumpert, Deutsch, & Epstein, | 69; 
Oskamp & Kleinke, 1970) as dispelling or- 
ries over the generalizability of PD resc rch 
utilizing small payoffs. Indeed, the exto-nal 
validity of studies employing only small | 1y- 
offs such as points or imaginary money is 
called seriously into question by the fact 1 at 
interpretation of between-conditions dior 
ences is likely to be invalid when applied <0 
natural conflicts in which more heteroge: 2- 
ous responses may be anticipated. Moreover. 
the present findings, which show that large 
rewards increase the extent of reciprocate: ! 
cooperation within dyads, shed further ligh 
on why the attempt to relate reward size to 
average level of cooperation has led to utter 
confusion, We should not expect large rewards 
either to facilitate or to inhibit conflict be- 
havior. Rather, we should remain alert to 
both possibiilties because for any pair of ad 
versaries, large rewards will tend to homoge 
nize their interaction pattern in whicheve 
direction it takes. 
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When Actions Reflect Attitudes: The Politics of 
Impression Management 


Mark Snyder and William B. Swann, Jr. 
University of Minnesota 


A conceptual formulation of the determinants of correspondence between at- 
titudes and behavior suggests that social environments differ in the extent to 
which they provide salient and relevant “attitudinal” and “situational” guides 
to action. We constructed experimental situations that differed in the extent to 
which interpersonal cues to situational appropriateness were available and/or 
relevant attitudes were made salient. Male students formulated judgments of 
liability in a sex-discrimination court case. In this basic situation, verdicts 
were generally unfavorable to the female plaintiff and uncorrelated with previ- 
ously reported attitudes (r= .07). When attitudes toward affirmative action 
were made salient, covariation between favorability of verdicts toward the 
female plaintiff and previously measured attitudes was substantial (r= .58). 
Participants who anticipated discussing their verdicts with a disagreeing partner 
adopted a “moderation” strategy and reached decisions favorable to neither the 
plaintiff nor the defendant. Their verdicts were uncorrelated with their per- 
sonal stands on affirmative action, whether or not attitudes had been made 


salient (rs = .14 and .06, respectively). 


To what extent are the attitudes expressed 
by individuals after contemplating their posi- 
tions on social issues actually reflected in 
their actions in relevant life situations? Al- 
though most theoretical viewpoints have 
stressed the links between attitude and be- 
havior, empirical researchers have all too often 
reported weak and inconsistent relationships 
between verbal measures of attitude and ob- 
servations of social behavior (for reviews, 
see Fishbein & Ajzen, 1975; Kiesler, Collins, 
& Miller, 1969; Wicker, 1969). Just as it is 
the case that avowed attitudes are not always 
reflected in actual behavior, so too is it the 
case that changes in attitude are not always 
translated into corresponding changes in be- 
havior (e.g., Festinger, 1964), Similarly, one 
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cannot reliably modify attitudes by alio-ing 
behavioral experience. Although it is true ihat 
under certain conditions individuals who Kave 
behaved in a manner discrepant from their 
initial attitudes may change their attitudes to 
make them more consistent with their new 
behavior, this link between behavior and atti- 
tude is often small and occasionally hard to 
demonstrate (e.g., Collins, 1973). 

One outcome of the empirical search for 
links between avowed attitudes and actual 
behavior has been a critical reassessment of 
the interplay between attitudes and behavio:. 
Two rather distinct viewpoints appear to have 
emerged: the assumption of consistency and 
the assumption of inconsistency. 

The assumption of consistency. According 
to this viewpoint there do exist reliable pre- 
dictive and causal links between attitudes and 
behavior, but these relationships are often 
attenuated or even obscured by the influence 
of other factors, including low reliability of 
measurement, differing levels of specificity 
of the measures of attitude and behavior, the 
intrusion of other attitudes and competing 
motives, and the constraints of normative so- 
cial pressures (e.g., Kiesler, Collins, & Miller, 
1969). 
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WHEN ACTIONS REFLECT ATTITUDES 


„The assumption of inconsistency. According 
to this viewpoint, human social behaviors— 
including verbal statements of attitude—are 
sufficien|'y sensitive to even subtle differences 
in situa!. onal factors and normative pressures 
to prec! the existence of reliable links 
betwee: havior and inner dispositional 
states inclo ‘ing attitudes, Thus, any apparent 
consi between measured attitude and 


obser ivior may only reflect the op- 


eratio mverging pressures within the 
verbal havioral response domains. This 
viewpo been bolstered by develop- 


ment 


personality theory that have 
utility of the trait concept 


(e.g, Misch 968). 
Pei D iinants of Consistency 

Whi f human nature is correct, 
the ass i consistency or the assump- 
tion of tency? It may be that both 
perspecti ve captured a part of reality 
but that r viewpoint is a full and ade- 
quate ch: ization of the relationship be- 
tween att and behavior. Recent empiri- 
cal resea; s indicated that people differ 
in the ext to which their social behavior 
covaries w neasures of relevant attitudes: 
Some peop’ are more consistent than others 
leg., Normon, 1975; Schwartz, 1973; Snyder 
& Tanke, i press). These differences in con- 
gruence betwcen attitude and behavior can be 
conceptualiz in terms of the social psycho- 
logical construct of self-monitoring (Snyder, 
1974). According to this formulation, an indi- 
vidual in a social setting actively attempts to 
Construct a pattern of social behavior appro- 
Pilate to that particular context. Diverse 


Sources of information are available to guide 
this choic cluding (a) cues to situational 
or interpersonal specifications of appropriate- 
hess and (b) information about inner states, 
dispositions, and attitudes. 

Individuals differ in the extent to which 
they rely on either source of information. For 
those who monitor their behavioral choices 
on the basis of situational information (high 
self-monitoring individuals), the impact of 
Situational and interpersonal cues to social 
appropriateness is considerable; they demon- 
Strate considerable situation-to-situation dis- 
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criminativeness in their social behavior (Sny- 
der & Monson, 1975). Moreover, for these 
individuals, correspondence between behavior 
and attitude is often minimal (Snyder & Tanke, 
in press). By contrast, persons who guide 
their choices on the basis of salient informa- 
tion from relevant inner states (low self- 
monitoring individuals) are less responsive to 
situational and interpersonal specifications of 
behavioral appropriateness (Snyder & Mon- 
son, 1975). For these individuals, covariation 
between behavior and attitude is quite sub- 
stantial (Snyder & Tanke, in press). 


Social Determinants of Consistency 


Similarly, it may be possible to partition 
situations according to their self-monitoring 
characteristics. Perhaps some social settings 
more than others stress the relevance and 
utility of attitudes, dispositions, and other 
personal characteristics as guides to action, 
These settings should foster behavior char- 
acteristic of the low self-monitoring indi- 
vidual: In such settings, covariation between 
measures of attitude and observations of so- 
cial behavior should be substantial, and the 
impact of situational and interpersonal cues to 
appropriateness of social behavior should be 
minimal. 

It may be possible to strengthen the link 
between attitude and behavior by construct- 
ing a situation that encourages individuals to 
reflect privately upon and articulate their 
attitudes and the behavioral implications of 
these viewpoints before choosing a course of 
action. Such a procedure of increasing the 
salience of relevant attitudes might have sev- 
eral of the following effects, each of which 
ought to increase the correspondence between 
attitude and behavior: 


1. To the extent that an individual actor’s 
attention is normally focused on the demands 
of coping with the immediate situation, any 
procedures that increase attention to inner 
states will also increase the probability that 
such inner states can serve as guides to action. 

2. To the extent that one ponders and re- 
flects on one’s attitudes, it is more likely that 
one will be aware of the behavioral implica- 
tions inherent in one’s viewpoints (cf. Cart- 


wright, 1949). 
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3. To the extent that inconsistencies exist 
between the cognitive, affective, and inten- 
tional components of one’s attitude, proce- 
dures that increase the salience of these atti- 
tudes may also lead to resolutions or elimi- 
nations of these inconsistencies and therefore 
place the individual in a better position to 
use attitudes as relevant and functional 
guides for monitoring behavioral choices (ci. 
Norman, 1975). 

4. To the extent that one considers and 
consolidates one’s viewpoints or attitudes, 
one’s sense of personal responsibility for one’s 
actions (cf. Schwartz, 1973) and one’s aware- 
ness of self as a potential cause of behavior 
(cf. Wicklund, 1975) are enhanced, 

5. To the extent that one is encouraged to 
consider and contemplate one’s attitudes, any 
processes of avoiding and reinterpreting com- 
mitment (cf. Kiesler, Roth, & Pallak, 1974) 
will be short-circuited, and it will not be pos- 
sible to define one’s attitudes as irrelevant to 
the action choices at hand. 


As a result of these specific effects of atti- 
tude salience, when an actor adopts a con- 
templative orientation to choosing a course of 
action, the relatively impulsive situation in 
which “behavior” is observed may become 
more similar to the reflective situation in 
which “attitudes” are typically measured (cf. 
Harré & Secord, 1973). It is then more likely 
that the cognitive structures that guide ac- 
tions will match those from which verbal 
Statements of attitude are generated. 

Similarly, it ought to be possible to con- 
struct other social situations that promote 
the behavioral orientation characteristic of 
the high self-monitoring individual. Individu- 
als may be particularly motivated to monitor 
their behavioral choices on the basis of salient 

and relevant situational cues in settings that 
motivate a strategic approach to social inter- 
action (cf. Goffman, 1959). Considerable re- 
search on self-presentstion suggests that the 
way an presses 
dyadic interaction is very sensitive to char- 
acteristics of the r person (e.g., Cialdini, 
Levy, Herman, & Fivenbeck, 1973; Jones, 
1964; Newtson é lins'-v, 1974). In such 
“Gmpression-mana nt ituations, words 
and deeds seem n not so much 


individual 


cho 


attitudes in. 
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for the information they commu: > about 
one’s actual attitudes but rath: ye their 
strategic value in winning friend i influ- 
encing people. In settings that » ivate a 
strategic orientation to social inter om, be- 
havior ought to be quite sensitiv: social 
and interpersonal cues to situatio! ppro- 
priateness; at the same time, cova! n be- 
tween social behavior and individual > ‘tudes 
may be minimal. 

To investigate these notions, we © mined 
the relationship between measured a'. tudes 


toward affirmative action and later \ rdicts 
in a court case involving a female j»..intiff 


who alleged that she had been a victim f sex 
discrimination in hiring. These behavio ! de- 
cisions were formulated in anticipatio: of a 
discussion of the verdict with a partne For 
half of our participants, attitudes to «rds 
affirmative action were made salient | ‘ore 
consideration of the affirmative action law 
suit; for the other half of our partici) ts, 


attitudes were not made salient. In add: on, 
half of our participants learned inform: ion 
about their partners that could potent «lly 
serve as a guide to expressing their verdic( on 
the law suit; others learned nothing al) ut 
their partners. 

According to our theoretical formulation, 
when attitudes have not been made salient, 
we would expect that (a) the situational ma- 
nipulation of potentially relevant information 
about the partner should substantially affect 
judgments on the law suit, and (b) covaria- 
tion between measured attitudes toward af- 
firmative action and judgments on the law 
suit should be minimal. By contrast, whea 
attitudes have been made salient, we would 
expect that (a) impact of the situational m:- 
nipulation of information about the partner 
on judgments of the law suit should be mini- 
mal, and (b) covariation between personal 
attitudes toward affirmative action and deci- 
sions in the sex-discrimination case should be 
substantial. 


METHOD 
Overview 


One hundred twenty undergraduate male stu- 
dents in introductory psychology participated in a 
decision-making experiment in which they individu- 
ally considered a mock affirmative action law suit. 
In anticipation of discussing his verdict with a 
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„partner, each participant prepared a written expla- 
nation and justification of his verdict in the case. 
Before learning about the specific case, some par- 
ticipants were encouraged to consider privately 
their attitudes on the issue of affirmative action 
(attitude salient); others considered the specific case 
without this opportunity (attitude not salient). Be- 
fore considering the case, some participants learned 
that their partner disagreed with them about the 

“value of affirmative action (disagreeing partner) ; 

» others learned nothing about their partners’ beliefs 

(no information). 


The Measurement of Initial Attitudes 


Attitudes were measured 2 weeks before the ex- 
periment (as part of a considerably larger question- 
naire survey) by responses on 10-point scales to four 
statements that had»been.written to tap general 

attitudes toward affirmative action. The mean of 

“} each participant's self-ratings on this four-item mea- 
sure constituted our index of initial attitudes toward 
affirmative action.* 


The Judicial-Judgment Task 


Participants had been scheduled in pairs of previ- 
ously unacquainted students. Their only contact was 
a brief introduction by the experimenter as he es- 
corted the “;-vtners” from their separate waiting 

“4 rooms to the’ separate experimental rooms. Each 
Á participant re ‘ved a booklet that contained in- 
structions for lùe experiment. 

The manipulation of attitude salience. The fol- 
lowing instructions appeared in the booklets of indi- 
viduals who had been randomly assigned to the 
attitude-salient conditions: 


The material you will be reading describes an 
affirmative action law suit. The suit conerns a 
woman who claims that she was turned down for 
a job because of her sex. Since you will be sum- 
ming up the main points of the suit right after 
you read the case history, we will give you a few 
minutes to organize your thoughts and views on 
the affirmative action issue (e.g., Is it important 
to you that everyone is given equal opportunity 
in obtaining employment? Should women and 
minorities be sctively recruited to help equalize 
employment ratios? Is it a good idea to have men 
and women equally represented in employment? 
Tf you were an employer, how would you feel 
about regulations specifying how many women 
« 4nd minorities to hire?). 


PA 


1 These instructions were designed to encourage par- 
ticipants to consider privately and articulate their 
attitudes on the issue of affirmative action and make 
salient the behavioral implications of their view- 
Points (cf, Snyder & Ebbesen, 1972). These par- 
ticipants were allowed 4 minutes to think about 
their viewpoints. 

The manipulation of partner's beliefs. At this 
point, each participant who had been randomly 
assigned to the disagreeing-partner conditions read 
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instructions that informed him that his partner had 
already been in a similar experiment and had writ- 
ten an essay on a different case. Accordingly, the 
instructions continued, that essay had been included 
in the booklet to provide a general idea of the 
format for the participant’s own essay. 

The “partner’s essay” was one of two hand-written 
essays that had been designed to reflect either favor- 
able or unfavorable attitudes toward affirmative 
action. Participants in the disagreeing-partner condi- 
tions always read the essay that reflected the view- 
point opposite to their own as indicated by the pre- 
experimental assessment. Participants in the no-in- 
formation conditions were given no opportunity to 
learn of their partner’s beliefs. 

The case history. All participants then read vitae 
of two biologists, Ms. C. A. Harrison and Mr. G. C. 
Sullivan, who had applied to the University of 
Maine for a professorial position. The vitae described 
each applicant’s training, honors, professional ex- 
perience, and publications. The University of Maine 
appointed Mr. Sullivan. Ms. Harrison filed suit: 
She insisted that the decision reflected a bias against 
females. Participants then considered arguments ad- 
vanced in court by attorneys for the plaintiff (Ms. 
Harrison) and for the defendant (University of 
Maine). 

Participants were instructed to study carefully 
both sides of the case, weigh all of the evidence, and 
then prepare a written communication explaining 
and justifying their verdict, citing evidence which 
led to the judgment and discussing the implications 
of their action. All participants then wrote the essay 
on the blank sheets provided for that purpose. 

After the essays had been written, participants 
expressed their final attitudes toward affirmative 
action on the same scales used in the premeasure 
session. The experimenter then explained that the 
partners would not, in fact, discuss their verdicts on 
the court case. 


The Measure of Behavior ° 


To assess each participant’s decision on the court 
case as reflected and communicated by his essay, all 


1 Originally, five items had been written. An ex- 
amination of the correlations between individual 
items and total scores (with that item excluded) 
revealed that one item was uncorrelated with total 
scores, 7(118) = —.095, ms. Accordingly, to increase 
the internal consistency of the measure of attitude, 
that item was deleted from the attitude scale (Nun- 
nally, 1967). The internal consistency of the four- 
item attitude measure is .63, as assessed by Cron- 
bach’s (1951) coefficient alpha. Participants’ scores 
on the attitude measure ranged from 1 to 9.75, with 
a mean of 5.01 and standard deviation of 1.75. 

2 We recognize, as do most students of the rela- 
tionship between attitude and behavior, that there is 
some ambiguity associated with classifying some 
responses as “attitudinal” and others as “behay- 
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TABLE 1 
VERDICTS IN THE COURT CASE 
Attitude Attitude 
Condition not salient salient 
Disagreeing partner 5.54 5.44 
No information 3.32 5.35 


Note. Range =1 to 10, Higher means indicate greater 
favorability of the verdict toward Ms. Harrison, the female 
plaintiff. (» = 30 participants per condition.) 


essays were read by four raters who were unaware 
either of the purposes of this study or of the condi- 
tions under which the essays had been written. Each 
rater read the 120 essays and indicated the position 
represented by each essay by assigning a number 
between 1 (strongly disagree) and 10 (strongly 
agree) in response to the statement, “The author is 
in favor of hiring Ms. Harrison, the female plain- 
tiff.” The mean of these four ratings constituted our 
measure of the favorability toward Ms. Harrison of 
each participant's judicial decision-making behavior.® 


RESULTS AND Discussion 


We separately examined the effects of atti- 
tude salience and a disagreeing partner on 
(a) verdicts in the court case, as measured 
by the coders’ ratings of the essays, and (b) 
covariation between these verdicts and atti- 
tudes toward affirmative action, as measured 
in the preexperimental assessment. 


The Verdicts 


What were the effects of salience of initial 
attitudes and anticipation of a disagreeing 
partner on verdicts in the court case? We had 
anticipated that under conditions of low sali- 
ence of initial attitudes, judicial decisions in 
the court case would be influenced by antici- 
pation of a disagreeing partner but that when 


ioral” (cf, Kiesler, Collins, & Miller, 1969). None- 
theless, most researchers have focused on pencil-and- 
paper self-report psychometric scales to assess “atti- 
tudes.” They then have examined the predictive 
relationships between these measures of general 
attitudinal orientation and diverse specific verbal or 
nonverbal behavioral criteria, including voting, essay 
writing, verbal commitments and pledges, speeches, 
volunteering, physiological responses, and so on 
(e.g., Wicker, 1969). C the behavioral acts of 
deciding and communicating in writing a verdict in 
a specific sex-di court case (even one 
simulated in the psycholo; laboratory) are “be- 
which one would want to be able to predict 
from prio measures of 1 “attitudes” 
toward affirr action. 


haviors” b 


al 
scalar 
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initial attitudes had been made salie», anti 
pation of a disagreeing partner wouid ha 
little or no impact on one’s judgm= : of the 
court case. The means presented ir able 
display just such a pattern, Ther: was 
reliable interaction between attitude salient 
and partner’s beliefs, F(1,116) = 3.74, p= 
055. A planned comparison revealed 
when attitudes were not salient, judgments i 
the court case were substantially les: favor 
able toward the female plaintiff when »othi 
was known about the partner’s belie) thi 


when a disagreeing partner was antic pated 
F(1,116) = 6.90, p<.01. Similariy, 
planned comparison revealed that, wher atti- 
tudes were salient, anticipation of a © «gre 
ing partner had virtually no impact ©: ver 
dicts in the court case, F(1,116) =$, 

Furthermore, it appears that individual 
who anticipated a disagreeing partner « 
had salient attitudes, expressed and 
municated judgments more moderate 
those of individuals with neither salieni atti 
tudes nor anticipation of a disagreeing part- 
ner, whose judgments were relatively less 
favorable to the female plaintiff, covtr 
F(1,116) = 11.16, p< .002. In faci, th 
verdicts of individuals with salient attitude: 
and/or anticipation of a disagreeing partne 
were in no case reliably different fron. th 
neutral point (5.5) of the scale used b 
coders, all ts < 1, ms. By contrast, individua 
with neither salient attitudes nor anticipatio 
of a disagreeing partner rendered judgment 
relatively unfavorable to the female plaintiff 
their judgments differed reliably from thi 
neutral point of the coders’ scale, ¢(29) 
—4.96, p < .001. 

Why do attitude salience and anticipatio! 
of a disagreeing partner produce “middlc-of 
the-road” verdicts in the court case? Perkap: 
as Cialdini et al. (1973) have suggested, su 


®This measure could range from 1 (a verdi 
strongly unfavorable to Ms. Harrison) to 10 
verdict strongly favorable to Ms. Harrison). TI 
interrater correlations for this measure ranged frol 
.80 to .91, with an average interrater correlation 
.87. The measure of behavior may be viewed as 
four-item scale in which the individual items 
the responses of the four raters. From this persp® 
tive, the internal consistency of the measure is 9 
as assessed by coefficient alpha (Cronbach, 1951). 
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und decisions are the outcome of a 
process of impression management 
to afford the best possible prepara- 


z middle- 
strateg 
design 


tion ı discussion of these decisions. A 
mode position affords at least two advan- 
tages: i=) The middle ground allows the in- 
divi draw supportive arguments from 
both of the issue without appearing to 
be ir ent; (b) An individual with a 
m sition is likely to be viewed favor- 
ab e he creates the impression of 
op | rationality. Accordingly, by 
str hoosing a moderate decision on 
tt ©, an individual increases his 
Í making a favorable impres- 
tner (winning friends) and of 
lict prevail in the discussion 

ople): 
fects of attitude salience and a 
d . partner actually the behavioral 
outcor trategic impression management, 
or are “hey perhaps a reflection of actual 
“mocoration” shifts in attitudes toward af- 
firr e action? To assess this possibility, 
we «amined the effects of our independent 
variéhles on differences between attitudes 
towa affirmative action reported in the pre- 
ex ental session and those expressed after 


consideration of the court case, Attitude 
vere calculated so that posi- 
vs indicated shifts away from 
one’s osition (e.g, from “agree” 

y sree” for individuals with initial 


scores 


tive vali: 


atti ‘avorable to affirmative action). An 
unweioied-means analysis of variance re- 
vealed neither significant main effects nor 


reliable interactions (all Fs <1). Thus, it 
appears that differences in verdicts after con- 
sideration of the court case reflect strategic 
choices of the moderation strategy of behav- 
ioral self-presentation rather than any shifts 
in general attitudes, 

That the anticipation of a disagreeing dis- 
cussion partner should motivate strategic im- 
Pression management is clearly consistent with 
Other research on social communication and 
self-presentation (e.g., Cialdini et al., 1973; 
Newtson & Czerlinsky, 1974), Why, however, 
should thinking about one’s own attitudes in 
anticipation of discussing one’s decisions with 
an unknown partner (attitude-salient/no-in- 


COVARIATION BETWEEN ATTITUDE AND BrNAVIOR 
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TABLE 2 


Attitude 


Condition salient “salient” 
Disagreeing partner 
Correlation 06 ld 
Variance: Attitude measure 2.35 2.68 
Variance: Behavior measure 10,37 11.51 
n 30 30 
No information 
Correlation 07 58" 
Variance: Attitude measure 3.95 3.44 
Variance: Behavior measure 5.95 8.29 
n 30 30 


*P <.001, 


formation condition) induce individuals to 
adopt a moderation strategy of self-presenta- 
tion? Perhaps as these individuals became 
increasingly aware of their own attitudes and 
the behavioral implications of their view- 
points, they also became increasingly aware 
that they knew nothing of their partners’ at- 
titudes toward affirmative action, A middle- 
of-the-road strategy of moderation would best 
permit them to cope with either an agreeing 
or a disagreeing partner by minimizing the 
possibility of being extremely different from 
the other individual (cf. Kelley & Thibaut, 
1954). 


Covariation Between Attitude and Behavior 


What is the relationship between attitudes 
expressed by individuals 2 weeks prior to the 
experiment and their verdicts in the court 
case? We computed product-moment correla- 
tions between the behavioral measure (the 
coders’ ratings of the verdicts) and the atti- 
tudinal measure (the preexperimental assess- 
ment of attitudes) for all 120 participants and 
separately for each experimental group, These 
correlations are reported in Table 2, 

Overall, for all 120 participants, the cor- 
respondence between initial attitudes and 
judgment behavior was, at best, modest, 
r(118) = .22, p < .05. The magnitude of this _ 
correlation is quite comparable to that most 
frequently observed in studies of the predic- 
tive relationship between attitude and be- 
havior (e.g., Wicker, 1969). 
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However, when we considered the relation- 
ship between attitudes and behaviors sepa- 
rately for the four conditions defined by the 
experimental manipulations, a rather striking 
pattern emerged. When initial attitudes were 
not salient, measured attitudes and actual 
behavior were uncorrelated whether one an- 
ticipated a disagreeing partner, r(28) = .06, 
ns, or knew nothing about the partner’s be- 
liefs, 7(28) = .07, ns. 

By contrast, when attitudes were salient 
and one had learned nothing of the beliefs of 
one’s discussion partner, initial attitude 
toward the general issue of affirmative action 
served as a reliable predictor of verdicts in 
the specific court case, r(28) = .58, p < .001. 
When corrected for the attenuation caused 
by the less than perfect reliability of the mea- 
sures of attitude and behavior, this observed 
correlation is equivalent to one of .73. Thus, 
although these individuals may have been 
following a moderation strategy of self- 
presentation, their verdicts nonetheless re- 
flected their general attitudes toward affirma- 
tive action policies and procedures. 

However, when attitudes were salient and 
one also anticipated a disagreeing partner, 
there was only minimal covariation between 
these attitudes and verdicts in the court case, 
r(28) =.14, ns. Apparently, in these circum- 
stances, the forces of impression management 
stimulated by anticipation of interaction with 
a disagreeing partner outweighed the effects 
of attitude salience. 

Thus, only when attitudes were salient 
and nothing was known about the partner's 
beliefs was there a substantial predictive rela- 
tionship between measured attitudes and ver- 
dicts in the court case. The correlation be- 
tween attitude and behavior for individuals in 
this attitude-salient/no-information condition 
was reliably larger than that for persons in 

any of the attitude-not-salient/no-information, 
s=2.19, p= .029, attitude-not-salient/dis- 
agreeing-partner, z = 2.22, p = .026, or atti- 
tude-salient/disagreeing-partner, z = 1.91, p 
= .056, conditions. 


The Effects of Self-Monite 


e relationship 
ts a result of 


To examine differences 
between attitude and beh 
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variation in self-monitoring, we iputed 
product-moment correlations betwec: ‘he at- 
titudinal measure and the behavior’ asi 
separately for the high self-monito (n= 
57) and low self-monitoring (7n = ¢ roups 
defined by a median partition of | vloni- 
toring Scale (Snyder, 1974) scores lected 
during the preexperimental session.” | ovaria- 
tion between attitude and behavior s reli- 
ably larger for low self-monitoring ino duals. 
(r= .42, p < .005) than for high se’ moni- 
toring individuals (r = .03, ns), s 22; 
= .026. These data provide further »port 
for the notion that congruence betwe: atti- 
tudes and behavior will be greatest fo per- 
sons who regard their overt behaviors s ac- 
curate reflections of corresponding © posi- 
tions (cf. Snyder & Tanke, in press) 
CONCLUSIONS 
Behavioral scientists who have search. for 


evidence of pervasive consistencies be: cen 
attitude and behavior have typically i und 
only weak and inconsistent relationship: be- 
tween measures of attitude and observa ions 
of social behavior. Such outcomes have en- 
erated considerable skepticism about th: ex- 


nd 
nu- 


istence of close ties between attitud: 

behavior. However, our conceptual foi 
lation suggests that social settings and in er- 
action contexts may differ in the extent to 
which attitudinal and situational guides to 
action are salient and relevant for the indi- 
vidual. In the presence of clear and unambig 
uous social or interpersonal cues to situation: 
appropriateness, correspondence between s- 
cial behavior and these situational factors 
should be substantial. At the same time, <o- 
variation between attitude and behavior mi; ht 
be minimal, By contrast, in situations that 
stress the relevance of attitudes and disposi- 


tions as guides to action, social behavior 


ought to be less responsive to situational and 
interpersonal specifications of behavioral 
appropriateness. Behavior in these situations 
should, however, be well predicted from 
knowledge of personal characteristics, includ- 
ing measures of social attitudes. 


4 One participant did not complete the Self-Moni- 


toring Scale. His data were not entered into these 
analyses, 
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rical research, we increased the 
» between attitude and behavior 


Tn our 


correspon 
by the ma: 'wulation of attitude salience. Simi- 
larly, ing ils may be particularly likely 
to moniio ‘heir behavioral choices on the 
basis of g ‘tional guides in environments 
that (a ice either one’s sense of com- 
mitment sonal responsibility for one’s 
actions , 1971; Schwartz, 1973); 
(b) heigi mes awareness of self as a 
potential È behavior (cf. Wicklund, 
1975) circuit the process of avoid- 
ing and r eting commitment (cf. Kies- 
ler, R lak, 1974) and therefore make 


) define one’s attitudes as ir- 
s actions; and (d) provide 
e ort for congruence between 
clief (cf. Kiesler, Nisbett, & 


it impossible 
relevant to 
normati 


behavior 


Zanna, 1< 

We! ‘gested that individuals will be 
particula likely to regulate their self- 
presenta on the basis of situational cues 
in those jiteraction contexts that promote a 
Strategic «rientation for purposes of impres- 
sion man: ement and social influence, This 
self-moni g process may also be character- 


istic of sox {tings that (a) are novel, un- 
familia ntain sources of social com- 
parison (cf. »estinger, 1954: Sherif, 1937); 
(b) make j luals uncertain of or con- 
fused abou inner states (cf. Schachter 
& Singer, 


(c) suggest that one’s atti- 


tudes are soc!: {ly undesirable (cf. Dutton & 


Lake, 19 or deviant (cf. Freedman & 
Doob, 1968): and (d) sensitize one to the 
Perspective of others and motivate concern 
With social evaluation and conformity with 
referen ‘p norms (cf. Charters & New- 
comb, 1 Zimbardo, 1969). In such con- 


texts, individuals will be particularly likely to 
Monitor their behavior on the basis of avail- 
able situational cues, including the actions of 
others who appear to be behaving appropri- 
ately in the situation, 

Both our conceptual formulation and our 
empirical research Suggest that correspond- 
ence between attitude and behavior will be 
Steatest in situations that stress the relevance 
of attitudes, dispositions, and other personal 
characteristics as guides to action. However, 
we do not mean to imply that when an indi- 
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vidual is monitoring behavioral choices on the 
basis of salient and relevant attitudes, situa- 
tional factors are not important in the deter- 
mination of social behavior. For even when 
an attitude and its behavioral implications 
have been carefully articulated, any behav- 
ioral manifestation of this attitude must none- 
theless be carefully molded and shaped to fit 
a specific situational context. Thus, a par- 
ticipant in our experiment may turn to his 
attitude toward affirmative action to define 
his stand on the court case, but he must still 
look to situational cues to learn that the 
appropriate expression of his attitude is a 
hand-written essay phrased in language ap- 
propriate for a college-aged male reader with 
whom he will later discuss the content of the 
essay. 

Similarly, we do not mean to imply that 
the impact of situational pressures on social 
behavior necessarily precludes the influence 
of stable individual attitudes and dispositions, 
For example, even if all people donate more 
money to Charity X when pressed by their 
peers, the rank ordering of the size of indi- 
vidual donations may continue to reflect the 
ordering of individual attitudes toward Char- 
ity X. More generally, situational influences 
may affect the likelihood of behavior inde- 
pendently of the impact of such influences on 
the correlation between behavior and attitude, 
Thus, even when participants with salient 
attitudes strategically chose middle-of-the- 
road self-presentations to cope behaviorally 
with unknown partners, they nonetheless ex- 
pressed verdicts that were highly correlated 
with and predictable from their personal at- 
titudes toward affirmative action. 
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Evidence Against the Possibility of Utopian Societies 


Uriel G. Foa, Steven Megonigal, and Jules R. Greipp 
Temple University 


Two experiments that test the proposition that giving love facilitates the trans- 
ission of another resource are described. The first investigation deals with the 


ansmission of information and the other one with giving of money. Sixth- 


! ide students did better in a multiple-choice examination when the test 
iterial was presented by a “warm” teacher than when presentation was 
cold.” In a second experiment restitution of money of which college students 
had been previously deprived was not sufficient to eliminate their residual 
aggression toward the depriver unless returning the money was accompanied 
by an expression of affection. Lack of symmetry between provision and 
deprivation is discussed, with particular reference to modern society; the 
implication that social systems tend to decay is examined. 


i ıghout human history many proposals 
for. topian society have been advanced, and 
the ch for a stable, self-regulating system, 
resp sive to the needs of the individual still 
goc: n. Data presented in this article suggest 
tha topia might be impossible to realize. This 
evic ice was accidentally obtained in the 
cou: ¢ of two investigations meant to clarify 
the ‘ole of affection in the facilitation of social 
exchanges. 

rich and diversified body of research has 
s) own that expression of warmth and affection 


is iuences the recipient’s behavior. It was found 
‘hat social reinforcement by an adult can con- 
dition infants’ vocalization (Rheingold, Ge- 
wirt 1964; Weisberg, 1963), smiling 
t , 1958), crawling (Harris, Johnston, 
& Wolf, 1964), crying (Hart, Allen, 

i Harris, & Wolf, 1964), and playing 
(Av a, Hart, Buell, Harris, & Wolf, 1964). 
Older children improved performance in a vari- 
ety of tasks when treated with warmth (e.g. 
Becker, Madsen, Arnold, & Thomas, 1967; 
Berkowitz, Butterfield, & Zigler, 1965; Ge- 
wirtz, 1969; Hall, Lund, & Jackson, 1968; 
į isazi, 1972; Silverman & Waite, 
1969; Spence, 1973). Similar findings were re- 
+ ported with retarded children (Blair & Fox, 
1973; Hall & Broden, 1967; Insalaco, 1970; 
Patterson, 1966; Srivastava, 1968; Zimmer- 
man & Zimmerman, 1962). In a clinical setting 
warmth and praise were successfully used in 


Requests for reprints should be sent to Uriel G. Foa, 
Department of Psychology, 505 Weiss Building, Temple 
University, Philadelphia, Pennsylvania 19122. 


the treatment of stuttering (Rickard & Mundy, 
1965), passivity (Johnston, Kelley, Harris, & 
Wolf, cited in Harris, Wolf, & Baer, 1972), 
compulsive stealing (Wetzel, 1966), vomiting 
(Wolf, Birnbauer, Williams, & Lawles, 1965), 
and hysterical blindness (Brady & Lind, 1961). 
With normal subjects Pope and Siegman (1972, 
p. 79) reported that a warm interviewer in- 
creased the productivity and fluency of the 
interviewee; and a previous pleasant inter- 
personal experience was found to facilitate 
helping behavior (Berkowitz, 1972, p. 75; Isen 
& Levin, 1972). In several of these investiga- 
tions (e.g. Berkowitz, 1972; Pope & Siegman, 
1972; Spence, 1973), an effect was obtained, 
although giving affection was not contingent 
upon the emission of the target behavior. Ap- 
parently the change in need state brought 
about by receiving love increased the emission 
of socially approved behaviors. 

The two experiments reported in this article 
were planned to test the effect of affection on 
the transmission of other resources, informa- 
tion in the first study and money in the second. 
Although support was found for the notion 
that giving affection serves to facilitate other 
exchanges, the data also revealed an unsus- 
pected lack of symmetry between the effect of 
giving and denying affection. 


Srupy 1: BEHAVIOR OF TEACHER AND 
STUDENTS’ PERFORMANCE 
Method 


Four classes of students fro 
retown, Pennsylvania, Elementary 


m the sixth grade at Jar- 
School participated 
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TABLE 1 
MEAN EVALUATION AND TEST SCORES 
IN THE Two CONDITIONS 
Condition 
Score Warm Cold F ? 
Evaluation 18.7 15.2 121 .0000 
Test 18.0 16.0 43 -0000 


Note, Maximum score = 20. 


in this investigation, approximately 24 in each class, 
with both sexes nearly equally represented, In total 96 
subjects were involved, half of them were females, They 
were given a 35-minute presentation by two substitute 
teachers, a male and a female who were actually con- 
federates of the experimenter. Each confederate taught 
two classes, one in a warm and friendly manner, the 
other in a cold manner. Classes were randomly assigned 
to teachers and to conditions. Following the presenta- 
tion each pupil was given a teacher rating form and a 
multiple-choice test on the topic of the presentation. 
Procedure, The regular teacher informed the class that 
a substitute teacher would give a presentation on the 
life of wolves, The regular teacher then left, and the 
substitute teacher, accompanied by another assistant 
of the experimenter, entered the room, The latter in- 
troduced the teacher as Mr. or Miss so and so from 
‘Temple University and left. Both the assistant and the 
substitute teacher were naive about the hypothesis. 

The format of presentation and the information 
provided were the same for all four classes. To insure 
uniformity the same prepared script was followed. It 
consisted of a 5-minute introduction and a series of 
questions directed to the students in order to generate 
a controlled discussion. Most of the information was 
given during the discussion, Moreover, each teacher had 
a list of the test items and was instructed to check off 
each topic after discussing it, 

Although the information provided was strictly 
standardized, the behavior of the teachers differed in 
the two conditions. In one condition the teachers were 
warm and friendly; they asked students to call them by 
their first names, smiled frequently, and generally 
created an atmosphere of congeniality, Students were 
encouraged to answer questions and were complimented 
for what they said, Before leaving, the teacher thanked 
the class for having been so nice and told them how 
much he/she enjoyed being with them. In the other 
condition the teachers refrained from smiling, formal ad- 
dressing was retained, and the teachers answered ques- 
tions or acknowledged what the students said without 
praise. A rather cold atmosphere was maintained, al- 
though the teacher was neither hostile nor critical, At 
the end he/she just said “good-bye” and left. The ex- 
perimenter’s assistant then reentered the classroom, 
explained to the students that the substitute teacher was 
practicing teaching, and asked them to evaluate his or 
ier performance in order to assist in its improvement. 
sS 5 ER Se aon was used to justify the examina- 

A s were then handed an evaluation form, 


and after its completion, they were gi i 
I Ti 
Seas , they e given the multiple- 
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Instruments, The evaluation form cont 12 que: 
tions probing whether the teacher loved spect 
the students and assessing his or her abil conve 
information, Three alternative answers \ rovid 
to each question: “no,” “slightly,” anc sae 
maximum possible score was 24 for the mo vorabl 
evaluation. These scores were later trans d to a 
20-point scale to facilitate comparison with xamine 
ation, The test consisted of 20 questions t! overed 
the material presented to the students. Ther re four 
possible answers to each question, of which or 1e Was 
right. The number of correct answers provide he ree 
tention score, 

Results 
Students who received affection in ac ition 


to information evaluated the teacher | gher 
than those who received only inform. ion. 
Those who obtained affection also ret. ined 
more information. The mean scores are ¿iven 
in Table 1. 

Differences between warm and cold co di- 
tions were highly significant (p < .0001) by 
analysis of variance for both rating and t st 
It is evident, however, although not surp is: 
ing, that the teacher’s behavior had a stron; et 
effect on the evaluation than on acquisition o 
knowledge. This warm-cold variable explainc ¢ 
50% of variance in evaluation but only 28% o 
the variance in test results. The correlation be- 
tween evaluation and test scores was .°7: 
evaluation explained 32% of the test variance, 


Discussion 


The results indicate that provision of ove 
and status by teachers improves the studc nts’ 
retention of information concurrently pre- 
sented. The relationship between test score 
and rating is of interest because tı 1S) 
evaluation by students is often used to ¿sess 
teaching effectiveness. Our results cast some 
doubts on this practice: Rating accounted for 
only one third of the variance in test scores, 
their intercorrelation being .57. This correla- 
tion was nearly the same in the two experi- 
mental conditions, thus the love-giving be- 
havior of the teacher does not seem to be a sig- 
nificant factor in determining the size of the 
rating test relationship. Several other studies, 
reviewed by Gessner (1973), have produced 
widely divergent findings regarding the rela- 
tionship between students’ ratings of teachers 
and students’ test performance. Whereas 
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‘n his own work reported a positive 
correla‘ion of .77, other investigators obtained 
a neg: relationship. 

Tni tudy the amount of information pro- 
vid the same in the two conditions; only 
f ve varied between them. It is quite 

ibl wever, that in natural situations a 
provides more love may also give 


‘] on. When this happens the corre- 


ı rating and performance is likely 


to de and could even become negative. 
Ir sion, teachers are most effective and 
be | when they give love without reduc- 
in amount of information provided. 
Srupy 2: Givinc Love IN Money 


RESTITUTION 


1e effect of receiving love was further in- 

1 ited in the entirely different setting of 

ivation and restitution of money. In the 

i ure on aggression, the topic of res- 

has not received much attention as 

\ by Berscheid and Walster (1967). Fur- 

th «more, some of these investigations were 

© cerned with the effect of restitution (or its 

ence) on the aggressor (e.g., Walster, Ber- 

wid, & Walster, 1970). Effects of restitution 

the victim were, however, reported by Wor- 

wel (1961); in his experiment hostility induced 

yy status deprivation was subsequently re- 

duced by status restitution. Further, indirect 

evidence of the hostility-reduction effect of 

‘ion can be found in an investigation of 

ion by Aronson and Linder (1965); 

cts who overheard a person derogate 

tl. and then praise them judged this person 

more attractive than when he uniformly 
praised them. * 

In Study 2 we investigated changes in resi- 
dual aggression when the harm-doer not only 
makes restitution to his victim but also ex- 
pres ion for him. The following prop- 
sitions were advanced: (a) Deprivation of a 
ce instigates aggression in the victim 

1 the harm-doer. (b) Restitution of the 
resource of deprivation reduces aggressiveness. 
(c) When the harm-doer expresses affection for 
his former victim, in addition to making full 
Testitution, aggressiveness is further reduced 
and approaches the levél that existed before 
deprivation. 

It was thus hypothesized that residual ag- 
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gression would be highest following depriva- 
tion, lower after restitution, and further re- 
duced, reaching the predeprivation level, when 
restitution was accompanied by giving of 
affection. 


Method 


‘The design for testing these hypotheses involved three 

groups of experimental subjects and one control group. 

In all three experimental groups, a confederate denied 

the subject his fair share in the money they had gained 

in a joint task. In one group there was no restitution; 

in a second group the money was restored in a subse- 
quent task; in the third group restitution of money was 
accompanied by a brief expression of friendship. In the ~ 
last stage all subjects described their impression of the 
confederate on a rating form, thus providing an indica- 
tion of their residual aggression toward her, 

Subjects. Male volunteers from introductory psychol- 
ogy classes at Temple University served as subjects and 
were randomly assigned to one of the four groups, with 
15 subjects in each group. Confederates were several 
female students who fulfilled this role as part of their 
laboratory training. They were randomly paired with 
subjects in the various groups. 

Procedure. Subject and confederate were seated at a 
desk facing each other but were separated by a divider 
that prevented them from seeing one another. They 
were instructed to refrain from any conversation dur- F 
ing the experiment. The sole communication allowed 
was writing on a pad that they could pass around the 
screen. In this manner communication between subjects 
and confederates was strictly controlled. Their task was 
to solve a number of mathematical problems within a 
given period of time, The leader of the pair (chosen by 
drawing) was to answer the first question, then pass the 
sheet to the co-worker who would answer the second 
question. This procedure continued until the time allo- 
cated was over. The leader decided how the money re- 
ceived would be divided between himself and his” 
co-worker, and he could modify any answer given by the zi 
co-worker. These instructions were handed over in 
writing to each participant. After making sure that all 
the rules were clear, the confederate was asked to pick s 
one of two slips of paper in order to decide whether sho na 
would be leader or co-worker. Actually both slips had 
the word “co-worker” on it so that the first batch of — 
problems the subject was always the leader. Although 
appearing to be timed, the task was actually stopped 
after both partners had provided an equal number si 
answers, 6 or 7 each. The experimenter than announc' 
the reward, which ostensibly depended on the correct- 
ness of the answers and varied between $2.50 and $3. 
The leader was then asked to mark on the sheet how the 
reward was to be divided between him and his co- 
worker, Subjects usually divided the reward equally. 
The four who did not were excluded from the analysis 
oe pect then presented a new ator priba 
lems and suggested that roles could now be switched n 7 
that the confederate would be the leader. Just before a) 
gave the signal to start the second set of problems, the 


TABLE 2 
MEAN RESIDUAL AGGRESSION BY TREATMENT AND RESOURCE CLA 
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Treatment 


No deprivation, 
no restitution 
(control) 
Resource class 


Money depri 


No restitution Money 


Money (198.36) 0 01 
Goods (8.35) 3.0 OL 
Services (8.28) 2.6 OL 
Love (10.66) 1.6 ‘01 
Status (2.35) 4.2 OL 
Information (3.24) 32 05 


17.0 OL 5.8 
8.2 ns 5.6 
8.0 ns 3.8 
6.0 ns 6.2 
8.0 ns 6.4 
6.2 ns 1.4 


Note. Maximum score = 20. Main effect Fs are given in parentheses. df = 3, 116. p values refer t of 
between each two adjacent means by analysis of variance, p values appear in italics. 


experimenter announced how the subject had divided 
the reward for the first series, At the end of this second 
stage, the confederate, who was now the leader, kept 
all the money to herself, except in the control group in 
which it was again divided equally, At this point sub- 
jects and confederates in the control and the no-restitu- 
tion groups filled out the evaluation form. Those in the 
two other groups received the third and last set of prob- 
Jems. A new bogus lottery showed that the confederate 
was again to be leader. This time, however, she assigned 
all the money to the subject (which was kept to the same 
amount as in the previous stage), thus providing full 
restitution, In the “love” condition the confederate, in 
addition to allocating all the money of the third stage 
to the subject, also handed him a note saying, “I’m 
sorry T did that; I think you're a nice guy, and I like 
you. TIl explain after the experiment.” The explanation 
was then given in the debriefing that followed the rating 
of the partner. 

Instrument, The rating form consisted of a list of 24 
attributes, half of which were fillers. The other 12 indi- 
cated negative evaluation in a specific resource class 
(Foa & Foa, 1974, p. 36); there were two attributes for 
each one of the six classes, For example: hostile (love), 
arrogant (status), misleading (information), unfair 
(money, in the particular context of this experiment), 
larcenous (goods or money), and hindering (services). 

‘The order of presentation of the adjectives was varied 
to avoid sequential effects. Subjects were asked to rate 
the confederate on a 10-point scale according to how 
much each adjective was descriptive of him (from “not 
at all” to “very much so”). Thus a high score on each 
pair of resource-specific attributes indicated strong resi- 
dual hostility in the resource class to which the two at- 
tributes referred, The maximum Score was 20, 


Results 


Analysis of variance showed a main effect for 
treatment conditions in all 
cept status; the respi 
Table 2. For probing 
ment groups tha 
ses, 


resource classes ex- 
ective Fs are shown in 
differences between treat- 
i ut were relevant to the hypothe- 
simple F tests were conducted. It had been 


predicted that potential ag 
lowest in the control group 
group that experienced de 
restitution. In the two oth 
aggression was expected to 
former extreme groups but 
who received only restituti \oney 
for those who got both m« 
data provided a good deal o! 
predictions and also some further ciui 
For every class of resources, residual ageres- 
sion was significantly lower in controls than i 
the group who experienced depriva with ug 
restitution, Restitution of money alon sig- 
nificantly reduced residual ag i for 
money but not for the other resource. H vever, 
provision of love, in addition to moù , low- 


below 


deprivation; none of the differences between 


; in fact, 
, except for 


Discussion 


Asymmetry of Generalization in Givi ng and 
Taking 


Provision of affection appears to e 


transmission of another resource: Students who 
receives 


y d love retained more of the information 
provided by their teachers; restitution Ol 


nhance thi 
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Fm accompanied by an expression of warm 
feel decreased the victim’s residual aggres- 
sion he money class more than did restitu- 
tio money alone. We also found that 


\ t 


n of money resulted in a deficit of 
1 cts deprived of money that was not 
‘pressed more aggression in the love 


clas : did controls; high residual aggression 
Y was also reported in an experiment de- 
scribed by Foa and Foa (1974, p. 197), after 
su ject and confederate had engaged in mutual 
di “vation of money. Thus provision or de- 
p! ©tlon of a given resource seems to affect the 
b. ance of other resources. This generalization 
e ect also suggests that (a) provision of any 
esource (e.g, money) will increase the amount 
cf love in possession of the receiver; and (b) 
€ privation of love will produce a deficit in 


( resources as well. The evidence seems, 
} cr, to point in the opposite direction: 
of money by the harm-doer did not 
si ual hostility in the love class, which 
I d the same as when no restitution oc- 
. Apparently, then, provision of money 
not affect love. Data contradicting the 
se od proposition were reported by Foa and 
I (1974, p. 197): Residual aggression for 
money, following mutual deprivation of love, 
w s almost as low as in control subjects; taking 
a ay love did not generalize to money. 
Considered together, these various results 


suggest that generalization across resource 
cla not necessarily symmetrical; when 


are provided, generalization from 
k > money is more likely to occur than 
gen- alization from money to love. In depriva- 
tion, on the other hand, generalization from 
money to love is more likely to happen than its 
opposite. When love is provided there is a gain 
in other resources, but no love is gained when 
other resources are given. Conversely, depriva- 
tion i sources results in a loss of love, 
alth : love deprivation does not produce a 
in other resources. Consequently, the 
loss of love occurring when other resources are 
taken away is not offset by receiving these re- 
Sources. By contrast, the gain in other resources 
when love is provided is not wiped out by de- 
privation of love. In the long run the system 
Seems to favor other resources over love: Loss 
of love following deprivation of other resources 


z not compensated when other resources are 
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provided. To maintain equilibrium and avoid 
accumulation of aggressive potential, it ap- 
pears necessary that on the average, more love 
should be given than received. 

In modern society the intrinsic difficulty of 
meeting this requirement is compounded by 
institutional specialization—the existence of a 
large number of social institutions, each of 
them restricted to the exchange of a narrow 
range of resources (Foa & Foa, 1974, pp. 326- 
328). This society presents scarce opportunities 
for exchanges of love, particularly in conjunc- 
tion with other resources; in consequence, the 
property of love to facilitate other simultane- 
ous exchanges remains largely unused, on the 
other hand, loss of love resulting from depriva- 
tion of other resources does occur, 

Asymmetric processes such as these are not 
unknown in other sciences; A classic example 
is the second law of thermodynamics. In a 
closed system asymmetric processes result in 
progressive decay, and only external interven- 
tion can maintain the system in a steady state, 
The evidence for asymmetry reported here sug- 
gests that a self-regulating, utopian society 
may be impossible to achieve: Intervention and 
purposeful changes appear likely to be required 
for reducing tensions and preserving a viable 
equilibrium. 

It would be presumptuous to advocate social 
reforms based on the results of two modest ex- 
periments, but further research on this asym- 
metric process appears very much warranted 
and fraught with practical relevance. 
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Locus of Control and the Generality of Learned Helplessness 
in Humans 


Sheldon Cohen, Myron Rothbart, and Susan Phillips 
University of Oregon 


To examine the effects of learned helplessness on tasks used to assess the 
aftereffects of stress, internal and external subjects were pretreated with con- 
tingent or noncontingent nonnoxious reinforcement using a yoking procedure 
to control for amount of trials and reinforcement. Both internal and external 
noncontingent subjects performed more Poorly on subsequent tasks requiring 
a problem-solving strategy. However, only externals showed helplessness effects 


on non-problem-solying tasks. 


Recent research has demonstrated that 
organisms which experience inescapable nox- 
ious stimulation show impaired performance 
on subsequent instrumental tasks. This phe- 
nomenon, described by Seligman (1975) as 
“learned helplessness,” has been observed with 
humans (e.g, Hiroto, 1974; Krantz, Glass, & 
Snyder, 1974) as well as with infrahumans (e.g. 
Overmier & Seligman, 1967 ; Seligman & Maier, 


1967). Selig has proposed that learned 
helpi rs because the organism learns 
that its ments are independent of its 


that it lacks control over its 
this learning undermines the 
initiate further instrumental 


responses 
outcomes), and 
motivation to 
responses. 
Closely related to the work on learned help- 
lessness is Glass and Singer’s (1972) research 
on the effects of noise on poststimulation per- 
formance. These authors reported that subjects 
exposed to uncontrollable noise that is un- 


related to the ongoing task do more poorly on 
poststin sks than subjects who per- 
ceive that terminate the noise at will. 
These effect: observed on poststimulation 
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tasks as diverse as proofreading, a tolerance for 
frustration test, and the Stroop Color-Word 
task. Seligman (1975) interpreted these results 
as an example of learned helplessness. He 
argued that subjects unable to control the 
noise learn that their reinforcements are in- 
dependent of their responses, which results in 
motivational decrements manifested in poorer 


performance on poststimulation tasks. Such an 


interpretation requires two assumptions that 
are not supported by existing literature: (a) 
Learned helplessness can be induced when per- 
formance on the experimental task (task per- 
formed during noise exposure) is not instru- 
mental in escaping or avoiding the aversive 
stimulus, and (b) helplessness will generalize 
to a wide range of cognitive tasks including 
ones that do not require a directed problem- 
solving strategy. (Test tasks used in previous 
studies require subjects to initiate responses in 
a trial-and-error fashion; cf. Wortman & 


Brehm, 1975, p. 299.) The first purpose of this 
study is to examine this last assumption. 


Specifically, we have tried to determine 
whether task impairment on the Glass and 
Singer aftereffects tasks can be replicated when 
a more standard learned helplessness pretreat- 
ment is used. 

Also related to the work on learned helpless- 
ness is the social learning construct, internal- 
external control of reinforcement (Lefcourt, 
1966; Rotter, 1966). Internal-external locus of 
control refers to the degree to which an in- 
dividual feels that his reinforcements are con- 
tingent upon his actions. Those who feel they 
control their reinforcements are labeled in- 
ternals, whereas those who perceive their out- 
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comes to be independent of their responses are 
labeled externals. The relationship between this 
personality factor and learned helplessness has 
been studied by Hiroto (1974), who reported 
that externals display greater helplessness 
effects than internals. Hiroto, however, in- 
cluded in his study only those subjects with 
extreme scores on the locus of control scale— 
at least one standard deviation from the mean. 
To determine whether people with internal- 
external scores that are less extreme are simi- 
larly affected by a helplessness manipulation, 
the present study used a less select group of 
subjects. 

A third purpose of the present research is to 
examine Seligman’s (1975) suggestion that 
learned helplessness should occur not only when 
an organism experiences uncontrollable noxious 
stimulation, as in the studies cited earlier, but 
also in situations in which the individual lacks 
control over nonnoxious contingencies. Evi- 
dence relevant to this hypothesis is supplied by 
two recent studies. Both Hiroto and Seligman 
(1975) and Roth and Kubal (1975) reported 
that subjects receiving noncontingent rein- 
forcement on a concept-formation problem 
showed deficiencies on a subsequent problem- 
solving task. Although these studies support 
the suggestion that learned helplessness is in- 
duced by noncontingency rather than by some 
Property of noxious stimulation, generaliza- 
tion of their results is limited by a subtle con- 
found. In both studies, subjects in the non- 
contingent group were on random 50% 
reinforcement schedules, However, subjects in 
the contingent groups received substantially 
greater percentages of reinforcement. (The 
exact percentage for any subject depended on 
the number of trials required to learn the dis- 
crimination.) Thus, in manipulating whether 
the reinforcement is administered contingently 
or noncontingently, these studies have also 
varied the subject’s actual success or failure. 
Recent work on learned helplessness induced by 
noxious contingencies has avoided this problem 
by yoking contingent and nhoncontingent sub- 
jects, with each member of a yoked pair re- 
ceiving the identical number of reinforcements 

(cf, Hiroto & Seligman, 1975). In line with this, 
the Present study manipulated contingency of 
nonnoxious reinforcement without varying the 
percentage of reinforcements received by the 


S. COHEN, M. ROTHBART, AND S. PHILLIPS 


contingent and noncontingent grou; 
procedure similar to the one desc: 
is used to accomplish this purpose. 
The present study, then, is an 
induce learned helplessness by pretri 


jects with noncontingent nonnoxious orce= 
ment by means of a yoking procedure does 
not confound contingency with perc: re of 
reinforcement. To determine the de. e to 
which a helplessness pretreatment on n in- 
strumental task will generalize to other asks, 
two of Glass and Singer’s (1972) afte: fects 
tasks, a problem-solving task, and a ta not 
requiring a problem-solving strategy wer. ised 
during the test phase. Finally, subjects’ > ores 
on a locus of control scale were used to o ter- 
mine the effects of this individual differen: on 


susceptibility to the learned helpless ess 
pretreatment. 

METHOD 
Subjects 


Subjects were 22 female and 20 male undergrad: «te 
students who received extra credit in an undergradu te 
psychology course for participating in the study. 


Locus of Control 


An abbreviated version of Rotter's (1966) Intern: l- 
External Locus of Control (I-E) Scale was administer-d 
to a large class of undergraduate psychology students. 
Political and ideological items were eliminated from 
Rotter’s original scale on the basis of a factor analysis 
performed by Mirels (1970). Thirteen items with the | 
highest loadings for a factor designated as control over | 
personal events were chosen for the modified scale along $ 
with six buffer items. Test administration was con- f 
ducted by experimenters other than those associated 
with the experimental task and in a context unrelate | 
to the experiment, The locus of control tests were scored 
in the external direction, resulting in a bell-shaped 
distribution ranging from 0 to 13. A median split 
excluding subjects who fell within 1 point of the medion 
(excluding less than 10% of the sample) produced two 
experimental groups, Subjects falling below the median 


were labeled infernals and those above the median > 
were labeled externals. 


Pretreatment: C ontingent Versus 
Noncontingent Reinforcement 


A computerized concept-formation task served as (he 
vehicle for the experimental manipulations, The con- 
cept-formation task was based on the spati 
ships between pairs of designs. Subjects wer 
with a repeating series of 32 
stimulus materials deve 
described by Roth and 


al relation- 
e presented 
different slides based on 
loped by Postman (1954) and 
Bootzin (1974), Each slide con- 


et 
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tained stimulus figure and two possible response 
figures of the response figures was related to the 
stimul gure in a manner that conformed to a specific 


prin A line from the left design in the figure was 
tra ! to the right design in the figure, 

vutset of the experiment, each subject was 

t perimenter A that the task was a test of 

ning ability. The format of the slides was 

descr > the subject, and an explanation of the 

ce tion task was given, In brief, each sub- 

iat one set of figures was arranged ac- 

ticular rule or principle. His task was 

t of figures that he believed conformed 

te iate principle. He was told that he would 

re ck after each trial informing him of 

w! \oice was correct or incorrect. The same 

p :d to all of the slides presented, and his 

discover the appropriate rule in as few 

The experimenter then showed the 

isk with a similar format and demon- 

: of systematically testing hypotheses. 

al task itself was executed in the 

ccond experimenter (B) as soon as the 

nter ascertained that the subject clearly 

nature of a concept-formation problem. 

lides were projected on a 12-inch 

y screen directly in front of the seated 

lide was exposed for 4 sec, after which a 

ond” message appeared on an oscilloscope 

loc cctly above the projection screen, The sub- 

J a ‘ied to take as Jong as he wished to 

cl ither the right or left set of figures by pushing 

í wo corresponding response keys placed on the 

t ont of him. After the subject pressed one of 

, a “right” or “wrong” message appeared on 

t ope. Immediately after the feedback message, the 

rd ready appeared. When the subject was ready to 

roceed, he pressed either of the response keys, which 

i tation of the next slide. The subject 

c as much time as he wished before pro- 

the next slide. He was told to continue 

s until he was notified to stop, even if he 
i he had discovered the correct principle. 

r each 20 trials, the experimenter showed the 

t (in all conditions) one of eight hints. The hints 

were yraduated so that the first one simply reiterated 

the instruction and the final ones described the solution 

to the problem, The hints were placed beside the sub- 

ject, who was free to refer back to them whenever he 

wished. The hints were used to ensure that all subjects 


sub 


receiving contingent reinforcement would learn the 
G within the 200 trials that were allowed. 
Postcxperimental interviews suggest that subjects who 
received noncontingent reinforcement (and thus non- 
informative hints) were not suspicious about the hints 


but that they assumed instead that the problem was a 
difficult one. 

Depending upon the subject's assignment to one of 
the experimental conditions, his performance on the 
concept-formation task was reinforced contingently or 
Noncontingently. An equal number of internals and 
externals were (otherwise randomly) assigned to both 
the contingent and noncontingent feedback conditions. 
Experimenter A, who explained the task to the subject 
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and later administered the aftereffects tasks, was un- 
aware of the subject’s assignment to the experimental 
conditions or whether the subject was in the group of 
internals or externals, 

In the contingent feedback condition, the subject was 
told he was right whenever he chose the set of figures in 
which a line had moved from the left design to the right 
one. The alternative choice resulted in a wrong message, 
The task was terminated when the subject reached a 
criterion of 10 consecutive correct choices, For each 
subject in the contingent feedback condition, data on 
the number of correct responses and the number of 
trials to criterion were saved in a computer file. Each 
subject in the noncontingent feedback condition was 
yoked to a subject in the contingent group. The yoked 
subject received the same number of correct responses 
and total trials as the contingent subject. For the non- 
contingent subject, however, the reinforced responses 
were distributed randomly over the total number of 
trials. 


Test Phase Task Assessment—Aflereffects Tasks 


At the termination of the concept-formation task, 
the subject was taken to another room where two test 
tasks (Glass & Singer, 1972, aftereffects tasks) were 
administered by Experimenter A, In the first test task, 
an adaptation of the Stroop Color—-Word test (Jensen & 
Rohwer, 1966; Stroop, 1935), the subject was required 
to read (or name) several lists of stimulus material as 
quickly and as accurately as possible. Subjects were 
timed separately on each of three separate pages of 
material, Each page was printed in paragraph form and 
contained each of four stimuli appearing in random 
order approximately 42 times each. The first sheet con- 
tained the color names red, orange, green, and blue, The 
second sheet contained color patches of the same four 
colors. Finally, the third sheet contained the Stroop 
task, in w the name of each of the four colors is 
printed in one of the other three colors, and the subject 
is required to name the colors that the words are printed 
in rather than reading the words themselves, The 
experimenter recorded the number of errors the subject: 
made on each of the three readings in addition to the 
time required for each, The Stroop task was chosen as 
an example of a task not requiring a problem-solving 
strategy. 

The second test task was a measure of tolerance for 
frustration adapted from Feather (1961). Four sets of 
line diagrams were placed in front of the subject. The 
diagrams were printed on 5 X 7 inch (12.7 X 17.8 cm) 
cards, and each of the four piles contained multiple 
copies of the same diagram or puzzle. The subject’s task 
was to trace over all of the lines of the diagram without 
tracing any line twice and without lifting the pencil from 
the figure. The piles were placed in a specific order such 
that the subject would work first on an insoluble puzzle, 
second on a soluble one, third on another insoluble 
puzzle, and finally on another soluble one. A subject 
could take as many trials on a given diagram as he 
wished, but he was limited to a maximum of 40 sec:per, 
card. The subject could choose to continue working on 
the same diagram or move on to the next pile at any 
time, but he could hot return to a previous pile after 
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proceeding to another. The cards were placed face down 
until the subject began working on the pile. If the sub- 
ject successfully completed the task for one diagram, he 
was to proceed to the next pile immediately, The num- 
ber of cards attempted on each of the four diagrams and 
the amount of time spent on each one was recorded. The 
more time (also, the more cards) a subject spent on the 
insoluble puzzles (Cards 1 and 3), the greater his 
tolerance for frustration, This task is considered a 
problem-solving task, 

Finally, each subject was asked to complete a short 
questionnaire designed to assess his reactions to the 
concept-formation task. On 7-point scales, subjects 
indicated how difficult, how stressful, and how frustrat- 
ing they felt the problem had been. Subjects also used 
7-point scales to evaluate their effort on the task, their 
degree of success on the task, and the degree to which 
they felt success on the task was under their own con- 
trol. Subjects were debriefed following completion of the 
questionnaire, 


RESULTS 
Performance on Concept-Formation Task 


Since the procedure required that one half 
the subjects in the noncontingent condition be 
yoked to internal-contingent subjects and the 
other half to external-contingent subjects, it 
was necessary to determine whether the inter- 
nal- and external-contingent subjects per- 
formed differently on the concept-formation 
task. A comparison of internal and external 
subjects in the contingent condition revealed 
no differences in number of trials to reach 
criterion (for internals, M = 91.80, for ex- 
temals, M = 88.36, £ = .15, ns) and no differ- 
ences in the proportion of successful trials (for 
internals, M = .57, for externals, M = .58, 
t= .26, ns). Thus, subjects in the noncon- 
tingent condition who were yoked with in- 
ternals received virtually the same number of 
trials and proportion of successes as did sub- 
jects who had been yoked with externals. 


Manipulation Check 


To determine whether the experimental 
manipulation achieved the desired effect, an 
analysis of covariance was carried out on 
measures of perceived control, perceived suc- 
cess, perceived difficulty, st ressfulness, frustra- 
tion, and effort. 
` An analysis of ratings of perceived control 
indicated that subjects in the random condi- 
tion felt significantly less control than subjects 
in the contingent condition, for contingency, 
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F(1, 41) = 9.96, p < .005; ther 
effects due to locus of control or t 
action between contingency ani 
control. 

Since one purpose of the yoking 
was to control for the proportion of 
and to vary only the contingency of 
forcement, it was important to d 
whether perceived success varied bet 


mine 
n the 


two conditions. Analyses of covariance Í per- 
ceived success ratings revealed no sig) icant 
main effects or interactions. 

An analysis of perceived difficulty «tings 
indicated only that subjects in the no icons] 
tingent condition perceived the task as nore 
difficult than did subjects in the conti sent 


condition (p < .02), Despite differenc. in 
perceived difficulty, subjects did not r vort 
expending significantly more effort in the on- 
contingent than contingent condition. No. did 
subjects in the noncontingent condition ite 
the stressfulness or frustration of the tas! as 
greater than did contingent subjects, 

Thus, the checks on the manipulation in- 
dicate the experimental manipulation succ ss- | 
fully influenced perceived control without n- 
fluencing feelings of perceived success. f 


The Stroop Test 


Data on error rates from the Stroop test 
showed no significant differences, so only data | 
on the speed of completion are presented here. _ 
Three different measures were obtained for the © 
Stroop test: (a) number of seconds to read » 
words alone; (b) number of seconds to name 
color patches; and (c) number of seconds io 
name the color in which a word is printed 
(where the color in which the word is printed 
is different from the color described lexically by 
the word). ý 

There were no significant differences among h 
groups in the speed with which they read the 
words alone. However, significant differences 4 


Tn all analyses of variance, 
Proportion of successful trials wi 
to separate variance due to diff 
pairs from variance due to experimental treatments. 
Although this procedure accounts for some residual 
variance, there were no differences between the con- 
tingent and noncontingent groups (who were yoked) or 
the internal and external groups on either of the 
covariates, 


number of trials and 
ere used as covariates 
erences between yoked 
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URE 1 Performance on color naming and Stroop as a function of contingent versus 
noncontingent feedback on a prior concept-formation task. 


€ h reading the color patches and 
il task itself. It is apparent from 
I performance on the color patches 
p task is comparable, and in both 
i vas a significant interaction be- 
t ingency and locus of control. The 


pe ince of internals and externals was 
simila: ontingent conditions, but the 
pe { the externals in the noncon- 


ti litions was significantly impaired. A 
} ct for contingency seems to be due to 
i rong cilect of noncontingency on the 
eternals (see Table 1). 


TABLE 1 


YSIS OF COVARIANCE FOR COLORS 
AND Stroop TASK 


n Source SS df MS F 
Colors 
10,866.95 41 265.05 
gency (A) 1,040.17 1  1,040,17 4.81% 
al-External (B) 235.61 1 235.61 1.09 
A 1,278.16 1 1,278.16 5.91** 
No, trials 241.99 1 241.99 1.12 
Proportion correct 583.16 1 583.16 2.70 
Residual 7,786.86 36 216.30 
Stroop 
Total 62,194.35 41 1,516.94 
A 6,324.25 1 6,324.25 5.10% 
B 2,706.72 «1 i 2.18 
AXB 7,293.35 1 5.88* 
No. trials 1,019.87 1 +82 
Proportion correct 1,640.44 1 1.32 
Residual 44,636.13 36 , 1,239.89 
* p <05. 
™ P< .02. 


Frustration-Tolerance Task 


Insoluble cards. Data from the insoluble cards 
(1 and 3) were combined and are presented in 
Figure 2. As Figure 2 indicates, there is a strong 
main effect due to the experimental manipula- 
tions, with subjects in the contingent condition 
persisting at the insoluble task longer than 
subjects in the noncontingent condition, for 
the contingency main effect, F(1, 41) = 6.85, 
$ < .02; see Table 2. There were no inter- 
actions or effects due to locus of control. 

Soluble cards. Although it was not our in- 
tention to analyze the data from the soluble 
cards, some interesting differences emerged for 
these cards as well. Figure 2 indicates that for 
the soluble card (2), subjects in the noncon- 
tingent condition took longer to solve the 
problem than subjects in the contingent con- 
dition? Also there is the hint of a main effect — 
for locus of control, with externals spending the 
greatest amount of time, for the interaction, 
F(1, 41) = 3.79, p < .06. x 


DISCUSSION 


Both internals and externals showed the 
expected helplessness effects on the puzzle- 
solving tasks. These results are consistent with 
those of previous work in the area (8 
Hiroto & Seligman, 1975; Roth & Kubal, 1975) 


2 Data from the other soluble card (4) are not in- 
cluded here because the amount of time spent on the 
last card is not independent of the time spent on the 
previous cards, However, the data are consistent with 
those presented in Figure 2. 
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Figure 2. Persistence at soluble and insoluble tasks as a function of contingent versus 
noncontingent feedback on a prior concept-formation task. 


and suggest that pretreating subjects with non- 
contingent nonnoxious reinforcement can in- 
duce subsequent deficits on problem-solving 
tasks. The strength of the helplessness pre- 
treatment is suggested by its impact on soluble 
as well as insoluble puzzles, Moreover, even 
though the Stroop task was administered first, 
and thus intervened between the pretreatment 
and the puzzle task, both internals and ex- 
ternals showed deficits in puzzle performance. 

Unlike the puzzles, the effects of the helpless- 
ness pretreatment on the Stroop—a non- 
problem-solving task—are mediated by indi- 


TABLE 2 


ANALYSIS OF COVARIANCE FOR INSOLUBLE 
AND SOLUBLE TASKS 


Source 


SS dg Ms P 
Insoluble task 
Total 2.146.629,00 41 52,356.81 
Contingency (A) 335,308.00 1 335,308.00 6,85% 
Internal-External (B) 285.35 1 285,35 .00 
AXB 26,368.09 1 : 54 
No. trials 13,933.95 1 128 
Proportion correct 228.83 1 -01 
Residual 1,763,355.00 36 48,982.08 
Soluble task 
Total 29,890.99 41 729.05 
A 5,366.27 1 5,366.27 9,18%%+ 
B 2216.77 4 2,216.77 3.79% 
AXB 285.34 1 285.34 49 
No. trials 777.71 1 777.71 1.33 
Proportion correct 10.74 1 10.74 02 
Residual 21,055.25 36 584.87 
$p <.06, 
** p <.02 
> <0 
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vidual differences in locus of control, Al 
both internals and externals show help! 
effects on the puzzles, only external 
deficits on the Stroop task. 


noncontingent pretreatment and their gen- 
eralized expectancy, helplessness for externals 
generalizes to a wider range of tasks. Thus 
extemals may be more susceptible to a non- 
contingent event in the sense that expectancies 
created by that event will be more general. In 
effect, the event may reinforce externals’ gen- 
eralized expectancy while creating only a task- 
Specific (limited to problem-solving tasks) 
expectancy for internals, 

Although it is striking that externals pre- 
treated with noncontingent teinforcement per- 
formed more poorly on the Stroop (a non- 
problem-solving task that nevertheless is quite 
difficult), it is even more Surprising that the 
experimental treatment affected the speed with 
which subjects could name the colored patches. 
Such an effect on such a low-effort task indi- 


8 the generality of the learned helplessness 
pretreatment for external subjects, It is puz- 
zling, however, that similar effects were not 
found for the reading of the names of the colors, 
This may indicate that there is a threshold of 
difficulty, or effort, below which the effect does 
not occur, Data indicating that naming color 
patches requires considerably more effort than 
reading color names (Fraisse, 1969) is con- 
sistent with this interpretation. 

In a recent article, Hiroto (1974) reported 
that externals performed more poorly 
problem-solving tasks than internals, regard- 
less of whether their pretreatments consist of 
contingent or noncontingent reinforcement. He 
found no interaction between the contingency 
pretreatment and I-E scores. This suggests 
that externals entered his xperiment in a 
helpless state (at least relative to internals) 
and that the contingency treatment did not 
further affect this state, This interpretation is 
further supported by the data Hiroto collected 
“3 on the pretreatment tasks, which indicated that 
Ppa externals were slower to escape and avoid than 
g J internals, externals required more trials to 
į Teach avoidance criterion, and externals made 
v fewer avoidance responses during pretreat- 

ment. The results of the present experiment do 
not support this interpretation. Intemals and 
externals in the contingent reinforcement group 
required the same number of trials to criterion 
on the pretreatment task, and there were no 
main effects for locus of control on any of the 
test phase tasks. 

The discrepancy between the results of the 
present study and Hiroto’s study (Hiroto, 
1974) may be attributable to the different 
means of assigning subjects to the internal 
and external conditions, In the present experi- 
ment only those subjects falling within one 
point of the median I-E score (less than 10% 
of the sample) were excluded. Hiroto, however, 
only used subjects scoring at least one stan- 
dard deviation above or below the median. The 
selection of only those with extreme scores 
might explain the chronic helplessness effect 
that he found. It is also important to note that 
the pretreatment task in Hiroto’s study was an 
| aversive escape-avoidance task as compared 

With nonaversive contingencies involved in the 

present experiment. This suggests a second 

possible explanation for the discrepant results, 
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that externals faced with aversive stimuli give 
up immediately, whether the aversive con- 
tingencies are controllable or not, but do not 
give up immediately when the contingencies 
are nonaversive, 

Finally, equating contingent and noncon- 
tingent groups on number of trials and number 
of reinforcements allows us to determine 
whether perceived failure is a necessary cause 
of learned helplessness. Data from the Roth 
and Kubal (1975) study (in which contingency | 
and amount of reinforcement were confounded) 
indicates that contingent subjects perceived 
that they were more successful than their non- 
contingent counterparts. However, in the 
present study, responses on the postexperi- 
mental questionnaire indicate that the experi- 
mental manipulation influenced perceived-éon- 
trol without influencing feelings of perceived 
success: Noncontingent subjects reported that 
they were just as successful as contingeS 
subjects. These results suggest that the helh ; 
lessness effect is a result of the subject learning 
that he cannot control his reinforcements, no 
an affective response to failure. A study i 
which contingency and success are manipulated 
independently could provide further clarifig 
tion of this point. 
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Erratum to Keating and Brock 


In the article, “The Effects of Prior Reward and Punishment. on 
Subsequent Reward and Punishment: Guilt Versus Consistency” by 
John P. Keating and Timothy C. Brock (Journal of Personality and 
Social Psychology, 1976, Vol. 34, No. 3, pp. 327-333), two errors 
appeared in the left-hand column of page 331. The equation in line 


24 reads: 


It should be changed to read: 


í LL It should be changed to read: 


F(1,44) = 2.04, p < .05. 


F (1,44) = 2.04, 
The equation in lines 31 and 32 reads: 


F(1,44) = .18, p = .02, 


F F(1,44) = .18, 
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7 According to White (1975), all election 
landslides have had an intoxicating effect on 
\, presidents, Immediately following an election 
f landslide, the president is likely to perceive 
pe himself as an instrument of destiny. This 
‘gear delusion of omnipotence increases the 
“ptobability that the president will attempt 
Y to carry out even his more extreme policies. 
For example, Roosevelt in 1937 (victory by 
à a 60.8%, 11,000,000-vote landslide) nearly 
jae his second administration by his 
upreme Court packing plan” to ensure 
g the constitutionality of his policies. In 1964, 
I yndon Johnson (61%, 16,000,000-vote land- 
slide) immediately initiated a bread-and- 
butter policy that resulted in the pouring of 
huge sums of monies into social programs as 
¡well as an unauthorized foreign war. Richard 
f Nixon in 1972 (60.7%, 18,000,000-vote 
, lardslide) envisioned total control in his 
“second term and the return to conservative 
jndamentals in domestic affairs, In short, 
presidents who received a mandate from the 
eople immediately began to impose their 
i ideologies on the electorate. We 
Efer to this as the mandate phenomenon. 
We proposé that the mandate phenomenon 
Jis not limited to the political arena. Rather, 
any leader’s behavior is determined in part 
by the support he perceives that he has from 
the group he leads. As the number of persons 
favoring a particular person for leadership in- 
\creases, that leader is less likely to anticipate 
criticism for his actions, is less concerned 
about dissenting others, and may even at- 
tach invulnerability to his decisions, This 


T Requests for reprints should be sent to Russell D. 
Clark III, Department of Psychology, Florida State 
University, Tallahassee, Florida 32306. 
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The Mandate Phenomenon 


Russell D. Clark II and Lee B, Sechrest 
Florida State University 


Groups consisting of six male persons were asked to elect a leader who would 
be responsible for deciding the extent to which he was willing to commit the 
entire group to the possibility of experiencing an unpleasant later event. The 
Jeader received either a unanimous vote of confidence, majority support, or was 
4 randomly selected. It was predicted and confirmed that leaders with unanimous 
support make the riskiest decisions, We refer to this as the mandate phenomenon, 


exceptional confidence is most likely to occur 
when the leader perceives that he has over- 
whelming support from his group. A leader 
who has been elected by a very large majority 
will be more likely to commit the entire group 
to a riskier course of action than a leader who 
has either been elected by a narrow majority 
or one who has been randomly assigned as 
the leader. In the case of the narrow majority, 
the leader has no mandate, and assuming only 
two candidates, there is at least one other 
person who is widely thought to be capable 
of representing the group. In other words, 
the leader must entertain the possibility of 
being criticized by group members, and such 
a risk can be reduced by choosing a less ex- 
treme course of action, Roughly the same 
pressures toward caution should operate for 
arbitrarily appointed leaders, 

There is an extensive literature on variables 
affecting the emergence of a leader (Stogdill, 
1974, pp. 200-271). Styles of the leader, 
status, reinforcement contingencies, rate of 
talking, variations in group tasks, size of 
group, group structure, interpersonal space, 
and position in the communication network 
have been shown to affect leadership emer- 
gence. However, investigators have ignored 
the relationship between a leader’s perceived 
social support and the kinds of decisions that 
he makes. Given the current theoretical and 
empirical emphasis on leader-follower rela- 
tionships and the fact that many leaders are 
elected, this is an incredible omission. 

The choice-shift literature and Janis’s 
(1972) notion of “groupthink” are tangen- 
tially related to the mandate phenomenon. In 
the choice-shift literature, it has been repeat- 
edly found that over a variety of populations, 
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manipulations, and tasks, individuals in 
groups make more extreme judgments or en- 
gage in more extreme behaviors than do in- 
dividuals alone. The shift in decisions usually 
occurs only when there is an existing tendency 
among the individuals to favor a particular 
alternative and when there is heterogeneity 
among the individuals’ initial choices (Cart- 
wright, 1971; Clark, 1971; Moscovici & 
Doise, 1974; Myers & Lamm, 1976; Pruitt, 
1971a, 1971b; Vinokur, 1971). The mandate 
phenomenon differs from the choice-shift ef- 
fect in that the extremity behavior is not de- 
pendent upon the exchange of task informa- 
tion among group members, and the leader 
will commit the group to a more extreme 
course of action even without knowledge of 
group members’ preferences, 

Janis’s (1972) concept of “groupthink” re- 
fers to decision-making processes in which a 
highly cohesive group is so preoccupied with 
maintaining a consensus that all critical 
thought is obliterated. The group discussion 
centers around facts and opinions that sup- 
port their initially preferred policies, and 
counterfacts and new opinions are ignored, 
Under these circumstances, a group is likely 
to make very risky decisions, even those for 
which failure is likely and the potential con- 
Sequences are extremely severe, This process 
may well occur as a result of a mandate, but 
it can occur in its absence, for example, 
Kennedy and the Bay of Pigs incident. 

The purpose of the present experiment was 
to empirically demonstrate that the mandate 
phenomenon can, in fact, occur under con- 
trolled conditions. Male subjects met in 
groups consist*ng of six members and were 


asked to < ` a person who would be re- 
sponsible ‘iding the extent to which he 
was williz ‘mmit the entire group to the 
possibili periencing an unpleasant 
later eve ı brief discussion of an ir- 
relevant t the exchange of personal- 


‘eader was either elected 
(mandate leader), a 

randomly assigned. It 
‘est decisions would 
leaders. 


ity inventor: 
by a unanim 
simple majority, 0 
was predicted that 
be made by the mand 
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METHOD 


— E 


Subjects 


Twenty-three male subjects volunteered to partici- 
pate in an experiment entitled “group decision mak- 
ing.” Six subjects were from the general psychology | 
subject pool, and the remaining subjects were fresh- 
men and sophomores from one of five fraternities, 
The former participated to satisfy a class requi 
ment, and the incentive for the fraternity subje 
was a keg of beer. Each fraternity was told that 
for each member who volunteered for the experi- 
ment, a slip of paper with the name of the subject 
and fraternity would be placed in a hat, and at the 
end of the experiment one slip of paper would be 
drawn and the winning fraternity would receive a 
keg of beer. In actuality, the fraternity received 
cash that could be used to purchase a keg or any | 
other necessity. 

There were seven mandate subjects, eight major- 
ity subjects, and eight randomly assigned subjects, 
Subjects were run in groups of six. The six general! 
psychology subjects were run in the same session, 
and in each of the other sessions only two subjects 
were from the same fraternity. 


Procedure 


Subjects were met by the first author and MA 
very attractive young female assistants. As soon as» 
the six subjects were seated, the following instruc- 
tions were read to them. (With the exception of the 
election procedure, the procedure was adapted from 
Bem, Wallach, & Kogan, 1965.) 1 


With our continued interest in manned space ex- ` 
ploration and with our recent environmental con- 
cerns, scientists have become increasingly concerned 
with the effects of various physiological stimula- 
tions on the ability of persons to perform various, 
tasks. For example, it is well-known that back’ 
ground noise interferes with the ability of an in: 
dividual in solving simple mathematical problenm; 
and that background noise interferes with th 
performance of a crew or group of men who mu 
perform joint tasks, tasks which require them { f 
coordinate their efforts. In our research, we are 
concerned with the effects of physiological stimu- 
lation on group performance. We particularly want 
to know if the typical effects of a decrement in 
group performance due to aversive stimulation oc- 
curs when group members have been allowed to 
elect a person who is responsible for selecting the 
particular stimulation that each member must 


3 ihis: task was selected because it required the 
leaders to commit the group to behaviors with 
potentially aversive consequences. Also, the nature of 
the task coincides with the strong physiological 
orientation of our department, which is reflected in 
the department’s reputation on campus. Thus, sub- 


jects would presumably see nothing unusual about 
the nature of the task. 


} 


A 
i 


undergo and who is responsible for coordinating 

the up activities during the problem-solving 

task, 1n most of the previous studies, groups have 

not kad a choice of the intensity of noise that 

th exposed to, and furthermore, if a person 

onsible for coordinating the activities of 
group members, he was assigned rather 
d by the group, 


ch we are interested in having people 
stimulation for a period of 15 min- 
zaging in a 45-minute group session 
ñ jal and mathematical problems. The 
to elect a leader and select an odor 
We will then decide on a suitable 
t to the university health clinic for 

ion of the experiment, 


ribe the odor stimulations and then 

for selecting the leader. In front 

le 1], you can see nine odor stimu- 

of the odors we are testing are 

‘duce effects of nausea and intense 

| portion of the population. The side 
within an hour and leave no perma- 

‘ou can also note, for each of the 

i ons, the approximate percentages of 
likely to suffer these side effects. 
ote that the payment is larger if 
to choose an odor for which the 
1e side effects is greater. However, 
cts do occur, it is impossible for 
© provide a meaningful contribu- 
up, and since we are primarily in- 
erformance of individuals who do 
1e side effects, those who do will 
inated from the experiment and 
f $1 for their time and incon- 


3 to be t 


a flat fec 


or selecting a leader is as follows: 
n an opportunity to get acquainted 
he role of students in advising uni- 
y s. Then each person will evaluate 
hi C personality traits. You will be pxo- 


V such trait inventories. Fill out each 
on to be retained by you, and the other 
five will be collected and distributed to the other 


“ve Members of the group. 


After comparing each other’s responses fo the in- 
ve elect the person whom you think would 
be the best leader of your group. In doing so we 
would like you to look at each member's overall 
Pattern of responses rather than whether or not 
they were high on all of the traits. For example, 
a person who regards himself as very selfish (a 
trait not included in the inventory) may be a very 
fine leader, Whereas selfishness is not usually a 
valued trait in our society, some selfish persons 
may be highly effective leaders. 


Please note that there are two restrictions to the 
election process. First, unless a person receives a 
Majority of votes, one of you will be randomly 
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TABLE 1 
PERCENTAGE or POPULATION EXPERIENCING ODORS 
AND SCHEDULE OP PAYMENTS 


Schedule of 


Odor % payments ($) 


A 
s 
Š 
| 


“ER he ao ony 
PS 
© 


assigned to select one of the odor stimulations 
which will be experienced by the entire group 
and then to subsequently lead the group on the 
group problem-solving tasks, For an individual 
to become an elected leader, he must receive at 
least a majority of the votes. Second, the condi- 
tions of the experiment require that you do not 


vote for yourself, Last quarter when we allowed oe 


group members to vote for themselves, we rarely 
found that in a six-member group that a single 
individual would receive four votes. But, toward 
the end of the quarter, when we asked individuals 
not to vote for themselves, we frequently found 
that a given individual would not only receive 
three votes (a majority) but sometimes four votes 
or even all five votes, We frankly don't under- 
stand why this occurred, but in order to study the 
effects of odor stimulation on group performance 
with elected leaders we are employing the same 
Procedure. 


After the initial instructions, subjects were taken 
across the hall by the experimenter to a room with 
six armchairs that were arranged in a circle. In front 
of each chair was a letter (A-F). The seat that 
the subject sat in was his “assigned” subject iden- 
tification. After a 15-minute discussion of the role 
that students should play in advising the university 
on university matters, the subjects returned to the 
Previous room and were given six copies of a “per- 
sonality inventory.” The inventory consisted of 5- 
point scales of the following traits: approachability, 
intelligence, helpfulness, considerateness, courteous- 
ness, respectfulness, agreeability, carelessness, depend- 
ability, and reasonableness. Each subject was then 
Placed in one of six small rooms. On the door of 
each room was a letter indicating each subject's 
identification. Subjects filled out the inventories in 
different colored pens to increase the believability 
of the later feedback manipulation, t 

When the subjects had completed the inventories, 
the inventories were collected, and each subject’s 
copies were distributed to the other subjects. Ballot 
sheets were also given to the subjects. After approxi- 
mately 5 minutes, the inventories and ballot sheets 
were collected, Subjects remained in their rooms 


1060 


TABLE 2 


Leavers MEAN Risk Decisions (% oF POPULATION 
EXPERIENCING THE SIDE EFFECTS) AS A 
Function or Group Support 


Experimental condition Risk (%) 
Mandate 63 
Majority y 44 
Randomly assigned 34 


with their doors closed. Within 3 minutes, subjects 
were given envelopes that informed each person that 
he was the leader of the group. In addition, the 
leaders were given instructions reminding them of 
the nature of the task. 

Experimental conditions. Subjects in the mandate 
condition received a message that all of the other 
five group members had elected them as the leader. 
In the majority condition, subjects were informed 
that three group members had chosen them, with 
two members choosing others. Finally, subjects in 
the randomly assigned leader condition were in- 
formed that no group member had received a ma- 
jority of the votes, and they had therefore been 
randomly assigned to make the decision for the en- 
tire group. In each group, one third of the subjects 
were mandate, majority, and randomly assigned 
leaders, respectively. Subjects were randomly as- 
signed beforehand and without the experimenter’s or 
the assistants’ knowledge. 

Dependent measure. The percentage of the popu- 
lation experiencing the side effects constituted the 
dependent measure. During debriefing, the entire ex- 
perimental setup was explained to the subjects, with- 
out ever revealing the hypothesis. Instead, subjects 
were told that we were interested in determining 
whether a leader’s decision varied as a function of 
his perceived support from the group. Subjects were 
informed that group members had been randomly 
assigned to the experimental conditions and that 
the vote that they had received had notheing to do 
with either the group discussion or the trait inven- 
tories. Subjects were asked if they thought that per- 
sons would make different decisions as a function 
of perceived group support. Not a single subject re- 
Ported that a mandate leader would make a riskier 
decision in this experimental situation, Instead, they 
either reported no differences among group support 
conditions or reported that decisions made by man- 
date leaders would be more cautious, 


RESULTS AND Discussion 


Subjects’ willingness to undergo aversive 
consequences by conditions is presented in 
Table 2. A one-way analysis of variance 
showed that there were significant differences 
among the three experimental conditions, F (2 
22) = 6.07, p< .01. As can be seen. from 
Table 2, the results were in the expected or- 
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der. The Newman-Keuls proc: 
5% level of significance) reve: 
mandate subjects made riskier « 
either the majority or randomly 
jects; the latter two did not differ 
from each other. It should be 
that the effect of the experiment: 
tion was quite strong, accounting 
the total variance.* 


The present results have shown hat the 
mandate phenomenon can be produ. | under 
controlled conditions. The mandai leaders’ 
were more likely to commit their ups to 
aversive stimulation than were ma ity or 
randomly assigned leaders. We bel’ e that 
this behavior occurred because man: lead- 
ers, in contrast to majority or rando ly as- 


signed leaders, experienced an inci ise in 
self-esteem, were less concerned about wel- 
fare of other group members, were les likely 
to anticipate group criticism, and if ci ticism 
were to occur, the mandate leader co ld re- 
ply by saying, “Don’t blame me; ya’ll unani- 
mously elected me.” As a result of these 
processes, the mandate leaders attach in- 
vulnerability to their decisions, In shori hav- 
ing received total support from his group, a 
leader will endorse a more extreme course of 
action, even when he does not know the 
others’ preferences and when the consequences 
may be unpleasant for himself and the other 
group members. 

An alternative view is that the mandate 
leader perceives that the group members ex- 
pect him to be more assertive, and in this 
task the only way to fulfill this expectation 
was to choose a more noxious alternative for 
the group. Although this notion may account 
for part, or all, of the mandate phenomenon, 
we prefer the view that the extremity behavior 
of the mandate leader occurs because, with 
overwhelmingly group support, he can make 
decisions and act as he sees fit, without con- 


cerning himself with opposition within the 
group. 


?An anonymous reviewer Suggested that these 


an examination of the data showed 
Psychology subjects, if anything, i 
were more extreme than the fraternity subjects. 


f 
J 


THE MANDATE PHENOMENON 


Generalizations based on these results may 
þe misleading unless group support is viewed 


as relative. In contrast to the present case, 
as the yeoup increases in size, particularly 
when th group is quite large, it becomes 
highly i: bable for a given person to re- 
ceive su! from every group member. How- 
ever, lc in universal support will not 
necessarily s opress the subsequent mandate 


effect because a leader’s perception of a man- 
date depend. apon the differential support for 
indidates under similar circum- 


one or mor 
stances. | sample, White (1975) has re- 
ferred tí 2oosevelt, Johnson, and Nixon 


electior idslide victories, yet approxi- 
mately the voting population chose 
the op; ndidate. But, it is still appro- 
priate t < of Roosevelt, Johnson, and 
Nixon a late leaders since most previous 
winning | ents won by considerably nar- 
rower marg 

Recall th : in the experimental situation 
leaders mad ‘heir decisions without anticipat- 
ing group fe. ‘back and without knowledge of 
their preferer es, Research is currently under- 
way to dete mine the effects of subsequent 
group ce attempts and knowledge of 
the gro references on the decisions of 


leaders \ vary in the magnitude of group 
support the meantime, the results of the 
present : «periment have clearly demonstrated 
that the magnitude of social support alone 
is a potentially powerful predictor of certain 
aspects of a leader’s behavior. 
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Reassessing the Validity of Laboratory—Produced Attitude 


Gary L. Wells 
Ohio State University 


In their 1974 study, Hendrick and Seyfried found that experimental group 
subjects given an attitude change treatment (reading a persuasive essay) were 
more attracted to a stimulus stranger who espoused their posttreatment atti- 
tudes than to a stimulus stranger who espoused their pretreatment attitudes. 
Control subjects, however, remained more attracted to a stimulus stranger 
espousing their pretreatment attitudes. Hendrick and Seyfried concluded that 
this attraction transfer demonstrated the validity of laboratory-produced atti- 
tude change. The present study added a new experimental group, which did not 
receive the posttest, in an attempt to eliminate potential demand and/or eval- 
uation apprehension interpretations, which are applicable to Hendrick and 
Seyfried’s results, Hendrick and Seyfried’s results were replicated with the 
original experimental group but not with the new experimental group. It is 
suggested that Hendrick and Seyfried’s results cannot be interpreted as indi- 


lange 


cative of genuine attitude change. 


Byrne (1971) and his associates have con- 
sistently demonstrated that people are more 
attracted to a stranger whose attitudes are 
similar to their own than to a stranger with 
dissimilar attitudes. On the basis of this find- 
ing, Hendrick and Seyfried (1974) presented 
an ingenious solution to the problem of dem- 
onstrating the validity of laboratory-produced 
attitude change, They reasoned that genuine 
attitude change should result in subjects be- 
ing more attracted to a stimulus stranger who 
espouses their posttest opinions than to a 
stimulus stranger who espouses their pretest 
opinions. 

To test this hypothesis, Hendrick and Sey- 
fried (1974) yoked subjects on the basis of 
their responses to a pretest opinion survey. 
In a 2 X 2 design, one half of the subjects 
were assigned to the experimental group, 
which received a persuasive essay designed to 
affect attitudes on a student opinion survey, 
and each experimental subject’s yoked partner 
was assigned to a control group. Later in the 


I would like to thank Anthony Greenwald, Rodney 
Bassett, John Harvey, Michael Leippe, Robert Mc- 
Peck, Thomas Ostrom, and Richard Petty for com- 
ments on an earlier draft. In addition, Clyde Hen- 
drick deserves special thanks for Promptly providing 
materials and commenting on an earlier draft of this 
paper. 

Requests for reprints should be sent to G: s, 
Department of Social Psychology, 404C Wee wk 
Avenue, Columbus, Ohio 43210, 


quarter, the experimental group receiv d a 
persuasive communication and again lled 
out the student opinion survey (the cc. trol 
subjects did not meet that day). The Ę ext 
day, all subjects rated two stimulus stra ers 
who were individually designed for each ub- 
ject. In the experimental group, the two 
stimulus strangers exactly matched eacl. ex- 
perimental subject’s own pretest (pretest 
stimulus stranger) and posttest (posi (est 
stimulus stranger) responses to the stucient 
opinion survey. Each control subject’s pretest 
and posttest stimulus strangers were exactly 
the same as their individually yoked partner's 
pretest and posttest strangers from the ex- 
perimental group, As predicted, Hendrick and 
Seyfried found that the control subjects were 
more attracted to the pretest stranger than to 
the posttest stranger, whereas the experimen- 
tal subjects were more attracted to the post- 
test stranger than to the pretest stranger. 
They concluded that the essay-induced at- 
titude scale change was genuine, 

There is an important assumption that 
underlies the Hendrick and Seyfried predic- 
tion. This assumption is that the similarity- 
attraction situation employed is itself free of 
demand characteristics (Orne, 1962), evalua- 
tion apprehension (Rosenberg, 1969), and 
other artifacts. Suppose, for example, that the 
posttest attitude scale responses were arti- 
ficial, due perhaps to apprehensive subjects 
having expressed an attitude they believed 
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an al 


ATTITUDE CHANGE VALIDITY 


would make them look good and/or having 
responded to the demand characteristics of 
the situation. When they participated in the 
experimen! the second day, the subjects might 
still have been trying to look good to the ex- 
perime? and the appropriate strategy 
would ! een for the subjects to rate the 
opinion recently expressed to the ex- 
perimen ıs having desirable qualities (in- 
telligen usted, etc.), Thus, for the ex- 
perimen yup subjects, it would have been 
logical t ive rated the posttest stranger 
(who e matched each subject’s own 
most rec -pression of opinion) in a highly 
desirable ier. Perhaps the subjects even 
felt that vere evaluating themselves. 

Tt is no’ cessary to assume that the ex- 
pressions «< nion toward the strangers were 
sincere or ine, Note that the subjects in 
the experim: al group rated a stranger whose 


an exact replica of their own 
This by itself might elicit 
ne and his colleagues have 


responses wi 
recent postte 
suspicion, B 


abandoned t : perfect matching procedure 
(called “iden ty mirror? stimulus pattern) 
in favor of a ‘constant discrepancy” pattern 
primarily | ise of the potential extraneous 
factor of ion (Byrne, 1971, p. 16). It 
should be l that Hendrick and Seyfried 
did chee! uspicion by asking their sub- 
jects to a paragraph giving their own 
conceptio» of what the experiment was about. 
However h questionnaires are of unknown 
validity, Page (1973) has récently found 
that que naires have a false-negative 
problem, is, questionnaires tend to score 


aware subjects as unaware. In addition, ques- 


tions that ask subjects to generate hypotheses 
May falter because the experimenter ignores 
hypotheses that are incorrect even though a 


given subject's hypothesis may have led to 
the same results as the experimenter’s hy- 
pothesis. 

One way to eliminate the possibility that 
subjects in the experimental group are trying 
to make their stated (but not necessarily 
genuine) opinions look good is to eliminate 
the posttest. If it is not genuine attitude 
change, experimental group subjects will not 
Prefer the posttest stranger, since they have 
Not filled out the posttest questionnaire and 
thus cannot make themselves look good with 
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high ratings of the posttest stranger. How- 
ever, if the findings of Hendrick and Sey- 
fried were not due to evaluation apprehen- 
sion, the results should not be changed by the 
elimination of the posttest. The present study 
was designed to test this idea. To do this, it 
was necessary to yoke three subjects and as- 
sign the third yoked partner to an experi- 
mental group that did not receive a posttest. 
The posttest stimulus stranger for each set of 
three subjects was based on the posttest re- 
sponses of the experimental group member 
who did receive the posttest. Note that this 
three-group design allows a direct replication 
of Hendrick and Seyfried’s two-group design 
and yields a more appropriate comparison be- 
tween the control group and the experimen- 
tal-no-posttest group. 

Thus, the design is a 3 X 2 mixed factorial, 
The between-subjects variable was exposure 
to a persuasive message followed by a post- 
test (experimental-posttest group) versus ex- 
posure to an irrelevant message (control 
group) or exposure to the persuasive message 
not followed by a posttest (experimental-no- 
posttest group). The within-subjects variable 
was the pretest and posttest stimulus stran- 
gers’ opinions. 

A preference for the posttest stranger in 
both experimental groups would be relatively 
strong evidence of attraction transfer and 
therefore the validity of attitude change. Tf 
the experimental-posttest subjects preferred 
the posttest stranger and the experimental- 
no-posttest subjects preferred the pretest 
stranger, this would indicate that Hendrick 
and Seyfried’s findings should not neces- 
sarily be taken as evidence for the validity of 
laboratory-produced attitude change. 


METHOD 


Subjects 

The subjects were students in several sections of 
introductory psychology at Ohio State University 
and participated in the study in partial fulfillment of 
the course requirements. In keeping with Hendrick 
and Seyfried’s procedure, those who responded favor- 
ably to at least four of the five student opinion sur- 
vey items and no less wa ean on the fifth 

jects) were selecte 

Oe ae A the quarter, 173 subjects took 
the pretest, but only 135 subjects us „per cell) 
satisfied both the attitude response criterion above 
and the yoking criterion, to be described in the Pro- 
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cedure section. On the first day of the experiment, 
10 more subjects per cell were eliminated because of 
a failure of 1 of the yoked members to attend. The 
total number of subjects was 35 per cell. 


Materials 


The current study employed a modification of the 
stimulus materials and dependent variables materials 
used in the Hendrick and Seyfried study. These in- 
cluded a counterattitudinal essay arguing against 
students voting in local elections, a five-item Likert- 
type scale called student opinion survey, and an in- 
terpersonal rating scale. Modifications were necessary 
in Hendrick and Seyfried’s persuasive message and 
the student opinion survey because the items were 
peculiar to Kent, Ohio. In addition, pilot testing in- 
dicated that the message failed to produce effects of 
the magnitude observed in the Hendrick and Sey- 
fried study. Therefore, fake statistics about student 
lack of knowledge were added, as well as a line at- 
tributing the message to a “professor in political sci- 
ence.” This modification resulted in significant dif- 
ferences between control and experimental groups 
with a 24-hour delayed posttest (X = 22.4 and 16.4, 
respectively), 

The five student opinion survey items were state- 
ments about students voting in local elections (eg., 
all students 18 years of age and older who reside in 
Columbus should be allowed to vote in Columbus 
elections.) The subjects responded on a five-point 
Likert-type scale with endpoints labled strongly agree 
and strongly disagree. This study also employed an 
irrelevant message on day-care centers for the con- 
trol group and a questionnaire on day-care centers 
for the control and experimental-no-posttest groups. 


Procedure 


At the first part of the quarter, all subjects filled 
out the five-item student opinion survey, which was 
embedded in a larger survey. The five items appeared. 
together on page 3 of a five-page survey. Insofar as 
possible, triads of subjects were yoked on the cri- 
terion of identical response patterns on the pretest. 
Perfect matching was possible for 32 of the 35 sub- 
jects per cell. The remaining subjects varied by only 
one response alternative on a single item. One mem- 
ber of each triad was randomly assigned to each of 
the three conditions. 

Later in the quarter, the subjects were called and 
asked to participate in a person Perception study, 
The subjects whose yoked Partner(s) did not attend 
(n=19) the experiment were given the student 
opinion survey again to check on Possible shift in 
attitude from the first part of the quarter. Group 
ord oon 21 to 27 subjects per session, and 
each of the five sessions had i 
each condition. oe ee 


Upon arrival, the control subjects 


duate students and faculty (ir- 


ted to a pro- 
lence. The subjects then me 
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their impressions of and attraction to rofessor 
on a form attached to the end of the e id filled $ 
out the questionnaire on day-care cen The ex. 
perimental-posttest group (designed to r te Hen. 
drick and Seyfried’s findings) read the ec arguing 
forcefully against student voting in lo sections 
(relevant message), which was attributed he same 
political science professor, and subseq ’ rated © 
their impressions of and attraction to t} ofessor, 
After these ratings were completed, th ve-item 
student opinion survey (posttest) was ag admin. 
istered, with the written explanation that »wledge 
of the subject’s own reactions was necessa yecause 
“your attitudes can influence your respon: to the 
person who wrote the essay.” This survey ed ex- 
actly the same format and instructions as i e pre- 
test. The experimental-no-posttest group given ~ 
the same treatment as the experimental g: p just 
described except that the day-care survey (i levant 
questionnaire) was administered with the wi. cn ex- 
planation that “information is also needec on an 
unrelated project.” This irrelevant questionn © was 
included to keep the experimental-no-postt sub- 
jects from finishing earlier than the subject in the 
other two groups. 

The subjects returned the next day and we ~ told, 
“Yesterday you formed an impression of a verson D 


based on an essay he wrote, Today we wish 


ou to 
form an impression of two more people based n the 
attitudes they express.” Booklets were dist buted 
that contained two completed student opinion sure 
veys (counterbalanced for order), one repres-nting 
the pretest stranger and the other the posttest 
stranger. The subjects evaluated the stimulus <tran- 
gers’ intelligence, knowledge of current event, mo- 


rality, and adjustment. Two items measured « trac- 
tion: “How much do you think you would like this 
Person?” and “How much would you enjoy work- 
ing with this person?” Two other items me: sured © 
perceived similarity: “How similar are your attitudes ~ 
to this person’s attitudes?” and “Excluding this 
person's attitudes, how similar do you think you 
would be in other areas, such as interests, values, 
Personality, etc.?” All items were measured on 7- 
Point scales with endpoints appropriately labeled. 
The items measuring evaluation of stimulus str ngers 
were identical to those employed by Hendrick and 
Seyfried. 


After the ratings were completed, the subjects 
were asked to try to “fill out the five-item attitude | 
scale exactly as you recall doing at the beginning of # 
the quarter.” This question was included because 
even though Hendrick and Seyfried did not get a 
measure of recall, they stated that “it seems likely 
that such recall for the experimental group would 
yield a mean closer to posttest than to initial pre- 
test attitudes” (p, 870). In addition, the subjects 
were asked to write a brief Paragraph giving their 


1 Hendrick and Seyfried’s 
meet on the first day and the: 
sage. 


control group did not 
refore received no mes- 


k 
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TABLE 1 
Mean ÅTTITUDE For ALL CONDITIONS 


Experimental with 


Experimental without 


posttest Control posttest 
= tudy Pretest Posttest Pretest Posttest Pretest  Posttest 
W 176) 22:3 16.2 22.4 22. 2 aoe 
H & Seyfried (1974) 21.2 16.3 21.1 rie sie = 


Note before essay and posttest = after essay, 


concep what the experiment was about. They 
were t illy debriefed as to the nature of the 
experim vd were asked to tell no one of the 
experin 2 weeks. 
RESULTS 

Altitud: nge 

The scores on the student opinion 
survey ai ven in Table 1. These scores are 
based on sum of the five items for each 
subject an thus could range from 5 to 25. 
The pretes! -cores for the subjects in all three 
groups we: virtually identical (22.3, 22.4, 
22.4), as si uld be the case due to yoking. 


The possibil y that the subjects in the con- 


trol group w uld shift their attitude responses 
from t} part of the quarter was tested 
by obs: such shift in the subjects who 
filled o e pretest but whose yoked part- 
ner(s) not attend. The data for these 
subjects indicated no opinion shift for the 
group = 19) mean from pretest to post- 
test (X 22.5 for both measures). In addi- 
tion, a 5 (Items) X 19 (Subjects) matrix of 
responses indicated zero shift in 82 of the 95 
cells. Thus, attitude stability for the control 
Subjecis seemed adequate.? The pretest to 
posttest shift for the experimental—posttest 


group indicates that the essay did affect their 
attitude scale responses (X = 22.3 and 16.2, 
respectively, and F(1, 68) = 9.34, p < 01). 

It should be recalled here that this study 
used the student opinion survey score of each 
subject in the experimental-—posttest group as 
the posttest stranger for their yoked partner 
in the experimental—no-posttest group. In 
having done this, this study assumed that the 
Postpersuasion opinion for each experimental- 
No-posttest subject was closely approximated 
by the posttest score of that subject’s yoked 
Partner from the experimental-posttest group. 
An estimate of how well a given subject’s 


actual posttest score can predict that sub- 
ject’s yoked partner’s posttest score was ob- 
tained by an internal analysis of the experi- 
mental-posttest group. Each subject in the 
experimental-posttest group was yoked to an- 
other member of the experimental-posttest 
group on the basis of pretest scores, Of the 
35 subjects, 30 met the yoking criteria de- 
scribed earlier with at least one other subject. 
Of these 30 subjects, 26 (86.6%) had actual 
posttest scores that were closer to their own 
yoked partner’s posttest than to their own 
pretest. Thus, since the experimental-no-post- 


test subjects’ posttest stimulus stranger was — 


based on their yoked partners’ posttest score, 
86% of the experimental—no-posttest subjects 
should have given higher ratings of attraction 
to the posttest stimulus stranger than to the 
pretest stimulus stranger. 


Attraction, Similarity, and Evaluation Scores 


The two attraction items and the two simi- 
larity items were correlated .72 and .67, re- 
spectively. Thus, as in the Hendrick and 
Seyfried study, the two attraction items were 
summed to form a single attraction score and 
the two similarity items were summed to form 
a single similarity score. Table 2 gives all 
six measures for all conditions along with the 
results obtained by Hendrick and Seyfried. 
It should first be noted that the data pat- 
tern of Hendrick and Seyfried replicated well. 
In the control condition, the subjects rated 
the pretest stranger more favorably on intelli- 
gence and attraction and felt the pretest 
stranger to be more similar to _themselves 
than the posttest stranger. Consistent with 
Hendrick and Seyfried, the subjects in the 
experimental condition who had received a 


2 Hendrick and Scyfried’s data indicate no shift in 
101 of 145 cells. 


i, 


£ 
~ 
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TABLE 2 
TTRACTION, SIMILARITY, AND EVALUATION SCORES FOR HENDRICK AND 
“eile SEYFRIED AND CURRENT REPLICATION FOR ALL CONDITIONS R 
Faedra with Coul male thout 
Item Pretest Posttest Pretest Posttest Pretest est 
Attracti 8.2 10.1, 10.5, 8.6, 10.6, 
pare (932) L3) (AL) (89) (o 
tellige: 44a 5.4, be Sa. 5.66 
T pag G8) GA Ga} G S 
Knowledge of current events 4.5a Sh -5b Sab 5.55 
j 49) 520) Go) (E59 G 
Morality 4.29 5.25 4.2, 4.0, 41a 4 
k (44) (5.30) GA) Gos) Go oi 
Adjustment 4.Ta 5:6b Dab Dab Day 
rs 48) Gh) gma" lea) tat 
Similarity 8.14 11.3, 10.6, 81a 10.7, 8 
; (8.34) (12.15) (11.7)  (8.0,) 5) ( 
Note. Pretest = evaluation of pretest stranger and posttest = evaluation of posttest stranger, The values in parenthese re- 


sent the results obtained by Hendrick and Seyfried (1974). Wi 
differed at the .05 level by the Newman-Keuls test. 


posttest rated the posttest stranger higher on 
attraction, similarity, and intelligence, 
Analysis of variance indicated a significant 
Experimental Condition X Stimulus Stranger 
interaction on all six dependent variables: at- 
traction, F(2, 102) = 7.94; intelligence, F(2, 
102) = 7.15; knowledge of current events, 
F(2, 102) = 9.33; mortality, F(2, 102) = 
13.12; adjustment, F(2, 102) = 10,67; simi- 
larity, F(2, 102) = 10.27—with Pp <.05 for 
all measures, In addition, a Newman-Keuls 
test was performed within the 3 (Conditions) 
X 2 (Pretest—Posttest) analyses for each de- 
pendent variable question. These results show 
that the experimental-posttest group’s pat- 
tern of significance generally replicates the 
pattern of Hendrick and Seyfried’s (1974) 
study. Most importantly, however, for the 
experimental group that did not receive a 
posttest, the ratings of attraction, similarity, 
and intelligence are significantly higher for 
the pretest stimulus Stranger than for the 
posttest stimulus stranger. In addition, the 
experimental—no-posttest group did not sig- 
nificantly differ from the control group on 


any of their ratings of the pretest and post- 
test strangers, 


Recalled Pretest Attitudes 


The attitude each subject recalled holding 
on his pretest was computed by summing 
across the five attitude items. Mean recall 


thin each of the 3 X 2 analyses, means with no common su 


scores for the experimental-posttest, con <0 
and experimental-no-posttest groups 
19.2, 23.0, and 21.1, respectively. A 3 (( on- 
ditions) Xx 2 (Pretest, Recall of Pret st) 
analysis of variance revealed a signific int 
interaction, F(2, 102) = 7.23, p<.05. A 
Newman-Keuls comparison of pretest to re- 
call for each group indicated that recall ‘or 
the control and experimental—no-posti st 
groups did not significantly differ from <c- 
tual pretest scores (p= .35). The experi- 
mental-posttest group’s pretest recall, how- 
ever, approached a difference that was nea: ly 
significant from their actual pretest score 


(p = .06). 


Suspicion 


Each subject’s paragraph on what they 
thought the experiment was about was scored 
by three raters (blind to condition) accord- 
ing to whether the subject mentioned that 
the experiment may have been interested in 
attraction (liking, rating, etc.) as a function 
of attitude change. All three raters inde- 
pendently indicated that four subjects gen- 
erated this idea and all four subjects were in 
the experimental-posttest group. In a bi- 
nomial distribution (the subjects could either 
generate the hypothesis or not), the proba- 
bility of observing all four cases in only one 
of the three conditions by chance is small 


($ = .03). 
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DISCUSSION 


The results of this experiment suggest that 
Hendrick and Seyfried’s results are replicable 
but may be due to an artifact. The experi- 
mental subjects who did not get a posttest 
failed to rate the posttest stimulus stranger 
in a desirable manner. In addition, the pilot 
data preclude an argument that the experi- 
mental—posttest subjects were committed as 
a function of filling out the posttest, and the 
experimental—no-posttest subjects were free 
to “slip” back to their initial attitude during 
the 24-hour interval between reading the per- 
suasive message and the attraction measure, 
The mean opinion in this study without delay 
of posttest was almost exactly the mean ob- 
tained in the pilot testing with a 24-hour 
delay (16.2 and t, respectively). Instead, 
it appears that th» elimination of a posttest 
reduced the differential evaluation apprehen- 
sion between exp::imental and control condi- 
tions. That is, the experimental subjects did 
not have a postte: response that they could 
have tried to mat- look good by giving high 
ratings. The fact '>at the experimental-post- 
test subjects were ‘he only ones to generate 
a hypothesis containing comments regarding 
both attitude change and attraction also sug- 
gests that differential cues were available for 
conditions. As noted earlier, however, there 
is generally a false-negative bias in suspicion 
measures, and there were no doubt more sus- 
Picious subjects. This index simply reflects 
the differential suspicion between conditions. 

The recall data are also not supportive of 
genuine attitude change. Hendrick and Sey- 
fried noted that the experimental subjects, 
after attitude change, just about “disowned” 
a stranger represented by their pretest atti- 


1067 


tude and that “this much of a shift would 
not be expected unless subjects had largely 
forgotten their original prepersuasion atti- 
tudes” (1974, p. 870). However, in this study, 
the experimental—no-posttest subjects’ recall 
did not significantly differ from their actual 
pretest opinions. For the experimental—post- 
test subjects, the recall of pretest attitude 
was significantly different from their actual 
pretest attitude; however, their recall was 
equally deviant from their actual posttest at- 
titude, and it is possible that the experimen- 
tal-posttest subjects, having highly evaluated 
the posttest stranger, distorted their recall 
intentionally in favor of the posttest stranger. 

These results may appear disappointing to 
attitude change researchers. However, the 
conclusion is not that laboratory-produced 
attitude change is invalid. The conclusion is 
that the important study conducted by Hen- 
drick and Seyfried provides equivocal evidence 
for the validity of laboratory-produced atti- 
tude change. 
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Wells claims that the results of the 1974 Hendrick and Seyfried study are due 

to demand characteristics in the sense that the subjects in the latter study 

might have shown pseudo-attitude-change and subsequently given bogus at- 

traction responses consistent with the pseudo-attitude-change. Wells bases his 

argument on the experimental findings that the subjects who did not receive 

an attitude posttest were more similar to control subjects in attraction re- i 
sponses than to subjects who did receive an attitude posttest. In the present E 
experiment, triplets of subjects who were yoked on an attitude pretest were 

exposed to a persuasive message. Two subjects in each triplet completed a 

relevant attitude posttest, and the third an irrelevant posttest. The next day 

all subjects evaluated three stimulus strangers on attraction; within each triplet 

there was a pretest stranger and two strangers identical to the two attitude 

patterns obtained from the two relevant posttest subjects. The results support 

the hypothesis that variability in the patterning of attitude posttest responses 

accounted for Wells’ attraction data and the discrepancy between his data and 

the Hendrick and Seyfried study. 


Wells (1976) proposes that Hendrick and perimental-posttest condition, Wells conc udes 
Seyfried’s (1974) results might be due to de- that Hendrick and Seyfried’s original r: sults 
mand characteristics, evaluation apprehension, were due to evaluation apprehension ai |/or 
and other artifacts. To make this argument demand characteristics, 

Wells has to assume that The many problems with the demand oiar- 
the posttest attitude scale responses were artificial, due AERE ennen Mro notedinthg discusion: 
perhaps to apprehensive subjects having expressed an One basic empirical fact is that Wells obta ned 
paride they pa o ake ges look koo ita different results for attraction ratings from the 
or having responde o e demand characteri: j y i 5 ag) 

y the dituation, When they Participated in pte mee hi opea Sa attitude et e . 
the second day, the subjects might still have been trying "1 irom the subjects who did not omple 
to look good to the experimenter, and the appropriate such a posttest. Those empirical findings need 
strategy would have been for the subjects to rate the to be accounted for in a satisfactory way. 
a foveal tr Peano ya eet net thrust of the present paper is to prov ide 

an account. We focus on an experimental ` 

Wells replicated Hendrick and Seyfried’s analysis of Wells’ experiment and the data he 
(1974) experiment, using their control and ex- obtained. One reasonable explanation sug- 
perimental-with-attitude-posttest conditions gested itself, which could be experimentally — 
and an additional experimental condition in tested and which, if it worked, would satisfac- | 
which subjects did not complete an attitude torily account for the presumed discrepancy 
posttest. Because attraction to the stimulus between Hendrick and Seyfried’s (1974) and 
strangers was comparable for the control and Wells’ experiments. To develop our argument, 
experimental-no-posttest conditions and both itis necessary to explicate a crucial assumption 
of these conditions were different from the ex- of Wells’ experiment, note a potential fallacy 

We are greatly indebted to Dawn Davis i = ioe. aye et basis, 
Petros for their help in data callection na can oe in thi eee aos 
Thanks are also due Gary L. Wells for his detailed» this paper, 
responses to questions about his study, A crucial experimental assumption that 
ep ke uae te eee be sent to Clyde Hend- Wells had to make was that “the postpersua- 
Kent, Ohio 4424p. noes Kent State University, Sion opinion for each experimental =no-post test 

subject was closely approximated by the post- h 
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test score of that subject’s yoked partner from 
the experimental-posttest group” (p. 1065). 
An intern analysis was presented in which the 
experimental-posttest subjects were yoked on 
the ba their pretest attitudes, so that their 
postt titude scores could be compared. 
Wells that of 30 such experimental-post- 
test si who could be yoked, 26 (86.6%) 
had a posttest scores closer to their 
yoked ‘TS posttest than to their own 
pretest `. Wells then concluded that 86% 
of the perimental-no-posttest subjects 
should r the posttest stimulus stranger 


more favor vly on attraction than the pretest 
stimulus st :nger, if Hendrick and Seyfried’s 
(1974) resu: s were not artifactual, 

There is potential fallacy in Wells’ argu- 
ment. The | \sic problem is that Wells’ argu- 
ment (and : -thodology) did not take into ac- 
count the pu -rning of altitude change over indi- 


vidual attit le items. Overall attitude scores 
are usually sed on the sum of the scores of 
Several indi dual items, a procedure followed 
by Wells as vell as by Hendrick and Seyfried. 
Two subject may have an identical pretest 
attitude palin, the same summed posttest 
summary : after exposure to a persuasive 
essay, | pattern of the individual item 
posttest nses may be quite different. In 
fact, th lute attitude discrepancy scores 
(definec he absolute difference in subjects’ 
posttest titude response per item, summed 
across į ) may be quite large. For example, 
assume subjects, A and B, each with pre- 
test ratings of 6 on each of two attitude items 
(assume e values can range from 1 to 7). 
After reading an essay, Subject A gave a post- 
test rati f 1 on Item 1 and a rating of 6 on 


Item 2. ct B gave a rating of 6 on Item 1 
and a rating of 1 on Item 2. Each subject would 
be scored as having a final and identical post- 
test score of 7, a considerable change from the 
Pretest score of 12. However, the pattern of the 
two posttest scores is very different, and in 
terms of absolute discrepancy scores, the two 
Subjects are a total of 10 scale units discrepant 
from each other. Such discrepancy scores would 
have 4 very potent effect on attraction responses 
if the two posttest patterns were attributed to 
two stimulus Strangers. Byrne (1971) has shown 
that Subjects are sensitive to even very small 
discrepancies in attitude scores in making at- 
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traction responses: “It would seem that indi- 
viduals are responding to extremely small de- 
grees of discrepancy between themselves and 
the strangers as well as to the more gross vari- 
able of agreement versus disagreement” (p. 
80). 

Our present argument is that Wells was mis- 
led in the interpretation of his data because he 
failed to take into account the discrepancy 
patterns that are possible for a given overall 
amount of attitude change and, further, that 
subjects’ attraction Tesponses are exquisitely 
sensitive to such discrepancy scores, 

The logic of this argument Suggests a fairly 
direct experimental test. The basic rationale 
for the experiment is as follows, Suppose that 
three subjects, A, B, and C, are yoked in terms 
of an identical pattern of pretest attitude re- 
sponses. The three subjects read a persuasive 
essay, but only A and B complete a relevant 
attitude posttest (C completes an irrelevant 
posttest). The following day Subjects A, B, 
and C each evaluate and rate their attraction 
to three stimulus strangers based on the 
strangers’ attitudes on the issue, One stranger 
is the three subjects’ identical pretest attitude 
pattern, a second stranger is Subject A’s post- 
test attitude pattern, and the third stranger is 
Subject B’s posttest attitude pattern. 

The general prediction is that a subject will 
be attracted to a stranger as a direct function 
of how similar the stranger's attitude pattern 
is to the subject's actual attitude pattern. It is 
assumed that attitude change is genuine, not 
artificial. The problem for prediction is to 
determine how each subject would be expected 
to rate attraction to each of the three strangers, 
on the assumption that attitude change scores 
represent true change. 

Some definitions concerning attitude change 
scores will be useful. Let Pre designate the three 
subjects’ pretest score on a given item, Ap Sub- 
ject A’s posttest score on that item, and Bp 
Subject B’s posttest score on the same item. 
An absolute discrepancy score is the absolute 
difference between two different responses to 
the same item. A total absolute discrepancy 
score, D, is defined as the sum of the absolute 
discrepancy scores over all items on the attitude 
scale. Three such total discrepancy sum scores 
are presently of interest: Dı = È |Ap — Bpl ; 
D:=5 |A, — Pre]; Ds = È |B, — Pre]. 


a 
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Within any triplet of subjects, the designa- 


tion of A and B is essentially arbitrary (and 


can be constrained so that D: = D;). With 
these definitions and some additional auxiliary 
assumptions five derivations can be made. 


Derivation 1. It is assumed that a persuasive 
communication results in some change, and 
thus by definition Da = D; > 0. It would also 
be expected that Dı > 0, on the average, be- 
cause it is improbable that over pairs of sub- 
jects, A and B would change their responses on 
all attitude items in an identical manner. Fur- 
thermore, if it is assumed that subjects do not 
show consistency among themselves in their 
item-to-item responses after reading a per- 
suasive essay, then the magnitude of Dı may 
often be comparable to the magnitude of D, 
and D; (i.e, Di = Dz = Ds). This equality is 
most likely to hold true within a range of 
moderate attitude change per item.! 

Derivation 2. The prediction that Di = Dy 
= Ds yields two immediate implications. 
First, Subject A’s perceived similarity and at- 
traction to the B posttest stranger should be 
about the same as Subject A’s similarity and 
attraction to the pretest stranger. Second, 
Subject B’s perceived similarity and attraction 
to the A posttest stranger should be about the 
same as Subject B’s similarity and attraction 
to the pretest stranger. 

These implications follow because if D, 
= Dz = Ds, on the average, Subjects A and B 
would be just as different from each other after 
persuasion as each is different from their pre- 
test attitude pattern. And these results would 
be expected even though Subjects A and B 
might have identical posttest attitude scores 
calculated as the simple sum of the individual- 
item-rating responses. Thus, the equality of 
two final posttest attitude scores may mask a 
wide discrepancy in the patterning of change 
responses to individual items. It is to such pat- 
terns of discrepancies that subjects can and do 
respond with great sensitivity. 

Derivation 3. Because Subject A’s final atti- 
tude pattern is identical to the A posttest 
Stranger’s pattern, it follows that Subject A 
should be significantly more attracted to the 
A posttest stranger than to either of the other 
two strangers. 

Seep EN same reasoning applies to 
2 Should be more attracted to 
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the B posttest stranger than to cit of the 
other two strangers. 

Derivation 5. The pattern of Subje C’s at- 
titude posttest scores is unknown | «use C 
does not complete a posttest, and alt ch the 
above arguments cannot be made C di- 
rectly, some fairly direct deduction. an be 
made. It is assumed that genuine titude 
change occurs whether a subject com; tes an 
attitude posttest and that the proces such 
change is in all respects the same as sub- 
jects who do complete a posttest. Ii then 
following Derivation 1, Subject C w id be 
expected to have the same lack of con tency 
with respect to Subjects A or B in item.’ -item 
responses after reading a persuasive ess. ~ that 
Subjects A and B manifest with respect cach 
other. It follows that Subject C’s (un! own) 
posttest attitude pattem would be bout 


equally discrepant from each of A’s a: | B’s 


posttest attitude patterns and from the |) ctest 
attitude pattern. 
Formally, the following hypothetical, ‘otal 


discrepancy sum scores may be defined: 


Di=¥ |C,—A,|;D5= > |C,— 1 ; 
De = X |Cp— Prej, 


where Cp is Subject C’s (unknown) positest 
attitude score on a given attitude item and 
the other terms are as previously defined. The 
discussion of the last paragraph above implies 
that Di = D; = Ds, with Dy > 0. There is no 
reason to expect Dg to differ from D, or Ds. 
Thus, it follows that in general, Di = D; = Ds 
=D,= D; = Ds. 


The prediction for Subject C’s attraction to ` 


the stimulus strangers follows directly: Sub- 
ject C should be equally attracted to the A post- 
test, B posttest, and pretest stimulus strangers. 

Thus, the generic prediction for a subject 
who does not take an attitude posttest (Sub- 
ject C) is that there will be no difference in 
attraction ratings for the three Stimulus strangers. 
Tn a given case, of course, the attitude discrep- 


1 This equality would not hold true within the range 
of extreme attitude change. For ex: 
gave the most extreme 


Tesponses to every item, then 
Di = 0 because all atti i 


use tude patterns would be identical. 
However, within a range of moderate attitude change, 
which was exhibited in both the Hendrick and Seyfried 


(1974) and Wells’ (1976) studies, the prediction that 
Di = D: = Deis tenable. eres 


ample, if all subjects ~ 
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cores between C’s (unknown) post- 


ancy u 
test response pattern and the attitude pretest 
response pattern may be more or less than the 
discrepan: y sum scores between C’s (unknown) 
posttest ern and A’s posttest pattern or 
betwee posttest pattern and B’s posttest 
patter: t is, in a given instance it might 
occur t} 1s Æ D, or that De Æ Ds. 

Such nequality may have occurred in 
Wells’ ( study because in the experi- 
mental ¿© -dition of no posttest, the pretest 


rated as more similar than the 
ct’s) posttest stranger. Ratings of 


stranger 
(yoked su 


perceived + vilarity can be quite diagnostic in 
attraction + idies, and the similarity ratings in 
Wells’ stu suggest that the theoretical 


summed at ‘ude discrepancy scores between 
the attitud: retest and the (unknown) experi- 


mental-no-) sttest attitude responses may 
have been aller than the summed attitude 
discrepanc) ores between the attitude pretest 


and the at 
amental-po 


ide posttest scores of the experi- 
st subjects. 


METHOD 


Subjects 

During st week of the quarter, students from 
introdu chology classes, in a large testing ses- 
sion, co the five-item Student Opinion Survey 
used by and Seyfried (1974), which assessed 
student toward voting in local elections. The 
subject ected for possible participation in the 
experi they responded favorably to voting in 
local e! four items and no lower than neutral 
on the item and if they did not give the most 
extrem jle response on all five items. 

Yok« ets of subjects were created by matching 
Same-s¢ ects who had identical response patterns 
on the e pretest. A total of 24 such triplets were 
constit nd perfect yoking was possible for all but 
two trip ad in those two instances two of the three 
Subjects had an identical attitude pretest pattern and 


the third subject differed by only one response alterna- 
tive on one item. 

There were a total of 72 subjects, 36 of each sex, and 
Sex of subjects was balanced over triplets. Within each 
triplet two subjects were randomly assigned to receive 
a relevant postessay attitude posttest, and the third sub- 
Ject was assigned to receive an irrelevant posttest. The 
Subjects within each triplet who were to receive a rele- 
vant attitude posttest were arbitrarily designated as 
A- or B-type subjects (as discussed in the introductory 
Section), and the subject who was to receive an irrele- 
vant posttest was designated as a C-type subject. For 
analytic purposes these three catgories, A, B, and C, 
Were considered as three levels of a variable. Thus, there 
were 12 subjects of each sex randomly assigned to each 
category. 
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Procedure 


The subjects were recruited by phone by a research 
assistant of the same sex to serve in two 30-minute ex- 
periments on impression formation to be held on 2 
consecutive days. The nature of the recruiting was such 
that the subjects participated in groups ranging in size 
from 1 to 15 individuals and typically included subjects 
of both sexes and all three categories (A, B, or C). The 
subjects received point credits for their participation, 
which was an optional part of the course requirements. 

The actual procedures followed Hendrick and Sey- 
fried (1974) closely. The main exception was that the 
essay the subjects read for the impression formation 
task of the first session was attributed to a professor of 
political science, following Wells’ (1976) procedure, The 
relevant attitude posttest completed at the end of the 
first session was identical to the attitude pretest. The 
irrelevant posttest dealt with alcoholism and its treat- 
ment. During the second session, the subjects studied 
the “attitude responses” of three stimulus strangers, 
formed impressions, and rated each on attraction, simi- 
larity, and related measures (the rating instrument was 
identical to that used by Hendrick and Seyfried), 
Within each triplet of subjects, one stranger's attitude 
response pattern was identical to the three subjects’ — 
pretest pattern, a second stranger’s pattern was identi- 
cal to Subject A’s posttest pattern, and the third 
stranger's pattern was identical to Subject B’s posttest 
pattern. All six orders of presentation of the three 
strangers were used, and orders were counterbalanced 
across subjects. 

After the ratings, the subjects were asked to write a 
response concerning what they thought the experi- 
ment was about. Only one subject indicated any 
suspicion about the authenticity of any of the experi- 
mental materials. 


Experimental Design 


Sex of subjects, subject category type (A, B, or C), 
and order of presentation of the three stimulus strangers 
(at six levels) were between-subjects variables. Type of 
stimulus stranger (A-type, B-type, or pretest-type) 
was a within-subjects variable. All four variables were” 


fully orthogonal to each other. ‘ 
‘ 
RESULTS 


As noted previously, the designation of Sub- 
jects A and B within each triplet was arbitrary. 
Tn terms of the reasoning in the introductory 
section, it was desirable that the sum of the 
absolute attitude discrepancy scores between 
pretest and posttest be equalized (i.e, Da 
= D). These scores were calculated and the 
A and B labels were switched as necessary 
Jets until D2 and D; were approxi- 


vithin tri I p 
TT ET With this requirement satisfied, 
the five derivations could be tested. 
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TABLE 1 
MEAN SUMMED ATTITUDE SCORES AND ÅTTITUDE DISCREPANCY SCORES 


; 
f 


Posttest 
Score Pretest Subject A Subject B F ratio 
Attitude 20.78 15,9 15.8 12.8* 
Attitude discrepancy Dı = XJA,—B,| D:= Z]Ap— Pre] Ds = |B, — Pre| 
5.3 5.3 5.7 9 

Note. Means with different superscripts differed at the .05 level by the Newman-Keuls test. 

*p <,0002, 

Derivation 1 the latter two means did not differ fro cach 
. other? 

It was predicted that the magnitude of the Š one : 3 
attitude A enine scores between the post- Finally, it was expected from Deriv: ion 1 
kesta tor EME A and B and between the tPat the average amount of attitude ange 
pretest and Seas for each of A and B were Pet item would be within the middle ra ve of 


approximately equal (i.e, Dı = Dz = Ds). 
The mean discrepancy scores are shown in the 
second row of Table 1. The |A, — Pre| and 
the |B, — Pre| discrepancies are equal of 
course because of the label switching noted 
above. The mean of interest is the |A, — B,| 
discrepancy. This mean is 5.3, almost identical 
to the other two means. The F ratio of .9 for 
the main effect indicates that there are no dif- 
ferences among the three attitude discrepancy 
means.” 

These results indicate that the patterning of 
attitude posttest ratings was such that Sub- 
jects A and B were just as different from each 
other on their posttest, on the average, as each 
was different from the attitude pretest pattern, 
This preliminary result was crucial in account- 
ing for the attraction results. 

Even though the discrepancy scores between 
A and B were large, it was expected that a 
simple sum of the posttest attitude ratings 
would yield a final posttest attitude of about 
the same magnitude for A and B. The means 
are given in the first row of Table 1. The means 
listed are the pretest attitude mean for sub- 
| jects in Category C (pretest means for subjects 
in the other two categories were virtually iden- 
tical because of yoking) and posttest means for 
subjects in Categories A and B. The size of 
these means is comparable to that obtained by 
Hendrick and Seyfried (1974) and Wells 
(1976). The F ratio of 12.8 for the main effect 
is significant, and the Newman-Keuls test in- 
= dicates that the pretest mean differed signifi- 

cantly from each of the posttest means but that 


iin 


possible change scores and with wide it: 1-to- 


item variability. This expectation implie. that 
D; would be independent of Dy and Ds. © tis- 
tically, D; should be uncorrelated with « her 


Ds or D;. The correlation between Dı an D: 
was .02, the correlation between D; and D 


vas 

1S, and thus, the assumption of independ nee 
was met. 

All of the assumptions of Derivation 1 ere 

satisfied, providing an adequate basis to a: ess 


the other results. The other derivations vere 
all concerned with attraction and perce: ved 
similarity of subject types A, B, and C for 
posttest strangers A, B, and the pretest 
stranger. The remaining results are presented 
in terms of the attraction and similarity data, 


and the basis of support for each derivation is 
noted as appropriate. 


Altraction and Perceived S imilarity 


As in Hendrick and Seyfried (1974) and 
Wells (1976), the two attraction ratings and 
the two similarity ratings were each summed 


? The attitude dis. 
each subject triplet 
sex of subjects and 


subjects, order of presenta- 
attitude score (A’s posttest, 
Only the main effect of cate- 


seal 


38) 


$ 


to provi le an overall measure of attraction and 


similarity, respectively. Each measure was sub- 

jected to inalysis of variance with between 
variab|: ubject sex, category type of sub- 
ject, and order of presentation, and a within 
variabli three stimulus strangers. 

The 1 for the attraction and similarity 
scores a wn in Table 2. The several deriva- 
tions {foi data imply an interaction of 
Type of ct Category X Type of Stimu- 
lus Stran Í a rather precise nature. Sig- 
nificant in tions of the predicted form were 
obtained { both measures; attraction, F(4, 
72) = 5.9,  < .0006; similarity, F(4, 72) 
=9.0,p<. 01.4 

The specif predictions were tested by indi- 
vidual compa isons, using the Newman-Keuls 
test. The out »mes of these testsare best sum- 
marized in s al fashion: 

1. As pre d, Category A subjects’ ratings 
of the pret stranger and the B posttest 
stranger did >t differ for either attraction or 
perceived sir arity (Derivation 2). 

2. As pi ed, Category B subjects’ rat- 
ings of thi st stranger and the A posttest 
stranger l not differ for either measure 
(Deriva 

3. Cai \ subjects were significantly 
more att to and perceived themselves as 
more si} the A posttest stranger than to 
either < ther two strangers (Derivation 
3). 

4. Ca ‘3 subjects were more attracted 
to and | ed greater similarity to the B 
posttest wer than to either of the other 
two strangers (Derivation 4). 

5. As predicted, Category C (no attitude 
posttest jjects’ ratings of the three stran- 
gers did no: differ on either the attraction or 
similarity measures (Derivation 5). 


It is also worth noting that for both attrac- 
tion and similarity, A-, B-, and C-type sub- 
jects did not differ in their ratings of the pre- 
test stimulus stranger. Also, Type A subjects’ 
ratings of the A posttest stranger were signifi- 
cantly higher than Type B’s and C’s ratings of 
this stranger, and the latter two ratings did not 
differ from each other. Finally, Type B sub- 
Jects’ ratings of the B posttest stranger were 
significantly higher than Type A’s or C’s rat- 
ings of the B posttest stranger, and A’s and C’s 
ratings did not differ from each other. 
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TABLE 2 
MEAN ATTRACTION AND PERCEIVED SIMILARITY SCORES 


Stimulus strangers 


Subject 
type Pretest A-Posttest B-Posttest 

Attraction 

A 8,5» 10,3" 7.9% 

B 8.5» 8.3» 10.5* 

c 9.0% 8.3% 8.3 
Similarity 

A 8.3% 11,08 7.7% 

B 7.4 1.7 10.8" 

Cc 8.9» 8.3% 7.8 


Note, Means within a row or a column with different super- 
scripts were significantly different at the .05 level by the 
Newman-Keuls test. 


Other Measures 


The interaction of Subject Category X Type 
of Stimulus Strangers was significant (p < .05) 
on the measures of intelligence, knowledge of 
current events, morality, and adjustment. The 
form of the interaction was in each case the 
same as the form for the similarity and attrac- 
tion measures. Thus, these data essentially con- 
firmed the attraction-similarity results and, in 
the interest of economy, need not be presented. 


Discussion 


The key assumption of the present experi- 
ment was that even though final attitude scores 
for two subjects receiving a communication are 
identical, the patterning of change responses to 
individual items can be very different, leading 
to quite different results for perceptions of sim- 
ilarity and for attraction. The summed attitude 
discrepancy scores shown in Table 1 provide 
direct evidence for such variability in the pat- 
tern of responses to individual attitude items 
and demonstrate that such variability occurred 
even though the final summed attitude post- 
test scores were virtually identical for subject 


types A and B. $ 
Given these basic results, if one assumes that 


£ REL 
4 Type of Subject Category Xx Type of Stimulu: 
ER was the only significant effect in the analysis 


of variance of the perceived similarity scores. For the 


i igni Sex of 
tion scores, one other effect was significant, 
E Order ‘of Presentation, F(5, 36) = 3.3, 


p < .02. This interaction effect did not appear to be 


meaningful and is not discussed further. 


`~ 
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altitude change is a genuine phenomenon, then 
the attraction and similarity results shown in 
Table 2 follow directly. The demand character- 
istics — evaluation apprehension argument, as 
Wells (1976) developed it, cannot account for 
the present data. According to Wells’ argu- 
ment, because both A- and B-type subjects 
showed the same amount of overall attitude 
change, they should have been equally at- 
tracted to A and B posttest strangers (because 
their attitude change was artificial and the sub- 
sequent attraction responses were a charade). 
Obviously such a result did not occur, and the 
demand argument fails to account for the 
present data. 

It can be recalled that the generic prediction 
for subjects not taking an attitude posttest was 
no difference in attraction toward the three 
stimulus strangers. This result was found. 
Wells found that experimental-no-posttest 
subjects rated a pretest stranger as more at- 
tractive than the posttest stranger of a yoked 
partner, a result discrepant with the present 
data. Wells’ result is easily explained by the 
similarity data he obtained; the pretest 
stranger was more similar than the posttest 
stranger for the experimental-no-posttest con- 
dition. In the present experiment, there were 
no differences in perceived similarity to the 
three strangers for subjects not taking an atti- 
tude posttest. Perceived similarity scores are a 
useful diagnostic tool. Wells’ similarity data 
suggest that, could they be known, the 
summed attitude discrepancy mean, |posttest 
— Pre], was probably smaller for the condition 
with no attitude posttest than for the condition 
with an attitude posttest. Such variation could 
occur for a variety of reasons. Careful experi- 
mental control would seem, however, more 
likely to lead to the rational generic prediction 
of no difference, as developed in Derivation 5. 

The demand characteristics accusation is 
easily made and is hard to refute directly. For 
example, Wells (1976) failed to emphasize that 
Hendrick and Seyfried’s (1974) procedures for 

the first day focused on impressions of an es- 
say writer; phenomenally, the attitude post- 
test was a very secondary event. The brunt of 
any evaluation apprehension — demand char- 
acteristics experienced by subjects should have 


been focused on the person perception ratings 
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completed immediately after readi 


le essay, 
not on the attitude scale. 

Basically, Wells has to assume xistence 
of extraintelligent subjects who ıd their 
way directly to the true purpose o experi- 
ment, but who, despite such intelle | quali- 
ties, need approval so strongly th: ey will 
not only willingly appear to be gullib’ -ouls by 
faking attitude change but will try i be con- 
sistent in faking attraction respons: a day 
later. If such extreme compliance eò ‘s, one 
wonders why subjects suddenly be me so 
secretive when asked to give their ide. about 


the nature of the experiment. 


One interesting aspect of the deman char- 
acteristics concept is that it is indefinit v flex- 
ible. If one version is quashed, anot +r can 
immediately be constructed. The ini: | Te- 
viewer of this paper proposed anothe. such 
version, as indicated in a quote from | s re- 
view :° “Basically what the present study. 10ws 
is that when you show subjects on a secon. day 
three attitude profiles, one of which me ches 
exactly the one they turned in on a pre ious 
day, they will like the profile which matc. es.” 


Fortunately, the demand character tics 
implication of the quote is in this case c sily 
refuted. We calculated attraction scores $ a 
function of the actual summed discrepancy 
scores of subjects’? own responses in relation to 
the simulated responses of the three strangers. 
The attitude discrepancy means were 0 (sub- 
jects’ own posttest attitude pattern), 3.52 (for 
the stranger closest to the subject’s posttest 
pattern), and 7.59 (for the st ranger most differ- 
ent from the subject’s posttest pattern), The 
attraction Tesponses were 10.4, 8.8, and 7.5, 
respectively, an approximately linear relation. 
Thus, the subjects were not simply working on 
a rule of “pick the perfect match and reject the 
other two strangers,” but instead were respond- 
ing to the subtle gradations in discrepancy, a 
result Byrne (1971) and his students have 
found many times. Importantly, the subjects’ 
responses to increasing discrepancy are ex- 
ceedingly sensitive, If Such responses were the 
result of the subjects’ conscious strategy, they 
Would require a much more sophisticated rule 
than that mentioned above, 


* We are indebted to Monte M. i care 
review of the shear: € M. Page for his careful 


1 


4 


VALIDITY OF ATTITUDE CHANGE 


One other alternative interpretation of the 
present study is that the attitude items did not 
constitute a proper scale. If attitude items are 
sufficiently internally consistent, then it ought 
to be appropriate to focus only on final attitude 
summary scores. For the pretest population 
(W = 933), the interitem correlations ranged 
between .41 and .63 and the item sum score cor- 
relations ranged between .75 and .88. Thus, 
psychometrically the scale was very adequate. 
However, classic psychometric assumptions are 
concerned with static responses, not with the 
dynamic aspects of response change along a 
rating scale. Change on a specific item seems to 
involve an “elasticity factor,” which includes 
many variables, such as a subject’s past his- 
tory, the specific nature of the item, nature of 
the communication, and the particular type of 
experimental manipulations used. Such an item 
elasticity factor is basically the conception that 
made Derivation 1 a reasonable assumptive 
basis for the present experiment. 

The concept of demand characteristics is 
used much too loosely in current social research. 
It is a hydra-headed concept in the sense that 
new versions of it are so easily invented for any 
set of social data. Because of this flexibility, the 
concept is frequently vacuous, reducing to the 
generic statement that “something else might 
have caused the results.” In the present in- 
stance, Wells’ (1976) use of the concept pro- 
vided the basis for a tentative reinterpretation 
of the Hendrick and Seyfried (1974) data. It 
cannot, however, in its relatively unelaborated 
form, adequately predict or account for the 
present data. 

To argue against a particular demand char- 
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acteristics explanation is not, as is commonly 
presumed, to take the position that people are 
unconscious entities that are compelled to re- 
spond, by simple reaction, to environmental 
events. We would be the first to agree that the 
subjects in an experiment are aware of theirsur- 
roundings and actively attempt to interpret the 
situation. What we are arguing against is an 
uncritical acceptance of rather vague demand 
characteristics explanations, such as in Wells’ 
(1976) study. Before such an explanation can 
be exclusively accepted in the present case, 
more unequivocal evidence must be presented 
that the true experimental purposes were very 
transparent and that the subjects were ac- 
tively employing the sophisticated strategy 
that would be necessary to produce the present 
results. We are pleased, however, that Wells 
conducted his experiment because it provided 
the incentive for a more carefully detailed dem- 
onstration of the validity of laboratory-pro- 
duced attitude change. 


© We are indebted to Alice H. Eagly for this observa- 
tion and, more generally, for her patience in serying as 
editor throughout these extended arguments. 
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Attitude Change Validity: Reply to Hendrick and kof 


Gary L. Wells 
Ohio State University 
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In their reply to the Wells (1976) study, Bukoff dismiss this prediction by st... ing tha 
Hendrick and Bukoff (1976) claim that the in a given case, the two experimeni. | group 
results obtained by Wells can be accounted for may have absolute discrepancy score that at 
by observing the patlerning of attitude change. greater between themselves than thc- are to 
Hendrick and Bukoff argue that although the the pretest. That is, if three Subjects (A,B,C) 
summed attitude scores of experimental- have exactly identical pretest opinion .atterns 
posttest and experimental-no-posttest groups across five items (e.g., X = 4.00 on x 5-point 
should not differ, the absolute attitude dis- scale) and two of the subjects (A anc B) are 
crepancy scores (defined as the sum of the yen an attitude change treatment (; osttest 
absolute difference in the subjects’ five posttest = 3.0), the absolute item discrepanc v could 
attitude items) may be quite large. An empiri- þe significantly greater between A and 3 than 
cal study by Hendrick and Bukoff demon- between B and C, Clea 
strated that the subjects exposed to the is possible and given 35 
persuasive communication had a posttest at- may be a few 
titude pattern that was equally as deviant (in but the overal 
absolute discrepancy for all five opinion items) low. In fact, th 
from their yoked experimental partners’ pat- such a hypoth 
tern as it was from the subjects’ own Pretest. study, and su 
Hendrick and Bukoff conclude that the post- E 
test itself was not a contributing factor but Bukoff position is that they ignore the half 
rather the discrepancy evidence completely ac- of the data from the Wells study that was un- 
counted for Wells’ data, thereby reaffirming af 
the validity of the original Hendrick and Sey- 
fried (1974) study, 

There are two major gaps in Hendrick and (1976) study cl 
Bukoff’s item discrepancy argument. First, 
Hendrick and Bukoff admit that the generic lower ratings to the Pretest stimulus stranger 
prediction from their item discrepancy notion than did the co 
is that experimental-no-posttest subjects 
should show no difference in attraction ratings the pretest stran 
for a pretest versus posttest stimulus stranger jects. The disc 
(see Table 2 in Wells, 1976). Hendrick and these comparisons because pret 


Requests for reprints should be sent to Gary L, Wells, Sponses for each experimenta 


Department. of Psychology, Ohio State University, the control Sroup. 
404C West 17th Avenue, Columbus, Ohi, 43210,” 


Wri 


od Bukoff fail to address the suspi- 
tained by Wells, as well as the recall 


Hendrick 
cion data 


data upo which Hendrick and Seyfried had 
made s) ‘fic, yet unconfirmed, predictions. 
Hendric) 2nd Bukoff assert that the demand 
characte’ -tics hypothesis is used too “loosely.” 
Howeve’ ihe current author’s predictions, 
derived the demand hypothesis, have re- 
mained t. If there is an element of loose- 
ness on er side of the issue, it is with 
Hendrick 1 Bukoff’s suggestion that Wells 
obtained sults that do not conform to an 
expected . neric pattern. In conclusion, as 
stated in \ ¿lls (1976), the current author's 
position is vt that laboratory-produced atti- 


tude change is invalid but that the important 
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Hendrick and Seyfried study is not solid evi- 
dence for the validity of laboratory-produced 
attitude change. 
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Motivation, Modeling, and Altruism: 
A Person X Situation Analysis 


John P. Wilson 
Cleveland State University 


This study investigated the interaction of human motivation and situational 
factors on helping behavior. Male and female subjects of three motivational 
orientations—safety, esteem, or an equal and moderate need for both safety 
and esteem—were placed in one of three experimental conditions. The sub- 
jects were either (a) alone, (b) with two passive bystanders, or (c) with two 
bystanders, one helping and one passive, Subjects were exposed to a simulated 
explosion which ostensibly injured the experimenter. As predicted, the results 
indicated that esteem-oriented subjects helped consistently across situations, 
whereas safety- and middle-oriented subjects modeled the behavior of the by- 
standers. In the alone condition, esteem-oriented subjects helped more than 
middles, who, in turn, helped more than safety-oriented persons. The results 
were discussed in terms of the social influence hypothesis and a Person X Situa- 


tion model of altruism. 


In recent years the study of altruism has 
demonstrated the importance of situational 
and modeling factors as major determinants 
of helping behavior (e.g., Bryan & Test, 1967; 
Clark & Word, 1974; Latané & Darley, 1970) 
but has not successfully shown how per- 
sonality factors interact with situations in de- 
termining helping responses. More recently, 
Staub (1974) has called for a Person x Situa- 
tion analysis of altruism to ascertain which 
personality factors are associated with con- 
sistency in helping responses across situations. 
Thus, in a series of studies on aiding a dis- 
tressed person, Staub (1974) used a more 
inclusive conceptualization of personality than 
a single trait characteristic and found that 
persons with a strong prosocial orientation (a 


composite personality measure which assessed 
an individual’s concern with the welfare of 
Others ) were significantly more helpful across 


rimental situations than were 
low prosocial orientation. The 
tudies suggest that the rela- 
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tionship between personality and altruism is a 
complex one which necessitates a more precise 
conceptualization of the interaction of per- 
sonality orientations and situational in- 
fluences, a point Bem and Allen (1974) have 
noted as well. 

Further evidence of the relevance of per- 
sonality characteristics to altruism.has been 
shown in a preliminary study by Michelini, 
Wilson, and Messé (1975) which examined 
safety and esteem motives, as characterized by 
Maslow (1970), and found that esteem-ori- 
ented males helped a female in an accidental 
situation significantly more often than did 
safety-oriented males, The results indicated 
that esteem-oriented Persons were concerned 
with demonstrating competency, whereas 
safety-oriented individuals failed to help be- 
cause their high anxiety inhibited responding. 
Additionally, research from naturalistic as well 
as experimental studies has shown that safety 
and esteem motivational processes interact 
with situational variables in determining role - 
differentiation, productivity, and the type of 
emergent social structures in small groups 
(e.g., Aronoff, 1967, 1970; Messé, Aronoff, & 
Wilson, 1972; Wilson, Aronoff, & Messé, 
1975). Thus, the results of these studies yield 
confidence that when personality variables are 
adequately conceptualized and assessed, the 
interaction of persons and situations can be 
accurately predicted, Therefore, the purpose 
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of this study was to examine more specifically 
the interaction of these variables in a com- 
plex situation involving a realistically simu- 
lated accident which ostensibly injured an 
experimenter. i; 
edicting the nature of interactions of 
is and situations, it is necessary to con- 
the conceptualization of each class of 
iables. However, when doing research of 
this type it is necessary to have a valid mea- 
sure of the personality constructs, The mo- 
tives used in this study were measured by a 
sentence completion test (SCT; Aronoff, Note 
i). To summarize briefly, safety-oriented in- 
dividuals are characterized as being highly 
anxious, mistrustful, passive, and dependent. 
“They have strong feelings of personal incom- 
petence, uncertainty, insecurity, inefficacy, 
and view the world as an unpredictable and 
therefore uncontrollable place. Esteem-ori- 
ented persons, in contrast, are less anxious, 
achievement-oriented, assertive, and striving 
for competence and efficacy in interpersonal 
relations, They have strong feelings of per- 
sonal adequacy and believe that they can 
master situations in realistic, functional, and 
adaptive ways (Wilson & Aronoff, 1973). 
Several studies have demonstrated the pre- 
dictive validity of the SCT, In a naturalistic 
study, Aronoff (1967) found that safety-ori- 
ented persons usually came from broken 
homes or disrupted family situations in which 
they had suffered significantly greater experi- 
ences of loss of parents and siblings before 13 
years of age than did esteem-oriented persons. 
As Maslow (1970) predicted, this familial and 
environmental situation, in which physical 
and psychological deprivation and traumata 
occurred, was strongly associated with the de- 
velopment of the safety-oriented personality. 
In contrast, however, esteem-oriented persons 
tended to come from intact and stable families 
characterized by high degrees of parental in- 
volvement with their children. Furthermore, 
Aronoff (1967) found that the social patterns 
of individuals could be predicted from the 
scores on the SCT. For example, he found that 
persons who were classified as safety oriented 
by the SCT typically worked in groups with 
authoritarian leadership and hierarchical so- 
cial structures; tended to have legalistic, for- 
mal marriages; denied aggression; and were 
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concerned with securing and defending per- 
sonal property. In contrast, persons who were 
classified as esteem oriented by the SCT 
worked in groups with democratically shared 
leadership and egalitarian social structures; 
had marriages based on mutuality; openly ex- 
pressed hostility; and were less concerned 
with defending property or possessions. More- 
over, in a follow-up study, Aronoff (1970) 
found that when esteem-oriented persons en- 
tered a hierarchical social structure, the struc- 
ture tended to become egalitarian and thus 
more congruent with the motives of the group 
members, More recently, these findings were 
replicated and extended in a series of studies 
concerned with the influence of these motives 
on small group processes (e.g., Aronoff & 
Messé, 1971; Wilson et al., 1975). Thus, the 
results of these studies provide support for 
the SCT as a valid measure of a robust set of 
motives relevant to the study of Person X 
Situation interactions, 

The situations typically studied in research 
on altruism have involved individuals or 
groups of bystanders performing a task or in- 
teracting together (e.g., Latané & Darley, 
1970). The argument for the bystander effect 
found in these studies states that in ambiguous 
situations persons compare their responses to 
those of others as a means of deciding on an 
appropriate course of action, In this sense, 
then, the bystander serves as a role model of 
a particular type whose behavior provides a 
definition of the situation. Implicit in this con- 
ceptualization of the situation is the assump- 
tion that the situation possesses properties 
which influence all individuals in the same 
way. Clearly, however, all persons do not per- 
ceive situations alike. Thus, while situations 
may have characteristics or dimensions com- 
mon to an individual or group, their effect de- 
pends on the persons in them as well as the 
nature and quality of their activity (Bowers, 
1973). 

In regard to the definition of the situation 
and the bystander’s subsequent response to it, 
Latané and Darley (1970) state that the com- 
petency of the bystander is an important per- 
sonal attribute associated with helping. More 
recently, Wilson and Wilson (1976) have sug- 
gested that high self-esteem, conceptualized 
as an enduring and broadly based sense of 


1080 


-personal competence, is a personality variable 

most relevant to the prediction of interactions 
between persons and situations, Therefore, 
knowing the degree to which an individual has 
a strong or weak sense of personal competence 
should permit unambiguous predictions of be- 
havior in situations in which help is needed. 
Moreover, our research has shown that 
strongly safety-oriented individuals depend on 
other persons to provide informational cues as 
to the appropriate set of responses in group 
situations. Further, these safety-oriented per- 
sons use external cues and external attributes 
of status as the basis of role assignment in 
small groups, whereas esteem-oriented per- 
sons focus on internal cues and personal mani- 
festations of task competence as the basis of 
group organization (Messé et al., 1972). 
These results suggest that safety-oriented in- 
dividuals are more influenced by the behavior 
of others and are likely to model their re- 
sponses as a means of reducing anxiety and 
uncertainty. 

In contrast, however, esteem-oriented indi- 
viduals function autonomously in accordance 
with a strong sense of personal competency. 
Rather than relying on others for the ap- 
propriate response cue, esteem-oriented indi- 
viduals initiate activity appropriate to the 
situation. Thus, these findings suggest that 
when confronted with a bystander interven- 
tion situation, esteem-oriented persons would 
be expected to help more frequently than 
safety-oriented persons because of their in- 
ternal locus of control and strong sense of 
personal competency, In contrast, safety-ori- 
ented individuals would be more affected by 
the behavior of bystanders who are providing 
a definition of the situation for them. If no 
bystanders are present, they would help less 
than esteem-oriented persons because their 
characteristically high levels of anxiety would 
generate a tendency toward passivity and 
withdrawal. 

Therefore, these findings generated the fol- 
lowing hypotheses for this study: 


Hypothesis 1: Strongly esteem-oriented per- 
sons help more in a bystander situation than 
do strongly safety-oriented individuals or 
those with an equal and moderate need for 
both motives (middles), 
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Hypothesis 2: There is more help given in 
situations with an active helping bystander 
than in situations where individuals are alone, 
where, in turn, more help is given than in 
situations with passive bystanders, 

Hypothesis 3: There is an interaction be- 
tween motivational orientation and situational 
conditions in determining altruism, Esteem- 
oriented individuals are consistent in their 
helping behavior across situations, whereas 
safety- and middle-oriented persons model 
the behavior of bystanders present in the 
situation. 


Mertuop 


Subjects 


Subjects were recruited through announcements in 
introductory psychology courses and by an advertise- 
ment in the university newspaper which solicited 
undergraduates interested in earning money or re- 
search credit by Participating in Psychological re- 
search. The announcements presented the number of 
times and places where assessment interviews were 
held. Approximately 600 undergraduate students ap- 
peared for an interview, 

During the assessment, which lasted approximately 
1 hour, the SCT, as mentioned above, was given to 
measure the extent to which the subjects were con- 
cerned with safety and esteem motives, Ten coders, 
trained in the rating procedure, scored the responses 
to the SCT by recording each time a Person expressed 
concern for safety or esteem motives, A respondent’s 
total score for each motive across the 40 items (ie, 
the total number of sentences expressing either safety 
or esteem motive themas) was used as an index of 
the strength of the motivational orientation, As Fiske 
(1971) has noted, this mode of measurement permits 
a wide range of response alternatives to the sentence 
stems. For example, the following are typical re- 
sponses which indicate a safety-oriented motive state, 
For the SCT stem: (1) I should like to . ++ g0 away 
to a very quiet place for a very long time; (6) Iji 
could only .. . find security; (26) If I were in 
charge... I'd resign in a hurry; (33) If I am put 
under pressure... I panic! On the other hand, re- 
sponses indicative of strong self-esteem generally ex- 
press statements of personal competency and high 
self-regard. For the SCT stem: (1) I should like to 
- . . be as perceptive and receptive as possible during 
my life; (6) If I could only . . . fulfill all my goals 
with extra time left over; (26) If I were in charge 
. . . things would be done Properly and fairly; (33) 
Ij I am put under pressure... I react well and ac- 
complish more of the task, Thus, each test was scored 
by two coders, and interjudge reliabilities (as mea- 
sured by the product-moment correlation coefficients) 
were .73 and .75 for safety and esteem motives, re- 
spectively. 
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Construction and Validation of the SCT 


The SCT was initially designed as a 60-item test 
consisting of incomplete sentences to measure physio- 
logical, safety, affiliation, and esteem motives (Aro- 
noff, 1967). However, after extensive testing in cross- 

ultural research (Aronoff, 1967, 1970) as well as use 
as a selection instrument for experimental studies in- 
veral thousand undergraduates (e.g., Aronoff 
1971; Messé, et al., 1972), the SCT was 
cted to an item analysis and reduced to a 40- 
which reliably discriminated between the 

for every sentence stub (Aronoff, 


empirical development of the SCT was pat- 
the methodology used by McClelland 
Atkinson (1958) in their research on 
nt motivation. In this assessment procedure 
motives are conceptualized as relatively enduring 
tates sil which influence goal-directed be- 
d by Winter, 1975). Therefore, in the 
ch, motive strength was measured, in 
t ihe frequency with which responses of a 
!-cehned class occurred to the sentence stems, 
thus, the more frequently a specific motive theme is 
expr , the more probable it is to be prepotent 
wi the personality of the individual. Reviews of 
other similarly constructed scales have shown the 
tence completion method of personality assessment 
be a sensitive measure of motive states and of 
erally high validity and reliability (e.g., Daston, 
i8; Murstein, 1965), Additionally, the research 
in the introduction yields confidence in the SCT 
selection instrument which measures well-defined 
ives states which constitute a typology of per- 
onality (Maddi, 1976). 


Design 


Subjects were selected by the criteria that they 
scored in the top 15th percentile for one motiye and 
bottom 15th percentile on the other motive, as scored 
by both judges. The middle group of subjects was 
selected by the criterion that they fell between the 
45th and 55th percentiles in the scores of both judges. 
‘The mean scores for esteem and safety motives were 
as follows: esteem-oriented subjects, 22.68 and 9.56; 
safety-oriented subjects, 10.31 and 21.47; “middles,” 
16.49 and 16,69. Thus, a revised subject pool was 
compiled that included about 45% of the original 
sample. From this pool, 196 persons were used in this 
experiment: 30 strongly safety-oriented males and 34 
strongly safety-oriented females: 32 middle males 
and 33 middle females; and 33 strongly esteem-ori- 
ented males and 34 strongly esteem-oriented females. 
The subjects were randomly assigned to the experi- 
mental conditions in which they were either alone 
(Condition 1), with two passive confederates (Con- 
dition 2), or with two confederates, one helping 
model and one passive bystander (Condition 3). In 
Condition 1, there were 34 males (10 safety, 12 mid- 
dle, 12 esteem) and 35 females (12 safety, 11 middle, 
12 esteem). In Condition 2, there were 29 males (8 
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safety, 10 middle, 11 esteem) and 32 females (10 
safety, 12 middle, 10 esteem). In Condition 3 there 
were 32 males (12 safety, 10 middle, 10 esteem) and 
34 females (12 safety, 10 middle, 12 esteem), The 
unequal number of subjects in each motivational ori- 
entation was due to subject attrition from the pool 
because some subjects failed to show up for the study 
or had already heard of it from another subject. 


Procedure 


When contacted, a subject was led to believe that 
he would be asked to complete a medical question- 
naire concerned with the relationship between per- 
sonality and the prediction of future physical health, 
especially heart disease. The subjects were told to re- 
port to a waiting room in the psychology building. 
In the experimental condition involving bystanders, 
the subject was informed that a group administration 
of the test would take place. On arrival, the subject 
either sat alone or with two confederates of the same 
sex. The two male experimenters, systematically 
varied over treatment conditions, informed the sub- 
jects that they were to go to a second room where 
the testing would be completed. The subjects then 
walked to the experimental rooms which were two of 
five laboratory rooms located in an L-shaped wing of 
the psychology building. On entering the corridor of 
the research wing, the subjects walked into the equip- 
ment room. They were informed that this room was 
the computer room where the experimenter worked in 
computing statistics and doing other tasks related to 
the study. 


Appearance of the Experimental Room 


The room contained many items of electronic equip- 
ment which were arranged to create the effect of a 
computing laboratory. The apparatus included a 
feed-in-relay computer which was approximately 5 
ft. high and 2 ft. wide (1.52 X .61 m) on which were 
moving reel-to-reel magnetic tapes; a number of red, 
blinking indicator lights; an illuminated digital dis- 
play window which systematically changed six digit 
numbers; various control buttons and switches; sev- 
eral read-out windows which presented numerical 
scores; as well as other types of indicator panels, 
Located on top of the computer was a Hunter elec- 
tronic digital timer, model 1520 which displayed 
bright orange-colored numbers and a calibrated 1 
1 pyrex laboratory beaker which was empty and 
ostensibly being stored there for another experiment. 
Directly adjacent to the computer was a small rec- 
tangular table on which was located an inoperative 
Electrohome videotape control monitor model 502- 
202. The room also contained metal storage shelves on 
which were stored electronic timers, event recorders, 
perceptual form boards, and other types of equipment. 

After being seated and given a release form to sign, 
the ‘subjects were told that the computer was being 
used to analyze the data from the medical question- 
naire. To increase the credibility of this manipula- 
tion, the subjects were given a demonstration of the 
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computer by supposedly programming data into the 
machine to obtain a statistical correlation which ap- 
peared in the illuminated digital display window. 
After this demonstration, the subjects were handed 
the Jenkins Activity Survey for Health Prediction, 
Form B and explained that the purpose of the survey 
was to correlate the data from the medical question- 
naire with scores from the personality test which they 
had previously completed. Any questions regarding 
the health survey were answered. The subjects then 
proceeded to the experimental room which was lo- 
cated down a short hallway approximately 11 ft. 
(3.35 m) from the computer room. The experimenter, 
unaware of the subject’s motivational orientation, 
seated the subjects at a rectangular table which was 
located in the middle of the room about 5 ft. (1.52 
m) from the door, Three chairs were arranged around 
the table so that the subjects always sat in the seat 
closest to the door. The confederates, when present, 
sat directly across from, and to the right of, the 
subjects, 


Experimental Conditions 


In Condition 1, subject alone, the experimenter gave 
the directions for the medical questionnaire and ex- 
plained the use of four oscillator keys which cor- 
responded to the response alternatives on the survey. 
The subject was informed that the keys were con- 
nected to the computer by wires which ran through 
a small conduit in the wall dividing the computer 
room and the testing room. Any questions regarding 
the medical survey and the equipment were answered. 
As he left the room, the experimenter informed the 
subjects that he would be processing some tests in 
the computer room during the testing and then 
closed the door to within 5 in. (12.70 cm) of the 
frame. For the next 3 min., the experimenter activated 
the computer to create a set of clicking and whirling 
sounds that conveyed the impression that he was 
working with the computer. At the end of 3 min. the 
experimenter simulated an explosion of broken glass 
by dropping a large wooden board from a height of 
4% ft. (1.40 m) on which was a plastic cafeteria tray, 
a dozen pieces of silverware, a wooden bowl, a 
plastic dish, and a plastic laboratory beaker. When 
the material crashed on the tile floor, the experi- 
menter screamed loudly, as if in pain: “Oh . . . oh, 
oh ...oh.... My foot oh oh oh my foot my foot 
ah oh oh shit, oh my foot, it’s broken it’s broken, 
my foot oh oh oh... I? The experimenter con- 
tinued groaning for approximately 20 sec then lapsed 
into silence. A tape-recorded procedure was not used 
since extens pilot studies in these rooms revealed 
that the recorded cries and crash sounded hollow 
and fake to the subjects, whereas the procedure used 
sounded real, startling, and impressive to virtually 
every subject. 

In Condition 2, the subject and two passive by- 
standers, and in Condition 3, the subject, helping 
model, and passive bystander, all of the same sex, 
the procedure was essentially the same as Condi- 
tion 1 except that three prenumbered IBM answer 
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sheets were placed on the table in front of each 
chair in the testing room. In the group administra- 
tion, the subjects were instructed to sit in the chairs 
so that the numbers on the Jenkins Activity Survey 
corresponded to the numbers on the IBM answer 
sheet. The subjects were informed that the results 
would be programmed into the computer from the 
IBM answer sheets, When the explosion occurred, the 
confederates in Condition 2 remained passive, gave 
no verbal or nonverbal indication that help should 
be given, and continued working on the test. If the 
confederate’s opinion was solicited by the subject in 
regard to the nature of the explosion and whether 
they should inquire as to what had happened, the 
confederate appeared perplexed and said, “I don’t 
know.” In Condition 3, immediately after the explo- 
sion occurred, the helping confederate, seated di- 
rectly across from the subject, looked up from the 
test booklet and said, “Jesus, what was that?” He 
(she) pushed away from the table and walked into 
the computer room and asked, “What happened? 
Let me help you!” In Condition 3, only one confed- 
erate helped the experimenter so that the experi- 
mental design would permit comparisons between 
situations. Thus, everything is identical to Condi- 
tion 2 except for the helping model. 


Dependent Measure 


The helping response was defined as the subject 
leaving the testing room coming into the computer 
room to aid the experimenter. Subjects who re- 
mained in the testing room were credited with a 
no-help response. The results are expressed as the 
percentage of subjects in a condition who helped 
the experimenter, 


RESULTS 
Manipulation Check 


Following the experiment each subject was 
extensively debriefed and asked a standard 
set of questions about the study, Table 1 
presents the results of the responses to the 
questions and clearly demonstrates that the 
simulated crash as well as the cries of the 
experimenter were realistic, highly believable 
and emotionally arousing. : 


Tests of Hypotheses 


_ Hypothesis 1—effect of motivation on help- 
ing. To examine the effect of motivational 
orientation and situational conditions on help- 
ing the experimenter, a 3 X 3 x 2 (Situation 
of Subject X Motivational Orientation x Sex) 
design was analyzed by the contingency par- 
titioning procedure suggested by Winer 
(1971). Hypothesis 1, that there were differ- 
ences in helping by motivational orientation 
was strongly supported, Wah s 18.90, p z 
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-0005. Table 2 presents the observed per- 
centage of help for each condition as classi- 
fied by motivational orientation. 
Hypothesis 2—effect of situations on help- 
iz. The results of the contingency partition- 
rongly supported Hypothesis 2, (2) = 
p < .0005. As Table 2 shows, the great- 
entage of help, 83%, occurred in Con- 
n 3 in which the bystander actively mod- 
helping response, as compared to 75% 
dition 1 and 38% in Condition 2. 
toihesis 3—Motive X Situation and 
j The hypothesis that there is an inter- 
ion between motivation and situation was 
) strongly supported, x?(4) = 15.69, p< 


005, 


th ests of Differences Across Motives 
Lach Situation 


ondition I—Subject alone. Examination of 

ihe influence of motivation within Condition 
1 indicated a significant effect, x2(2) = 24.85, 
2 < 0005, To minimize the probability of 
committing a Type I error, the critical level 
individual comparisons was set at p< 
)167, which was derived by dividing p = .05 
by three, the number of individual compari- 
sons made, Individual comparisons between 
tives indicated that, as predicted, esteem- 


TABLE 1 
TIONS ASKED DURING DEBRIEFING 
EXPERIMENTAL MANIPULATION 


RESUL Q 
OF SUBJECTS 


Percentage 


of subjects 
answering 
Question asked in debriefing positively 
1. Did you hear a crash or explosionlike 
noise? 100 
2. Was the sound of the crash realistic and 
believable? ‘ 100 
3. Was the cry of the experimenters believ- 
able, i.e., thought he was hurt? 95 
4, What did you think had happened? 
a) Computer blew up 80 
b) Other apparatus fell or exploded 20 
5. What did you think had happened to 
experimenter? 5 
a) bleeding, cut or seriously injured 34 
b) uncertain as to nature of problem 66 
6. Did you have an emotional response to 
crash, i.e., heart acceleration, sweaty 
palms, etc.? 60 
7. Did you think that this research was 
valuable and worth doing? 100 
&, Did you find the use of deception ac- 
ceptable to study the problem? 90 
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TABLE 2 
OBSERVED PERCENTAGE OF HELPING FOR SAPETY-, 


MIDDLE-, AND ESTEEM-ORIENTED PERSONS IN 
THE EXPERIMENTAL SITUATIONS 


Experimental situation 


Passive Helping 

Motivational Alone bystander bystander M 
orientation n=69 n=61 n=66 helping 

Safety 55 17 71 50 

Middle 65 23 95 60 

Esteem 96 71 86 85 

M help 15 38 83 65 


oriented subjects helped more frequently than 
did middle- or safety-oriented subjects, x (1) 
> 7.16, p < .005, who were not significantly 
different from each other, x?(1) = .53, 

Condition 2—Two passive bystanders, 
Analysis of the influence of motivation within 
Condition 2 indicated a significant effect, 
x°(2) = 15.62, p< .0005. Individual com- 
parisons found, as predicted, that esteem- 
oriented persons helped more frequently than 
middle- or safety-oriented subjects, y2(1) > 
10.24, p < 005, who were not significantly 
different from each other, y?(1) = .22. 

Condition 3—Active, helping bystander and 
passive bystander. Analysis of the effect of 
motivation within Condition 3 revealed a sig- 
nificant effect, (2) = 5.26, p < .05. Indi- 
vidual comparisons confirmed that middle- 
oriented subjects were significantly more help- 
ful than safety-oriented persons, x?(1) = 6.64, 
$ < .005. However, esteem-oriented subjects 
were not significantly different in helping 
from either safety- or middle-oriented per- 
sons, x*(1) < 1.62. 


Further Tests of Differences Across Situations 
Within Each Motive 


Safety oriented. Examination of the influ- 
ence of situations on safety—motivation re- 
vealed a significant effect, y*(2) = 12.34, p 
< .005. Individual comparisons between the 
three conditions indicated that, as predicted, 
safety-oriented subjects helped more when 
alone and with the helping model than with 
passive bystanders, x?(1) > 6.03, p< .01. 
However, there were no significant differences 
between Situations 1 and 3 in the degree of 
help rendered, x°(1) = 1.31. 
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Middle oriented. Analysis of the influence 
of situations on middles showed a significant 
effect, x?(2) = 30.24, p < .0005. Individual 
comparisons between conditions indicated 
that, as predicted, more help was given when 
alone and with the helping model than with 
the passive models, y?(1) > 7.98, p< .01. 
Further, there was significantly more help 
given in the situation with the helping model 
than when alone, x*(1) = 5.72, p < .05. 

Esteem oriented. Analysis of the influence 
of situations on esteem-oriented persons indi- 
cated a significant effect, x°(2) = 5.26, p < 
.05. However, in support of Hypothesis 3, 
individual comparisons of the experimental 
conditions indicated that there were no sig- 
nificant differences in helping between situa- 
tions, x?(1) < 1.31. 


Additional Tests of Modeling Effects 


To examine the effect of the active helping 
model in Condition 3, comparisons were made 
between safety-, middle-, and esteem-oriented 
subjects who left the room before the active 
confederate. The results showed that of those 
who helped, 68% of the esteem-oriented sub- 
jects left the room before the confederate as 
compared to 26% of the middle-oriented and 
12% of the safety-oriented subjects, y°(2) = 
14.01, p < .0005. 


Discussion 


The results of the present research suggest 
a modification of the social influence hypothe- 
sis proposed by Latané and Darley (1969) 
which contends that individuals in a bystander 
situation are influenced by the behavior of the 
other group members who provide a definition 
of the normative behavior appropriate to the 
situation. The results of this study suggest 
that helping behavior is not determined by 
situational factors alone, rather by the moti- 
vational states of the bystanders and situa- 
tional forces in interaction with each other. 
Thus, the influence of bystanders in a situa- 
tion may vary depending on the motivational 
state of these potential helpers and the type 
of information generated by their actions. In 
this study, when individuals are exposed to a 
startling crash and the cries of the experi- 
menter, it is the highly anxious, mistrustful, 
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safety- and middle-oriented persons who are 
most likely to model the behavior of the by- 
stander(s) irrespective of their actions. How- 
ever, further research is necessary to deter- 
mine if esteem-oriented individuals are more 
subject to social influence processes when the 
modeling cues of the bystanders provide goal 
path clarification for effective helping be- 
havior. 

While the safety-oriented individual ap- 
pears to be strongly affected by the social 
influence process, the esteem-oriented person 
seems less subject to these environmental 
constraints. As hypothesized, the esteem-ori- 
ented subjects were relatively consistent in 
their helping responses across situations even 
though there was a nonsignificant decrease in 
Condition 2, Thus, when confronted with the 
accident, they are not concerned with others’ 
evaluation of their performance and can at- 
tend to the situation more fully in a problem- 
solving manner. This interpretation is sup- 
ported by the research of Mossman and Ziller 
(1968), who found that high self-esteem per- 
sons are able to ignore self-evaluative cues in 
problem-solving situations in order to process 
effectively information leading to a successful 
conclusion, 

The strong bystander effect shown in this 
study was consistent with the findings of other 
studies in which the passivity of the bystander 
in Condition 2 appeared to inhibit the fre- 
quency of helping (e.g., Latané & Rodin, 
1969). In the terminology of Latané and 
Darley (1970), the passivity of the confed- 
erates may have influenced the subjects’ per- 
ception of the situation as an ambiguous one 
and thereby led to an interpretation that noth- 
ing had happened to warrant help. While this 
is plausible, it is contradicted by the fact, as 
shown in Table 1, that all subjects reported 
during the debriefing that they heard the cries 
of the experimenter and thought that the com- 
puter or some other piece of apparatus had 
blown up. Given that this is so, it is likely 
that the anxiety of the more safety-oriented 
subjects, in addition to the passivity of the 
confederates, inhibited their tendency to re- 
spond to the accident. Further, this explana- 
tion is consistent with Schacter’s (1959) find- 
ing that under stressful conditions highly 
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anxious persons prefer to be with others in a 
similar situation in order to reduce emotional 
arousal. Thus, the nature of the bystanders’ 
activity is important, since they are modeling 
onses which differentially affect others in 
situation. It appears then, that safety- 
esteem-oriented persons utilize different 
rategies in dealing with the emotional 
associated with the sound of the 
safety-oriented individual looks 
to the bystanders and compare his 
nse to theirs, whereas the esteem-ori- 
ented person looks internally to his own eval- 
uation of the situation and is comparatively 
ffected by the behavior of others. It is 
therefore, that the influence of the 
would be even more pronounced 
y-oriented persons if two or more 
ilederates were to leave the room. 

Phe question can be raised, however, as to 
the point at which esteem-oriented persons 
are strongly affected by the action of the by- 
standers. Under what conditions do esteem- 
oriented individuals initiate the helping pro- 
cess and when are they more likely to be 
strongly influenced by situational factors? The 
results of this study indicated that, in general, 
steem-oriented individuals were initiating the 
helping sequence rather than modeling the be- 
havior of the bystander. For these persons, 
demonstrating competence is necessary to the 
maintenance of their self-image as an effica- 
cious individual. Thus, it is likely that they 
would organize and initiate helping qyehavior 
in a variety of situations unless the personal 
costs for helping were so great that direct 
intervention would be physically punishing or 
detrimental to their sense of self-esteem. 

It can be concluded from the present study 
that personality differences, as expressed in 
motivational orientation, and situational fac- 
tors interact in the determination of helping 
behavior. As Bem and Allen (1974) have 
shown, some individuals are consistent in 
behavior some of the time, while others vary 
greatly between situations. The results of this 
study suggest there is much value in identi- 
fying the motivational characteristics associ- 
ated with individual response consistency, 
since these processes may mediate social in- 
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fluence processes and underlie the mechanisms 
of helping behavior as well. 


REFERENCE NOTE 


1. Aronoff, J. A scoring manual for safety, afiliation 
and esteem motives. Unpublished manuscript, 
Michigan State University, 1972. 
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Predictions were tested as to the persistence of change after sensitivity training. 
Using concepts derived from Kelman, it was predicted that internalizers will 
show most durable benefit from training, whereas externalizers (compliants and 
identifiers) will show less. Questionnaire measures were used based on socio- 
metric rankings of influence sources during training. The subjects were 199 
professionals of mixed sex and age attending 5-day sensitivity training programs 
in England. As predicted, at the close of training internalizers did not differ 
from externalizers on ratings of benefit by self or by other trainees. Trainers 
rated internalizers higher (p < .05). Five months later, internalizers were rated 
higher than externalizers on benefit by self (p < .01), and there was a trend in 
the same direction for ratings by others, Between the two testings of exter- 
nalizers’ benefit there were significant decreases for self-benefit rating (p < .01) 
and other-benefit rating (p< .01), whereas internalizers showed no significant 
decrease. The findings support typological approaches to the study of social 
influence and underline the need for caution in selecting the criterion measures 


to be used immediately after sensitivity training. 


Practical attempts to influence individual 
behavior through training or psychotherapy 
constitute a much underused resource in the 
study of social influence. Most laboratory re- 
searchers have assumed social influence to be 
a unitary process, thus bolstering their view 
hat the study of influence in laboratory set- 
tings does provide a microcosm representative 
of social influence in everyday life. Such as- 
sumptions may well have some validity, since 
there are some settings in everyday life that 
do have properties in common with the lab- 
oratory setting. One class of settings in which 
such commonalities are minimized are those 
in which attempts are made to influence an 
individual’s behavior’ in a manner that per- 
sists after the transaction is complete. Sensi- 
tivity training and psychotherapy provide in- 
stances of such settings. The laboratory re- 
searcher investigating social influence typically 
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examines changes within his research setting; 
the trainer or therapist is much more con- 
cerned with whether his client carries any 
behavior changes with him into other settings. 

The central proposition of this paper is that 
the preconditions for the internalization of 
social influence are quite different from the 
preconditions for maximizing other forms of 
influence that are limited to the immediate 
situation. A typological approach to the study 
of social influence has been proposed by Kel- 
man (1958, 1963), who contrasts internaliza- 
tion with identification and compliance. Kel- 
man’s model has been critically reviewed by 
Kiesler, Collins, and Miller (1969), who con- 
clude that many aspects of Kelman’s model 
remain untested. The principal empirical tests 
reported to date are those by Kelman (1958; 
Note 1) Kelman and Eagly (1965) and Klein 
(1967). Each of these studies was a labora- 
tory-based study in which the three postulated 
bases of social influences were manipulated 
as indepndent variables in an attempt to af- 
fect the amount and persistence of change in 
attitudes or behavior. 

In Kelman’s model, compliance occurs when 
an individual accepts influence in the hope 
of gaining some reward or avoiding some pun- 
ishment to which the influence source has 
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access. A compliant action is done for ex- 
pediency’s sake, not because the individual 
has changed his beliefs. Kelman (1958) 
found, as predicted, that influence from a 
source to whom high-means control had been 
attributed persisted only when the attitude 
change measure would be seen by the source 
of means control. Klein (1967) obtained a 
comparable finding. 

Identification-based influence is said to oc- 
cur when an individual changes his behavior 
toward that preferred by an attractive figure. 
This influence has been manipulated in the 
studies by attributing messages to sources 
thought to have high-reference-group status 
(Kelman, 1958; Kelman & Eagly, 1965) or 
by presenting the message as “widely ac- 
cepted” (Kelman, Note 1). Change is pre- 
dicted to persist only when the identification 
source remains salient. 

Internalization-based influence occurs when 
an individual accepts influence because the 
change is congruent with his value system. 
Internalization has been manipulated by at- 
tributing messages to an expert (Kelman, 
1958) or by stressing the importance of main- 
taining value congruence (Kelman, Note 1). 
In the cases of both identification and inter- 
nalization, no validatory evidence has been 
published that the message recipients do in- 
deed perceive the source in the way intended. 
Kelman (1958) indicates that some differ- 
ences were obtained between conditions in the 
manner in which sources were perceived, but 
it does not appear that ratings of attractive- 
ness or credibility were included. Thus, the 
studies have generated the predicted effects, 
but we cannot be sure that this is because of 
the way the influence sources were differen- 
tiated. To cite just one possible complication, 
in these anticonformist times, the knowledge 
that the majority of students favor a particu- 
lar viewpoint on an issue can be relied on to 
influence a substantial number of message re- 
cipients away from that viewpoint. Nor are 
students as prone as they used to be to ac- 
cepting an expert as credible just because 
they are told that he is an expert. 

The present study differs in a number of 
important respects from those so far reviewed. 
It was conducted in a field setting so that 


the incidence of different influence types was 
of necessity measured rather than manipu- 
lated. Furthermore, the influence types were 
formulated in a manner that does not pre- 
cisely correspond with Kelman’s definitions. 
This reformulation arose from criticisms of 
the original model, which will be outlined. 


‘THEORY 


According to Kelman’s model, compliance 
arises when A is in a position to exert means 
control over B. The influence that is achieved 
is merely a superficial conformity leaving B 
unchanged. As soon as A ceases to be present 
the behavior required by him is no longer 
shown by B. If we consider social as well as 
material rewards, then almost everyone has 
some measure of means control over everyone 
else. In a small group some will certainly 
have more ability to influence outcomes either 
by virtue of their access to resources external 
to that group or through superior interaction 
skills, but the curve of distributed compliant 
power will be flat rather than steep, This 
suggests that compliance is a very widely dis- 
tributed form of social influence that becomes 
more conspicuous in the absence of other bases 
of influence. The concept has two compo- 
nents: First, B perceives A as preferring thai 
he behave in certain ways, and for whate 
reasons are important in the specific instan 
B accedes to those preferences; second, di 
spite acceding to those preferences, B’s own 
preferences remain at variance with what he 
is doing. If the second of these component 
is eliminated from the definition, the concep! 
becomes extremely broad and would encom- 
pass most forms of social influence. If we re- 
tain this second component, the implication 
is that compliance will be found primarily in 
settings that B is unable to leave. These can 
include both such short-term settings as re- 
quired laboratory experiments and such long- 
term settings as uncongenial jobs, Compliance 
of this type is somewhat less likely in settings 
that stress voluntarism such as therapy wl 
training. Compliance is defined in this study 
as a process occurring when B accepts influ- 
ence from those who confront him but who 
are not attractive to him. 

Identification is distinguished from com- 
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pliance by Kelman as an influence state based 
on A’s attractiveness to B. The relationship 
of B to A is sufficiently rewarding that he 
wishes to continue it and thus behaves in 
whatever ways he believes will encourage A to 
persevere. The motivation of B in this is that 
the relationship with A enables him to develop 

l sustain a more certain conception of him- 

Kelman distinguishes between reciprocal 
‘le identification and classical identification. 
In the former, B behaves in a manner com- 
plementary to A in the hope that this pleases 
A. For example, B might adopt a passive or 
submissive role in relation to a dominant A, 
who clearly needs followers, In classical iden- 
tification, B believes that by becoming more 
like A he will perpetuate the relationship. 
Classical identification can be regarded as a 
special instance of reciprocal role identifica- 
tion, in which B assumes that the role A 
would like him to adopt is that of mimicry. 
The common element in these various types 
of identification is that B permits himself to 
he influenced by those whom he finds attrac- 
ive to maintain his definition of self. Iden- 

‘ication can therefore be defined as the op- 
posite of compliance: a process occurring 
when B accepts influence from those who are 
ıtiractive to him but who do not confront 
him, 

The concept of internalization has received 
considerably less attention from researchers 
than have those of compliance or identifica- 
tion. Kelman sees internalization as a process 
occurring when B makes a change in his be- 
havior because he sees that that change would 
have personal utility for him. B derives his 
idea of making a change from A, whom he 
observes to have goals congruent with his own 
but whom he perceives as able to achieve 
certain goals more effectively than he does, 
There are thus two components to this process: 
B finds A a credible source of influence, but 
A does something that confronts B with the 
difference between A and B. 

Kelman lays considerable stress on the first 
of these points, that is, value congruence as 
the basis for internalization. In the present 
study this definition is broadened to include 
other bases for influence. Numerous research- 
ers (e.g., Byrne, 1969; Newcomb, 1961) have 


documented the strong link between similarity 
and attractiveness so that those scoring high 
on value congruence would presumably find 
one another attractive. Consequently, Kel- 
man’s differentiation of credibility from at- 
tractiveness is not always likely to be useful. 
By defining internalization in terms of at- 
tractiveness rather than credibility, one makes 
it possible to include as bases for internal- 
ization not only value congruence but also 
attitudinal or behavioral similarity, 

The second element in Kelman’s definition 
of internalization is that B behaves in a way 
that confronts A with the difference between 
A and B. This is not experienced by A as a 
pressure for change from B, but it does lead 
him to explore ways of changing. This ele- 
ment is no more than implicit in Kelman’s 
presentation but is here given a more definite 
status. The term confrontation is used to 
convey the challenge or impetus for change 
experienced when B becomes aware that A’s 
behavior shows B’s current behavior to be 
less effective than it might be. In Kelman’s 
model such a challenge is sharply differenti- 
ated from the pressure toward change experi- 
enced in compliance through differential ac- 
cess to means control. In the present study 
both types of impetus for change will be re- 
ferred to as confrontation. Although Kelman 
finds it important to differentiate the separate 
bases of such confrontation by contrasting 
interpersonal pressures for change in com- 
pliance with intrapersonal pressures in inter- 
nalization, it is here seen as more important 
to stress the common element in the process 
of confrontation, namely the challenge to 
one’s current pattern of behavior. Compliance 
and internalization may still be differentiated 
in terms of the differential attractiveness of 
the influence source. Internalization is there- 
fore defined as a process occurring when B 
accepts influence from those who are both at- 
tractive to him and who confront him, 

There is no consensus in the literature as to 
the most fruitful manner in which to con- 
ceptualize social influence. The present defi- 
nitions depart from Kelman principally in 
regard to internalization, but other authors 
have used each of the terms, particularly 
identification, in a variety of other ways (see 
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eg., Bronfenbrenner, 1958). Some critics 
would undoubtedly argue that Kelman’s defi- 
nition of internalization constitutes no more 
than a specific type of identification, in which 
the basis for attractiveness happens to be 
value congruence. The reasons for preferring 
the present formulation is that it provides a 
firmer basis for predicting that internaliza- 
tion-based influence will persist, whereas in- 
fluence based on compliance or identification 
is less likely to do so. 

The reasons for expecting internalization- 
based influence to be most persistent can be 
most clearly presented in terms of Kelley’s 
(1967, 1973) attribution theory. Kelley is 
concerned with the conditions under which we 
attribute causality for events to ourselves, to 
others, or to circumstances. If one were to 
ask a compliant person why he behaved as 
he did (and could rely on an honest answer), 
he would be likely to say “because of the 
way A confronts me.” Likewise an identifier 
would attribute his behavior to his wish to 
please A. An internalizer would find it less 
easy to attribute his behavior to A. The in- 
ternalizer knows that A is both attractive 
and yet behaves in ways that confront him. 
Faced with these two concurrent facts, it is 
not easy for him to attribute changes in him- 
self to the effect of A. If A is attractive to 
him, then he might assume that, like an iden- 
tifier, he could sustain his self-concept by 
doing what A wishes. But A’s confronting 
behavior shows B that he is not succeeding: 
There is a divergence betwen his self-concept 
and how he might like to behave. Similarly, 
if the internalizer finds himself confronted 
by A’s behavior, he might model his changed 
behavior on A while derogating A to himself, 
as would a complaint subject. But this course 
of action too is made difficult by A’s attrac- 
tiveness. In effect, each of the possible reasons 
the internalizer might advance for his having 
changed undermines the other. The potential 
internalizer might convert himself into a com- 
pliant responder by seeing his source as less 
attractive or into an identifier by focusing 
his attractions on those who do not upset his 
self-concept. To the extent that he does not 
use these options he must choose to wrestle 
with the complexity and possible confusion 
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of being confronted by those who are also 


attractive. Thus B can decide that face’ vith 
paradoxical encouragements to both ge 
and stay the same, he chose to change so 
doing B is attributing internal causa to 
himself. And if B finds that he is a to 


cause changes in his behavior in one sv iting, 
he is likely to try out similar changes in 
other settings. Similar mechanisms may well 
underlie the procedures of psychotherapy 
(Strong, 1970; Collins, Note 2). A variety 
of writers have discussed the role of the ex- 
perience of paradox in inducing therapeutic 
change (Perls, Hefferline, & Goodman, 1951; 
Watts, 1961; Watzlawick, Beavin, & Jackson, 
1967). Support for the notion that persist- 
ence of change is associated with feeling per- 
sonally responsible for one’s actions has also 
been obtained from experimental studies of 
forced compliance (Collins & Hoyt, 1972). 
This study tests the utility of the three 
modes of social influence in predicting the 
outcome of sensitivity training. Sensitivity 
training provides an ideal setting in which to 
test the model because it is usually on a 
brief but intensive basis, making it easier to 
distinguish the effects of training from the 
numerous other potential sources of personal 
change in daily living. Since sensitivity train- 
ing occurs in groups, each participant is ex- 
posed to a wide variety of potential influen: 
sources. The changes found during sensitivity 
training are substantial, but not all of these 
changes persist long after the end of train- 
ing. Smith (1975) reviewed 100 studies of 
sensitivity training, each of which involved 
the use of a repeated measures design, some 
form of control group, and a training period 
of not less than 20 hours. Of studies using 
measures completed immediately after ti 
ing, 78% detected significant change on 0. 
or more measures that was not shown by 
controls. Among this sample, 31 studies also 
included measures collected 1 month or mo: 
after training was complete. Only 68% of 
these studies detected significant persistence 
of change. Thus the evidence is that change 
frequently does arise from sensitivity train- 
ing and that some but not all of it does per- 
sist. The changes detected included more 
favorable self-concept, reduced prejudice, 
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modified scores on Schutz’s (1958) FIRO-B 
and Shostrom’s Personal Orientation Inven- 
tory (1966), and changed behavior as per- 
ceived by others not present during training. 


METHOD 


ubjects were participants in four 5-day resi- 
tial sensitivity-training programs organized by 
Group Relations Training Association and held 
England. The programs were held annually dur- 
ng 1972-1975 and comprised a total of 199 par- 
e participants comprised a mixed pop- 
of professionals including teachers, social 
psychologists, nurses, training officers, and 
s. There were 88 women and 111 men rang- 


ing in age from 21 to 50 years. The participants 
were mostly strangers to one another at first. 
The design for each of the programs was some- 


what similar 


The majority of time was allocated 
of training groups, each of which con- 
9-11 members and 1 or 2 trainers. Scheduled 
a roup meetings during the week took up 
0-25 hours, although some groups certainly met 
onsiderably longer hours. The remainder of the 
scheduled program was given over to a variety of 
community activities, which differed somewhat in the 
various programs, Community activities included 
laboratory exercises, research-form completion, pres- 

tations of themselves by each training group in 

n, a variety of nonverbal activities designed by 
the trainers, formation of temporary new groups, 
and so forth. Scheduled community activities aver- 
aged 10 hours. 

The data to be reported in this paper were col- 
lected on the morning of the final day of the pro- 
gram and by mail 5 months later. The measures of 
compliance, identification, and internalization were 
derived from ratings on a number of 7-point scales. 
Participants were asked to rate how much each of 
the members and trainers of their group had influ- 
enced them, how much they liked them, how much 
they trusted them, and how much each had be- 
haved in ways that made them feel tense. Each 
point on these rating scales was anchored by a verbal 
description. The anchors used for the extreme points 
of the various scales were: (a) “has not influenced 
me at all—“has influenced me a very great deal; 
(b) “did not like/her at all”-“liked him/her as 
much as I ever liked anyone”; (c) “did not trust 
his/her genuineness at all”-“I trusted his/her gen- 
uineness more completely than I ever trusted any- 
one”; (d) “did not make me feel tense at all” — 
“made me feel extremely tense.” The classification 
of participants into the three influence types was 
achieved through correlating each participant’s in- 
fluence ratings with his ratings of liking, trust, and 
tension. The liking and trust ratings were used as 
measures of attractiveness. Although perceptions of 
trust and liking are frequently strongly associated, 
some perceivers do distinguish them and it was con- 
sidered worthwhile to retain them as separate in- 
dexes. The ratings of tension were used as a mea- 
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sure of confrontation with the perceiver. As discussed 
earlier, the required measure needed to encompass 
not only situations in which A demanded overtly 
of B that he behave in a certain way but also the 
much more frequent occurrence in which A’s be- 
havior confronts B simply by its occurrence, whether 
or not A intends it to influence B. The incidence 
of tension was regarded as a valid indicator of the 
incidence of such confrontation. 

The allocation of participants to influence type was 
made according to the definitions given earlier, which 
are expressed here in their operational form: 

1, Complaint participants were those with a nega- 
tive correlation between influence and liking and a 
negative correlation between influence and trust but a 
positive correlation between influence and tension. 

2. Identifying participants were those with a posi- 
tive correlation between influence and either liking or 
trust and a negative correlation between influence 
and tension, 

3. Internalizing participants were those with posi- 
tive correlations between influence and either liking 
or trust as well as a positive correlation between in- 
fluence and tension. š 

The second set of measures required were measures 
of the incidence of change arising from the experi- 
ence, Smith (1975) has described the diversity of 
measures currently used to assess the effects of sensi- 
tivity training. Some evidence is available that 
change on these various indicators does not neces- 
sarily correlate highly (Smith, 1971). The present 
study used as its change criterion a global 7-point 
rating of “benefit,” completed by participants and 
trainers, The rating was completed with regard to 
oneself and to each of the other group members at 
the close of the week, and again 5 months later with 
regard to oneself only. The use of a global criterion 
Was seen as most appropriate at the present stage of 
the project. Since no particular hypotheses are here 
advanced as to the types of change to be expected 
from, for example, internalizers, the required measure 
is one that maximizes the chances of detecting the 
changes that do occur. The benefit ratings assigned to 
a particular member by all trainees in his training 
group except himself were averaged and are referred 
to as “other benefit.” The participant’s own “self- 
benefit” rating is treated as a separate criterion of 
change. Ratings by the group trainers were used as a 
third criterion measure, referred to as “trainer bene- 
fit.” An additional follow-up measure was available 
from the final two programs. Five months after these 
two programs, two people closely associated with 
each trainee were also asked to make ratings of how 
much the trainee had benefited. 


+ Participants on the final two programs were in- 
stead asked to rate how much others liked them. 
This change was adopted to approximate more 
closely the circumstance that would make it maxi- 
mally difficult for internalizers to attribute causality 
externally. 
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TABLE 1 
MEAN RATINGS OF BENEFIT AT THE END OF TRAINING 
Other Trainer 
Influence type Self-benefit benefit benefit 
Externalizers 5.6 (57) 4.6 (58) 4.1 (58) 
Internalizers 5.7 (99) 4.6 (108) 4.6 (108)= 
Unclassifiable 6.0 (8) 48 (33) 4.8 (32) 
Totalsample 5.6 (164) 4.7 (199) 4.5 (198) 


Note. Values of n are given in parentheses after each mean. 
In the self-benefit column 1 is somewhat lower because of those 
trainees who failed to include themselves in the ratings. 

* Mean for internalizers differs from that for externalizers 
(z = 2.00, p <.05), 


RESULTS 


Of the 199 participants, 5 were classified 
as compliant, 53 as identifiers, and 108 as in- 
ternalizers. The remaining 33 could not be 
classified, In seven instances, this was be- 
cause the respondent assigned the same socio- 
metric rating to all his sources, on the in- 
fluence criterion, In two instances, the par- 
ticipant was no longer present at the time the 
rankings were completed, whereas in the re- 
maining 24 instances the respondent failed to 
complete some or all of the rankings. Since 
these so-called unclassifiable participants con- 
stitute more than 10% of the participants, 
the data available for them is also presented. 

The small number of compliant participants 
obtained was not surprising given the values 
of sensitivity trainers, Since it was predicted 
both for compliants and for identifiers that 
they would show change during training but 
that this change would tend not to persist, 
the data for the two types were pooled, Par- 
ticipants in the combined category are re- 
ferred to as externalizers, since they should 
attribute change to sources external to them- 
selves, 

The mean benefit ratings completed at the 
end of the training programs are shown in 
Table 1, It can be seen that all types were 
favorable about the effects of the program on 
themselves, The verbal anchors corresponding 
to the relevant points on the rating scale 
were; 4—“derived some benefits,” 5—“de- 
rived some important benefits,” and 6—“de- 
rived some very important benefits.” No 
type was perceived as gaining greater benefit 
from the program either by self-benefit rat- 
ings or by other-benefit ratings. However, the 


PETER B. SMITH 


trainers were significantly more favorabl 
about the benefits achieved by internalizers,’ 
The assessment of benefit by trainers may be 
expected to be influenced by their much 
greater previous experience in sensitivity 
training. In particular, they can be expected 
to be more aware than most participants that 
change during training does not guarantee the 
persistence of change subsequent to training, 
In the present sample the trainers were more 
impressed by the changes shown by internal- 
izers. The means for benefit of unclassifiable 
participants were highest on all three criteria, 
although the differences were not significant, 

Participants were contacted by mail 5 
months after the training experience and in- 
vited to complete a further questionnaire that 
included the benefit rating. Usable responses 
were obtained from 61% of externalizers, 
69% of internalizers and 75% of unclassi- 
fiables. The means for these ratings are shown 
in Table 2. As predicted, benefit is now rated 
significantly higher by internalizers than by ` 
externalizers, Ratings completed indepen- 
dently by associates of the trainee also favor 
internalizers, but a much smaller sample of 
these was obtained and the difference does 
not achieve significance. 

Since the number of ratings available at 
follow-up is substantially reduced, it is im- 
portant to clarify whether the significant dif- 
ference in favor of internalizers is in me 
way a consequence of differential response, ` 
Table 3 shows that for those subjects who 
provided ratings on both occasions there 
a significant decrease for external: j 
whereas internalizers showed no significat 
change. The difference between the chazges 
for externalizers and internalizers is also sig 


TABLE 2 


Mean RATINGS or BENEFIT 5 Monrtns 
AFTER TRAINING 


Influence type 


Self-benefit Other bencñt 
Externalizers 4.3 (35 3.5 (15 
Internalizers 5.3 oe 3.9 ey 
Unclassifiable 5.2 (6) 4.0 (4) 
Total sample 5.0 (105) 3.9 (42) 


ole. Values of m are given in parentheses after each mean. 


* Mean for internalizers differs fı x iz 
Pesan on ts Irom that for externalizers 
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nificant (t = 2.01; p < .05). Thus the change 


Pi in self-benefit rating at follow-up is a real 


effect rather than an artifact of response rate, 

In a similar manner Table 3 shows the 

anges found in ratings of benefit by others. 
it is necessary to bear in mind that 
benefit ratings were completed by dif- 

nt raters on the two occasions. At the end 
he week the other-benefit ratings were an 
verage of ratings completed by each of the 
other members of the trainee’s training group. 
Five months later the ratings were based on 
responses by one or at most two close as- 
sociates of the trainee. These differences can 
be expected to contribute considerable vari- 
ance to changes in other-benefit ratings over 
5-month interval, but there is no reason 
to expect such variance to favor the hypothe- 
sis of differential change between external- 
izers and internalizers, The table shows that 
other-benefit ratings for externalizers declined 
significantly, whereas those for internalizers 
did not. The difference between the change 
for externalizers and that for internalizers, 
however, does not achieve significance (¢ = 
1,59). 

The changes in ratings for subjects who 
proved unclassifiable resemble more closely 
the pattern shown by externalizers. However, 
the small number of cases precludes a firm 
conclusion. 


the 


Discussion 


The findings support the hypothesis that 
internalizers are better able to sustain change 
arising from sensitivity training. At the ter- 
mination of training the superiority of inter- 
nalizers was apparent only to the trainers and 
not to the trainee himself or to other trainees. 
However, 5 months later there were signifi- 
cant declines in externalizers benefit as rated 
by self and others, whereas there were no 
significant changes in internalizer benefit. 

Although the measures used in this study 
did not derive precisely from Kelman’s con- 
ceptualization of social influence, the find- 
ings certainly support the utility of a typo- 
logical approach to the study of social in- 
fluence. It is important to bear in mind that 
neither the present model nor Kelman would 
assert that externalizers will never success- 
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TABLE 3 
MEAN CHANGE IN BENEFIT RATINGS OVER TIME 


Influence type Self-benefit Other benefit 
Externalizers —.9 (33)" —1.5 (15)* 
Internalizers —.3 (61) —.6 (26) 
Unclassifiable —1.2 (5) —1.5 (4) 

Total sample —.6 (99) —1.0 (45) 


Note. Values of n are given in parentheses after each mean, 
^ Changes significant by two-tailed £ test (p <.01). 


fully retain changes induced by training. In 
terms of the Kelman model, such success 
should occur where there is continuing salience 
of the influence source or of someone similar 
or where there is continuing surveillance by 
the influence source. In selecting for study 
training groups composed of strangers, the 
present project chose that population most 
likely to show internalizers as superior. Where 
sensitivity training is conducted within or- 
ganizations, very different findings may be 
expected. 

Tt was suggested that the reason why in- 
ternalizers would show better retention of 
change was that the pattern of their in- 
fluences within the training group would en- 
courage them to see themselves as causal of 
their change. Since no data are reported in 
the present study concerning the actual causal 
attributions used by trainees, this aspect of 
the model remains untested. An equally valid 
way of summarizing the findings would be to 
state that those who showed most durable 
changes were those who experienced most ten- 
sion from their influence sources, To differ- 
entiate this summary from the attributional 
analysis previously advanced, data would be 
required containing a higher proportion of 
compliant subjects. 

Explanation of the changes that were found 
in terms of increased self-attribution of 
changes does find some support from a study 
by Allen, reported in Lieberman, Yalom, and 
Miles (1973). Allen interviewed former en- 
counter group participants, asking them how 
they had attempted to maintain changes aris- 
ing from their group experience. She found 
that those who had succeeded in maintaining 
change stressed the taking of personal initia- 
tives, thinking out what one wanted to do, 
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and so forth. Those who had failed to main- 
tain change laid greater stress on the reac- 
tions of others subsequent to training, a pat- 
tern clearly similar to that predicted for ex- 
ternalizers. The encounter groups studied by 
Allen were also composed of initial strangers. 

The superior ability of trainers to differ- 
entiate internalizers at the close of training 
has some importance. At the present time 
many of the published studies of processes 
occurring during sensitivity training use as 
criterion measures some form of rating com- 
pleted at the close of training, often by train- 
ees. It is by no means clear how often such 
measures are validly predictive of subsequent 
change. Miles (1965 ) also found that ratings 
by trainers were predictive of subsequent be- 
havior change, whereas ratings by trainees 
were not, 

The creation of an internalization-oriented 
climate of influence within training groups is 
likely to depend on numerous variables, Smith 
(1974) has shown that in experimental train- 
ing groups, internalization was greatest when 
the composition provided some personal con- 
frontation. In the present sample, the propor- 
tion of internalizers in the 20 training groups 
we studied varied from 25% to 86%. Further 
studies will be required to clarify the degree 
to which this variation arises from personal- 
ity predispositions of the trainee, the mix of 


each particular group, or the orientation of 
the trainer, 
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Toward a Causal Model of Love 


„Abraham Tesser and Delroy L, Paulhus 
Institute for Behavioral Research, University of Georgia 


Under a proposed systems model of love, (a) thought about other and love for 
other have a positive causal impact on one another, (b) dating frequency and 
love have a positive causal impact on one another, and (c) dating has a posi- 
tive impact on “reality constraints,” which, in turn, (d) have a negative impact 
on love. Questionnaires administered 2 weeks apart to a 212-member panel of 
college students served as the data collection vehicle. Temporal order of the 
measures was used to determine causal direction, and path analysis was used 
to analyze these correlational data. Results support Hypotheses a, b, and d but 


do not support Hypothesis c. 


Some recent work demonstrates that 
thought about some person, thing, or idea in- 
the likelihood that one’s attitude will 
polarize (e.g., Sadler & Tesser, 1973; Tesser & 
Conlee, 1975). Perhaps this effect occurs be- 
cause thought alters one’s salient cognitions, 
making them more consistent with the initial 
attitude, Since affect is related to cognitions, 
thought polarizes attitudes. Given that per- 
sons like those they date, thinking will lead 
to the generation of favorable attributes 
(Tesser & Cowan, 1975) or the distortion of 
unfavorable attributes (Tesser & Cowan, in 
press). Thus, the more thought about a previ- 
ous date, the greater the resulting love for 
that date. Reciprocally, we assume that ex- 
tremity of feeling about some object increases 
the likelihood of its being thought about. In 
the present context, then, thought about a 
date should be a positive function of one’s 
love for that date. 

We have suggested that thought induces a 
systematic alteration of cognitions. Such 
changes in cognitions, although consistent with 
the initial affect, may be inconsistent with 
reality. Persons need to maintain an accurate 
picture of the world and are quite responsive 
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Tesser, who is now at the Department of Psychology, 
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Paulhus is now at Columbia University. 


to reality (Byrne & Clore, 1967; Festinger, 
1954; White; 1959). Thus, thought-induced 
changes in cognitions (and attendant attitude 
change) will be attenuated to the extent that 
such changes are inconsistent with “testable” 
reality. In a recent study (Tesser, 1976), it 
was reasoned that although thought results in 
greater attitude change than no thought (i.e, 
distraction), the presence of the attitude ob- 
ject during thought should facilitate reality 
testing and hence reduce attitude polarization 
compared to thought about an absent object. 
The results for female subjects* were as ex- 
pected. 

Speculation consistent with this theorizing 
and research is not new to the interpersonal 
attraction literature. According to Kerckhoff 
and Davis (1962), a “point so often stressed 
in the literature [is] that couples go through 
a period of idealization and perception distor- 
tion which may lead to disillusionment (or 
“reality shock”) at a later date” (p. 302). In 
the present study, the term reality constraints 
is defined as knowledge that is inconsistent 
with one’s own thought-produced expectations 
about one’s date, Granting our assumption 
that individuals are initially attracted to the 
persons they date, thought-produced expecta- 
tions about those persons should have a posi- 


1 Although males were also run, the independent 
variables had no systematic effect on them, Since the 
stimuli were art reproductions, this sex difference was 
interpreted as reflecting better developed schemas in 
women than men for thinking about this stimulus 
domain. (See Tesser, 1976; Tesser & Leone, in press, 
for discussion.) 
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Ficure 1. Model of love. 


tive valence, Thus, knowledge inconsistent 
with these expectations, that is, reality con- 
straints, should lead to reduced positivity, and 
hence, to decreased love. Information pro- 
vided by others can serve as a source of such 
“disconfirming expectancies” as well as con- 
tact with the individual himself. Being with 
the individual forces awareness of his looks, 
actual behavior, mannerisms, etc. Thus, in the 
present study, we expect dating frequency to 
have a positive effect on reality constraints. 
Although there are a number of variables 
that affect dating frequency (e.g., availability 
of time and money, geographic proximity, 
transportation), we would expect love also to 
increase dating frequency. On the other hand, 
there are a number of reasons to expect dating 
frequency to have a positive, reciprocal im- 
pact on love. For example, frequency of ex- 
posure per se produces positivity (Zajonc, 
1968). Increased dating increases predictabil- 
ity, which is associated with positivity. Dating 
tends to be associated with “good times,” 
which could lead to classically conditioned 
positive regard for the date (Lott & Lott, 
1968; Staats, 1975). Increased dating is as- 
sociated with an increased number of common 
experiences, which, in turn, could lead to 


greater similarity, and similarity produces at- 
traction (Byrne, 1971). Dating often in- 
volves the voluntary expenditure of time and 
money, and one way to justify this expendi- 
ture is to see the date as very attractive 
(Festinger, 1957). Increased dating can serve 
as a public commitment to the date, and in- 
creased commitment tends to “freeze” one’s 
attitude (Kiesler, 1971). 

In sum, attitude extremity produces 
thought. Since thought biases cognitions in the 
direction of the initial attitude, unexpected 
information (i.e., reality constraints) will tend 
to be in a direction opposite to the initial at- 
titude. Thus, thought polarizes attitudes, and 
reality constraints moderate such changes. 
Dating produces reality constraints and has a 
reciprocal, causal relationship to love. Finally, 
we assume that these variables do not com- 
prise a closed system but have determinants in 
addition to those explicitly dealt with here, 
that is, “‘residual causes.” The entire model is 
shown in Figure 1, 


Design and Analysis 


As noted above, several of the hypothesized 
relationships have been subject to previous 
laboratory tests, However, we share with a 
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growing number of our colleagues (e.g., Mc- 
Guire, 1973) a certain discomfort in relying 
exclusively on the outcome of laboratory ex- 
periments, In the present study, we systemati- 
cally and simultaneously evaluate a system of 
nownanipulated variables that may exhibit 
yrocal causality and from which we can 
proximate causal inferences. We do this 
hrough path analysis of time-lagged variables 
and “retrospective questions.” 
Path analysis is useful for testing an explicit 
multivariate causal model by seeing if 
xe empirical correlations among the relevant 
variables are consistent with the causal model. 
One question that path analysis addresses is, 
Does a hypothesized cause in fact have an im- 
pact? This question is answered descriptively 
by computing the path coefficient from the 
hypothesized cause to the hypothesized effect. 
Under some conditions a path coefficient is 
identical to a standardized regression coef- 
ficient; therefore, multiple regression analysis 
and path analysis are very closely related. 
Since most of us are familiar with the usual 
ind of multiple regression approach, a brief 
comparison might be useful. Both techniques 
allow for the ready estimation of percentage 
of variance accounted for. Path analysis, how- 
ever, appears to be better adapted for theory 
testing. For example, suppose Variable A 
causes B, which in turn causes C. Since A has 
its effect on C only indirectly through B, A 
will not share any variance with C that it 
does not also share with B. Thus, if we were 
to use a multiple regression analysis and re- 
gress C on A and B simultaneously, A would 
get a zero weight and drop out as being ir- 
relevant. On the other hand, a path analysis 
approach, by separately regressing C on B 
and regressing B on A, would show both A’s 
direct effect on B and its indirect effect on C. 
In addition to providing estimates of indirect 
effects, path analysis provides for a second 
test of theory that is absent in the usual mul- 
tiple regression approach, The second test ad- 
dresses the question, Does the model as a 
whole adequately fit the data? This question 
is answered by comparing the observed cor- 
relations among the variables with the ex- 
pected correlations given the correctness of 
the model. Finally, path analysis provides for 
the decomposition of the overall correlation 
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between any two variables into their causal 
and noncausal components. 

The assumptions (Land, 1969) underlying 
the use of recursive path analysis include the 
following: The variables are measured on an 
interval scale, and the relations among the 
variables are linear and additive. The causal 
flow is unidirectional; if A causes B, B would 
not also cause A. (It is this assumption that 
makes the model recursive.) The causal direc- 
tion can be unambiguously stated; that is, if 
there is a link between A and B, it is possible 
to assume a priori that A is the cause of B 
rather than vice versa. 

Another important aspect of our design in- 
cluded the measurement of each variable at 
two points in time and the use of retrospec- 
tive questions. The temporal ordering of vari- 
ables is a crucial component in causal infer- 
ence that we attempted to exploit in this non- 
experimental study. Recall our model as pre- 
sented in Figure 1. There are a number of in- 
stances of reciprocal causality, for example, 
thought produces love, and love produces 
thought. These instances appear, at least 
superficially, to be inconsistent with the as- 
sumption of recursiveness. Since a cause must 
precede an effect in time, it is possible to un- 
ambiguously infer causal direction through 
use of a panel design. Accordingly, each mea- 
sure was administered on two occasions sepa- 
rated by 2 weeks. Further, each of the vari- 
ables, except love, was measured using “retro- 
spective questions.” Subjects were asked, for 
example, to report the extent to which they 
thought about their partner during the im- 
mediately preceding 2 weeks. Love was mea- 
sured as of the particular point in time of the 
administration, Assuming that responses to 
the retrospective questions actually reflect the 
state of the variables for the time asked about, 
within each administration love could be an 
effect but not a cause of any of the other vari- 
ables. Figure 2 shows each of the variables at 
each administration. The a priori causal model 
specified in Figure 1 is mapped onto these 
variables in a way that is consistent with the 
assumptions of path analysis.” 


2 There are two kinds of variables in a path model, 
namely, exogenous variables, which cause but are not 
influenced by any of the other variables explicitly 
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Ficure 2. Proposed causal model for path analysis. (Each single-headed arrow denotes a hy- 
pothesized causal path. The curved, two-headed arrows denote unanalyzed correlations among 


exogenous variables.) 


Let us focus first on the paths between ad- 
ministrations. Assuming some psychological 
inertia, we expect a particular variable at one 
point in time to exert a causal influence on the 
state of that variable at a later point in time. 
More importantly, by using temporal order to 
determine causal direction between different 
variables, the specific model to be tested is 
formally recursive. For example, thought at 
Time 1 (T1) causes love at Time 2 (L2), and 
love at Time 1 (L1) causes thought at Time 2 
(T2), but L2 is not hypothesized to cause T1, 
and T2 is not hypothesized to cause L1. Simi- 
larly, the relationships between love and dat- 
ing and love and reality constraints are as- 
sumed to exert a reciprocal causal impact on 
one another. However, since this influence is 
assumed to be time dependent rather than 
simultaneous, they are shown recursively in 
Figure 2. 

The temporal order is also maintained 
within the endogenous (see Footnote 2) vari- 
ables measured at Time 2. Variables measured 
by retrospective questions (thought, reality 
constraints, dating) are hypothesized to cause 
love, a contemporaneously measured variable. 
Note that each endogenous variable has a 
residual cause. This residual includes all in- 


specified in the model, and endogenous variables, 
which are at least partially determined by and may 
also serve as causes of other explicitly specified vari- 
ables, 


fluences on the variable that are not specified 
by the model. It is assumed that each residual 
is uncorrelated both with the other residuals 
and with the variables immediately determin- 
ing the endogenous variable in question. 

Because of this assumption the variables at 
Time 1 and at Time 2, while measured in 
exactly the same way, are treated differently. 
At Time 2, the variables are hypothesized to 
be causally related; at Time 1 they are 
treated as correlated, exogenous variables. Re- 
call that the variables at Time 1 are hypothe- 
sized to affect those at Time 2. There is an 
analogous set of unmeasured variables at 
Time 0 that is affecting those at Time 1. The 
variables at Time 0, like those taken at Time 
2, are expected to be causally related. Since 
they are expected to be correlated, the as- 
sumption of independence of residuals would 
be violated if we treated the Time 1 variables 
as endogenous. Path analysis does, however, 
allow for the correlation of exogenous vari- 
ables. Although these correlations have an 
impact on the estimated path coefficients, the 
correlations remain unanalyzed in any causal 
sense and are treated as “givens.” Thus the 
Time 1 variables are treated as exogenous. 
(We assume that the unmeasured variables 
prior to Time 1 have their effect on the 
variables at Time 2 only via their effect on 
the variables at Time 1.) 
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TABLE 1 
MEANS, STANDARD DEVIATIONS, AND CORRELATIONS AMONG THE MEASURES 


Variable 1 2 3 4 5 6 7 8 M sD 

inistration 1 

rhought (T1) — .728 129 430 741 612 —.027 464 9.83 3.59 

Love (L1) = — 224 451 .748 .830 094 .495 50.66 19.49 

ity constraints (RC1) — 086 154.279 242 104 5.08 1.80 

z (D1) — 414 404 108 [806 3.07 2.87 
ration 2 

ght (T2) — 764 161 503 9,20 3.75 

e (L2) æ „103 .505 49.27 20.67 

y constraints (RC2) — 070 498 1.72 

= 2.95 3.16 


Dating (D2) 


METHOD 
Subjects 


Subjecte were run in intact classes of introductory 
sociology, introductory psychology, and psychology 
of adjustment. The final sample (subjects who were 
unmarried and who completed both administrations 
of the questionnaires) consisted of 96 males and 106 
females. Their mean age was 19.47. 


Procedure 


The experimenter told each class that he was 
interested in studying attitudes toward dating, that 
»articipation in the study was completely voluntary, 
and that responses would be kept confidential. Since 
going out on a formal date is fast becoming an 
archaic custom, dating was defined as “going out 
with someone of the opposite sex where there is some 
romantic interest involved on somebody’s part. This 
might mean just going for a walk with them or 
staying with them at a party.” All but four of the 
unmarried students in all of the classes volunteered 
to complete the questionnaire. 

Exactly 2 weeks later, the experimenter returned 
to each class with the second questionnaire. Since 
the questions in the first administration concerned 
the last person the subjects had dated at that time, 
subjects were reminded to keep that same person in 
mind while answering the questions this second time. 
They were told: “Now, you may feel differently 
about this person or you may feel the same way, but 
we want you to think about your present feelings, 
so forget about your previous answers.” (Because of 
absences, about 25% of the questionnaires could not 
be matched.) After completing the second question- 
naire, subjects were debriefed, thanked, and returned 
to their instructor. 


Instrument 


The instrument, which contained items in addition 
to those under discussion, took about 10 minutes to 
complete. Thought was indexed by the sum of two 
retrospective 7-point scales asking how often (1 = 
not at all, 7 = very often) and how much time (1 = 
none at all, 7 =a great deal) subjects thought about 


the target person “during the Jast 2 weeks.” The 
corrected internal consistency of this measure was 
.95. Love was measured by Rubin’s (1970)? nine- 
item love scale, a scale with good construct validity 
and a reported internal consistency in the low .80s, 
To reflect our definition of reality constraints, sub- 
jects were asked: “To the extent that you acquired 
any new information about the (target) person, did 
that new information tend to confirm or contradict 
your expectations?” This also was a retrospective 
question and referred to the prior 2-week period. 
Note that in order to make its predicted association 
with love positive, this item was inversely scored 
so that high levels of reality constraints were asso- 
ciated with small numbers (1=contradicted my 
expectations, 7 = confirmed my expectations). Fi- 
nally, dating was indexed by the subject’s self-report 
of the number of dates with the target person in 
the past 2 weeks. Means, standard deviations, and 
intercorrelations for each of the measures are pre- 
sented in Table 1. 


RESULTS AND DISCUSSION 


For the model proposed in Figure 2, path 
(standardized regression) coefficients were 
obtained by the regression of each effect or 
endogenous variable (normalized) on only 
the variables (also normalized) postulated to 
be the immediate causes of each effect or 
endogenous variable. A path coefficient can be 
interpreted as the proportion of the standard 


8 Rubin distinguishes between liking and loving. 
Liking refers to a traditional conception of interper- 
sonal attraction. Loving includes affiliative and de- 
pendent needs, a predisposition to help an orientation 
of exclusiveness. Rubin has developed separate mea- 
sures for each. Even though both were included in 
this study, the major focus and the results presented 
pertain to love. The resulting pattern of path coef- 
ficients with liking was similar, as one might expect 
given the correlation between liking and loving (r= 
.63), but the fit between model and data was not 
quite as good. 
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The correlations among the exogenous variables are shown appended to the two-headed arrows.) 


deviation of the endogenous variable ac- 
counted for by an immediate cause, with the 
effects of all other hypothesized immediate 
causes held constant. Like the more tradi- 
tional regression coefficient, the path coeffi- 
cient can be tested for statistical as well as 
practical significance, In this case, the path 
coefficients from D1 to RC2 (prospi 088, 
#= 1.27) and RC2 to L2 (Praxce = —.04, 
t < 1) failed to meet this test (D1 = date at 
Time 1; RC2 = reality constraints at Time 
2). Therefore, RC2 and the paths leading to 
and from it were dropped. The path from 
D1 to L2 was also nonsignificant (prop; = 
—.086; ¢ = —1.42) and it too was deleted.‘ 
The path coefficients for the revised model 
reflecting the deletions were calculated and are 
shown in Figure 3. 

Before examining these path coefficients in 
detail, we should note that a path coefficient 
represents only one component of the correla- 
tion between two variables. Given a particu- 
lar causal model, it is possible to analytically 
decompose the correlations between each en- 
dogenous variable and all the other variables 
in the system. (Recall that correlations among 
the exogenous variables are treated as givens, 
They go unanalyzed, and no particular causal 
significance is attributed to them.) Each cor- 
relation can be decomposed into a causal 
component and a noncausal component (Al- 
win & Hauser, 1975), The causal component 
can be further subdivided: The path coeffi- 
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coefficients are significant, T) 


ADMINISTRATION 
AT TIME 2 
(ENDOGENOUS VAR 


hose with an asterisk 


asterisk are significant beyond the 05 level. 


cient indexes the magnitude of the direct 
causal effect. The indirect causal effect refers 
to the extent to which a cause results in a 
change in an effect indirectly by causing a 
change in an intervening or mediating varia- 
ble. Suppose, for example, that Variable A 
Causes Variable B, which in turn causes Varia- 
ble C. A has a direct causal effect on B of 
magnitude py, and an indirect causal effect 
on C of magnitude rapor. The magnitude of 
the indirect causal effect is the product of the 
path coefficients along the path leading from 
cause to effect. If there is more than one such 
path, the total indirect causal effect is the 
sum of these products. The noncausal effect 
refers to the extent to which two variables are 
correlated not because one causes the other 
but because both are either results of the same 
cause or of correlated causes, Table 2 shows 
the decomposition of each correlation given 
the model in Figure 2. 


Thought and Love 


on. love for that date. Both T1 and T2 are 
implicated in this hypothesis, but before look- 


*The deletions were 


actually made sequentially, 
The path from D1 to L2 


Was not significant regard- 
less of the presence of the path from RC2 to L2 and 
the path from RC2 to L2 was not significant regard- 
less of the presence of the path from D1 to L2. 


= 


TOWARD A CAUSAL MODEL OF LOVE 1101 


TABLE 2 
DECOMPOSITION OF CORRELATIONS AMONG MODEL VARIABLES 


Causal association er Noncausal association = Total associ- 
Direct + Indirect = Total ERSA aen Ae Aon 
Pre- Ob- 
able dicted* tained 
T1LT2 prem = AIS frifri 323 741 741 
(418) (323) 
T2L2 pieta — 370 rripa + rrn prena + -390 700.764 
(370) (444) (041) 
rrer piercer + ree’ prem 
(015) (=.110) 
TiL2 prer Previpiere .006 rmp + rripreripiers + 606 612.612 
(=.149) ~ (455) (436) (119) 
Trw prep + rrincipiarcs 
(.038) (013) 
LiT2 Preta = 444 rum prets 304 -148 TAL 
(444) (.304) 
RC1L2 Ptarer — (.099) TROLD + rrorrepier: + 180 279.279 
(.099) (134) (057) 
rnc” Pipe + rreiripier 
(008) (—.019) 
Lib2 fous = 165 E AS 330 495 495 
(.165) (330) 
D2L2 Lape - 081 Piatarpeta + roer prore + Al4 495 .505 
081) (297) C186) 
rosrcipiercr + FLTirpzT 
(.010) (—.069) 
D1IL2 — pomi X .059 tonais + roir prer: + -381 440 404 
Piape (.270) (.153) 
(.059) roinciPierci + apa ah 
(.009) 013) 
(lee 
— 064 
D1D2 D2D1 — .732 foin 074 „806 806 
732) (074) 
LiL2 Pista rupi + 176 rutiprenipiate + runipoipiso + -053 829 830 
599) (164) (113) (027) 
$onapiD rurcipPiarer + rumpiars 
{ (.022) —.109) 
p2T2 — —- ad Prerirripippspi + prauiruipippan + 401 401 503 
(132) (146) 
Pripp + Preirrnapona 
(.073) (.050) 
T1D2 — ar = ruDi$om + rrappara 435 435 464 
(315) (120) 
D1T2 — — = romipreri + ronipri .380 380 414 
(180) (200) 
RCIT2 — — — TROLIÊTLI + FROTIPTET 154 153.154 
(100) (054) 
RC1D2 = =- — TRCIDIŻDD: + FRCLIPDLI .106 106.104 
(.069) (037) 


Note. T1 = thought at Time 1; T2 = thought at Time 2; L1 = love at Time 1; L2 = love at Time 2; RC = reality constraints 
at Time 1; RCD S enlity constraints at Time 2; D1 = date at Time 1; D2 = date at Time 2. Values in parentheses refer to the 
numerical value of each expression. z é k 

a Where there is A direct cane link between the variables, the predicted and obtained association are logically equivalent. The 
differences that appear in the table are due to rounding error. On the other hand, in the absence of a direct causal link, the predicted 
and obtained associations need not be the same and their discrepancy reflects on the adequacy of the model. 

» This correlation is decomposed elsewhere in this table. 
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ing at their relationship with L2 let us exam- 
ine their relationship to each other. Asšum- 
ing some “psychological inertia” we hypothe- 
sized a causal path from T1 to T2. As can be 
seen from Figure 3 and row 1 of Table 2, the 
hypothesized direct effect is substantial (Prr 
= 418). The balance (.323) of the overall 
association (rpypo = -741) comes from the 
fact that there is an unanalyzed correlation 
between T1 and Li (rey = 728), and L1 
has a causal impact on T2 (Pron. = .444; 
728 X 444 = 323), 

The direct effect of thought on love (T2) 
during the 2-week period immediately pre- 
ceding the measurement of love (L2) is prove 
= .370. This effect is substantial and provides 
Strong support for the hypothesis. Note that 
the overall association between these variables 
iS rove = .764. The difference between the 
causal and total association reflects noncausal 
associations primarily involving L1, although 
D2 (dating at Time 2), RCL (reality con- 
straints at Time 1), and T1 also enter in. 

The longer term effect of thought on love 
can be examined by looking at the impact of 
T1 on L2. Recall that T1 reflects the amount 
of thought during the 2-week period that 
ended 2 weeks prior to the measurement of 
L2. The direct effect, Prom = —.149, is nega- 
tive and contradicts the hypothesis. However, 
as can be seen in Table 2, row 3, T1 has a 
direct effect on T2 (Prem = -418) and T2, in 
turn, has a direct effect on L2 (brer = 370), 
This indirect effect of T1 on L2 (Preri prove 
= .155) is comparable in magnitude but op- 
posite in sign from the direct effect, and thus 
the total causal impact of T1 on 12 is near 
zero. Although the total causal relationship is 
negligible, the correlation between the varia- 

bles is substantial (reine = 612), Again, L1 
appears to play the most important role in 


These field data appear to supplement the 
laboratory findings concerning the role of 
thought in attitude change. They extend the 
apparent time range of the effect. Most of 
the laboratory work has used time frames 
Measured in seconds and minutes (e.g., Tesser 
& Conlee, 1975). The present data suggest 
that the amount of (self-reported) thought 


ABRAHAM TESSER AND DELROY L, PAULHUS 


during a period of up to 2 we ids to 
result in increased attitude polari (i.e, 
love). In short, “idealization” (+ Hf & 
Davis, 1962) does play a role i ntic 
attraction. 

The longer term effects of th we 
complex and provide information th ld 
be impossible to obtain with a sh n 


laboratory thought manipulation and , 
to detect with a less analytic approac 
correlational data. The overall causal imp 
is negligible because the negative direct effec 
and positive indirect effect are compensating. 
The indirect effect of T2 is straightforward 
and consistent with our hypothesis: Ty has 
an impact on T2, and T2 causes love, The 
direct negative effect of T1 on love is more 
problematic, Why should the immediate ef- 
fect be positive and the delayed effect be 
negative? 

Perhaps thought-generated changes in feel- 
ings require continued thought and/or other 
things beyond thought (e.g., various kinds of 


Another more plausible Possibility assumes 
that thought has only a relatively short-term 
impact on love, That is, the causal interval is 
less than 2 weeks and, therefore, T1 has no 


“causal” Impact on L2. Since T1 has a strong 


relationship 
with L2), 


gives Ti a negative weight to “suppress” some 


Psychological inertia (T1) and love (L1), 
Taken together they account for most of the 
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variance (R* = .642) in thought; all other 
(residual) causes combined therefore account 
for only 36% of the variance, 
There is yet another interpretation of the 
relationships between thought and love, 
v, Spuriousness.’ Assume that the retro- 
thought measure summarizes the 
of affairs 1 week prior to the love mea- 
of the same administration: T1 repre- 
nts Week 1, L1 represents Week 2, T2 rep- 
resents Week 3, and L2 represents Week 4. If 
Jove and thought are not causally related but 
y correlated because they share a 
common cause, we would expect rayn1 = rurra 


Prone > Yang, Since each of the first three 
correlations represents time differentials of 1 
week, while the fourth represents a differential 


of 3 weeks, The obtained zero-order correla- 
tions are consistent with this interpretation 
(.728, 741, .764 and .612, respectively). Al- 
though this explanation cannot be unequivo- 
cally ruled out, we are inclined to discard it 
for two reasons. First, it is not immediately 
apparent what the common cause of love and 
thought might be. Second, previous laboratory 
esearch has shown that thought, at least 
under some conditions, does play a causal role 
in attraction, 

Reality Constraints 

We hypothesized that reality constraints 
would have a negative impact on love. (Since 
the concept of reality constraints was in- 
versely scored, the sign of the path should be 
positive.) The design allowed for two tests of 
this hypothesis: one involving the immediate 
effects (Pierce) and one involving the longer 
term effects (Prore1). Recall that the former 
was insignificant and dropped from the model. 
The long-term effect, though small, does, as 
predicted, significantly affect L2. 

A priori one would not expect the delayed 
effect to be stronger than the immediate effect. 
Since the observed result was unanticipated, 
and since both coefficients are small in abso- 
lute value, pure chance is a possible explana- 
tion. On the other hand, there may in fact be 
a delayed impact. New information may re- 
quire an “incubation period” before it be- 
comes operative, or people may be reluctant 
to recognize the invalidity of their own expec- 
tations, and therefore it takes time to aban- 
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don them. Perhaps they only abandon them 
after thoroughly testing the new information, 
and this testing takes time. Finally, it may be 
that it takes time for disconfirmed cognitive 
expectancies or beliefs to impact feelings (e.g., 
McGuire, 1960). 


Dating 


We expected dating to increase reality con- 
straints. The only way to test this hypothe- 
sis, using temporal order to determine causal 
direction, is by examining the path coefficient, 
Prozor. It will be recalled that this coeffi- 
cient was not significant and was dropped. 
There are several possible reasons why the 
impact of dating was negligible. Dates may be 
on their best behavior and offer little chance 
for disconfirming positive beliefs, Altercasting 
may be relatively easy in the dating situa- 
tion, with the result that partners set one an- 
other up for mutual self-fulfilling prophecies, 
The dating situation may be one in which 
“interpretation” of behavior in accord with 
expectancies is relatively easy since there are 
relatively few negative consequences of such 
interpretation. 

We hypothesized that love would produce 
dating and that dating would produce love. 
Both hypotheses appear to have been con- 
firmed. The path from L1 to D2, though mod- 
est (Poor = -165), is significant, as is the 
path from D2 to L2 (pəpə = .081). It was 
also possible to examine the delayed impact of 
dating on love by looking at the effect of D1 
on L2. Recall, however, that this effect 
(Pu2p1) Was nonsignificant and dropped from 
the model leaving only a negligible indirect 
effect through D2. 

The effect of love on dating is small, How- 
ever, it (L1) and previous dating (D1) are 
the only hypothesized causes of current dat- 
ing, and together they account for 67% of the 
variance in current dating (R? = .672). Since 
the causal influence of prior dating history, 
by itself, accounts for so much of the variance 
(Po2p1 = .732), there is not much variance 
left for L1 to control. Perhaps the magnitude 
of Poor. should be interpreted in this light. 

Similarly, the lion’s share of the variance in 


5 We are grateful to an anonymous reviewer for 
suggesting this interpretation, 


“eae 
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one’s present feelings about someone (L2) can 
be accounted for in either the direct (Pro = 
-599) or indirect (.177) causal effects of one’s 
past feelings (L1). When one adds to that 
the effects of thought and reality constraints, 
the meaning of the size of the contribution of 
dating to love is put into perspective. (Note 
that the model accounts for over 75% of the 
variance in L2, R? = .758.) 


Reproducing the Correlations 


The path coefficients and correlations among 
exogenous variables can be used to estimate 
the remaining correlations. Wherever the 
model does not specify a direct path between 
two variables, the estimated correlation be- 
tween these variables and the actual correla- 
tions are not necessarily the same. There are 
six such relationships in the present model: 
D112 and the last five entries in Table 2. On 
the whole, the model does fairly well. (The 
relevant numbers are in the last two columns 
of the table.) The absolute difference between 
the estimated and observed correlations, after 
converting each to z, was obtained and aver- 
aged over the six cases. This value was com- 
fortably small, .027 z units. The largest dis- 
crepancy, .102 for D2T2, appears to be within 
tolerable limits. Finally, Land (1973) has de- 
veloped an extremely sensitive test for assess- 
ing whether the discrepancy between esti- 
mated values and obtained values for the 
model as a whole is beyond chance. In this 
case, x?(6) = 11.39, which is not significant. 


CONCLUSIONS 


Although path analysis can lead to the re- 
jection of an inappropriate model, it cannot 
“prove” that a particular model is correct. It 
can only confirm that a particular model is 


enansar Wke Obtained correlations. Path 


coefficients estimate the direct causal rela- 
tionships among the theoretical constructs 
only to the extent that the measures are both 
reliable and valid. As indicated earlier, the 
love measure seems to have satisfactory reli- 
ability and validity (cf. Rubin, 1970). The 
other measures are, however, ad hoc, and the 
reader is cautioned to consider this in weigh- 
ing the results. 

We have used retrospective questions as a 
way of temporally ordering the data in order 
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to establish causal direction. The most im 
portant problem unique to such a technique 
concerns relevant systematic bias in memory 
(e.g., the more we love someone, the more 
likely we are to recall spending time with hin 
and thinking about him). The use of a panel 
to establish temporal order has its problems 
too. Since respondents answer on two occa- 
sions, they can systematically bias their an- 
swers to be consistent or to exaggerate 
changes. The time between administrations, 
2 weeks, was chosen arbitrarily. This interval 
might be longer or shorter than the optimal 
causal interval for any or all of the variables. 
To minimize these problems, retrospective 
questions appeared first on the questionnaire, 
responses were kept confidential, and the im- 
portance of accuracy and honesty was stressed 
to the respondents. 

Taking the present problems into account 
and considering previous research and theory, 
it appears to us that a number of tentative 
conclusions can be drawn from this study: 
(a) Degree of love for a person appears to be 
a direct function of amount of recent thought 
about that person. On the other hand, the 
long-term effect of thought on love might be 
negative. (b) Thought appears to be a direct 
function of degree of love. (c) The role of 
reality constraints in determining love is am- 
biguous. Since its immediate effect was not 
significant and its delayed effect was small, 
further research is needed. (d) Dating appears 
to be both a cause and effect of love, but in 
neither case is the relationship strong. (e) 
Exposure through dating does not appear to 
play much of a role in the disconfirmation of 
one’s expectancies about others. (f) When one 
includes prior states of each of the variables 
as determinants of present states, the model, 
while certainly not complete, accounts for 
most of the variance of each endogenous 
variable. 

Finally, the methodology used here appears 
promising as a vehicle for studying a wide 
range of social-psychological problems. The 
measurement of variables at two points in 
time provides some leverage in approximating 
causal inferences with nonexperimental data, 
and path analysis helps to untangle the web 
of intercorrelations among nonmanipulated 


P: 


i 
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variables. Although the logic of time-lagged 

-measurement and path analysis is certainly 

not new, these techniques appear to have been 

q relatively infrequently by social psy- 

ists. Their utility in the present study 

succes.s that they may have broad applicabil- 
ought to be exploited more often. 


ity 
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Conditioning and Discrimination of Affect and Attraction 


Paul R. Bleda 
U.S. Army Research Institute, Arlington, Virginia 


This experiment assessed the operation of conditioning and discrimination pro- 
cesses as reflected in the affective experience and evaluative behavior of indi- 
viduals within a competitive group context. Male (n =45) and female (n 
=57) subjects were randomly paired with a member of the same sex and 
were then separated into three teams, which competed for the highest score 
on a manual task. All members received either favorable or unfavorable eval- 
uations from one of their opponents and then indicated their reactions to both 
other groups. The results showed that the team responsible for unfavorable 
rather than favorable feedback evoked significantly (p < .001 in all cases) more 
depression, anxiety, and hostility in recipients as well as more negative ratings 
of that team’s attraction (direct conditioning). In addition, affective and eval- 
uative responses to the control group were intermediate (p < .001 in all cases) 
to those elicited by opponents directly responsible for unfavorable or favorable 
evaluations (discrimination). Moreover, the control team evoked more hostility 
in males (p <.001) and more anxiety and less attraction in both sexes (p < .05 
in both cases) after unfavorable rather than favorable feedback had been re- 
ceived (indirect conditioning). Also, the opposing teams generated more anxiety 
(p < .02) and hostility (p < .005) in males than in females. It was concluded 
that the processes that mediate interpersonal social behaviors also operate in 


an intergroup context. 


The reinforcement-affect (Byrne & Clore, 
1970; Clore & Byrne, 1974) and associative re- 
inforcement (Lott & Lott, 1974) models of at- 
traction apply basic principles of learning to 
the attraction situation. Within both of these 
frameworks, social interactions are conceptual- 
ized in terms of the reinforcement properties 
inherent in relevant stimuli in a situation and 
responses to them. A major premise of these 
formulations is that any reinforcing event (un- 
conditioned stimulus) will evoke in those ex- 
posed to it an implicit affective or anticipatory 
response (unconditioned response) that is as- 
sumed to fall along an unpleasant-pleasant 


Portions of this article were presented at the 21st 
International Congress of Psychology held in Paris, 
August 1976. 
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express his appreciation to Sharon Estee Bleda for her 
helpful comments and to Donn Byrne, who served as 
advisor to the thesis. It also should be noted that the 
views expressed in this article are those of the author 
and do not reflect those of the U. S. Army or any other 
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subjective continuum. Through tempor: 
sociation with such affect, previously unkni: 
and presumably neutral persons (conditior | 
stimulus) become capable of eliciting a con: 
tioned response that mediates various forms « 
approach-avoidance responses including verba 
evaluations of attraction. 

Empirical support for these propositions has 
been provided by a demonstration (Singh, 


1974) of the differential impact of varying mag- ' 


nitudes and proportions of positive reinforce- 
ment on the affective experience and evaluative 
behavior of recipients. Subjects received vari- 
ous proportions of personality similarity, atti- 
tude agreement, and positive personal evalua- 
tions that had been found to vary in 
reinforcement magnitude from low to high. 
Recipients then indicated either their immedi- 


a 


ate personal feelings or their attraction toward » 


the stranger directly associated with the rein- 
forcements. The results showed that with 
greater magnitudes of rewards and punish- 
ments the slopes of the linear relationships be- 
tween proportion of positive reinforcements 
and both self-ratings of affect and attraction 
tended to increase, whereas the y intercepts 
decreased. The parallel findings for personal 
feeling and attraction support the notion that 
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evaluations of another person are mediated by 


. the implicit affective responses evoked by rein- 


forcements linked directly to him. 
logical derivation of these models is that 
re temporal association of a neutral 
and reinforcement would be a suffi- 
ndition for the attraction effect (Byrne, 
|. Thus, a more stringent test would be 
ided by a situation in which there were no 
strumental relationship between the one 
being evaluated and the unconditioned stimu- 
lus. Lott and Lott (1960) conducted such a test 
by varying the amount of material reward (e.g., 
toys) given in games involving three children 
ind determining each child’s choice of two 
classmates to accompany on an imaginary ex- 
cursion. Though rewards were not made con- 
tingent on the performance of the other players, 
children who received toy prizes selected their 
play group members more often than did those 
not reinforced. These findings support the con- 
tention that positive attitudes toward others 
develop when rewards are administered in their 
presence. Moreover, Lott, Lott, and Matthews 

969) demonstrated that the vicarious experi- 

ce of reward through observation of another’s 
successful as compared to unsuccessful per- 
formance evoked positive attitudes toward 
others merely present in the situation. 

Further evidence of indirect conditioning 
effects was provided in a study by Griffitt and 
Guay (1969), Subjects created a series of origi- 
nal stories that were purportedly assessed by 
one of two unknown peers. After receiving 

ither favorable or unfavorable feedback, sub- 
jects were given a moderately similar (.50) at- 
titude scale from one of the strangers. Attrac- 
tion toward the latter person was found to vary 
only as a function of the proportion of favor- 
able evaluations administered and not as a 
consequence of responsibility for them. Al- 
though these results establish an indirect con- 
ditioning effect based on the nature of the 
evaluations rendered, they do not reveal any 
tendency toward discrimination between others 
according to their direct or indirect association 
with reinforcement. However, this lack of 
evaluative discrimination may have been due 
to a social link inadvertantly established be- 
tween the two possible story raters through the 
operations used in this particular study." 


McGinley and McGinley (1972) replicated 


cient 
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the indirect and direct conditioning effects 
reported by Griffitt and Guay (1969) under 
circumstances in which a nonresponsible ob- 
server was immediately present in the situa- 
tion. Subjects received either .25, .50, or .75 
favorable creativity ratings (positive reinforce- 
ments) and then evaluated both the rater and a 
person who acted as an observer. The results 
indicated that attraction toward both the per- 
son responsible for the reinforcements and one 
merely present during their administration was 
a positive function of the proportion of rewards 
received. As in previous work (Griffitt & Guay, 
1969), there was no indication that subjects 
discriminated between responsible and nonre- 
sponsible others. 

The ability of individuals to discriminate 
between simultaneous sources of reward and 
punishment has been indicated in another 
study (Hughes, Note 1). Subjects told se- 
quences of original stories to two judges who, 
in turn, gave either consistently favorable or 
unfavorable feedback. Also, each rater was 
either present or absent during the other’s ap- 
praisal of the stories. Regardless of the physi- 
cal proximity of the judges while administering 
reinforcements, recipients were more attracted 
to the one responsible for the favorable rather 
than the unfavorable evaluations. It was con- 
cluded that to the extent that individuals can 
cognitively isolate the direct source of their 
affect, they will discriminate in terms of their 
subsequent evaluative behavior. 

Although evidence exists for the operation of 
both conditioning and discrimination processes 
in the attraction situation, the occurrence of 
both of these processes has never been shown. 
One aim of the present work is to demonstrate 
the concurrent operation of these processes as 
they are reflected in the affective experience 
and evaluative behavior of individuals who are 
given social reinforcements. Such a demonstra- 
tion would provide empirical validation of a 
logical possibility either ignored or inade- 
quately examined by previous research. A fur- 
ther goal was to show that the same processes 
that mediate these effects at the interpersonal 
level also operate in an intergroup context. 


1 Participants overheard the person responsible for 
their evaluations ostensibly introduce the other un- 
known peer to the experimenter upon their arrival, 
thereby conyeying that they were prior acquaintances. 
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These goals were pursued using a competitive 
arrangement to assess the impact of selected 
factors on the socioemotional responses of 
group members who received evaluations from 
an opposing team. 
It was hypothesized that favorable evalua- 
tions provided by another team would elicit 
more pleasant feeling states and more positive 
ratings of attraction toward that group than 
would unfavorable ones (direct conditioning). 
Moreover, these responses were expected to 
become conditioned to a third, initially neutral 
team, which would evoke differential affect and 
attraction in accordance with the favorability 
of the evaluations rendered (indirect condi- 
tioning). However, recipients of reinforcements 
were expected to respond more extremely to 
the opposing team directly responsible for in- 
tergroup evaluations than to a control team 
merely present at the time of their administra- 
tion (discrimination). Thus, this experiment 
seeks to extend previous findings by focusing 
on the affective component of conditioning and 
discrimination processes. This would provide a 
more adequate test of the basic tenets of the 
reinforcement models of attraction. 


METHOD 
Design 


The experiment wasa 2 X 2 X 2 X 2 mixed factorial 
with three between variables (type of evaluations, sex 
of subject, and order of responding to both opponents) 
and one repeated measure (responses of group members 
to two other teams). Subjects of both sex were paired 
according to sex and received unfavorable or favorable 
feedback from a competing same-sex team. The major 
dependent measures were the affective (depression) and 
evaluative (attraction) responses of members to the 
team responsible for the evaluations and a control group 
presented in counterbalanced order, 


Subjects 


The subjects were 45 male and 57 female students in 
an introductory psychology course at Purdue Universi ty 
whose participation satisfied part of a class requirement. 
The responses of 20 additional subjects (14 males and 
6 females) were replaced because they discerned the 
actual purpose of the study. All subjects were randomly 
assigned to the experimental treatments. 


Procedure 


Upon entering the laboratory, subjects were paired 
with a partner of the same sex (five male and four female 
confederates were used) and assigned to one of three 
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experimental rooms that were equipped wit! osed- 
circuit TV system. This arrangement allowed ine pre- 
sentation of recorded auditory and visual i: ctions 
to teams located in three separate but adjacer IMS, 
Participants were seated by a male experiment p= 
posite sides of a small table and informed that ( re 
members of Team A and that the other two tean re 


B and C. The teams were provided with a set of » en 
instructions that introduced the study as one conc od ® 
with how newly created teams formed initial impres+i ns 
of each other in a competitive situation. The s: 3 
were told that $5 would be awarded to thc p 
achieving the highest total score on a manual task € 
performed later. 

Survey of political opinions. Each member of Team \ 
and C was instructed to complete a brief survey of p i- 
tical opinions, which required responses along a 6-poi. t 
scale on four politically oriented issues (i.e. amnes , > 
pardon of Richard Nixon, legalized abortion, and | 
ing). All subjects completed this survey since they h 
been led to believe that they were members of Team 
Teams A and C were further instructed to place the cor 
pleted forms in an accompanying envelope and to de 
liver them to Team B, It was arranged so that the con 
federate, who always was seated closest to the door, 
could inconspicuously deliver these surveys to the 
experimenter, 

Manipulation of intergroup evaluations, To lend credi 
bility to the experimental manipulations, the alle 
instructions to Team B regarding its use of the atti 
tudinal information about the other teams were pre 
sented to all subjects over the headphones, After 
inspecting the surveys, Team B was to indicate its im 
pressions of both teams on separate intergroup judg 
ment scales. This instrument contained six 7-point 
scales for rating the other teams in terms of their open- 
mindedness, thoughfulness, sincerity, trustworthiness, 
and likability. A space also was provided for making 
additional comments about the other group. 

The experimenter always completed the intergroup 
judgment scales in a manner that was either very un- uf 
favorable or favorable, Teams that received the unfavor- 
able evaluations were rated as being very narrow- 
minded, thoughtless to a slight degree, very insincere, 
untrustworthy, stupid, and very dislikable, In addition, 
the members of Team B indicated that they were un- 
impressed with the political views of Team A and 
thought that the members of Team A seemed like a 
couple of “typical Hoosiers.” On the other hand, groups 
receiving favorable feedback were rated as being very 
open-minded, thoughtful to a slight degree, very sincere, 
trustworthy, intelligent, and very likable. Also in- 
cluded was a gratuitous compliment from the members =y 
of Team B indicating that they had been impressed ` 
with the views of Team A and thought them to be inter- 
esting people. The evaluations were deposited by the 
experimenter under the doors of the experimental rooms 
after the manual task was performed. 

Performance of the manual task. The instructions fo: 
the manual task were presented via videotape on T\ 
monitors. This task involved assembling various sma?! 
metal parts in a specific order on the Purdue Pegboar:] 
(Science Research Associates, 1948). The subject's 
Score on this task was the number of units completed on 


ne. 


a single 60-sec trial with each performer timing the 
other. After finishing the task, both participants com- 
yteted a modified version of the Interpersonal Judgment 
(IJS; Byrne, 1971), which included five 7-point 
rating their partner in terms of his/her intel- 
iturity, knowledgeability, likability, and de- 
s a work partner, 
4 “nl of affective responses. Following their com- 
1 of the IJS, the teams were instructed to inspect 
1p evaluations made of them by Team B, Each 
tedly received varying amounts of in- 
ts opponents in the form of attitudinal 
m both opponents for Team B, evalua- 
rom Team B for Team A, and no in- 
leam C, the control group. It was further 
ext it to assess the impact of these different 
types of information on intergroup relations, each team 
nember was to complete two separate copies of the 
rlened form (52 items) of the Multiple Affect Ad- 
List (MAACL; Zuckerman & Lubin, 
Phe subjects indicated separately how Teams B 
d C made them feel by checking a list of adjectives 
that described the subjective feelings evoked by each 
ticam. The order of assessing feelings elicited by the 
other two teams was counterbalanced. 
ment of evaluative responses, After finishing the 
each subject completed two separate group 
evaluation scales, which included eight 7-point rating 
les for evaluating each of the opponents in terms of 
ility, friendliness, cooperativeness, kindness, fair- 
, desirability, integrity, and pleasantness. The order 
evaluating the teams was counterbalanced. 
Manipulation checks. Subjects were administered a 
final questionnaire composed of items designed to pro- 
ide a check on the experimental manipulations, They 
ere to indicate the team that had evaluated their 
group and the nature of that group’s ratings on a scale 
from 1 (very unfavorable) to 8 (very favorable), Also, 
the degree of anger evoked by the evaluations was as- 
sessed on an 8-point scale ranging from 1 (very little) 
Ito 8 (very much). Finally, subjects were given a full 
f explanation of the study and were allowed to leave. 


had pur 
rmation abc 
intorma 
tive feed 
for 


e+ 


jective Check 


1965 


RESULTS 


Preliminary analyses revealed that none of 
the main or interactive effects involving the 
order of responding to the opposing teams 
reached statistical significance for any of the 
major dependent measures. Consequently, 

,# scores were collapsed across both levels of this 
"variable in all subsequent analyses. 


Check on the Experimental Manipulations 


Most subjects (93%) were able to identify 
correctly the team that was responsible for 
their evaluations. In addition, a 2 X 2 analysis 
of variance showed a main effect of the type of 
evaluations rendered on perceived favorability 
responses, F(1, 94) = 343.08, p < .001. Un- 
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favorable evaluations were rated more nega- 
tively than were favorable ones (1.54 vs. 6.81). 
Also, a 2 X 2 unweighted-means analysis of 
variance of anger responses indicated a main 
effect due to the type of feedback received, 
F(A, 94) = 62.78, p < .001, such that unfavor- 
able evaluations evoked more anger than did 
favorable ones (4.51 vs. 1.22). 


Affective Responses 


Responses obtained on the MAACLs were 
scored separately for the affective dimensions of 
depression/elation, anxiety/well-being, and 
hostility/affection, Along each of these con- 
tinua positive, negative, and zero values repre- 
sent unpleasant, pleasant, and neutral feeling 
states, respectively. Scores along these dimen- 
sions could range in value from 18 to —6 for 
depression, 9 to —3 for anxiety, and 6 to —10 
for hostility. 

A series of 2X 2 X 2 unweighted-means 
analyses of variance revealed parallel effects of 
certain variables on all the affective measures, 
In particular, the type of evaluations rendered 
significantly influenced depression, F(1, 94) 
= 23.56, p < .001, anxiety, F(1, 94) = 46,69, 
p < .001, and hostility, (1,94) = 106.98, 
p < 001. Group members who received un- 
favorable as compared to favorable feedback 
evidenced more depression (.98 vs. —1.14), 
anxiety (1.06 vs. —1.08), and hostility (.54 
vs. —4.05) toward their opponents. In all 
cases, these main effects were qualified by 
significant interactive effects between the 
type of evaluations rendered and responsi- 
bility for them (p < .001 in all cases). Subse- 
quent Newman-Keuls analyses showed that 
compared to their corresponding control team 
the teams responsible for unfavorable evalua- 
tions evoked more depression (2.28 vs. —.33), 
anxiety (1.80 vs. .32), and hostility (2.32 ys. 
—1.29), whereas teams responsible for favor- 
able feedback elicited more elation (—1.88 vs. 
—.14), well-being (—1.76 vs. —.40), and af- 
fection (—4.30 vs. —3.19). In all of the above 
comparisons, the affective responses evoked 
by the control teams were less extreme (p 
< .001 in all cases) than those elicited by teams 
responsible for either unfavorable or favorable 
evaluative feedback. 

Further comparisons of the feelings evoked 
by the control team after the administration of 
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TABLE 1 


Mean Hostiiry RESPONSES EVOKED BY EACH 
OF THE OPPOSING TEAMS 


Sex of subject 
Type of evaluation 
dered 


renderi Male Female 
Unfavorable 

Team B 3.02 1.62 

Team C -03 —2.60 
Favorable 

Team B —4.31 —5.56 

Team C 3.18 —3.14 


Note. Positive, negative, and zero values represent un- 


pleasant, pleasant, and neutral feeling states, respectively. 


either unfavorable or favorable evaluations 
showed significant differences in the cases of 
anxiety (p < .05) and hostility (p < .001) but 
not for depression (F < 1). That is, following 
the administration of unfavorable as compared 
to favorable evaluations, the control team 
evoked more anxiety (.32 vs. —.40) and less 
affection (—1.29 vs. —3.19). In the case of 
hostility (only), this effect was qualified by a 
borderline significant interactive effect (p 
< .06) involving sex of subject, type of 
evaluations, and responsibility for evaluations. 
Inspection of the data presented in Table 1 sug- 
gests that the indirect conditioning of hostility 
to the control team was due to males rather 
than females who had received unfavorable 
feedback. In addition, the subject’s sex was 
found to have a significant main effect on both 
anxiety, F(1, 94) = 5.63, p < .02, and hostil- 
ity, F(1, 94) = 8.81, p< .005, such that 
males evidenced more anxiety (.36 vs. —.30) 
and hostility (—1.12 vs. —2.42) than females 
in response to the opposing teams. 
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Evaluative Responses 


Responses obtained on the group evali (ion 
scales were scored from 1 (most negative’ i> 8 
(most positive). Mean ratings of group ro m- 
bers toward each of their opponents after bx ug 
administered different types of group ev 
ations are presented in Table 2. The re 
of a series of 2X 2 X 2 unweighted-m 
analyses of variance indicated significa t 
main effects on all of the evaluative dimensio: s 
(p < .001 in all cases) due to the type £ 
evaluations rendered. Specifically, more neg- 
tive ratings of the opposing teams were ev - 
denced by recipients of unfavorable as cor 
pared to favorable evaluations. 

The main effects due to the type of eva! 
ations rendered were qualified in all cases (« 
cept integrity) by interactive effects betwe 
the type of feedback received and responsibili 
for it. Subsequent Newman-Keuls analys: 
showed that the corresponding control tear 
elicited less negative ratings than the tean 
responsible for the unfavorable evaluations = 
and less positive ratings than the team pro- 
viding favorable ones (p < .01 in all cases). 
These findings clearly indicate that recipients 
discriminated evaluatively between the teams 
responsible and those not responsible for their 
evaluations. Moreover, ratings of the control 
team in terms of likability, friendliness, kind- 
ness, and attraction were significantly less 
positive after the receipt of unfavorable rather 
than favorable evaluations (p < .05 in all 
cases). Thus, indirect conditioning of evalua- 
tive behavior occurred in addition to the pre- 
viously described discrimination effects. 


TABLE 2 
MEAN EVALUATIVE RESPONSES TOWARD EACH OF THE OPPOSING TEAMS 


Type of evaluations rendered 


Unfavorable Favorable 
Male — Female __ Male Female ay 

Dimension Team B Team C Team B Team C Team B Team C Team B Team C 
Likability 2.46 4.46 3.16 4.73 5.96 5.32 638 500 
Friendliness 2.53 4.04 320 4:38 5.73 T 505 626 486 
Cooperativeness 3.19 401 2.61 4.27 4.91 4.09 5.12 4.35 
Kindness 2.54 3.92 311 4.56 5.63 4.92 6.18 5.04 
Fairness 215 419 2.53 444 468 438 5.78 5.06 
Desirability 215 419 3.26 455 564 496 6.17 504 
Integrity 4.13 418 3.50 429 418 478 521 4.50 
Pleasaniness 212 4.40 2.77 4.60 5.54 4.80 634 5.07 
Attraction 461 8.65 642 9.28 11.60 10.28 12.55 10.04 
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TABLE 3 


ZERO-ORDER CORRELATIONS BETWEEN AFFECTIVE AND ATTRACTION RESPONSES TO 
‘Teams B AND C ror Born Sexes 


Response 1 2 4 5 6 7 8 
sepression to B — 
repression to C .26** = 
y to B om AL z 
7 14 Age 1** res 
stility ‘oo 16, ree oe, = 
6. Hostility to C .24* 54" .24* Le Ane a 
7. Anger 630" 10 63°** 15 TUNS, 24% _ 
8, tion to B —.68%"* —.16 — 71 — 279°  — gA — 44eee gg 
9 i —.22" —.22* =- =.15 —30"* 419 28 359% 


The results also indicated significant main 
fects due to the sex of subject on a number of 
‘alive measures including friendliness, 
(1, 94) = 3.30, p < .07, kindness, F(1, 94) 
= 6,75, p< .01, fairness, F(1, 94) = 6.40, 
p < 02, desirability, F(1, 94) = 4.90, p < .03, 
pleasantness, F(1, 94) = 6.87, p < .01, and 
the combined index of attraction, F(1, 94) 
= 5,03, p < .03. Along all of these dimensions, 
males responded more positively than did 
males. None of the interactive effects involving 
this variable reached statistical significance. 


Supplementary Analyses 

A series of zero-order and partial correla- 
tional analyses were used to further explicate 
the nature and degree of the relationships 
among the affective and attraction responses 
associated with the other teams. These analy- 
ses were performed separately for each sex as 
well as for the combined sample. 

A fective responses to Teams B and C. Zero- 
order correlation coefficients among the vari- 
ous response measures for the combined sample 
are given in Table 3. The correlations indicate 
positive relationships between the correspond- 
ing depression responses associated with both 
teams and between anxiety responses evoked 
by these two teams. However, the correspond- 
ing feelings of hostility associated with both 
opponents were correlated highly for males 
(r = 67, p < .001) but not for females (r 
= .15, p > .05). Similarly, the degree of anger 
produced by Team B’s evaluations was corre- 
lated with hostility toward Team C for males 
(r = 49, p <.01) but not for females (r 
= .02, p > .05). 


Affective responses to and attraction toward 
Team B. The results in Table 3 show that at- 
traction toward Team B was negatively corre- 
lated with each of the affective responses 
associated with that team. However, partial 
correlations between each of the feeling states 
and attraction with the remaining affective re- 
sponses held constant revealed significant rela- 
tionships only in the cases of hostility, r(58, 
137) = — 43, p < .001, and anger, r(78, 135) 
= — Al, p < .001, but not depression, (18, 
357) = — 14, p > .05, and anxiety, 7(38, 157) 
= — .08, p > .05, These results suggest that 
attraction toward Team B was mediated 
primarily by the feelings of hostility and anger 
evoked by their evaluations. 

Affective responses to Team B and attraction 
toward Team C. The zero-order correlations 
further indicated that attraction toward ‘Team 
C was related significantly to depression, hos- 
tility, and anger for the combined sample. 
When these relations were subjected to partial 
correlational analyses, only hostility evoked in 
males by Team B was significantly related to 
attraction toward the control team, r(59, 137) 
=— .28, p< .05, when the other affective 
responses were statistically controlled. These 
findings suggest that the hostility evoked in 
males by the team responsible for the inter- 
group evaluations may influence their attrac- 
tion toward the control group. 


Discussion 
The nature of evaluative feedback adminis- 
tered by an opposing team was found to in- 
fluence recipient reactions to that group, 
Specifically, members of teams receiving un- 
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favorable as opposed to favorable evaluations 
responded more negatively along a number of 
affective dimensions including depression, anxi- 
ety, hostility, and anger as well as an evalua- 
tive one, attraction. These parallel effects of 
intergroup evaluations on the affective experi- 
ence and evaluative behavior of group mem- 
bers corresponds closely to the reactions of 
individuals to interpersonal evaluations (eg., 
Singh, 1974). In addition, given the particular 
circumstances and variables investigated in the 
present work, attraction toward the team re- 
sponsible for reinforcements appeared to be 
mediated by feelings of hostility and anger 
rather than depression or anxiety. 

Indirect conditioning of affective and con- 
sequent attraction responses to a competing 
group not responsible for the administration of 
reinforcements also was found. In particular, 
the control team evoked more hostility in males 
and greater anxiety and less attraction in both 
sexes after subjects had been given unfavorable 
feedback from another group. Furthermore, 
since there was no order effect, it cannot be 
argued that these differential responses to the 
control group were due to an undifferentiated 
“halo” phenomenon. These findings provide 
further support for the basic tenets of the rein- 
forcement models of attraction. 

Recipients were found to discriminate be- 
tween the opposing groups according to the 
nature of the evaluations rendered and re- 
sponsibility for them. Along both affective and 
evaluative dimensions, responses to the control 
team were less extreme than those evoked by 
groups directly responsible for either rewards 
or punishments. Hughes (Note 1) proposed 
that to the extent individuals isolate the source 
of their affect in a particular situation, these 
feelings will not become conditioned to proxi- 
mate stimuli merely associated in time with 
the administration of reinforcements. Accord- 
ing to this argument, if indirect conditioning of 
affect does occur, then the ability of recipients 
of reinforcement to isolate the cause of their 
immediate feelings must have been incomplete. 
Since discrimination effects appeared to be 
Stronger than conditioning effects in the pres- 
ent work, the uncertainity that may have oc- 
curred could have stemmed from the competi- 
tive nature of the situation and the lack of any 
physical ‘contact between participants. 


BLEDA 
The degree of affective and evaluati lis- 
crimination depended to some extent c- the & 


sex of group members. In general, ma! 
denced greater feelings of anxiety and hos! ‘ity 
and less attraction toward their opponents 
did females. Moreover, feelings of host 
evoked in males by the evaluative actions -f 
one competitor played a major role in dete: 
mining their responses to another team as- 
sumed to be present in the situation. 
contrast, the hostility elicited in females by t 
evaluations administered by one team did n 
condition or influence attraction toward a noi 
responsible third party. These latter finding 
Suggest that in the present circumstance 
males were either less willing or less able tha 
females to discriminate between their opp 
nents in terms of the affect that they evoked 
This sex difference may have been due to th: 
greater intensity of hostility that males evi 
denced in response to unfavorable feedback as 
compared to females. That is, the level of 
hostility evoked in individuals may be related 
to the degree to which it is displaced or condi- 
tioned to others in the immediate situation, 
Alternatively, previous work (Rubin, 1970) has 
shown that females are better able to discrim- 
inate between sentiments such as love and at- 
traction that they have toward a particular 
person. This greater discriminatory sophistica- 
tion of females in the affective domain also may 
apply to the sentiments they evidence between 
people. 


n 


: 


The present findings demonstrate that the =f) 


processes that mediate various social behaviors 
at the interpersonal level also operate in an in- 
tergroup context. Specifically, empirical evi- 
dence was obtained for the simultaneous 
operation of conditioning and discrimination 
processes in determining affective and evalua- 
tive responses to other groups associated di- 
rectly and indirectly with reinforcements. 
Moreover, the magnitude of conditioning 
effects was related to the sex of members who 
received reinforcements. ‘This latter relation 
was interpreted in terms of the possible greater 
sophistication of females in the affective 
domain of experience. 
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A Test of the Bargaining Theory of Coalition Formatio 
in Four-Person Groups 


H. Andrew Michener, John A. Fleishman, and Jerry J. Vaske 
Department of Sociology, University of Wisconsin—Madison 


Using eight veto and eight nonveto distributions, this study tested predictions 
from Komorita and Chertkofi’s bargaining theory of coalition formation 
against competing predictions from minimum resource and pivotal power the- 
ories, Male subjects in four-person groups negotiated the division of rewards 
in an unconstrained, face-to-face format. For the nonveto distributions, results 
indicate that the bargaining theory is superior to minimum resource and 
pivotal power theories in predicting payoff division, and at least as good in 
predicting coalition formation. For the veto distributions, however, results 
indicate that the bargaining theory is less effective than the minimum resource 
and pivotal power theories in predicting payoff division. 


Over the past 2 decades, social scientists 
have formulated several theories of coalition 
formation, of which the most prominent are 
minimum resource theory (Gamson, 1961a, 
1961b) and pivotal power theory (Rapoport, 
1970; Shapley & Shubik, 1954). Recently, 
however, an alternative theory of coalition 
formation, the bargaining theory, has been ad- 
vanced by Komorita and Chertkoff (1973), 
The bargaining theory, unlike other theories, 
attends closely to the dynamics of negotiation 
in coalitional situations, Its predictions occa- 
sionally differ from those of minimum re- 
source and pivotal power theories. Since the 
bargaining theory is new, it has undergone 
only modest empirical testing to date. The 
purpose of this article, therefore, is to test 
the bargaining theory against minimum re- 
source and pivotal power theories in a series 
of situations where the theories make compet- 
ing predictions. 
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To make the discussion concrete, consid 
a coalitional situation involving four perso: 
(A, B, C, and D) where each person controi 
some resources. As defined by Gamson (1964, 
p. 82), resources are weights controlled by 


the participants such that some specified 


quantity of these weights (termed the quota) 
is needed to win a desired payoff. The quota 
may be any arbitrary value. For example, as- 
sume that person A controls 7 resource units, 
B controls 5 units, C controls 3 units, and D 
controls 1 unit, and that a quota of 9 re- 
source units is necessary to win the payoff, No 
individual has sufficient resources to attain the 
payoff by himself, so a coalition is certain to 
form. In this example, any of seven coalitions 
has adequate resources to win (i.e, AB, AC; 
ABC, ABD, ACD, BCD, ABCD) and the 
members of the winning coalition can split the 
payoff in any manner they choose. 

As this example shows, the important re- 
search questions for coalition theory are (a) 
which one of the various possible coalitions 
actually forms and (b) how do the members 
of the winning coalition divide the payoff 
among themselves? 

Beginning with only the information on the 
distribution of resources, each of the theories 
mentioned above tries to answer these basic 
questions. Because they differ in the cognitive 
decision processes ascribed to the coalition 
members, the theories occasionally make com- 
peting predictions. Consider in turn the de- 
cision processes posited by each theory. 
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BARGAINING THEORY OF COALITION FORMATION 


As described by Gamson (196la, 1964), 
minimum resource theory postulates that per- 
sons try to maximize their gains, that is, to 

+t the largest possible share of the payoff. 

heory assumes that persons import into 
alition situation a norm of equity (or 
pariy”), which prescribes that rewards 
ud be divided in proportion to the re- 
wces contributed by members of the win- 
iing coalition, Minimum resource theory pre- 
dicts the formation of that coalition having 
the minimum combined resources necessary to 
win, This coalition is the one that maximizes 
1yoffs to the individual members if they 
de payoffs as prescribed by the parity 
norm, 
ivotal power theory, like minimum re- 
theory, assumes that individuals try to 
uze their share of the payoff. This the- 
postulates, however, that people expect 
the payoff to be divided in proportion to their 
pivotal power, not in proportion to their re- 
sources, A person’s pivotal power (a concept 
introduced by Shapley, 1953, and by Shapley 
% Shubik, 1954) is calculated by examining 
hose potential winning coalitions of which he 
is a member and determining which of them 
would no longer be able to win were he to 
withdraw from membership.t An individual 
has a high level of pivotal power if, by with- 
drawing, he can cagise numerous potential 
winning coalitions to fail. In general, pivotal 
power weights are different from resource 
weights, Using a rationale analogous to that 
of minimum resource theory, pivotal power 
theory predicts the formation of that winning 
coalition whose members have the minimum 
combined pivotal power. 

Previous research has shown that both min- 
imum resource theory and pivotal power the- 
ory have a moderate level of predictive ac- 
curacy (Kelley & Arrowood, 1960; Vinacke 
& Arkoff, 1957), Chertkoff (1971) demon- 
strated that minimum resource theory is 
slightly more accurate than pivotal power the- 
ory in predicting the formation of coalitions. 
A recent study (Michener, Fleishman, Vaske, 
& Statza, 1975) replicated this finding but 
indicated that minimum resource theory is 
correct only on 40% of its predictions in four- 
person situations. Moreover, as this study 
showed, both theories perform well in predict- 
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ing payoff division (explaining about 75% of 
the variance), but this accuracy is conditional 
on prior information concerning which coali- 
tion actually formed. 


The Bargaining Theory of Coalition 
Formation 


Komorita and Chertkoff (1973) suggest 
that predictive accuracy can be improved by 
examining the dynamics of the bargaining 
that leads to the creation of coalitions and 
the division of payoffs, Accordingly, their new 
bargaining theory places heavy emphasis on 
the negotiation process. 

The bargaining theory shares with pivotal 
power and minimum resource theories the as- 
sumption that people are motivated to maxi- 
mize their payoffs, Like the others, this the- 
ory focuses on the initial distribution of re- 
sources, but it differs in the decisional pro- 
cesses it postulates, Komorita and Chertkoff 
propose that each person in the bargaining 
situation calculates his “probable expected 
outcome.” To make this calculation, a person 
estimates what his share of the payoff would 
be in a particular coalition if the payoff were 
divided in proportion to the resources con- 
tributed by each coalition member, The per- 
son also estimates what his payoff would be if 
everyone in the coalition received an equal 
share. His probable expected outcome lies 
halfway between these two estimates. Having 
calculated his probable expected outcome for 
every potential winning coalition, a person 
prefers to join the coalition which offers the 
highest expected outcome. The bargaining 
theory then predicts that a coalition actually 
forms if it maximizes the probable expected 
outcomes for two (or more) persons, In a 


1 Person ïs pivotal power (Shapley value) is com- 
puted from the formula: 


z (s— Dias s)! 
SCN ni 


e= [o(S) — o(S — (i), 

where ø, is person ?’s pivotal power, n is the number 
of persons in the coalition situation (n is always 
equal to 4 in the present study), s is the number of 
persons in a given subset (coalition), v(S) is the 
value (payoff) to a given coalition, and v(S — {i}) 
is the payoff to the coalition if person i withdraws 
from membership. The summation in the formula is 
over all subsets S of N, the four-person situation, 


1116 


given coalitional situation, only one of the 
potential coalitions meets this condition. The 
bargaining theory also predicts that, once the 
coalition forms, the members divide the pay- 
offs so that each receives his probable ex- 
pected outcome.? 

As an example of the bargaining theory’s 
predictions, consider again the four-person 
situation where A has 7 resource units, B has 
5, C has 3, D has 1, and where the quota 
needed to win is 9. This distribution is sym- 
bolized as D(9:7-5-3-1).* For this distribu- 
tion, the bargaining theory predicts that a 
coalition consisting of persons A and C, sym- 
bolized as C(7-3), forms. The coalition has 
enough resources to meet the quota, and it 
offers persons A and C a higher probable ex- 
pected outcome than other coalitions, Within 
this coalition, A’s expected share of the pay- 
off is 60%, which is halfway between a divi- 
sion based on equality (50%) and a division 
proportional to resources (70%), Person C’s 
expected share is 40%, also halfway between 
equality (50%) and equity (30%). Thus, 
for D(9:7-5-3-1), the bargaining theory pre- 
dicts the formation of C(7-3), with payoffs 
split 60%-40% between persons A and C. 

Minimum resource and pivotal power the- 
ories offer a different prediction for this situa- 
tion. Both predict C(5-3-1), With respect to 
payoffs, minimum resource theory predicts 
that B, C, and D receive 5/9, 3/9, and 1/9 
of the payoff, respectively, This division is 
predicted because C(5-3-1) contains a total 
of 9 resource units and payoffs are divided in 
proportion to resources contributed. Pivotal 
power theory, while predicting the same coali- 
tion as minimum resource theory, offers a 
different prediction for the division of pay- 
offs. It forecasts that B, C, and D split the 
payoff on a 3/7, 3/7, 1/7 basis. Since the 
pivotal power weights of B, C, and D are 
3/12, 3/12, and 1/12, respectively, the total 
pivotal power (“joint Shapley value”) in 
C(5-3-1) is 7/12. Dividing the payoffs in pro- 
portion to pivotal power means that B re- 
ceives (3/12) /(7/12), or 3/7, C receives 3/7, 
and D receives 1/7. 

Although extensive data exist on minimum 
Tesource and pivotal power theories, research 
on the bargaining theory is sparse. The the- 
orists themselves (Chertkoff, 1971; Komorita 
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& Meek, Note 1, as described in Komo: iia & 
Chertkoff, 1973) have conducted initia! sts 
using the tetradic distribution D(90: 9- 
30-30). Minimum resource and pivotal r 
theories predict the formation of C(30-3 ) 
for this distribution, while bargaining th 
predicts the formation of C(80-30), Th 
tests offer encouraging results, Cheri 
(1971) found that C(80-30) predomins 
and that the payoff division corresponded wiii 
the predictions from bargaining theory, T) 
procedures in this study, however, made 
easier for subjects to form two-person th: 
three-person coalitions, The rules permitté 
each person to designate only one other ba 
gainer whom he desired as a coalition partne 
so that a three-person coalition could for: 
only when a previously established (nonwin 
ning) two-person coalition added a new men 
ber. Thus, a two-person coalition require; 
fewer bargaining steps than a three-perso: 
coalition, As Komorita and Chertkoff (1973, 
p. 159) note, this experiment may have been 
biased in favor of the bargaining theory 
(which predicted a two-person coalition) and 
against minimum resource and pivotal power 
theories (which predicted a three-person co- 
alition). 

The subsequent study by Komorita and 
Meek (Note 1) permitted subjects to send 
written messages proposing a coalition to 
more than one person. This reduced the bias 
by facilitating the formation of three-person 
coalitions in one stage. The data from this 
study support the bargaining theory, showing 
that C(80-30) formed most frequently. More- 
over, the division of payoffs within this co- 
alition corresponded more closely to bargain- 
ing theory predictions than to minimum re- 


2 Komorita and Chertkoff emphasize that predic- 
tions based on this logic apply only to situations in- 
volving a single encounter (or an initial instance) of 
coalition formation. They also offer asymptotic pre- 
dictions for situations involving repeated instances of 
coalition formation with the same distribution of re- 
sources. This article tests only the predictions for 
initial instances and therefore does not review the 
bargaining theory’s treatment of multiple iterations, 

The general form of this symbolization for a 
four-person group is D(Q:A-B-C-D), Q refers to the 
quota necessary to win; A, B, C, and D designate 
the resources held by the four members, with A being 
the strongest, B being the second strongest, and so on. 
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TABLE 1 
RESOURCE DISTRIBUTIONS AND THEORETICAL PREDICTIONS 


Predictions from Predictions 


Distribution disiributisn Shapley values Gnd ce peter. lhareauntig 
and type D(Q:A-B-C-D) (A-B-C-D) theories theory 

| veto D(14:9-6-3-2) 7/12-3/12-1/12-1/12 C(9-3-2) 
: 2 nonveto 9: 6/12-2/12-2/12-2/12 C(3-3-3) 
#3 veto 7/12-3/12-1/12-1/12 C(7-1-1) 
#4 nonveto 6/12-2/12-2/12-2/12 C (5-4-3) 
#5 ve 7/12-3/12-1/12-1/12 C(15-3-2) 
7/12-3/12-1/12-1/12 C(5-2-1) 
6/12-2/12-2/12-2/12 C(43-2) 
#8 nonveto 5/12-3/12-3/12-1/12 (53-1) 
9 veto 7/12-3/12-1/12-1/12 C(9-5-1) 
#10 nonveto D(11:10-5-4-2) 6/12-2/12-2/12-2/12 C(5-42) 
£11 veto D(11:7-6-2-2) 7/12-3/12-1/12-1/12 C(7-2-2) 
#12 nonveto D(18: 13-8-6-4) 5/12-3/12-3/12-1/12 C(8-6-4) 
#13 veto 43-1-1) 7/12-3/12-1/12-1/12 C4141) 
i 7/12-3/12-1/12-1/12 C(9-3-1) 
5/12-3/12-3/12-1/12 C(6-4-2) 
C(7-5-3) 


5/12-3/12-3/12-1/12 


source or pivotal power predictions (Komorita 
& Chertkoff, 1973, pp. 159-161). 
Although the procedure used by Komorita 
and Meek (Note 1) reduced the constraints 
n negotiation, it did not eliminate them en- 
irely. A three-person coalition could form in 
one step, but this happened only if one of the 
potential members communicated a prefer- 
once for the three-person coalition and if none 
of the other potential members communicated 
a preference for any other coalition, Spon- 
taneous formation of a three-person coalition 
obviously required a great deal of tacit co- 
ordination, The results of this study are more 
convincing, but the procedures may still be 
biased in favor of the two-person coalition 
predicted by the bargaining theory. 

Moreover, supportive as this earlier work 
is, it lacks generality. The only resource dis- 
tribution investigated was D(90:80-30-30- 
30), and the results may be limited by the 
idiosyncratic nature of this distribution, Data 
from a variety of distributions would provide 
more conclusive evidence. 


The Present Study 


The goal of the present study is to conduct 
a fair, competitive test of the bargaining the- 
ory against minimum resource and pivotal 
power theories, To assure some generality for 
the findings, this study used 16 tetradic re- 
source distributions. Table 1 presents these 


distributions, along with the corresponding 


predictions from each theory. 

The present study used tetradic distribu- 
tions, rather than the more frequently studied 
triadic distributions, because the three the- 
ories do not make distinct predictions for 
coalition formation in triads (see, e.g., Komo- 
rita & Chertkoff, 1973, p. 157). However, the 
three theories do offer discrete predictions for 
some (not all) tetradic distributions. The 16 
distributions in the present study were se- 
lected because the formation predictions from 
bargaining theory do not overlap with those 
from minimum resource and pivotal power 
theories.‘ These distributions therefore con- 
stitute a set for which a competitive test of 
bargaining theory is possible. 


4¥For the 16 distributions used in this study, the 
coalition formation predictions of bargaining theory 
differ from those of minimum resource and pivotal 
power theories, This is crucial for a competitive test 
of the bargaining theory. The predictions of mini- 
mum resource theory, however, are identical to those 
of pivotal power. There exist some tetradic distribu- 
tions where the formation predictions of minimum 
resource theory differ from those of pivotal power 
theory (see, e.g., Michener et al., 1975), but we are 
not aware of any tetradic distributions where the 
formation predictions of each theory differ from 
those of the other two theories. The theories, how- 
ever, do make different predictions regarding the 
division of payoffs for the distributions investigated 
here. 
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Like an earlier study with tetrads (Willis, 
1962), the present study uses an uncon- 
strained, face-to-face bargaining format. This 
minimizes factors that might otherwise in- 
hibit the formation of three-person coalitions. 
Since the various theories predict coalitions of 
different sizes for the distributions under 
study (see Table 1) , this is a necessary safe- 
guard, 

The 16 distributions in Table 1 form two 
subsets. Eight are veto distributions, for they 
share the property that one person (A) must 
be included in any winning coalition. That is, 
no coalition can win unless it includes A, be- 
cause the remaining members (BCD) lack 
sufficient resources to win by themselves, The 
remaining eight distributions are nonveto dis- 
tributions, because persons B, C, and D can 
form a winning coalition among themselves, 
Person A does not have veto power in these 
situations, 

Pivotal power theory can make predictions 
for both veto and nonveto distributions. While 
the minimum resource and bargaining the- 
ories were originally intended to apply only 
to nonveto distributions, they are both highly 
general in conception and can be asked to 
make predictions for veto distributions, as 
well. The distinction between veto and non- 
veto distributions has been shown to have a 
strong effect on the division of payoffs in 
tetrads (Michener et al., 1975; Michener & 
Sakurai, 1976), For this reason, the present 
study includes an equal number of each type 
to examine the accuracy of all theories under 
both conditions, 


METHOD 
Subjects 


Male undergraduates (N = 128) at the University 
of Wisconsin served as subjects. The subjects, who 
volunteered in expectation of receiving cash pay- 
ment, comprised 32 4-person groups. 

Laboratory procedures consisted of repeated mea- 
sures on each group. Each of the 32 tetrads was ex- 
posed serially to the 16 resource distributions in 
Table 1. In the entire study, 512 instances of po- 
tential coalition formation were observed (ie, 32 
groups X 16 distributions). 


Procedure 


Upon arriving at the laboratory, subjects were 
ushered into a room containing a square table and 
four chairs, Rather than introduce subjects to each 
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other by name, the experimenter assigned « trad 
member a different letter of the alphabet. ugh- 
out the experiment, subjects addressed or ther 


in terms of their letter designations, 

At the outset, the experimenter gave eacli 
set of written instructions that character ed 3 
periment as concerned with bargaining in four on 
groups. The instructions presented an exa: 
tetradic coalition problem and listed all p 
alitions that could form (including the grand 
tion). 

On each of the 16 rounds, 100 payoff points 
available. The instructions directed each subje 
get as many of these payoff points as _possib); 
each round and indicated that, in addition to 
ceiving a regular $2 payment for participating, 
the study, those bargainers most successful in 
cumulating payoff points would win cash prizes 
subject’s total payoff points (over all 16 roun 
would be compared with the winnings of th 
subjects in other groups having the same letter des 
nation. For each of the four designations, the ba 
gainer with the most payoff points would recei 
$10 and the bargainer with the next highest tot; 
would get $5 (a total of eight prizes). This incen 
tive structure was to assure that subjects would 
try to obtain the most payoff points possible. 

At the beginning of each round, the experimenter 
gave each subject a Resource Distribution form 
indicating how many resource units each person in 
the group controlled. The distribution of resources 
differed from round to round, corresponding to the 
16 resource distributions in Table 1. The Resource 
Distribution form also Specified the quota necessary 
to win the 100 payoff points on that round. Since 
the resource units controlled by any individual bar- 
gainer were always fewer than the quota, subjects 
realized that payoff points could be obtained only 
by pooling resources in a coalition with some other 
member(s). Persons not included in a coalition re- 
ceived no payoff points. This placed subjects in a 
mixed-motive situation because, to form a coalition, 
they had to induce at least One other member to 
join. Yet, the more payoff they offered a potential 
partner, the less they had for themselves. 

To ensure that the negotiations were free and un- 
constrained, the experimenter left the room after 
passing out the Resource Distribution forms, Dur- 
ing each round, subjects bargained face to face in 
a four-way discussion. These discussions were typi- 
cally vigorous and animated. A round ended either 
when a coalition formed or when a 4-min. time limit 
expired. 
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member wrote both his initials and the number of 
payoff points received on the Coalition Contract. 
When the round ended (almost always through the 
{formation of a coalition rather than through ex- 
piration of the time limit), the subjects summoned 
th imenter. He collected the Coalition Con- 
tr į the Resource Distribution forms and then 
passed out new forms for the next round. 
The Coalition Contract forms, which carried data 
ilition formation and payoff division, were the 
me source of dependent measures in this study. 
Additional information was collected on a brief, 
two-page questionnaire which each subject com- 
eted after Rounds 4, 8, 12, and 16, This provided 
on perceptions of each bargainer’s power and 
inñuence, as well as information on the subject’s 
rences among the 11 possible coalitions. 
pon completion of Round 16, the experiment 
nded. The experimenter answered any questions the 
sul h bout the study and its hypotheses. 
Į ) prizes and four $5 prizes were subse- 
aid by mail. 


Design 


Several aspects of the research design should be 
noted. First, all groups were exposed to the 16 dis- 
tributions in the same sequential order. This order 
s shown in Table 1. The design manipulated the 

source distribution facing the various groups on 

ìe first round. Thus, Group 1 started with Distri- 
ution #1 and ended with Distribution #16. Group 
2 started with Distribution #2, proceeded through 
the order, and ended with Distribution #1. Group 
16 started with #16 and ended with #15. Group 
17 began the order again, starting with #1 and 
ending with #16. 

Another property of this study is that, over the 
16 rounds, subjects rotated through the A, B, C, 
and D roles in a Latin square pattern, This meant 
that no subject monopolized the strongest resource 
position (A) and that no subject was recurringly 
relegated to the weakest role (D), thereby avoiding 
he confounding of subjects with roles. Subjects 
were randomly assigned to the A, B, C, or D role 
on the first round. Under the Latin square pattern, 
the subject occupying the A role on Round 1 sub- 
sequently occupied the B, C, and D roles on Rounds 
2, 3, and 4 and then rotated back to the A role 
on Round 5. Other subjects rotated in a similar 
pattern, such that each subject occupied the four 
roles four times over the 16 rounds. Subjects knew 
in advance that they would control varying amounts 
of resources on different rounds, but they were not 
told the exact nature of the rotation scheme. 


RESULTS 
Coalition Preference Data 


For each of the 16 distributions, the bar- 
gaining theory predicts the formation of a 
coalition different from that indicated by 
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minimum resource and pivotal power theories, 
but the predicted coalitions have at least 
one person in common. For example, in Dis- 
tribution #1 (in Table 1), bargaining the- 
ory predicts an AB coalition, while the other 
theories predict an ACD coalition. Person A 
is common to both coalitions, and thus his 
preferences are crucial. A similar situation 
exists with respect to each of the other dis- 
tributions. 

To obtain data on preferences, the ques- 
tionnaires following Rounds 4, 8, 12, and 16 
asked each subject to indicate his sentiments 
regarding membership in each of the poten- 
tial coalitions. The item asked, “From your 
own standpoint, how desirable were the vari- 
ous possible coalitions?” Subjects rated each 
of the 11 logically possible coalitions (i.e., 
AB, AC, ... , ABCD) in terms of their 
“desirability” (9-point scale). 

Table 2 reports the mean preferences 
among the predicted coalitions for subjects 
in the A, B, C, and D roles for each of the 
resource distributions, These data highlight 
the choices of the crucial person in each in- 
stance. In general, the data support bargain- 
ing theory. Mean scores for subjects in the 
critical roles favor the coalition predicted by 
bargaining theory for 11 of the 16 distribu- 
tions. 

A 16 (Distribution) x 3 (Theory) anal- 
ysis of variance, with repeated measures on 
the three theories (minimum resource, pivotal 
power, bargaining theory), shows a signifi- 
cant main effect of theories, F(1, 15) = 
11.168, p < .05, but no significant main ef- 
fect of distributions.’ The critical persons 
preferred the coalitions predicted by bargain- 
ing theory significantly more than those pre- 
dicted by minimum resource and pivotal 
power theories. Separate planned comparisons 
were also run on coalition preferences under 


5 Unlike other analyses reported below, analysis of 
the preference data does not include the sequence of 
presentation of the distributions as a factor in the 
design. Since the preference data were collected only 
on Rounds 4, 8, 12, and 16 (and therefore include 
only 128 observations rather than 512), the analysis 
necessarily assumes no sequence effects. This assump- 
tion appears tenable, since no sequence effects emerge 
in the analyses of other dependent variables such as 
coalition formation or payoff division. 
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TABLE 2 
MEAN PREFERENCES FOR PREDICTED COALITIONS AS A FUNCTION OF RESOURCE DISTRIBUT! 
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ROLE 


Resource 
distribution Predicted Role Role Role 
D(Q:A-B-C-D) coalitions® A B x 
1 D(14:9-6-3-2. 7.75 7.25 
i ‘ : 5.00 7.00 
#2 D(9:83-3-3) 7.25 7.75 6.88 6 
7.86 6.00 7 
#3 = D(9:7-4-1-1) 7.00 8.00 
1.15 5.88 6.6 
#4 D(12:11-5-43) 7.75 6.0 
7.13 8.13 6.88 
#5 D(20:15-10-3-2) 8.25 8.63 
4.88 8.13 7.50 
#6 D(8:54-2-1) 5.88 8.00 
6.75 6.25 6.75 
#7 = =D(9:8-4-3-2) 6.75 hes + $ 
6.13 5.50 6.50 
#8 D(9:7-5-3-1) 7.00 8.25 
6.50 7.63 7.88 
#9 D(15:9-7-5-1) 6.63 7.13 
5.50 6.50 7.88 
#10 D(11:10-5-4-2) 7.50 7.88 
7.50 7.25 6.38 
#11 D(11:7-6-2-2) 8.25 7.75 
6.63 7.63 5.88 
#12 D(18:13-8-6-4) 7.50 8.63 
6.25 8.13 6.25 
#13 D(6:4-3-1-1) 7.13 7.50 
4.13 7.38 7.75 
#14 D(13:9-5-3-1) 5.38 8.25 
6.13 6.38 7.00 
#15 D(12:9-6-4-2) 7.13 7.50 
5.75 5.25 7.38 
#16 D(15:11-7-5-3) 8.00 7.25 
6.00 6.38 6,75 


a For each resource distribution, the first coalition is that predicted by bargaining theory, and the second coalition is that predic\ d 


by minimum resource and pivotal power theories. 


nonveto and under veto distributions, Results 
indicate that, for the eight nonveto distribu- 
tions, the coalitions predicted by the bargain- 
ing theory were preferred to those predicted 
by minimum resource and pivotal power the- 
ories, F(1, 15) = 13.067, p < .05. A similar 
comparison for the eight veto distributions 
reveals the same effect, F(1, 15) = 34.080 
p<.05. 


? 


Coalition Formation 


Table 3 presents the basic findings on coali- 
tion formation for each of the 16 distribu- 
tions, The table also shows each theory’s 
formation predictions, permitting a detailed 
comparison with the data. Overall, subjects 
formed 35 grand coalitions, 148 three-person 
coalitions, and 323 two-person coalitions, In 
6 instances no coalition formed. It is clear 
that two-person coalitions were most numer- 


ous, despite the unconstrained bargaining 
format. 

Table 4 summarizes the fit between pre- 
dictions and observations by showing the 
number of correct predictions for each the- 
ory. Since minimum resource and pivotal 
power theories each predict only 1 coalition 
per round, these theories each make a total 
of 512 predictions (i.e., 32 groups X 16 dis- 
tributions). For one distribution ( #2), how- 
ever, the bargaining theory predicts the oc- 
currence of more than 1 coalition, so the 
total number of coalitions predicted by this 
theory exceeds 512. One cannot assess the 
relative adequacy of the three theories merely 
by comparing the number of correct predic- 
tions; bargaining theory might have more 
correct predictions than the other theories 
simply because it makes more predictions in 
toto. For this reason, a correct prediction 
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TABLE 3 
FREQUENCY OF COALITION FORMATION AS A Function or RESOURCE DISTRIBUTION 
3 Resource Coalition forming 
ribution distribution 
mherand tyne. D(Q:A-B-C-D) ABCD ABC ABD ACD BCD AB AC AD BC BD CD None 
D(14:9-6-3-2) 5 20b 
D(9:8-3-3-3) 3 1ipr 4b 8b 5b 1 
D(9:7-4-1-1) 1 17b 
D(12:11-5-4-3) 2 spr 5 6 11b 
D(20:15-10-3-2) 1 24b 
D(8:5-4-2-1) 3 lib 1 
D(9:8-4-3-2) 2 3pp 11 5 10b 
D(9:7-5-3-1) t Gr 11 12b 1 
i D(15:9-7-5-1) 1 25b 
10n > D(11:10-542) 2 10pr 5 4 10b 
D(11:7-6-2-2) 2 18b 1 
eto DU8:13-8-6-4) 3 1 10pr 11 7b 
D(6:4-3-1-1) 3 1 6pr 22b 
1 9 D(13:9-5-3-1) 3 BOS Tipr: 15b 1 
15 nonyeto D(12:9-6-4-2) 1 1 1 Opt 9 13b zt 
# 16 nonveto D(15:11-7-5-3) 2 1 lipr 11 Tb 


arce theor: 


ratio (defined as the ratio of correct predic- 
tions to total predictions made on a given 
vund) has been computed on each of the 
rounds for each of the three theories. 
vean correct prediction ratios appear in 
Table 4. 
The correct prediction ratios were analyzed 
a 16 (Sequence of Presentation of the 
Distribution) X 16 (Distribution) X 3 (The- 
ory) split-plot factorial analysis of variance, 
, with two groups per cell (cf. Kirk, 1968, 
4 PP- 298-307). (The sequence variable can 
* 


TABLE 4 
PREDICTIVE ACCURACY OF MINIMUM RESOURCE, 
PrvoraL POWER, AND BARGAINING THEORIES 
OF COALITION FORMATION 


Theory 
Minimum Pivotal Bargaining 
Index resource power theory 
” 
A Total predictions 
j Veto 256 256 256 
Nonveto 256 256 320 
Total 512 512 576 
Correct predictions 
Veto ik 71 158 
Nonveto 65 65 87 
Total 136 136 245 
Mean correct prediction ratio 
Veto 277 277 O17 
Nonveto 254 254 .296 
Overall .266 266 456 


_ tremely conservative. 


signates coalitions predicted to form by pivotal power theory; r designates coalitions predicted to form by minimum 
y; b designates coalitions predicted to form by bargaining theory. 


be thought of as the distribution to which 
a group was exposed on the first bargaining 
round). Each group was nested within 1 of 
the 16 levels of sequence. The repeated mea- 
sures factors, distribution and theory, were 
crossed with sequence. The results indicate 
that the difference between theories is statis- 
tically significant, F(1, 16)= 35.373, p< 
.05. In general, the bargaining theory is su- 
perior to minimum resource and pivotal 
power theories in predicting coalition forma- 
tion for the 16 distributions in this study. 
No other effects attained significance; in par- 
ticular, the sequ nce variable did not influ- 
ence the correct > pees ratios.® 

Separate planne comparisons were also 
run on correct prediction ratios under non- 
veto and under veto distributions. Results 
indicate that, for the eight nonveto distribu- 
tions, the mean correct prediction ratio of 
the bargaining theory did not differ signifi- 


an 


6 Since the study design involved repeated mea- 
sures over 16 resource distributions for each of the 
32 groups, the Geisser-Greenhouse correction for de- 
pendency of observations has been applied to all F 
tests reported in this article. This correction is ex- 
Computational procedures are 
identical to those of a conventional test except that 
the degrees of freedom are reduced. For a detailed 
description of the Geisser-Greenhouse correction, see 
Kirk (1968, pp. 142-143). 
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TABLE 5 
Mean PAYOFF POINTS ror Persons WITHIN EACH COALITION FOR VETO DISTRIBUTIONS 


Resource 
distribution Coalitions Role Role Role Ro! 
D(Q:A-B-C-D) forming A B c D 
#1 D(14:9-6-3-2) C(9-6-3-2) 60.00 15.40 12.80 11. 
C(9-6-2) 60.00 25.00 15.( 
C(9-3-2) 74.33 12,83 12 
C(9-6) 71.35 28.65 
#3 D(9:7-4-1-1) C(7-4-1-1) 25.00 25.00 25.00 
C(7-+-1) 60.00 25.00 
C(7-1-1) 77.38 11.31 
81.82 18.18 
#5 D(20:15-10-3-2) 40.00 20.00 20.00 
85.29 8.14 
77.7\ 22.29 
#6 D(8:5-4-2-1) 35.00 21.67 21.67 
C(5-2-1) 65.00 19.64 
C(5-4) 71.12 28.88 
#9 D(15:9-7-5-1) C(9-7-5-1) 20.00 40.00 10,00 
C(9-5-1) 72.50 14.17 
C (9-7) 76.20 23.80 
#11 D(11:7-6-2-2) C(7-6-2-2) 47.50 17.50 17.50 
Caa 71.27 14.27 
C(7-6) 78.33 21.67 
#13 D(6:4-3-1-1) C(4-3-1-1) 35.00 21.67 21.67 21.67 
C(4-3-1) 60.00 20.00 20,00 
C(4-1-1) 86.00 7.00 7.00 
C(4-3) 72.32 27.68 
#14 D(13:9-5-3-1) C(9-5-3-1) 45.00 23.33 15.00 i $ 
EN 82.00 11.50 
C(9-3-1) 77.64 13.36 
C(9-5) 81.20 18.80 


cantly from those of minimum resource and 
pivotal power theories, F(1, 32) = .096, ns. 
A similar comparison for the eight veto dis- 
tributions shows the bargaining theory to be 
superior, F(1, 32) = 64.106, p < .05. The 
critical values of all tests reported here have 
been adjusted to maintain an error rate of 
.05 experimentwise.” 


Payoff Division 


Tables 5 and 6 report the mean payoff 
points for persons A, B, C, and D within 
the coalitions that formed during the experi- 
ment, Table 5 presents these results for the 
eight veto distributions, while Table 6 pre- 
sents them for the eight nonveto distribu- 
tions. In general, payoffs are related to role, 
with person A getting the most and person 
D getting the least. 

A comparison across Table 5 and Table 6 
highlights the effect of the veto-nonveto 
variable on payoff division. In veto situa- 
tions, the payoffs to person A (who holds 
veto power) are markedly higher than in 


nonveto situations, while those to person B, 
C, and D decline accordingly. Analysis of 
variance indicates that this effect of veto on 
payoffs is significant for each of the bargain- 


7Qne can place all the theories in perspective by 
comparing them against a baseline model, which is 
analogous to Gamson’s (1964) “utter confusion” the- 
ory. The baseline model assumes that the formation 
of any given coalition—even a nonwinning coalition 
—is as probable as the formation of any other coali- 
tion. Since this baseline model predicts every logically 
possible outcome, it is always correct although it does 
not reduce uncertainty. The mean correct prediction 
ratio for the baseline model is .0833 under both non- 
veto and veto distributions. This mean is consider- 
ably lower than that for any of the three theories. { 
Analysis of variance using the correct prediction ra- 
tios indicates that the baseline is surpassed by the 
bargaining theory under both nonveto, F(1, 32) = 
34,225, p < .05, and veto distributions, F(1, 32) = 
216.455, p<.05. It is also surpassed by minimum 
resource and pivotal power theories under both non- 
veto, F(1, 32) =22.101, p< .05, and veto distribu- 
tions, F(1, 32) = 28.579, p< .05. These comparisons 
show that all three theories have a substantia] ca- 
pacity to reduce uncertainty regarding the formation 
of coalitions. 
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ers, $ < .01 in all cases, The sequence vari- 
able has no significant effect on the division 
of payoffs. 

How accurate are the various theories in 
re ing the division of payoffs? As noted 
each theory makes a prediction re- 
which coalition forms and how the 
s are divided within that coalition. Each 

eory is sufficiently general, however, to pro- 
vide a prediction regarding payoff division 
for each of the potential winning coalitions. 
‘Thus, even though the coalition predicted 
by a theory may not have formed on a 
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given round, it is still possible to test that 
theory’s payoff predictions for the coalition 
which actually formed. Payoff predictions 
in this instance are conditional upon knowl- 
edge of which coalition actually formed. Since 
these conditional payoff predictions provide 
the fairest test of the various theories, they 
are used here to test goodness of fit. 

To assess the accuracy of a given theory, 
a goodness-of-fit score was computed for each 
group within each distribution (making a 
total of 32 X 16 = 512 scores for that the- 
ory). These scores are the square root of the 


TABLE 6 
:¿ PAYOFF POINTS FOR Persons WirHin EACH COALITION For Nonveto DISTRIBUTIONS 


Resource 


bution Coalitions Role Role Role Role 
D(Q:A-B-C-D) forming A B C D 
#2 D(0:8-3-3-3) C(8-3-3-3) 30.00 23.33 23.33 23.33 
C(3-3-3) 31.18 37.36 31.46 
ces) 64.00 36.00 
C(8-3) 67.87 32.13 
C(8-3) 52.00 48.00 
#4 D(12:11-5-43) C(11-5-43) 25.00 25.00 25.00 25.00 
Cts) 28.62 33.13 38.25 
C(i1-5) 45.80 54.20 
C(A) 67.67 33.33 
C(11-3) 64.18 35.82 
#7 D(9:8-4-3-2) C(8-4-3-2) 15.50 27.50 27.50 27.50 
C(8-3-2) 30.00 10.00 10.00 
C(43-2) 35.67 35.33 29.00 
C(8-4) 69.36 30.64 
C(8-3) 58.00 42.00 
C82) 69.50 30.50 
#8 D(0:7-5-3-1) C(7-53-1) 25.00 25.00 25.00 25.00 
C(7-5-1) 50.00 25.00 25.00 
(5-3-1) 40.67 36.83 22.50 
C(7-5) 56.45 43.55 
€(7-3) 56.66 43.33 
#10 D(11:10-5-4-2) C(10-5-4-2) 25.00 25.00 25.00 25.00 
C(10-4-2) 80.00 10.00 10.00 
C(5-42) 34.80 31.80 33.40 
C(10-5) 57.20 42.80 
C(10-4) 56.25 43.75 
(10-2) 66.50 33.50 
#12 D(18:13-8-6-4) C(13-8-6-4) 25.00 25.00 25.00 25.00 
C(13-8-4) 70.00 15.00 15.00 
C(8-6-4) 41.50 37.50 21.00 
(13-8) 55.45 44.55 
C(13-6) 58.00 z 2.0 Ma 
15 D(12:9-6-42 C(9-6-4-2) 25.00 5.00 i . 
s $ C(9-6-4) 33.00 34.00 33.00 
C (9-4-2) 80.00 10.00 10.00 
CAD 37.60 41.00 21.40 
C (9-6) 52.50 47.50 
e e are 
#16 D(15:11-7-5-3 C(11-7-5-3) 50.00 17.50 ; f 
‘ 2 SHEED 80.00 10.00 10.00 
(7-5-3) 43.82 36.55 19.63 
C(11-7) 55.91 44.09 
65.71 34.29 


: 
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Fıcure 1. Mean goodness of fit for payoff data. 


sum of the squared differences between each 
member’s observed payoff and predicted pay- 
off (where the summation runs over the four 
group members), that is, 


4 
ft= 4/2 (0; — Pi}. 


i=l 


Separate goodness-of-fit computations were 
performed for each theory. Figure 1 pre- 
sents the mean fit scores for each of the three 
theories under both veto and nonveto condi- 
tions. The smaller the mean fit score, the 
more accurate the theory. In nonveto distri- 
butions, the mean fit score of bargaining 
theory (M = .1503) is smaller than those of 
minimum resource (M = .1925) and pivotal 
power (M = .1789). In veto distributions, 
however, the mean fit scores of minimum re- 
source (M = .2585) and pivotal power (M 
= .2106) theories are smaller than that of 
bargaining theory (M = .3213). 

Since the primary goal of this research is 
a competitive test among the theories, the 
relative adequacy of one theory vis-à-vis an- 
other was ascertained by means of a 16 
(Sequence) X 16 (Distribution) X 3 (The- 
ory) split-plot factorial analysis of variance, 
with two groups per cell. The goodness-of- 
fit scores on payoff division served as the 
dependent variable. The results show a sig- 
nificant main effect for theories, F(1, 16) = 
17.277, p < .05, such that, across all 16 dis- 
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tributions, pivotal power theory v e most 
accurate. No other effects, includir se of | 
sequence of presentation, attained a: -ptable 
levels of statistical significance. 

Planned comparisons of the mear 1ess- 
of-fit scores for each theory were í cted 
separately under nonveto and under vi'o dis- 
tributions. For the eight nonveto di ibu- $ 


tions, bargaining theory is superior b::^ to 
pivotal power theory, F (1, 32) = 12.880 » < 


.05, and to minimum resource theory, (1, 
32) = 28,044, p< .05. For the eight -cto 
distributions, however, both pivotal power he- 


ory, F(1, 32) = 192.983, p < .05, and rini- 
mum resource theory, F(1, 32) = 62..07, 
p < .05, are superior to the bargaining th ory 
in predicting payoffs. The bargaining theo y’s 
superior accuracy under nonveto distr’ u- 
tions is particularly important, because iuis 
theory was developed primarily to pre: it 
payoffs in the nonveto case.’ 

Further analyses show that pivotal po 
and minimum resource theories do not dil 
under nonveto distributions, F(1, 32) 
2.912, ms, while pivotal power is supciior 


8 As indicated previously in Footnote 2, the K 
morita-Chertkoff bargaining theory offers two sets <í 
predictions. One set pertains to a single encounter (or 
initial instance) of coalition formation; the other sct 
(called the “asymptotic prediction”) pertains to rc- 
peated instances of coalition formation with the 
same distribution of resources. The procedures used 
in the present study differ from those used previously 
(Komorita & Meek, Note 1) to test the asymptotic 
predictions. Not only did groups change resource dis- 
tributions from round to round, but individual sub- 
jects shifted roles. Since the asymptotic predictions 
pertain to situations where a group is exposed re- 
peatedly (over rounds) to a given distribution, they 
are not directly applicable to the present study. Be- 
cause they may be of peripheral interest to some 
readers, however, the bargaining theory’s asymptotic 
predictions were checked against the payoff data. For 
nonveto distributions, the mean fit score was 1515; 
for veto distributions, it was .2844. Tests indicate 
that, for nonveto distributions, the asymptotic 
predictions were not significantly different from the 
initial-trial bargaining theory predictions, F(1, 32) 
= .023, ns, were superior to the minimum resource 
predictions, F(1, 32)= 28.157, p < 05, and were 
superior to the pivotal power predictions, F(1, 32) 
= 12.574, $ < .05. For veto distributions, the asymp- 
totic predictions were superior to the initial-trial 
bargaining theory predictions, F(1, 32)= 22.807, > 
< .05, inferior to the minimum resource predictions, 
F(1, 32)= 11.236, p < .05, and inferior to the piv- 
otal power predictions, F(1, 32) = 91.229, p < .05. 
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under veto distributions, F(1, 32) = 36.132, 


-p < .05. The critical values of all tests re- 


ported above have been adjusted to maintain 
vperimentwise error rate of .05. 


Discussion 


By providing data on both nonveto and 
listributions, the present study extends 

vious arch on the bargaining theory 

f coa xrmation, The findings corrobo- 
rate earlier tests (Chertkoff, 1971; Komorita 
fe Note 1) in demonstrating the su- 

y of this theory for nonveto distribu- 
t they also show that the bargaining 
is less effective for veto distributions. 
veto distributions, results show 
lictions from bargaining theory 

ose from minimum resource and 
power theories with respect to pref- 
ences among potential coalitions, equal 
them with respect to the formation of coali- 
tions, and surpass them with respect to the 


, division of payoffs. The superiority of bar- 


ining theory suggests that persons in non- 

to distributions use (or act as if they use) 

point midway between equity and equality 
in determining their probable expected out- 
in any given coalition. Predictions based 
n this assumption are more accurate than 
those based solely on equity (minimum re- 
source theory) or on proportionate Shapley 
values (pivotal power theory). 

\Vhy is bargaining theory superior to the 
other theories (under nonveto distributions) 
n predicting preferences and the division of 
payoffs, but only equal to them in predicting 
the formation of coalitions? For the eight 
nonveto distributions in the present study, 
bargaining theory always predicted the emer- 
gence of a two-person coalition, while mini- 
mum resource and pivotal power theories al- 
ways predicted a three-person coalition. The 
source of inaccuracy, however, lies not so 
much in the formation of three-person coali- 
tions, but in the frequent formation of un- 
predicted two-person coalitions. Data in the 
present study (and in the related study by 
Michener et al., 1975) show that, for nonveto 
tetradic distributions, two-person coalitions 
are far more likely to emerge than three- 
person coalitions. As indicated in ,Table 3, 
subjects formed 165 two-person coalitions but 
only 72 three-person coalitions on the 256 
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nonyeto rounds in the present study. The rea- 
son for this seems clear: Two-person coali- 
tions involve simpler negotiations, require less 
coordination, and usually provide higher pay- 
offs to members than three-person coalitions. 
The present study differed from previous 
research on the bargaining theory by utiliz- 
ing a free, face-to-face negotiation format. 
This format placed a premium on bargaining 
skill. To receive any payoff at all, a subject 
had to secure membership in some coalition, 
and this led to a competitive volley of offers 
and counteroffers. In such circumstances, con- 
siderations other than maximizing probable 
expected outcome doubtless came into play 
and led to the emergence of two-person co- 
alitions not predicted by bargaining theory. 
With respect to the veto distributions, the 
present study shows that the bargaining the- 
ory had considerable success in predicting 
coalition formation (because it predicted the 
only two-person coalition that could win, 
while the other theories predicted three-person 
coalitions), but it had less success in predict- 
ing the division of payoffs. In fact, as de- 
picted in Figure 1, the bargaining theory is 
significantly less effective than the other two 
theories in predicting payoffs under veto dis- 
tributions. Moreover, the goodness-of-fit scores 
for payoffs show the bargaining theory to be 
less accurate under veto than under nonveto 
distributions, F(1, 32)= 163.357, p< .05. 
The same holds true for minimum resource 
theory, F(1, 32) = 24.335, p < .05, but not 
for pivotal power theory, F(1, 32) = 5.613, 
ns (adjusted for experimentwise error rate). 
In fairness, one must remember that the 
bargaining theory was originally advanced to 
predict payoffs under nonveto distributions, 
and not under veto distributions. Never- 
theless, it is informative to ask why the 
bargaining theory performs less adequately 
under veto than under nonveto distributions, 
The answer lies, in part, in differences in the 
patterns of interaction among subjects in veto 
and nonveto distributions. When person A 
had veto capabilities, he could induce the 
other members to bid against one another, 
thereby bolstering his own payoffs at their 
expense. On veto rounds, person A occasion- 
ally acted like an auctioneer, soliciting in- 
creasingly favorable offers from the various 
members and sometimes securing as much as 
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99% of the payoff. The high payoffs to per- 
son A and the concomitant low payoffs to the 
other members suggest that subjects in veto 
distributions were more concerned with mere 
inclusion in the winning coalition than with 
criteria such as equity or/and equality. 

Further analyses of the observed payofis 
to each person (A, B, C, D) indicate that, 
for veto distributions, the bargaining theory’s 
predictions were not sufficiently extreme. 
That is, the bargaining theory markedly un- 
derestimated the payoffs received by person 
A (who had veto capabilities) and overes- 
timated the payoffs received by B, C, and D. 
Minimum resource theory also underestimated 
the payoffs to A and overestimated those to 
B, C, and D; but, since it is based on an 
equity postulate (rather than on a combina- 
tion of equity and equality, as is the bar- 
gaining theory), the degree of underestima- 
tion (for A) and overestimation (for B, C, 
and D) was less extreme. 

The best theory under veto distributions 
was pivotal power theory, which postulates 
neither equity nor equity/equality. This the- 
ory is based instead on a power principle, 
where power is derived via membership in 
winning coalitions. That is, a person has high 
pivotal power if he has the capacity to con- 
vert (a lot of) winning coalitions to losing 
coalitions by withdrawing from membership. 
Under veto distributions, person A’s pivotal 
power was very high (because no coalition 
could win without him as a member), while 
B’s, C’s, and D’s pivotal power was relatively 
low. For this reason, pivotal power provided 
close estimates of the payoffs to all persons, 
and therefore proved superior to both mini- 
mum resource theory and the bargaining the- 
ory under veto distributions. 

In sum, these results highlight the inter- 
action between distribution type (i.e., the 
veto-nonveto variable) and theory. Under 
veto distributions, payoff predictions based on 
power processes (pivotal power theory) sur- 
pass those based on equity (minimum re- 
source theory) or on equity/equality (bar- 
gaining theory). Under nonveto distributions, 
however, payoff predictions based on equity/ 
equality (bargaining theory) are more accu- 
rate than those based either on equity or on 
pivotal power. This interaction between the- 
ory and distribution type indicates that there 
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is an underlying difference betwen veto an 
nonveto distributions in the decision criteria 
actually used by subjects. While pivotal pows 
remains the superior theory for the veto case, 
the Komorita-Chertkoff bargaining theory 
represents an advance over the other theori 

for the nonveto case. 


REFERENCE NOTE 


1. Komorita, S. S., & Meek, D. A test of the ba 
gaining theory of coalition formation. Unpub: 
lished manuscript, Indiana University, 1972. 
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- Personality Patterns Related to Own-, Joint-, and Relative-Gain 


Texas Woman's University 


Maximizing Behaviors 
Richard P. Bennett Joseph P. Carbonari 
University of Houston 


Three modes of behavior have been hypothesized ìn response to the Prisoner’s 
Dilemma game and other cooperative/competitive games: a behavior mode 
that maximizes one’s own gain, a behavior mode that maximizes joint gain, 
or a mode that maximizes relative gain. Since all behavior is a function, in 
part, of the individual’s personality, it was hypothesized that a distinct per- 
sonality pattern would be expected to be associated with each mode. Using a 
decomposed game format offering three unique choices and the Edwards Per- 
sonal Preference Schedule, the responses of 239 12th grade students were ana- 
lyzed in a two-factor multivariate analysis of variance model. Although three 
unique response groups were identified, only two distinct personality patterns 
merged: a joint and an own-relative mode, Proposed explanations of these 
findings include the interaction of personality structure with authoritarianism 
and with cognitive and ego/moral developmental stages. 


Cooperation and competition are two pri- 
mary modes of interactive behavior among 
individuals. Research into the causes of co- 
operation and competition has ranged widely 

d has resulted in a massive body of litera- 

ire, Internal motives of the participants have 
cen shown to be important aspects of be- 
havior. Some individuals have shown a prefer- 

for a cooperative mode of interaction 
egardless of the situation and regardless of 
possible loss to themselves. Others have opted 
for competition even when the expectancy of 
success is low. Research into the psychosocial 
y characteristics of motive have ranged from 
wif-esteem (Graf & Hearne, 1970), to inter- 
ionalism (Lutzker, 1960), to authoritarian- 
(Kelley & Stahelski, 1970). Studies of 
versonality are typified by Tiller (1970), who 
elated Rotter’s Introversion—Extraversion 
scale to cooperative and competitive behaviors. 
Despite its importance, research involving the 
motives associated with these behaviors has 
been limited in scope and has been incon- 


/ sistent in direction. One possible reason for 


the inconsistencies in the past may lie in the 
inefficiency of the dichotomous definition of 
competition versus cooperation. The two are 
usually seen as polar opposites, thus creating 
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a constant sum paradigm. That is, an incre- 
ment in cooperative behavior is automatically 
associated with a decrement in competitive 
behavior. Use of different paradigm might 
better explain this general class of behavior. 


Own, Joint, and Relative Behavior 


Breaking with the tradition that concen- 
trated on cooperative and competitive be- 
haviors as the response limits, Messick and 
McClintock (1968) proposed that three be- 
haviors could be identified in terms of the 
allocation of rewards. The subject could opt 
for the behavior of maximizing his own reward 
(designated own). As a second alternative, the 
subject could maximize the difference between 
his own reward and that of the other partici- 
pants (designated relative). Finally, the subject 
could choose to maximize the total reward of 
all participants (designated joint). Of the 
three behaviors, the last requires some rede- 
finition. It is possible to define a reward struc- 
ture that will provide a maximum gain for all 
participants that would simultaneously pro- 
vide the subject with a maximum own re- 
ward. In the paradigm used by Messick and 
McClintock, the maximum joint gain was al- 
ways shared equally, thereby minimizing the 
differences among the participants. Con- 
sequently, without changing the intent of 
Messick and McClintock, joint behavior is 
better described as behavior that minimizes 
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the reward difference between the subject and 
the other participants while at the same time 
maximizing the sum of their outcomes. 

Research has shown that there are tendencies 
for particular personality types to respond in 
certain ways to interactive situations. Using 
scores obtained in response to Rotter’s 
Introversion-Extraversion scale, Tiller (1970) 
found internals to be more cooperative than 
externals. Bixenstine, Potash, and Wilson 
(1963) measured subjects on scales of Freudian 
neurotic behavior (approval of ethical be- 
havior based on rigid rules) and normal be- 
havior (approval of ethical behavior based on 
reason) and found that cooperative behavior 
was significantly related to the difference be- 
tween the scores on the two scales. These 
studies suggest that the cooperative and com- 
petitive behaviors observed were related to 
personality variables. It follows that the own, 
relative, and joint behaviors hypothesized by 
Messick and McClintock should likewise show 
a relationship to personality variables. 


Murray’s Theory of Motive 


Because the individual has the capability 
to display to some degree any of the interactive 
behaviors, the assumption was made that the 
personality variables involved were traits 
rather than types. One of the best known trait 
theories is that of Murray (1938), who estab- 
lished a personality structure based on psycho- 
logical needs (which he also referred to as 
drives or motives). Within this framework, all 
human behavior could be attributed to the 
strength of the various needs. 

To support the existence of needs as an 
explanation for human behavior, Murray 
reasoned that physical survival depends on 
the attainment of certain goals. Although 
these goals are attained by all organisms, 
different organisms use means that vary. Even 
the same individual may vary his means of 
obtaining the same goal repeatedly, although 
effective past actions tend to become estab- 
lished as habitual. 

Certain positions emerge from Murray’s 
theory. First, the personality defined by the 
needs structure is not fixed. Second, although 
personality is variable, the habitual aspects 
of goal attainment will result in slow changes. 
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Third, there is a relationship 
, I 
personality and behavior. 


en the 


The original catalog of needs ha orgone © 
modification and redefinition such rat of 
Edwards (1959). From the presen of 15; 


it is hypothesized that specific nee: ve a 


relationship to own, joint, and rc be- 
haviors as described by Messick Mc- 
Clintock. 

Behavior intended to maximize one own 
gain would seem most closely related the 
need for achievement with elemen! of 
autonomy and dominance. Edwards d ines 


achievement in terms of doing one’s be , to 


be successful. Autonomy is described as ing 4 
independent of others in making deci. ons. 
Dominance is defined, in part, as the « sire 
to be a leader. These characteristics de: vibe 
the individual who is primarily concerned ~ ith 


taking charge and assuring his own goals 
Relative behavior, that intended to xi- 

mize the difference in gains, is expecte. to 

show the strongest relationship to the nds 


for dominance and aggression. Aspects of ` 


dominance that apply to a relative motivation 
would be telling others how to act and arguing 
for one’s own view. Aggression, as defined by 
Edwards, includes attacking other’s points o 
view and getting revenge for insults. Ay 
element of heterosexuality, in the sense o! 
conflict between the sexes, may also be pres 
ent when the interaction involves both sexes. 
Thus, relative behavior is most likely to ap- 
pear as a combination of attacking others 
(aggression) and leading others (dominance). 

Joint behavior should be closely related to 
the needs for affiliation and nurturance. The 
need for succorance may also be present as an 
element. Edwards sees affiliation as sharing 
and doing things with friends, whereas 
nurturance implies helping friends and others 
who are less fortunate. Generosity is also 
specified. Edwards defines succorance as 
basically the opposite of nurturance, i.e., the 
desire to have others help and share with one- 
self. To minimize the difference in gain be- 
tween oneself and others is typical sharing 
behavior that would also be expected from 
one who wants to be with and to help and be 
helped by others. 


It is then hypothesized that (a) own be- 
havior is related to a personality strong in 
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achievement with elements of autonomy and 
dominance; (b) relative behavior is associated 
with aggression and dominance with an ele- 
t of heterosexuality; and (c) joint be- 
will occur when the personality is 

in affiliation, nurturance, and suc- 


METHOD 


ubjects 


The subjects were 239 12th-grade students (111 

nale ; female) at a suburban Houston, Texas, 

ondary school. Minority membership of the group 
t exceed 5%, 
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lhe two-person 2 X 2 matrix game is a 
ised tool for measuring cooperative and 
behavior, Behavior leading to the highest 

win has been operationally defined as co- 
». The lowest mutual gain has been operation- 
ally “defined as competitive (or defecting) behavior. 

Concurrently with their definition of own, joint, and 
relative behaviors, Messick and McClintock (1968) 
described a technique for decomposing the matrix 
format into two alternatives, each of which permits 

e subject to “keep” a specified reward and, at the 

ome time, “give” a second specified reward to the other 
participant. The reward structure for the two paradigms 
is mathematically equivalent. If two subjects were to 
hoose the cooperative keep/give alternative, the total 
ut each would receive would be equal to the re- 
associated with the mutually cooperative choice 
in the matrix format, Mutually competitive choices 
or mixed choices also produce consistent rewards be- 
tween the two forms. 

The difference lies in the effect of the other partici- 
nt, In the decomposed format, the other is a shadowy 

who is the recipient of a portion of the reward. 
other may or may not be seen as the donor of 
urther reward, In the matrix format, the second partici- 
pant is seen as one who shares in the decision process 
and whose strategy becomes a factor, 

Messick and McClintock (1968) found six 2 X 2 
matrix paradigms of which three could be decomposed. 
Two of these are the Prisoner’s Dilemma (PD) game 
and the Maximizing Difference Game (MDG). Since 
both forms of each game offer only two choices to the 
subject, neither can provide a unique response for each 
of the three behaviors described: own, joint, and rela- 
tive. The PD game’s reward structure is such that the 
best own alternative is identical to the best relative 
alternative. In the MDG, own and joint are similarly 
confounded. The third decomposable game, called 
Chicken, offers no best alternative to anyone preferring 
the own behavior. Conflict is limited to that between 
joint behavior and relative behavior. In all three games, 
relative behavior falls in the category labeled com- 
petitive, and joint behavior is cooperative. Only those 
using own behavior must switch from one response 
ategory to another, 
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Motivation, the propensity to act, has been de- 
scribed as a function of motive, incentive, and ex- 
pectancy (Atkinson, 1958), One’s behavior that is 
supported by motivation will not necessarily be in- 
dicative of one’s intentions or goals, i.e., one’s motives. 
A strong motive cannot activate behavior if the ex- 
pectancy of success is low or if there is no incentive 
available. Both of these factors are subjective. Each 
individual will judge the expectancy of success dif- 
ferently. Incentives likewise will vary. Some incentives 
are internalized and are completely controlled by the 
individual, Other incentives are external. They may 
not be controlled by the individual, but they are 
judged by him. Consequently, attempts to assess 
motive by observing behavior are impeded by the 
uncertainties of incentive and expectancy. One partial 
solution to this problem is to establish extremes of 
incentive and expectancy and then analyze the dif- 
ferential in the resulting behaviors. An alternate 
solution is to neutralize incentive and expectancy as 
much as possible. 

Because this study sought to determine the in- 
dividual’s motives, the following steps were taken. to 
neutralize the effects of incentive and expectancy: (a) 
A three-choice decomposed format was selected that 
offered unique choices for each of the three behaviors, 
(b) The anonymity of the other was maintained in the 
instructions to the subjects. (c) No feedback of results 
was provided to minimize any expectancy that might 
develop from predicting the strategy of the hypothetical 
other, (d) Although the reward structure varied from 
trial to trial, the reward values were kept as nearly 
equal as the definition of the behaviors would permit, 
The median personal reward for the own behavior was 
22.0 points. For using the relative behavior, the median 
personal reward was 16.1 and the median difference 
between the rewards was 14.7 points, Joint behavior 
produced a personal median gain of 17.5 and a difference 
of 1,5 points. 

Kuhlman and Marshello (1975a, 1975b) used a 
three-choice format involving four game styles. The 
triple dominance style offered unique choices for the 
own, joint, and relative motivations. The joint alter- 
native set the difference between the scores of self and 
other at zero. The PD game style confounded the own 
and relative motivations in one alternative, provided 
a second alternative for joint, and left the third alter- 
native neutral. The MDG followed that of the PD 
game except for confounding own with joint rather 
than with relative. The single dominance style provided 
one alternative that confounded all three motivations 
and two other alternatives that were neutral. There 
were three payoff formats in each style, resulting in a 
total of 12 different formats. Each was presented twice 
to the subject. 

The instrument used in the present study was similar 
to the triple dominance style. Three alternatives were 
provided, each of which offered the subject a specific 
keep and give payoff structure (see Figure 1). The three 
alternatives met the following criteria: (a) The alter- 
native having the best keep payoff is operationally 
defined as the own choice. (b) The alternative having 
the smallest difference between keep and give values is 
defined as the joint choice. Unlike the cooperative 
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OWN 
JOINT 
RELATIVE 


Ficure 1, Format for decomposed items offering three 
unique alternatives. (o=own; j=joint; r=relative.) 


choice of the triple dominance format, the value kept 
always exceeded that given. (c) The alternative having 
the largest difference between keep and give values is 
associated with the relative choice. To guarantee the 
uniqueness of the three choices, the keep (K) and give 
(G) values must satisfy the following equations: 


Kown > Kretative > Kjotnty oF 

Kown > Kjoint > Kretativer [1] 
(K — G)retative > (K — G)own > (K — G) joints [2] 
(E + G)joint > (K + G)own > (K + G)retative [3] 


Equations 2 and 3 can be satisfied under either condi- 
tion defined in Equation 1. 

A set of 48 three-choice items were prepared in 
accord with the above equations, One half of the items 
were designed with Kjoint greater than Kyetativee The 
remainder reversed that relationship. These 48 items 
comprised the behavioral instrument. 

Personality. The personality instrument used was 
the Edwards Personal Preference Schedule (EPPS; 
Edwards, 1959). The EPPS yields scores on 15 non- 
orthogonal personality characteristics: Achievement, 
Deference, Order, Exhibition, Autonomy, Affiliation, 
Intraception, Succorance, Dominance, Abasement, 
Nurturance, Change, Endurance, Heterosexuality, and 
Aggression. 


Procedure 


The two instruments were administered on separate 
days approximately 2 weeks apart. The subjects were 
tested in seven groups ranging from 20 to 90 in number, 
The behavioral instrument was administered on the 
first day, the EPPS on the second. 

Instructions for the behavioral instrument were 
Stated in neutral terms. The “other” was identified as 
possibly being “an acquaintance or a stranger, some- 
one you like or someone you dislike, perhaps a charity, 
perhaps my [the experimenter’s] pocket.” This was 
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intended to offset any biases that m 


vor one 
behavioral mode. The subjects knew th choices 
were hypothetical and that no actua would ` 
result. This was necessary to minimize ex cy and 


consistent with the conditions of grou; g and 
subject anonymity. 
The scores from the behavioral instr 


were 
used to identify subjects having a prop r each 
of the three behaviors: own, joint, and relat Three 
groups were established in which membe was 
defined as a majority of responses in any one gory, 
Any subject failing to achieve a majority of type 
of response was excluded from the analysis, «e 15 
scale scores from the EPPS were subjected to. two- 
factor (Groups Sex) multivariate analysis of va: «nce, 
RESULTS 

The responses to the behavioral instru: ent 
produced group memberships as follows: wn, 
111 (60 males, 51 females) ; joint, 46 (161 les, 
30 females); and relative, 56 (27 male: 29 
females). The probability that an indivi ual 


will select a majority (24 or more items) by 
chance is less than .037 based on the trino:: ial 
distribution. A total of 26 subjects (10. 4) 
did not achieve this criteria and were ex- 
cluded from the final analysis. In comparison, 
Kuhlman and Marshello (1975a), using 24 
trials involving four different game styles, 
found that 66% of their subjects responded 
consistently (50% choices of one motive). 

Both raw scale scores and centile equiva- 
lents were available from the EPPS. The latter 
were used in the analysis because the raw 
scores for male and female subjects were not 
on a common scale. A 15-variable two-factor 
multivariate analysis of variance was then 
performed. 

Significant main effects were indicated for 
both sex (p < .0001) and group membership 
(p = .0276; see Figure 2). The main effect 
of sex was caused by higher scores for the 
male subjects on the Endurance and Deference 
scales and lower scores for Intraception 
Autonomy, and Change. Within the main 
effect of group membership, significant u 
variate F ratios were indicated for four EP 
scales: Affiliation (p = 0042), Aggression 
(p = .0062), Intraception (p = 0142), and 
Heterosexuality (p = .0406). The own and 
relative groups had lower centile rankings on 
Affiliation and Intraception and were higher on 
Aggression and Change. The interaction effect 
(Sex X Group Membership) was marginally 
significant (p = .0629), 
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EPPS SCALE 
AFF AGG INT HET 


76 


68 


60 


52 


Percentile 


44 


MEAN SCALE SCORE 


36 


of 50% of one alternative, (APF 
Heterosexuality.) 


Two of the postulated relationships were 
found. The group preferring joint behavior had 

significantly higher mean score on the 
\ffiliation scale than either of the other two 

roups. The relative group was significantly 
higher on Aggression and Heterosexuality 
han the joint group, although the relative 

soup was not significantly different from the 
own group, 

The general hypothesis that the three be- 
havioral groups would display different person- 
ality characteristics was not supported. When 
the infinite variety of possible loadings on the 
!5 scales of the EPPS are considered, the 

imilarity of the configurations produced by 
ihe own and relative groups is quite signif- 
icant. It is possible that there was inadequate 
separation between the two groups. Although 
membership in either group required a majority 
of the 48 responses to be of that type, the 
remaining responses are not accounted for 
in the use of a unidimensional statistic. It is 
conceivable that many members of the own 
and relative groups were strongly attracted to 
the other response as a second choice, pro- 
ducing the similarity of personality char- 
acteristics. To test this possibility, the three 
groups were reduced to include only those 
subjects who had indicated at least three 
fourths of their responses in the same category 
(chance p < 1.02 X 10-8), When these data 


2. Significant personality characteristics when group membership is defined as the selection 
Affiliation; AGG = Aggression; INT = Intraception; HET = 


were subjected to the same two-factor multi- 
variate analysis of variance, the results as- 
sociated with the main effect of groups con- 
firmed those of the earlier analysis. 
Discussion 

One explanation for the results obtained is 
that there are only two unique personality 
types involved, joint, which would be related 
to the cooperative behavior of the PD game, 
and own-relative, which would account for 
competitive behavior. But, if own and rela- 
tive are confounded in a single personality 
type, how does one explain the two different 
behaviors? Perhaps the behaviors were not 
different. In our social system, some people 
may perceive some resources to be finite and 
quite limited while others perceive them to be 
infinite. Two such people, although having 
essentially the same personality traits, might 
well behave differently because of their per- 
ception of the resource. The one who perceives 
a limited resource might seek to maximize his 
gain aware that someone else must lose, thus 
exhibiting relative behavior. The other, per- 
ceiving an infinite resource, might seek to 
maximize his gain regardless of someone else’s 
gain, thus exhibiting own behavior. In either 
case, the goal would be the same, but the 
behavior would be a partial function of both 
persons’ perceptions. 
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These results have bearing on those found 
by Kelley and Stahelski (1970). Using a PD 
game that inherently confounds the relative 
and own behaviors as the competitive alterna- 
tive, they developed a series of hypotheses and 
conclusions that were built on the presumed 
dichotomous nature of cooperation and com- 
petition. There was no awareness indicated 
that competition within the PD game para- 
digm could support two unique behavioral 
alternatives. Although Kelley and Stahelski’s 
definition of cooperative behavior is quite 
similar to that of joint behavior, their definition 
of competitive behavior is essentially that of 
own behavior. 

The existence of conceptual confusion was 
emphasized in two of the studies Kelley and 
Stahelski (1970) cited. Sermat and Gregovich 
(1966) used the matrix game called Chicken. 
Chicken offers conflict only for relative and 
joint behaviors. Anyone preferring own be- 
havior finds the two alternatives identical and 
would be expected to display a random 
pattern of cooperative and competitive choices. 
Despite the dissimilarity between the PD game 
and Chicken, Kelley and Stahelski compared 
their PD game results to those of Sermat and 
Gregovich. 

The second work cited was that of Apfel- 
baum (1967). She had established three be- 
havioral groups: D, high personal gain but 
not at the expense of the other; E, equalizing 
scores; and P, maximizing difference and own 
score simultaneously. These bear a strong 
resemblance to own, joint, and relative be- 
haviors, respectively, To make these results 
fit the two-choice pattern, Kelley and Stahel- 
ski (1970) merged the E (joint) and D (own) 
groups. In effect, they were treating the data 
as if they had been obtained from an MDG, 
which confounds the own and joint behaviors. 
This, too, they compared to their own PD 

game results. 

One of the conclusions drawn by Kelley 
and Stahelski (1970) is the “triangle hypoth- 
esis,” which states in essence that an in- 
dividual’s position with regard to the co- 
operative-competitive dichotomy determines 
the extent to which he can recognize the 
existence of the two unique behavioral types. 
The more cooperative the individual, the more 
likely he is to recognize that both competitors 
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and cooperators exist. The place of own be- 
havior was determined by Kuh and 
Marshello (1975a), who found th: e in- 
dividualist (own) held the middle d in 
recognition of two behavioral ty; This 
would account for that portion of K and 
Stahelski’s competitive group that dic og- 


nize two distinct behaviors. 

The present study supports the find of 
Kuhlman and Marshello (1975a, 1975) ind 
Kuhlman and Wimberley (1976) that eab- 
lished the existence of three unique behav rs. 
Kuhlman and Marshello (1975b) also fo nd 
that two of the three groups were simil: in 
authoritarianism. In contrast to the prevent 
study, however, the two groups were own | nd 
joint rather than own and relative. This if- 
ference may be an explanation of how the -<o 
behavioral types could be based on the sir le 
personality structure. Given this particu r 
personality (own-relative), higher auth 
tarianism produces relative (or competi 
behavior, whereas a lower authoritarianism 
score facilitates own (or individual 
behavior. 

This hypothesis could be extrapolated to the 
second personality type (joint). The present 
study did not attempt to isolate an altruistic 
group (maximizing the gain of the other), Con- 
sequently, no personality structure was defined, 
Had this been achieved, one might hypothesize 
that it could have shown strong similarity to 
the joint personality type. It would be no 
more unreasonable for the joint personality 
type to support both joint and altruistic be- 
haviors; the level of authoritarianism would 
determine which of the two behaviors would be 
displayed. 

Another explanation of the two behaviors 
associated with the single personality type 
may lie in cognitive factors. Users of the PD 
and Maximizing Difference games know that 
they are non-zero-sum games in nature, But 
they may not be so viewed by subjects who 
encounter them. If the two groups differ 
cognitively, two reasonable explanations are 
that the two groups are functioning at dif- 
fering cognitive levels or that the own group 
has learned to recognize a non-zero-sum 
situation. 

The implication that the non-zero-sum fol- 
lows the zero-sum situation in cognitive de- 
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velopment is based on the assumption that 
young children seem to function in a zero-sum 
manner. If the non-zero-sum situation were 
ed through practice, it would necessitate 
environmental factors to explain why 
vembers of this relatively homogeneous 
group had achieved the knowledge and others 
had not. This may be true. But the idea of 
ering cognitive stages is not incompatible 
vith Piaget’s stages of cognitive development. 
Non sum behavior might very well be a 
function of the formal operational stage. 
possibility that an explanation for the 

Uring of own and relative behaviors may lie 
1 tage theory of development requires that 
consid n be given to both Kohlberg’s 
i ral development (Kohlberg, 1969) 

) nger’s theory of ego development 
(Loevinger & Wessler, 1970). Both theories 
involve sequential progression through stages. 
Both offer stages that are essentially inde- 
pendent of age that would permit the in- 
dividual members of a particular age group to 
occupy several stages. 

This hypothesis is supported by the earlier 
findings of Deutsch (1960), who identified 
four types of behavioral responses to a PD 
came. Two of these were based on initial 

ponses. The individual who was initially 
voperative was considered trusting, whereas 
the individual who was initially competitive 
was called suspicious. The trusting group was 
associated with lower F scores than the sus- 
picious group. These are precisely the results 
found by Kuhlman and Marshello (1975b). 

Deutsch cited Adorno, Frenkel-Brunswik, 
Levinson, and Sanford (1950), who concluded 
that the authoritarian personality is a struc- 
(ure in which the superego is incompletely 
integrated with the ego. Deutsch contended 
that the difference was not of structure but of 
content, that the differences reflected failure 
to internalize changes in value. Such changes 
are expected to occur as the individual trans- 
fers from one stage of ego or moral develop- 
ment to another. The resulting implication is 
that the level of authoritarianism could shift 
parallel to a stage transfer. Thus, an individual 
of a particular personality type (e.g. relative 
own) could indicate one behavior at an earlier 
stage and the other at a later stage. However, 
data are insufficient at this time to attempt to 
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draw any specific comparisons between own, 
joint, and relative behaviors and the different 
stages. 

The only firm conclusion that can be drawn 
from this study is that while three unique be- 
haviors were demonstrated, they were sup- 
ported by only two unique personality patterns. 
Several explanations are possible, but no 
definitive answer to the question can be offered. 
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In a test of Argyle and Dean’s affiliative conflict theory, the present study ex- 
amined the effects of acquaintance on nonverbal intimacy and the compensatory 
adjustment of one interactant to a change in the gaze direction of her coactor. 
Female dyads of friends and female dyads of strangers engaged in discussions of 
Thematic Apperception Test pictures during which time their immediacy be- 
haviors were recorded. Prior to the third discussion, one member of each dyad 
was enlisted as an accomplice and instructed not to look at her coactor. As 
predicted, friends engaged in more individual gaze and mutual gaze and spent 
more time smiling than did strangers. However, the reduction in the gaze of 
he accomplice member of a dyad did not elicit a compensatory increase in the 
immediacy behaviors of the other member. It was concluded that while the 
data are consistent with the notion that approach and avoidance forces underlie 
the display of immediacy behaviors in social interaction, the notion of behav- 


ioral compensation requires a clarification of its parameters. 


Mehrabian (1967) has defined immediacy 
as the extent to which communication behav- 
‘ors enhance closeness to and nonverbal inter- 

tion with another. According to Argyle and 

ean’s (1965) affiliative conflict theory, both 
approach and avoidance forces underlie the 
immediacy of nonverbal behaviors in social 
.ction, Approach forces include the de- 

sire for social feedback and the gratification 
of affiliative needs. Avoidance forces include 
the fear of being open to scrutiny and rejec- 
tion, Interactants strive to maintain an equi- 
librium level of intimacy (point of mutual 
omfort) that is a function of several immedi- 
behaviors (e.g., physical proximity, eye 
ontact, and smiling). Further, a change in 
one of the components of intimacy on the 
part of one person elicits in the other a com- 
pensatory adjustment in one or more compo- 
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nents. This adjustment serves to maintain the 
equilibrium. 

If, as Argyle and Dean (1965) suggested, 
a nonverbal intimacy equilibrium develops 
between social interactants such that ap- 
proach and avoidance forces are balanced at 
a level specific to the interactants, then it 
might be expected that the intimacy equi- 
librium would be maintained at a more im- 
mediate level between those individuals whose 
approach forces are greater than their avoid- 
ance forces. Moreover, one may assume that 
this ratio of approach to avoidance forces 
would be greater between friends than be- 
tween strangers. Thus, one objective of this 
investigation was to compare the nonverbal 
immediacy of friends and strangers. A second 
objective was to test the notion of behavioral 
compensation. 


Level of Intimacy 


Evidence from a number of studies sug- 
gests that a higher level of nonverbal inti- 
macy is more characteristic of interactions 
involving friends than those involving stran- 
gers. Close distances (e.g, Little, 1965; 
Willis, 1966), little backward lean (eg., 
Mehrabian & Friar, 1969), and a high level 
of eye contact (e.g., Mehrabian, 1968; Rubin, 
1970) are typically associated with the com- 
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munication of positive interpersonal attitudes. 
However, methodological and conceptual con- 
siderations indicate the need for caution in 
generalizing these findings to an analysis of 
the intimacy equilibrium per se. First, al- 
though most researchers have considered only 
a single dimension of immediacy, the inti- 
macy equilibrium is thought to be a function 
of several immediacy dimensions.. Whereas 
positive interpersonal affect may be reflected 
in a high degree of immediacy along one 
dimension (e.g., eye contact), there may be 
a compensatory low degree of immediacy 
along some other unmeasured dimension (s) 
(e.g., body orientation). In order to examine 
systematically the intimacy equilibrium, con- 
sideration of several immediacy behaviors 
Seems to be essential. Second, some studies 
have involved the use of confederate-subject 
pairs wherein the confederate is instructed to 
maintain some preestablished pattern of be- 
havior (e.g., Goldberg, Kiesler, & Collins, 
1969). In addition, some investigators have 
employed projective measures of immediacy 
behaviors (e.g., Little, 1965), while still others 
have observed subjects’ behaviors in response 
to an imaginary, generalized other (eg., 
Mehrabian, 1968). The use of such method- 
ologies has precluded exploration of the 
intimacy equilibrium as a function of the 
dynamics of approach and avoidance forces 
concerning intimacy between two or more 
naturally responding interactants, 


Nonverbal Compensation 


In a recent literature review Patterson 
(1973a) concluded that the empirical evi- 
dence generally supports the notion of behav- 
ioral compensation. Among the evidence cited 
by Patterson are the findings that as the 
distance between persons decreases, they in- 
crease their angle away from a frontal ap- 
proach and reduce their eye contact (eg., 
Argyle & Dean, 1965; Patterson, 1973b). 
However, although Patterson (1973a) af- 
firmed that some of the clearest support for 
the compensation hypothesis comes from the 
relationship between eye contact and dis- 
tance, several investigations of nonverbal ad- 
justments to variations in level of eye gaze 
provide contradictory evidence (Breed, 1972: 
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Kleinke, Staneski, & Berger, 19 Uhayer, 
1969; Schneider & Hansvick, Not: Breed, 
for example, manipulated three ees of 
intimacy through variations in a con’ ‘erate’s 
level of gaze, body lean, and o tation, 
Contrary to the compensation hypo! is, in- 
creased intimacy resulted in recip: cal in- 
creases in eye contact and forward | ns on 
the part of subjects, 

Furthermore, it is questionable whe the 
vast majority of studies can be repre -nted 
as viable tests of the compensation hypo! »esis, 


According to affiliative conflict theory, com- 
pensatory reactions occur following a c/ mge 


in a previously established intimacy eq ilib- 
rium. However, in all of the experiments, 
with the possible exception of tha by 
Schneider and Hansvick (Note 1), sub/ects 


were exposed to a fixed level of nonve bal 
intimacy (e.g., close or far distance) rai er 
than to a change in behavior once an ` ij- 
macy equilibrium had developed. 

Thus, on the basis of Argyle and Dean's 
(1965) equilibrium model, two hypotheses 
were made: (a) During a dyadic interaction 
the level of immediacy is greater between 
friends than between strangers, (b) Following 
a series of interactions between two initially 
naive subjects, a reduction in the gaze of one 
subject effects a compensatory increase in one 
or more of the immediacy behaviors of the 
other subject. Also, the design of the study 
afforded the opportunity to investigate the 
stability of the intimacy equilibrium across 
two interaction sessions. 


METHOD 


Subjects and Observers 


The subjects were 80 female undergraduate volun- 
teers who were scheduled in 
consisted of friends, 


Dyads of friends were arranged by asking 20 initial 


immediacy behaviors. 
of the purpose of the 


a A 
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Experimental Setting and Behavioral 


experiment was conducted in a room (5.4 
m) that was divided in half by a wall partition. 
A bench (.5 X 1.8 m), on which the subjects 
>y side, was situated toward une end of 
Facing each end of the bench was a one- 
5 ror. A videotape camera and microphone 
a laced 3 m in front of the bench. 
following immediacy behaviors were mea- 
i interpersonal gaze, smiling, body orientation, 
sody lean, and distance. The immediacy behaviors 
wed during each of three 5-minute dis- 
essions. Interpersonal gaze was recorded by 
o vers during the first 3 minutes of each 
Each observer, situated behind a one-way 
3 used a hand switch that activated a pen 
| X vulative counter, and timer. Each ob- 
| 1 the switch during the time he 
l he subject farthest from him was 
ihe face region of her coactor. This 
silowed for the recording of the frequency 
tion of both individual gaze (the time a 
son spends looking at her coactor) and mutual 
ze (eye contact). Using the same apparatus, the 
observers recorded the frequency and duration of 
smiles during the last 2 minutes of each session. In 
; addition, the mean durations of individual gaze, 
tual gaze, and smiling were calculated. Body 
f ntation, body lean, and seating distance were 
ermined at the beginning, middle, and end of each 
session. The measures of body orientation and body 
lean were obtained by a third observer, who used 
‘| platform on the center of which was 
caned a movable dial. The surface of the plat- 
iorm was marked off into units of 5°. By moving 
the dial in line with the subject’s shoulders, the 
observer was able to estimate her body orientation; 
by standing the platform upright and moving the 
y dial in line with the subject's torso, body lean 
could be estimated. Whereas body orientation was 
ensured during the live interaction, the body lean 
distance measures were taken from videotape 
cordings. The distance measures consisted of head- 
te-lead distance and were determined by the use 
‘a ruler, The three scores obtained for each mea- 
sure of distance, orientation, and lean were aver- 
aged in order to derive a composite score for each 
dimension, 
A dyad’s level of immediacy was calculated by 
averaging both members’ scores for each of the 
+ following variables: individual gaze, smiling, body 
orientation, and body lean. Mutual gaze and distance 
already represented dyadic scores. Finally, using the 
videotape recordings, an independent observer rated 
(on a 7-point scale) the verbal intimacy of each 
dyad’s discussion during the first session. 


Procedure 


The subjects were solicited to participate in a 
study concerned with “interaction patterns among 
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persons in various social situations.” Upon their 
arrival the subjects were escorted into the experi- 
mental room and seated side by side on the bench 
facing the camera. The experimenter (a male) in- 
formed them that in the experiment they would 
engage in various joint activities during which time 
their behavior would be observed. Each subject 
was given a copy of the same Thematic Appercep- 
tion Test (TAT) picture. They were instructed to 
cooperate in making up a story involving the charac- 
ters shown in the picture. As soon as the experi- 
menter left the room, Session 1 began, and the 
observers recorded the subjects’ immediacy behaviors 
for a 5-minute period. 

After Session 1, the subjects were seated side by 
side at a table and presented with six small-group 
problems that were intended to encourage inter- 
personal familiarity. These problems were selected 
on the basis of their medium-high scale values 
concerning cooperation requirements and intrinsic in- 
terest level (see Shaw, Note 2). The subjects were 
instructed to discuss each problem and agree on a 
common solution. This phase of the study lasted 
20 minutes. 

Then the subjects returned to the bench and en- 
gaged in another 5-minute discussion involving TAT 
pictures (Session 2), during which time their im- 
mediacy behaviors again were recorded. At the end 
of Session 2, the subjects were taken to separate 
rooms and asked to provide some questionnaire data 
prior to continuing with the experiment. In actu- 
ality, for 10 of the 20 dyads at each level of ac- 
quaintance, while one subject completed a question- 
naire (items from the Kuder Interest Survey) de- 
signed only to occupy her time, the other was asked 
to serve as an experimental accomplice during the 
third session. Upon her consent (all of the subjects 
were willing to serve as an accomplice), the accom- 
plice subject was told that during the forthcoming 
discussion she would have to refrain completely 
from looking at her coactor. The experimenter 
stressed the importance of ensuring that her other 
behaviors (e.g., body orientation and lean) remained 
the same as they were during the two previous 
sessions (i.e. as they were under natural circum- 
stances). During the 15 minutes required by the 
naive subject to complete the questionnaire, the 
accomplice subject rehearsed her role while inter- 
acting with the experimenter until she felt relatively 
comfortable in enacting the manipulation. 

Then the subjects returned to the experimental 
room for the third session. They again constructed 
a story about a TAT picture, and the immediacy 
behaviors of both persons were recorded. After 


1 Pilot research suggested that the use of Thematic 
Apperception Test cards maintained subjects’ interest 
and facilitated conversation. Three separate cards 
were used in the study. Two of the cards were pre- 
sented in a counterbalanced order during the first 
two sessions; the third card always was used during 
the final session. 
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TABLE 1 
Mean ĪMMEDIACY BEHAVIORS FoR EACH LEVEL OF ACQUAINTANCE AND SESSION 
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Friends Strangers 

Immediacy behavior Session 1 Session 2 Session 1 
Frequency of individual gaze 18.30 18.83 14.78 
Duration of individual gaze* 37.65 37.43 23.48 
Mean duration of individual gaze* 2.06 2.02 1.54 
Frequency of mutual gaze 8.80 9,50 5.30 
Duration of mutual gaze* 11.68 12.14 5.38 
Mean duration of mutual gaze* 1.19 1.20 -90 
Frequency of smiles 6.03 8.05 6.05 
Duration of smiles* 18.38 21.55 
Mean duration of smiles* 3.23 2.63 3. 
Body orientation” 28.99 23.89 30.60 
Body lean? 3.82 2.22 2.68 
Distance? 54.30 50.40 54.60 


Note. For each cell, n = 20, 
* In seconds. 

b In degrees. 

© In centimeters, 


Session 3, the experimenter discussed with the sub- 
jects the purpose of the study, thanked them for 
their participation, and asked them not to discuss 
the study with their peers. A postexperimental inter- 
view revealed that whereas the subjects who were 
exposed to the reduction in their coactor’s gaze were 
aware of the change in behavior, none associated 
it with the objectives of the experiment. 


RESULTS 
Preliminary Analyses 


The subjects were asked to indicate on a 
6-point scale how well they knew their co- 
actor prior to the experiment. Confirming the 
validity of the acquaintance manipulation, 
all of the subjects in the friends condition 
reported that their coactor was either a close 
or very close friend, whereas those in the 
stranger condition reported that either they 
had never spoken to their coactor or had 
never seen her before. 

To estimate interobserver reliability with 
respect to the immediacy dimensions, data 
were obtained during several training sessions. 
The correlations for 27 estimates of body 
orientation and 20 estimates of body lean by 
the two observers were .98 and .80, respec- 
tively. Estimation of the reliability of judg- 
ments of gaze direction involved pairing the 
four observers with each other during two 
3-minute intervals. Using Exline’s (1963) 
method of computation, interobserver agree- 
ment averaged 86.1%. The correlation between 
ratings of verbal intimacy by the independent 


observer and a second person, based n a 
sample of 11 dyads, was .83. Relia! lity 
estimates were not obtained for distance nd 
smiles. However, the use of a ruler to mea re 
distance ensured the reliability of its mea- 
surement, and the training sessions sug- 
gested that smiles were readily oi served an i 
recorded. 


Level of Intimacy 


The 12 dependent measures were subjecte | 
to a 2 X 2 multivariate analysis of variance. 
The factors were acquaintance (friends and 
strangers) and session (Session 1 and Ses 
sion 2), Acquaintance was the only factor 
that attained significance, F(12, 67) = 3.25, 
p< .001. As predicted, friends were more 
immediate than were strangers. 

A 2 X 2 repeated-measures analysis of vari 
ance was also performed on each immediacy 
behavior. The mean scores are presented ir 
Table 1. Acquaintance was a significant 
source of variance for frequency of individual 
gaze, F(1,78) = 15.19, p< 001, duration of 
individual gaze, F(1, 78) = 19.28, p < .001, 
mean duration of individual gaze, F(1,78) 
= 11.79, p < .001, frequency of mutual gaze, 
F(1, 78) = 16.48, p< .001, duration of mu- 
tual gaze, F(1,78) = 16.48, < 001, mean 
duration of mutual gaze, F(1,78) = 11.48, p 
< .01, and duration of smiles, F(1,78) = 
8.78, $ < .01. Friends engaged in more indi- 
vidual gaze and mutual gaze and spent more 
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time smiling than did strangers. In addition, 
analyses of simple effects of the acquaintance 
-y Session interaction, F(1,38) = 5.48, p< 
revealed that friends smiled more fre- 
than strangers during the second 
W(1,38) = 6.31, p<.05, but not 
se first session. The effect of session 
| significance only for distance, F(1, 
9.09, p < .O1. Subjects sat closer during 
second session than during the first. 
inally, a £ test indicated the absence of a 
« difference between the verbal inti- 
‘riends and strangers (Ms = 3.0 and 
vectively). 
ough the analyses of variance yield 
| format egarding changes in the mean 
| vehaviors across sessions, they 
| e, if any, information regarding 
ve stability of the dyads. Thus 
¿moment correlations were computed 
tween Sessions 1 and 2 for each immediacy 
behavior. The stability coeffiicents for friends 
and those for strangers are shown in Table 2. 
Phe stability of seating distance and body 
ntation was high for both friends and 
gers. However, the stability coefficients 
ior the remaining behaviors were generally 
higher for friends than for strangers. 


ion 


onverbal Compensation 


In considering the notion of compensation, 
we distinguish between interpersonal adjust- 
„ment, a person’s reaction to changes in the 


TABLE 2 


ELATIONS BETWEEN SESSION Í AND SESSION 2 FOR 
EACH ĪMMEDIACY BEHAVIOR 


Immediacy behavior Friends Strangers 
Frequency of individual gaze 33 45 
Duration of individual gaze* .66"* 19 
Mean duration of individual gaze*  .79*** 31 
. Frequency of mutual gaze 34 08 
; Duration of mutual gaze A6* 13 
Mean duration of mutual gaze A2 39 
Frequency of smiles 35 is 
Duration of smiles* .50* —.06 
Mean duration of smiles* 590%. —.28 
Body orientation one 74" 
Body lean* 58** 07 
Distance "97" 7244 


s Correlations for friends and strangers are significantly 
different, p < .05, one-tailed. 
*p <05 
+p <.01, 
+p <00. 
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behavior of another, and intrapersonal ad- 
justment, a person’s reaction to changes in his 
own behavior. In testing the hypothesized 
compensation effect, we used measures of 
interpersonal adjustment. Recall that in Ses- 
sion 3, at each level of acquaintance, for 
half of the dyads one member was instructed 
not to look at her coactor (gaze reduction 
condition), whereas for the remaining dyads 
no manipulation was enacted (control condi- 
tion). The immediacy behaviors of the naive 
subjects in the gaze reduction condition were 
compared with those of the subjects in the 
control condition, For each immediacy be- 
havior, scores in the gaze reduction condi- 
tion represented those of the naive member 
of the dyad, while scores in the control con- 
dition represented the average of both dyad 
members’ scores. Inspection of the data con- 
cerning Sessions 1 and 2 revealed that in 
spite of random assignment, the gaze reduc- 
tion and control groups were dissimilar on a 
number of immediacy behaviors. Therefore, 
the dependent variables were subjected to 
a 2X2 (Acquaintance X Gaze Condition) 
multivariate analysis of covariance. The scores 
for each behavior in Sessions 1 and 2 served 
as the covariates, Contrary to the compensa- 
tion hypothesis, gaze condition was not a 
significant source of variance. Similarly, a 
series of 2 X 2 univariate analyses of covari- 
ance revealed that in none of the analyses 
did gaze condition attain significance. 

In training the accomplice subjects not to 
look at their coactor, it was hoped that they 
would be able to avoid making adjustments 
along other dimensions of their own behav- 
ior. There is some evidence to indicate that 
it is difficult to control for such intrapersonal 
adjustments.? The recording of the immediacy 
behaviors of the accomplice subjects during 
Session 3 permitted an analysis of such intra- 
personal adjustments. (It should be noted 
that all of the accomplice subjects success- 
fully refrained from looking at their coactor 
during Session 3.) Thus, the immediacy be- 


2 For example, in a study involving the manipul 
tion of approach distance, Patterson and Sechy 
(1970) observed that as they approached a subj 
even practiced confederates tended to orient tl 
selves less directly and maintain less eye co 
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haviors (other than those involving visual 
behavior) of the accomplice subjects in the 
gaze reduction condition were compared with 
the immediacy behaviors of the subjects in 
the control condition (i.e., the same control 
group scores as those used in the analysis of 
interpersonal adjustment). A 2 X 2 (Acquain- 
tance X Gaze condition) multivariate analy- 
sis of covariance (with scores in Sessions 1 
and 2 used as covariates) indicated that gaze 
condition was not a significant factor. How- 
ever, a series of 2 X 2 univariate analyses of 
covariance revealed that the effect of gaze 
condition was significant for frequency of 
smiles, F(1,34) = 16.85, p < .001, duration 
of smiles, F(1,34) = 2.92, p < .09, and body 
orientation, F (1,34) = 24.24, p < .001. Not 
only did the accomplice subjects refrain from 
looking at their coactor, but they tended to 
smile less and orient themselves less directly 
than did the control subjects. In short, the 
experimental manipulation of interpersonal im- 
mediacy was more pronounced than originally 
intended, 


Discussion 


The results offer support for the prediction, 
derived from Argyle and Dean’s (1965) af- 
filiative conflict theory, that the level of non- 
verbal immediacy is greater between friends 
than between strangers. The univariate analy- 
ses of variance suggest that this differential 
level of immediacy was primarily a function 
of individual gaze, mutual gaze, and smiling. 
Although these results generally agree with 
those of previous studies, one noteworthy dis- 
crepancy pertains to distance. Unlike other 
research, in which close distances were asso- 
ciated with positive interpersonal attitudes, 
in the present study friends did not sit closer 
together than strangers. Although it was 
noted in the introduction that most of these 
experiments did not entail the interaction of 
naive subjects and thus are not directly com- 
parable to the results of the present study, 
investigations of real-life friendship have also 
yielded results that conflict with the present 
findings. One possible explanation for this dis- 
crepancy is that whereas other researchers 
(eg, Willis, 1966) employed measures of 
standing distance, we employed a measure of 
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seating distance. Due to the fa ut stan 
ing persons have greater maneu\ lity 

seated persons, perhaps standing ince is 
more sensitive measure of affilic orient; 
tion than seating distance and | is mor 
likely to discriminate between ds an 
strangers, Alternatively, interpe: 1 affec 


may be expressed by means of 
mediacy cues. However, the role p! 
given immediacy cue is situational contin. 
gent. In the present context visual -havior 
and smiling were the salient exp ors of 
interpersonal affect, whereas in anot) r con 
text distance and other cues may en “ge as 
significant. 

The lack of a significant difference |. tween 
Sessions 1 and 2 in the immediacy o sub- 
jects’ visual behavior, body orientatio: , and 


d by 


body lean is consistent with the findi: s re- 
ported by Patterson (1973b). Moreo `, al 
though the increase in proximity fror Ses- 
sion 1 to Session 2 is in accord w the 
results provided by Pedersen and shears 
(1974), it is possible that this finding was an 
artifact of the experimental proced: t 


is, having participated in Session 1, t b- 
jects may have chosen closer seating iis- 
tances in Session 2 in order to monitor more 
easily one another’s reference to her TAT 
card. On the basis of the equilibrium modd, 
we might have expected a greater increase in 
immediacy (other than just proximity, and 
particularly among strangers) from Session 1 
to Session 2 as a result of the increase in- 
approach motivation, which generally ac- 
companies the development of a relationship. 
However, given the relatively short period «f 
interaction and the confines of the exper i- 
mental context, the subjects may not have 
had the opportunity, nor perhaps the desire, 
to increase their initial levels of immediac ye 
The stability coefficients computed between 
Sessions 1 and 2 are consistent with affiliative 
conflict theory. Whereas seating distance and ` 
body orientation were highly stable for both 
friends and strangers, the remaining immedi- ~ 
acy behaviors generally were more stable for 
friends than for strangers. If, as affiliative 
conflict theory suggests, the intimacy equi- 
librium is more clearly defined for friends 
than for strangers, it follows that there is L 


$ 
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less need for nonverbal negotiation on the 
part of the former than the latter, Conse- 
immediacy behaviors should show 
:» stability among friends than strangers. 
sults did not support the prediction 
duction in the visual behavior of one 
of a dyad would effect a compensa- 
ncrease in one or more of the immedi- 
behaviors of the other member, The 
search evidence suggests that compensatory 
found following variations in 
he immediacy of interpersonal distance (e.g., 
\ ¿ Dean, 1965) but not following vari- 
the level of individual gaze (e.g., 
ed J2). On the basis of this evidence 
nt findings, one might infer that 
more potent component of inti- 
is eye contact. If so, the lack of 
£ nsatory effect is readily explicable. 
ever, in view of the fact that the accom- 
plice subjects reduced the immediacy of their 
smiling and body orientation as well as their 
saze, it is likely that the cumulative loss in 
nacy was at least comparable to the re- 
tion in intimacy of those manipulations 
volving only a decrease in proximity. Under 
such circumstances, one would expect com- 
‘ory reactions to have occurred, and 

lid not.? 

Our main recourse appears to be to ques- 
tion the validity of the compensation hypothe- 
sis. We suggest that the compensation hy- 
pothesis represents an oversimplification of 
what actually happens during social interac- 

n and that efforts should be directed toward 
dolineating the parameters of those situations 
in which compensation occurs and those in 
vhich it does not occur. For example, as Pat- 
terson (1973a) has suggested, compensatory 
reactions might be expected “anytime an un- 
usual expression of immediacy behavior is 
encountered” (p. 247). Thus, one important 


memb 


reactions àre 


variable may be the intensity of change in < 


immediacy (and perhaps the direction of 
change). However, in the present instance, 
the change in immediacy was anything but 
slight. Alternatively, a major consideration 
may be the variability of meaning attributed 
to changes in the immediacy (particularly 
gaze) of another. For instance, in the present 
context, the subjects may have interpreted 
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the accomplices’ behavior as reflecting an 
increase in concentration on their TAT pic- 
ture (the accomplices typically enacted the 
manipulation by focusing on their card) 
rather than as an attempt to reduce interper- 
sonal intimacy. 

In summary, the present data are consis- 
tent with the view that approach and avoid- 
ance forces underlie the display of immediacy 
behaviors in social interaction and that such 
forces are balanced in the establishment of an 
intimacy equilibrium. However, the evidence 
suggests that the notion of behavioral com- 
pensation requires clarification. 


3 Although the additional reductions in immediacy 
on the part of the accomplices are in potential con- 
flict with the notion of compensation, it may have 
been too awkward for the accomplices to smile and 
maintain a more direct orientation while not looking 
at their coactor. 
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Changes in Leadership Appraisal as a Function of the Stress 


of a Simulated Panic Situation 


Andrew L. Klein 
U.S. Central Intelligence Agency 
Washington, D.C. 


Research by Mintz and Kelley, Condry, Dahke, and Hill left unresolved the 
question of what level of personal threat leads to group incoordination of 
“panic behavior.” The present study set out to resolve this issue and to explore 
the additional relationship between leadership and behavior in the panic situa- 
tion. The dependent variables were Likert-type questionnaire items designed to 
investigate the level of responsibility attributed to a leader by members of a 
group as a function of four variables: (a) two levels of stress (threat of shock 
or of small monetary loss); (b) two levels of leadership authority (elected or 
appointed); (c) two conditions of leadership (“me-last” or “me-first”); and 
(d) group success or failure. The specific questions were adapted from earlier 
pilot work by Sulzer and Sisti. One hundred and forty-four males were run in 
this 2X 2X 2X2 factorial design with a nonfactorial control group. The task 
setting, which required subjects to retrieve their wooden cones through the 
same hole, was designed to be analogous to the kind of situation that would 
occur in a theater fire where only one narrow exit existed. The results support 
the contention of Kelley that group incoordination (panic) increases under 


personal threat. Additional results were: (a) Leaders facilitated achieving the 
goal of safe exit; (b) elected leaders were given more responsibility and were 
seen as more competent than appointed leaders, but only when the stress is com- 
paratively low, as otherwise the difference in evaluation narrows and reverses; 
and (c) success or failure seems to have had little effect on the leader’s eval- 


uation by the group. 


The problem of panic behavior has long 
of practical and theoretical significance. 
Panic has contributed to a needlessly high 
loss of life in natural and man-made disaster 
situations, While our knowledge of other as- 
pects of group behavior has increased sig- 
nificantly in the past two decades, our knowl- 
edge of panic behavior has shown very little 
stantive increase since LaPiere’s (1938) 
intuitive analysis of the problem. 

The literature that exists on panic consists 
of post hoc impressions, recollections, and 
speculations with little substantive material 
amenable to systematic interpretation. In the 
entire literature on panic, there have been 
only three empirical investigations prior to 
this current study. There are three possible 
reasons: (a) Panic behavior is a form of 
group behavior that does not permit rigid, 
precise control to the degree we normally de- 


Requests for reprints should be sent to Andrew L. 
Klein, Central Intelligence Agency, Washington, 
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sire for experimental research in psychology; 
(b) J. P. R. French (1944/1964) did simu- 
late a fire in a locked room containing sub- 
jects. This was in complete violation of cur- 
rent ethical standards and points to needed 
methodological innovation; (c) French’s 
method lacked systematic controls and yielded 
results which are difficult to express in quan- 
titative terms (Schultz, 1964). 

An approach to the study of panic behavior 
with fewer methodological problems is pro- 
vided by a simulation of the sort used by 
Alexander Mintz (1951). Mintz was in con- 
flict with the earlier views on the question of 
the role of perceived danger. Mintz explicitly 
denied that intense fear (or “emotional ex- 
citement”) is responsible for nonadaptive 
group behavior., Mintz emphasized that per- 
sons are merely reacting to the reward struc- 
ture of the situation. As long as escape is 
orderly, each sees that his interests lie in con- 
tinuing to cooperate with others. However, 
when a few individuals begin to create a dis- 
turbance in the use of the exit, the others will 
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recognize the threat to their own escape and 
perceive that their only hope lies in trying to 
push through. Thus, the individual is no less 
rational or moral in the panic than in any 
other situation. He is always in pursuit of his 
own interests. Mintz characterized this pur- 
suit in terms of potentially available rein- 
forcement and punishment contingencies and 
predicted that if the reward structure is un- 
stable or unpredictable for the participants, 
their behavior will reflect this uncertainty and, 
in turn, be unstable and unpredictable. Thus, 
Mintz saw a direct relationship between the 
specifics of reward and the specific behaviors 
of the group. 

Mintz’s experimental paradigm was de- 
signed to show that it is possible, sys- 
tematically, to produce disorganized, uncoop- 
erative, nonadaptive group behavior resulting 
in “traffic jams.” The situation was repre- 
sented as a game where a subject could win 
or lose money depending on how successful he 
was in pulling on a string to remove a wooden 
cone from a narrow-necked bottle, Success 
was defined in terms of succeeding under ar- 
bitrary time limits or avoiding wetting the 
cone as the bottle slowly filled with water 
from the bottom. In a situation where no 
tangible rewards or losses were evident there 
was little evidence for “jam ups.” Mintz 
claimed support for his reward-structure the- 
ory of panic. 

In 1965 Kelley, Condry, Dahlke, and Hill 
challenged Mintz’s methods and conclusions, 
The most important criticism they made was 
that Mintz’s method could not be considered 
to test the effects of danger. Mintz’s subjects 
were promised small monetary rewards, rang- 
ing from 10¢ to 25¢, for escaping from the 
bottle in time, and even smaller fines, from 
1¢ to 10¢ for failing to do so. It is, according 
to Kelley et al., unlikely that the results from 
this situation shed much light on behavior 
under conditions of high stress, 

Kelley et al. stated that a queue is the 
logical escape plan if planning time were 
available. Without planning time, or a leader 
to provide an “instant” plan, one would ex- 
pect the behavior of the group to be a func- 
tion of the most dominant responses in the 
response hierarchy, since it has often been 
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shown that an increase in arousa lds a ree 
version to dominant response p 33 (Cote 
trell, 1971; Zajonc, 1965). They under 
the pressure of perceived threat, i minant 
response of dashing to the exit, ¢ if mal- 
adaptive, would be expected. 

More than a decade after Min Celley, 
and his co-workers set out to exp panic 


behavior, four factors were of inter: 
present author: (a) the perceived vel of 
threat; (b) the size of the collectivi -: (c) 


susceptibility of group members to so al jn- 
fluence; and (d) the availability of “con- 
fidence” responses concerning anticipat ` suc- 
cess of the outcome for the group. 

The attempt by Kelley et al. to con ‘ruc- 
tively criticize earlier work and improve + eth- 
odology was fraught with problems. The oro- 
cedures for establishing a “group” and “p: aic 
and the necessary relationships lacked ‘ace 
validity and ignored Mintz’s findings o° the 


value of face-to-face confrontation in pro: uc- 
ing appropriate relationships. 

The level of perceived threat does seen. to 
be an important factor, and small mone: iry 
loss seems a poor parallel to the threat of 
real physical danger. Thus, the earlier me'h- 
odologies already mentioned are to be refined 
by this study in the hope of clarifying the 
antecedents of “panic behavior.” 

Kelley et al. pointed out Mintz’s lack of 
concrete proposals for dealing with panic be- 
havior. It is believed that relevant situational 
variables must be identified before such pro- 
posals can be made. Leadership variables 
seem to be relevant factors. 


Leadership 


Effective leadership behavior is best defined 
as those acts that are functionally related to 
goal achievement. Acts instrumental to 
achieving the goals of the group include mak- 
ing suggestions for action, evaluating move- 
ment toward the goal, preventing activities ir- 
relevant to the goal, and offering solutions 
for goal achievement. It is in this latter con- 


text that leadership was studied in this ex- 
periment. 


Group Success and Failure 


To the extent that group members are goal 
oriented, it follows that Successful leaders will 
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p be viewed quite differently by their followers 
than less productive leaders, If, as Selznick 
1/1957) says, the purpose of a leader is to “de- 
ihe ends of group existence, to design an 
e distinctly adapted to these ends, 
e that the design becomes a living 
(p. 37), then it is reasonable to as- 
tun devaluation of the leader who is not 
sful or leads an unsuccessful group. 
an, Hollander, and Regula (1969) and 
sisti (1972) found just this result. This kind 
f is, what Hollander and Julian (1968, 
il transactional analysis, is similar in 
t to the reward-structure analysis of- 
Mintz. In transactional terms, the 
ks success (escape from the 
ng stimuli in the present study) 
iccess is a function of the group 
cader. The leader who fulfills ex- 
ions and helps to achieve group goals 
provides a rewarding resource for others in 
exchange for status, esteem, and greater in- 
fluence, This is an extension of the logic-of- 
‘exchange theory as espoused by Thibaut and 
iey (1959). Thus, the leader gives some- 
ig and gets something; and what he gets 
contributes to his legitimacy insofar as he is 
lidated” in his role by his followers. 
iressful situations the relationship be- 
ween perceived legitimacy of the leader and 
success seems emphasized. Sargent and Wil- 
liamson (1966) point out that an open threat 
„to the survival of the group is an important 
‘factor in bringing out leadership tendencies. 
hey reason: “Undoubtedly the most devoted 
towers are attached to leaders who have ef- 
fectively solved a major crisis” (p. 431). A 
is affecting a group increases the proba- 
bility that the group will react favorably to 
an effective action by the leader and reject 
the leader who, by his failure, takes on aver- 
sive qualities, 


Source of Authority: Leader Legitimacy 


A leader has been distinguished from a 
head, in that a leader’s authority is accorded 
him by his followers, whereas the authority 
of a head derives from some external source 
(Gibb, 1954), There is evidence that an 
elected supervisor, who is a leader rather than 
a head, is able to control work output more 
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effectively than an assigned leader because 
his power is seen as having greater “legiti- 
macy” (Raven & French, 1958). 

Thus, the leader’s source of authority af- 
fects the leader’s legitimacy in the perception 
of the group members. Secord and Backman 
(1964, 1974) state that leader legitimacy is 
acquired through a formal recognition of the 
leader’s role. In a bipolar fashion, the only 
other source of authority would be from out- 
side the group as when a leader is designated 
or appointed by an outside authority. Since 
these could be viewed on an ingroup, out- 
group basis a differential value could be as- 
signed to these two different leaders by the 
group members. McGinnies (1970), Gibb 
(1954), Raven and French (1958), Hol- 
lander and Julian (1970), Thibaut and Kel- 
ley (1959), and Sisti (1972) have found sup- 
port for this hypothesis. Julian et al.’s (1969) 
results suggested that election, rather than 
making a leader more secure, made him more 
vulnerable to censure if he either was initially 
perceived to be incompetent or subsequently 
failed to secure a successful outcome as 
spokesman for the group. While this finding 
alone does not sustain a generalization that 
the appointed leader is necessarily more 
firmly entrenched, it does support the conclu- 
sion that the leader’s source of authority is 
perceived and reacted to as a relevant ele- 
ment in the leadership process. 

Thibaut and Kelley (1959) and Blake and 
Mouton (1961) documented the differential 
reaction of followers to their elected or ap- 
pointed representatives, Their results indicate 
that elected leaders (a) have their directive 
attempts reacted to in a Jess hostile manner; 
(b) are initially seen as more intellectually 
competent than appointed leaders; and (c) 
are seen as having more procedural skills 
than their appointed counterparts. 


Leadership Style 


The concept of “leader style” would in- 
clude the differential aspects of action taken 
by different leaders in the same type of situa- 
tion, Virtually all of the research on leader 
style under stress has centered around the 
characteristics identified in the original Lewin, 
Lippitt, and White study (1939) (Hamblin, 


) 
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1958; Hook, 1943; Waring, Dwyer, & Jun- 
kin, 1939/1968; White & Lippitt, 1960). 

A specific aspect of leader style seems rele- 
vant to the situation of high stress and lim- 
ited escape possibilities. A question of im- 
portance is: Does the leader “Jead”? his group 
by actually existing first (“me-first” leader) 
and thus guaranteeing his own safety, or does 
the leader offer to “go down with the-ship,” 
(“me-last” leader) potentially martyring him- 
self for the good of the group? This aspect of 
leadership style is unique and opens a new 
avenue for the exploration of leader—follower 
relations. On the basis of a just-world prin- 
ciple (Lerner, 1965), the group reaction to 
the martyr would be one of devaluation. The 
person who sees someone suffer misfortune 
will be motivated to believe that the un- 

fortunate victim in some sense merited his 
fate, There is supporting data relevant to this 
assertion (Lerner, 1965, 1966; Shaw & 
Sulzer, 1964; Walster, 1966). 
It is possible, therefore, that the captain 
going down with his ship may be seen as 
behaving appropriately, for he was responsible 
for the ship and its passengers. Likewise, the 
group leader in the panic situation should, by 
the same reasoning, be seen as responsible for 
the outcome. A successful outcome “forgives 
all sins” and even elevates the leader who 
risks self-sacrifice (martyr) to hero status 
(Julian et al., 1969; Torrence, 1961; Wal- 
ster, 1966). On the other hand, group failure 
might lead to significantly more condemnatory 
evaluations of the leader with special dis- 
favor for the martyred leader, 


Attribution of Responsibility 


Most past leadership research has focused 
primarily on the leader. Only recently has the 
group members’ evaluation of the leader’s 
role been of empirical interest (Hollander & 
Julian, 1970; Sisti, 1972). An important de- 
terminant of the evaluation is the amount of 
responsibility which the group members at- 
tribute to a group leader for a particular task 
outcome, 

Sulzer (1964) found that the most potent 
determinant in the attribution of responsi- 
bility is tha’ of the perceived causal structure, 

as exhibite ‘n five levels of responsibility 
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proposed by Heider (1958). Heider min- 


tained that positively seen, high-status ver- 
sons are perceived as being more respor sible 
for positive events and more internally di- . 
rected. Thus, the elected (internal ligh- 
status) leader should be (a) more effe- ive 
(Raven & French, 1958) and (b) attrib» ted 
more responsibility than the more negat ely 4 
seen, lower status appointed leader. A © ady 
by Thibaut and Riecken (1965) pred ted 
that leaders initially “caused” by extern ` or 
internal controls (appointment and elec’ on, | 
respectively) should receive differentially at- 
tributed levels of responsibility, It was fo und 


that the behavior of high-status persons 
perceived to be internally caused and 
status persons as externally caused. T 
high status seems to mean elected, intern: ly 
controlled, and more responsible. Low sta as 
is the mirror image. This study adds mre 
data to this issue. 

More recent research has found that a per- 
son will be judged more stringently if the oui- 
come is a failure than if it is a success (e.g., , 
Julian et al., 1969; Walster, 1966). In Julia, 
et al. (1969) it was also shown that electe 1 
leaders were blamed more for the negati e 
group outcome than were the appointed lea. - 
ers. Lerner (1965) reports that an appointe 1 
partner in a dyad is held to be more ri- 
sponsible for negative outcomes and this adds 
support to Heider’s view. 

In the present study an interaction was prc- 
dicted for the effects to task outcome and 
leader style. Successful martyrs are “heroes” 
and high-status (Torrence, 1967) and success- 
ful leaders who conform to expectancies ia 
light of the reward-punishment contingenci:s 
are high-status, and thus these should be seca 
as internally caused leader styles with high — 
attribution of responsibility. Failure and ma’ - 
tyrdom should be very low-status and exter- 
nally controlled perceptions should exist. The 
attribution should be differential in favor of 
the high-status leaders as Thibaut and Rieken — 
(1965) predict. f 
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Hypotheses 


The present study investigated the level 
of responsibility attributed to a leader by | 
members of a group in a simulated panic , 
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shock cuff 


la, Panic Simulator 


female 


lc. Shock Cuff and Ball 


Ficure 1, Panic simulator and component parts, 


situation with four variables: (a) two levels 
of stress (threat of shock or of small monetary 
loss); (b) two levels of leadership authority 
(elected or appointed); (c) group success or 
failure; and (d) two styles of leadership 
(“me first” or “me last”). 

It was predicted that the level of stress 
would be greater in the shock condition, which 
in turn would emphasize the effect of stress 
significantly more than monetary loss on the 
leadership variables. This increased stress 
should increase group incoordination by re- 


ducing cooperative pressures and increasing 
self-interest pressures. The leaders will, there- 
fore, be seen as more valuable (higher at- 
tribution of responsibility) in the high-stress 
conditions than in the low-stress situation. 

Elected leaders were expected to receive 
more attribution of responsibility than ap- 
pointed leaders for the task outcomes and this 
effect would interact with the style of leader- 
ship as the preceding analysis above would 
predict. 
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METHOD 


Subjects 


One hundred and forty-four white college age 
males (18-24 years of age) were induced to partici- 
pate in this study by being offered, in exchange for 
their participation, a chance to win one of four 
cash prizes given away by random drawing at the 
end of the data collection period of the experiment. 
Two $100 checks and two $50 checks were the prizes. 


Apparatus 


Three rooms were used. Two approximately 30 X 
12 foot (9.1 X 3.7 m) rooms served as waiting room 
and questionnaire room, respectively. A third room, 
8X12 feet (2.4 X 3.7 m), was labeled the “escape” 
room where the emergency simulation apparatus was 
placed. 

Figure 1 gives the shapes and dimensions of the 
cones and the “panic simulator” in the escape room 
and shows how the wires were attached to the sub- 
ject, cone, and shock generator. The cones were 
made of balsa wood and the exits stand of 1-inch 
(2.54 cm) plywood. The stand was painted white 
and the cones were various bright and distinctly 
different colors. 

The stand consisted of two platforms. The larger, 
lower platform (24 X 24 1 inches; 61 X 61 X 2.54 
cm) stood on four 6 inch (15.2 cm) Jegs and held a 
Lafayette shock generator with a dial on one side 
that had a scale from 1 to 100 units on its dial. 
Drilled through the center of the platform was a 
series of six .25-inch (.635 cm) holes forming a 
circle at the center of the platform base. These holes 
were for the six wires that led from the shock gen- 
erator, under the platform, through the holes to the 
cones. The second smaller (12 X 12 X 1 inches; 30.5 X 
30.5 X 2.54 cm) platform stood 23 inches (58.4 cm) 
above the lower platform on four legs which were 
bolted to both platforms. In the center of the smaller 
platform a hole was cut of just sufficient size to 
allow passage of one cone at a time. A funnel with 
its tip removed was bolted below the exit hole so 
that the flat-topped cones would not catch on the 
opening. 

The wires from the shock generator through the 
lower platform to the cones were white, multistrand, 
two-wire lamp cord, These wires were secured under 
the lower platform by a knot in the cord, thus pre- 
venting tension during the experiment from discon- 
necting the wires from the shock generator. The 
cord terminated in a female jack fitting. 

Inside each cone the female jack was connected to 
a male jack fitting (see Figure 1b). The wire from 
the male jack passed out the top of the cone, through 
the upper platform exit hole, through a 2-inch (5.1 
cm) diameter red, solid rubber ball, with the wire 
knotted on each side of the ball, and then fastened 
to an elastic cuff (see Fig. 1c). The cuff consisted 
of an elastic strip with “velcro” fasteners on each 
end allowing for adjustability and secure fastening 
of the cuff to the wrist of any subject. Two copper 
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bolts attached to the cuff and to the wires provid- 
contacts for the electric current. The net result > 

a shock cuff for each of the six subjects (four sv: 
jects and two confederates) and a red ball to po] 
on to remove the cones from the emergency si! 
tion. 

The shock generator had a hand-operated swi 
on a 10-foot (3.04 m) cord that allowed remote « 
eration by the experimenter, Thus, with the h: 
switch on, shock passed from the generator, thro: 
the jack in the cones to the shock cuff. Shock co: 
be stopped by turning the remote hand switch 
turning the on-off switch on the generator to | 
off position, or by pulling the cone through t e 
“escape” hole in the upper platform where the 10y 
cord would pull up taut and continued tension cau 
the jacks to separate, breaking the circuit to i 
shock cuff. 

A second 10-foot (3.04 m) remote switch operat: 
a buzzer mounted on the board next to the sho 
generator. 

A 22-item questionnaire was constructed usin 
items from unpublished sources on attribution o 
responsibility (Sisti, 1972; Sulzer, 1964). These stud 
jes were used as pilot work that provided pretested 
questions on the various components of the experi- i 
mental situation, 


Design 


The experimental variables combined in a 2 X 2 X 
2 X 2 factorial design (Stress X Leadership Authority 
X Leadership X Outcome) with a nonfactorial con- 
trol condition analyzed separately. 

Stress. Two kinds of motivation to escape were 
used. Replicating Mintz’s work was a monetary loss 
condition where 25¢ was won or lost. In a condi- 
tion where shock was the threat, a painful sample 
shock (5 mA) was given before the experimental 
trial and the participants were told that the shock 
following failure to escape would be far more painful. 

The control group was designed to allow the test 
of the hypothesis of increased group incoordination 
as a function of the level of fear, threat, or panic. 
Mintz (1951) claimed his threat of small monetary 
loss was adequate to produce “paniclike” behavior. ie 
Kelley et al. (1965) disputed this, 

Leadership authority, Leaders (who were confed- 
erates in the experiment) were either elected or ap- 
pointed, 

Leadership. Leaders provided two kinds of leader- 
ship, either the me-first Jeader who escaped first, 
reducing danger to himself if his leadership were fol- 
lowed, or the me-last leader, who volunteered to go 5 
last, thus having the greatest risk of loss. The control 
condition had no leader. 

Outcome. The groups either succeeded or failed; 
that is, the last person (in the me-last condition, the 
leader; in the me-first condition, the other confed- 
erate) did not make it out when time was up, re- 
sulting in being “shocked” or monetary loss. 
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Procedure 


rhe subjects were randomly assigned to a 1-hour 
experimental session at the time they signed up for 
th- eyperiment. Confederates were also randomly as- 

Upon arrival at the waiting room the four 
subjects for each session found two early ar- 

rival’ —the experimental confederates. When the total 
gre of six had assembled, they were moved into 
esting room where each sat at one of six booths 

„i were lettered A, B, C, D, E, or F. A letter 
natching the booth letter was pinned to each per- 
shict end the experiment was “explained” as a 
study of panic behavior and leadership. 

point a “leader” for the group was either 
} by supposed secret ballot or “arbitrarily” ap- 
d by the experimenter. In either case one of 
vifederates was always manipulated into being 
the | The control group had no leader and this 
st tted. 
rticipants then moved with the experi- 
» the small “panic” room where the ap- 
ıwajted them. The door was closed on the 
owded room. 

Jhe device, described in the apparatus section, was 
called a panic simulator and it was explained and 
demonstrated carefully to the group. Any questions 
concerning its function and method of use were 
answered, 

The shock cuffs were attached and a shock sam- 
ple of 5 mA was given to all subjects. The meter 
cale on the shock generator was pointed out and 
since the needle deflection at 5 mA was about 4 
of the scale on the meter, the subjects were told 

vil the “real” shock would be eight times the in- 
sity of the sample. A knob on the generator 
labeled “shock intensity” was then twisted clock- 
wise. In the money condition the cuffs were attached, 
but there was no shock generator present in the room 
and 25¢ was given to each subject. The subjects 
were told that they could lose this money if they 
lid not fulfill the conditions of the escape, and 
that they could keep it if they did escape. 

in capsule form the situation was as follows: All 
six participants (4 subjects and 2 confederates) were 
cuffed to the wires that lead to the different-colored 
cones which in turn lead through their plugs to 
wires that lead to the shock generator. When a very 
loud (approximately 80-90 dB, SPL) buzzer was ac- 
tivated by remote wire by the experimenter, con- 
versation was all but impossible and the escape was 
to fake place. The appointed or elected leader was 
permitted to explain his plan of escape for the group 
for only 5 sec preceding the buzzer. This buzzer 
had a threefold purpose: (a) It effectively sup- 
pressed conversation making unlikely any attempt 
to usurp leadership or confound the manipulation by 
added comment, (b) it acted as an arousing stimulus 
increasing the stress and, of course, (c) it signaled 
the “escape” period. During the 5 sec one of two 
prearranged plans were espoused by the leader-con- 
federate. In one plan the leader said, “Me first 
[pointing to himself] then you, you, you [around 
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the circle] . . . and you last!,” pointing to the other 
confederate. In the other plan the leader said, “You 
first [pointing to the confederate] then you, you, 
you, [around the circle] . . . and me last!” In the 
first plan the me-first Jeader was set up, and in the 
second plan the me-last leader. In both cases a con- 
federate was always first and last in the sequence. 
This sequence was a parallel of the queue suggested 
by Kelley et al. (1965) as the logical solution of the 
problem of escape through a limiting exit. 

At the start of the buzzer, which occurred imme- 
diately at the end of the 5-sec “leadership” period, 
escape was to occur. During the explanation by the 
experimenter to the subjects it was repeatedly stressed 
that the escape time before their time was up and 
shock (or monetary loss) occurred was very short, 
but left undefined. 

In successful groups the buzzer remained on until 
everyone escaped, regardless of any other factor. In 
the failure groups the buzzer terminated just when 
the confederate (either the me-last leader or the 
second confederate) was the only one left with his 
cone still in the apparatus. The confederate then 
feigned very great pain or sincere disappointment at 
the loss of his quarter. Thus, in actuality, only the 
brief sample shock was ever actually received by 
subjects. 

As quickly as possible, the six participants were 
uncuffed and returned to their respective letter- 
labeled booths in the testing room to fill out the 
questionnaire. 

After completion of the questionnaire the confed- 
erates were introduced and a thorough debriefing 
followed where all ruses were exposed and explained. 
The subjects were assured that the raffle was real 
and that they were now eligible to win. The im- 
portance of not disclosing what had occurred was 
stressed and everyone was asked if they had heard 
about the experiment from anyone who had par- 
ticipated in an earlier group. Only one group had to 
be discarded because of this problem, and a replace- 
ment group was run, 

The subjects were thanked and dismissed, exiting 
through a side door so that the confederates would 
not encounter subjects arriving for the next session, 

Additionally, a control condition consisting of two 
groups was run, These two groups were four-man 
leaderless groups that were required to escape as 
best as they could from a situation of threatened 
shock or small monetary loss. Every attempt was 
made to equate these two groups with their “led” 
counterparts on all aspects of the situation except 
that no leader of any kind was ever mentioned, 


RESULTS 
Manipulation Checks 


Part of the questionnaire used in eygiia- 
tion included three items designed’ to tap 
whether the participants perceiyed the de- 
pendent variable manipulations as intended. 
There were errors in perception at a rate of 
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4.8%. This 95.22% accuracy was deemed 
only of borderline adequacy so the conserva- 
tive alpha level of .01 was chosen to assist 
in assuring the validity of the results. Find- 
ings of differences between means with proba- 
bility levels greater than .01 are considered 
trends only. 


Factor Analysis 


Separate factorial analyses of variance were 
conducted for each question on the question- 
naire and the total questionnaire score for 
each subject. Many questions proved to be 
redundant measures of the same thing. It had 

ot been possible to anticipate completely 

is redundancy because of the disjointedness 
the previous research in this area. 

Previous work in this area predicted that 
there should be a “leader competency” com- 
ponent to the data (Hollander & Julian, 
1970) and also an “attribution of respon- 
sibility” component (Sisti, 1972). With this 
in mind a principal components factor anal- 
ysis with varimax rotation (Veldman, 1967) 
was conducted on the 16 questions that ulti- 
mately constituted the questionnaire. The 
number of items was reduced to 16 from 22 
by discarding the manipulation checks, and, 


TABLE 1 
ANALYSIS OF VARIANCE ON FACTOR SCORES OF 
“LEADER COMPETENCE” 


Source of 
variations df SS MS F 
1 1 2.25781 2.25781 5.4233* 
2 1 00781 00781 
3 1 -63281 63281 
4 1 94531 94531 
12 i. 32031 1.32031 
13 1 07031 07031 
14 1 -00781 -00781 
23 1 38281 38281 
24 1 «00781 «00781 
34 1 19531 -19531 
123 1 -63281 -63281 
124 1 38281 38281 
134 1 -19531 19531 
234 1 63281 63281 
1234. 1 -07031 -07031 
Within 
«replicates 112 46.62500 41629 
Toil 127 54.36719 
Mh, 
a Source of Viðyriation: 1 = Stress: shock versus monetary 
ss; 2 = Lead hip authori cted versus appointed; 


cess versus failure; 4 = Leader style: me- 


te 


= Outcome: 
rst versus me- 
*p < 0i. 


f 


” 
£ 


Ä 
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TABLE 2 


ANALYSIS OF VARIANCE ON FACTOR SCORES 0: 
“ATTRIBUTION OF RESPONSIBILITY” 


É 
eaten df SS MS 
1 1 
2 1 
3 1 
4 1 
12 1 
13 it 
14 J 
23 1 aot s 
24 1 
34 1 S 
123 1 3.78125 7.2 
124 1 .28125 
234 1 1.53125 
1234 1 28125 
Within 
replicates 112 56.25000 50223 
Total EZT =. 71.21875 


a Source of variation: 1 = Stress: shock versus monetary 
loss; 2 = Leadership authority: elected versus appointed 
3 = Outcome: success versus failure; 4 = Leader style: me 
first versus me-last, 

*p <.01 


also, three questions which were found con- 
fusing to subjects and/or open to varied in- 
terpretations by independent judges. 

Two clear factors did emerge and an anal- 
ysis of variance was conducted on factor 
scores of the items involved (Tables 1 and 2). 

Factor 1 was labeled Perceived Leader 
Competence. As can be seen in Figure 2, the 
money-versus-shock manipulation produced a 
statistically significant effect, F(1, TZ) 
5.4233, p < .01, on the perception of leader 
competence. Where monetary loss was the 
threat, the leader was seen as much less com- 
petent than when shock was the threat. 

This important finding lends support to the 
criticism of Mintz’s (1951) methodology by 
Kelley et al. (1965) that monetary loss is a 
very different form of threat or panic inducer 
compared to shock. The subjects decidedly 
do respond to the situation differentially un- 
der these two conditions. 

The more positive evaluation of the leader 
in terms of competence attributed to him in 
stress situations is a partial replication of the 
Sargent and Williamson (1966) study, which 
found that the group reacted more favorably to 
a leader under stress than nonstress and was 
especially true when the leader was success- 


CHANGES IN LEADERSHIP APPRAISAL 


3 
2 
: 0625 
ï 1 
5 -20313 
3 La 
Money Shock 


Levels of Fear Arousal 


2, Leader competence based on factor scores. 


extent, It is reasonable to specu- 
ce subjects saw their leader as gen- 
ssful for in no instance did they 
nly occasionally did another person 
-and he may have “deserved” it, as 
as predicted from the research by Lerner 
(1965, 1966). 
Factor 2 was labeled the Attribution of Re- 
ponsibility factor. Two interactions proved 
nificant in the analysis of the scores. 

The interaction of leadership style and 
source of authority shows an interesting rela- 
tionship between attribution of responsibility 

he election-appointment variable. Hol- 
vader and his co-workers and Sisti (1972) all 
found support for the contention that elected 
leaders receive more responsibility for the 
outcome than appointed leaders. This finding 

s supported, F(1, 112) = 7.294, p < .01, 
by these data; but, in addition, it can be seen 

this relationship is only valid if the 
¿coders style is the me-first one. If the leader 
is a me-last leader, then the attribution of re- 
sponsibility relationship is reversed (Figure 
3), and dramatically so. If the me-first leader 
is perceived by the subjects as fulfilling the 
eader role in an expected manner this find- 
ing isi y interpretable. The reward-struc- 
ture hypothesis predicts that individuals will 
take advantage of an opportunity to secure 
their own safety. The leadership format in 
this study allows just this possibility. If the 
leader did not take advantage of this chance 
to secure his own safety his leadership could 
be perceived as unusual. The me-first leader 
fulfills expectancies and is therefore seen more 
positively. Julian, Hollander, and Regula 
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(1969) anticipate this interaction of style and 
authority when they state that election makes 
the leader more vulnerable to devaluation if 
he is perceived as incompetent or unsuccess- 
ful. The me-last leader can be perceived as 
either a martyr-type leader or as a leader 
who fails to act as anyone else would given 
the same situation. The martyr fails to fol- 
low reward-structure thinking and, therefore, 
may be seen as incompetent or foolish. It 
should be remembered, additionally, that Ler- 
ner’s (1965, 1966) work suggests that martyrs 
are devalued. Thus, the reaction to elected or 
appointed leaders is not just a function of 
their source of authority, but also the style 
of leadership the leader adopts once in power. 
This is also suggested by the diverse reaction 
of followers to various leadership styles in 
Lewin et als (1939) research. The demon- 
stration of this interaction may well explain 
the conflicting findings concerning whether 
leaders selected by election or appointment re- 
ceive high or low attribution of responsibility. 

The second significant interaction, F(1, 
112)= 5.423, p < .01, in Factor 2 is that of 
source of authority, leadership style and level 
of panic or threat (shock versus monetary 
loss). Figure 4a demonstrates that the three- 
way interaction has the same relationship of 
attribution of responsibility, leader style and 
elected versus appointed as in Figure 3 when 
stress is high (shock). If the stress is reduced 
(Figure 4b) the relationship changes, Under 
low stress the elected leader always receives 
higher attribution of responsibility than the 
appointed leader. From these data it is ap- 


3 A appointed 
a A olected 
= 
z aA 
A 
s (] 
æ 
S 1 
c 
s K 
5 
2 
£ -3 
< 
-4 
Leader Martyr 


Leadership Style 


Fıcure 3. Attribution of responsibility based on 
factor scores. (Leadership style.) 
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Leadership Style Under Threat of Monetary Loss 


Ficure 4, Attribution of responsibility based on 
factor scores. (4a, Leadership style under threat of 
shock. 4b. Leadership style under threat of monetary 
loss.) 


parent that (a) “panic behavior” is a function 
of the stress level involved as Kelley et al. 
(1965) stated, (b) that the conflicting find- 
ings of whether more attribution of respon- 
sibility goes to elected leaders or appointed 
leaders is resolvable when it is realized that 
leadership style and level-of-stress are both 
important variables that must be systemati- 
cally manipulated in order to get predictable 
results. This again supports the conclusion 
that there is a weakness in Mintz’s “panic” 
manipulation as a useful variable in producing 
paniclike behavior. 


Control Group 


By pooling all the data in the shock-with- 
leader groups and the data in the monetary 
loss-with-leader groups (df = 63 for each ex- 
perimental group) data from the resultant 
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two experimental groups were analyzed with 
the two control groups (df = 7 for each con- 
trol group) in a one-way analysis of var nce 


(Table 3). 
In terms of fear aroused by the situ. ‘on 


shock was much more threatening thar he 
fear of monetary loss. The least fear aro ed 
was in the leaderless-money-loss group; ‘he 
next least fearful was the led-money vss 
group; next, shock with a leader present as 
fear arousing, and finally, the most fe: ‘ul 
situation had shock as the threat and 


leader present (Scheffé test, p < .001). 
ordering does not completely fit the con pt 
that groups with leaders would have less 
than leaderless groups, but it does, in gene 11, 
support this prediction and that shock is m ʻe 
fear arousing than monetary loss. 

The hypothesis that group incoordinati n 
would increase under higher threat was st 
stantiated by the measure of the number 
“traffic jams” (Table 4). Very clear suppo 
was also found for the hypothesis that le 
groups would have less incoordination, “traffic 
jams,” than leaderless groups. The rank or- 
dering for jam-ups was experimental group 
money; experimental group-shock; contr 
group-shock; control group-money, with di 
ferences significant at the .001 level based o 
a Scheffé test. 


CONCLUSIONS 


This study has provided several potentiai 
contributions to the literature. 

First, the methodological equipment prob- 
lems of studying panic behavior were reduced, 
The “panic simulator” developed here ex- 


TABLE 3 


CONTROL GROUPS Versus EXPERIMENTAL Groups 
ANALYSIS OF VARIANCE AND TREATMENT 
MEANS ON FEAR AROUSAL 


No. 
= Treatment repli- Ran 
Variable M cations orde 
Experimental 
Money 3.5938 64 i* 
Shock 7.0156 64 ar 
Control 
Shock 4.0625 8 4* 
Money 5.7500 8 Ph 
* Differences between variable orders are significant at .001 


as determined by a Scheffé test. 


| 
d 


ae 
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Tintz’s technique and added the addi- 
necessary factor of stress to produce 


tional 


Tp ze behavior, 


od, the study seems to support the con- 
that Mintz’s (1951) manipulation— 
havior-producing monetary loss—does 
duce the same kind of behaviors as 
eat of real bodily harm. It is suggested 
hese findings that future research into 
aic behavior use as high a level of per- 
as ethically possible. 
leadership has been shown to be a 
cnt factor in the behavior of panic- 
individuals. Leadership can reduce 
ipcoordination that can make a minor 
xw tragedy. In adding to the 
der-stress literature this study 
eaders are seen as more competent 
increases. This may account for their 
liveness, 

Fourth, the style of leadership also affects 
the evaluation of the leader’s capabilities. The 
leader who leads in a style that is expected, 

‘ven the circumstances, is seen more posi- 
cly in all respects than the martyr-type 
ider who is, as Lerner (1965, 1966) pre- 

dicted, devalued. It appears that these data 
pport a notion that the negative value to 
subjects of nonfulfillment of leadership 
expectancies outweighs their obvious observa- 
tion that the me-first leader uses his power 
to protect himself and further jeopardize his 
followers. This apparent contradiction in re- 
ward-structure values merits further study, 

Vinally, the use of attribution of respon- 

lity measures has yielded strong elected- 
appointed differences. Hollander and Julian 

1970) demonstrated these differences only 

with difficulty. The use of attribution mea- 
sures seems to provide a more sensitive 
technique than the traditional questionnaire 
items used in the past. In this study, elected 
leaders were, as predicted by Sisti (1972), 
given more attribution of responsibility than 
appointed leaders. This relationship only holds 
under low-stress situations; otherwise the dif- 
ference in evaluation narrows and reverses its 
relationship, 

These findings contribute to a possible reso- 
lution of the contradictory findings in the 
area of attribution of responsibility to the 


not } 


the 


ceived threat 
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TABLE 4 
CONTROL Groups Versus EXPERIMENTAL GROUPS 


ANALYSIS OF VARIANCE AND TREATMENT 
MEANS ON “TRAFFIC Jams” 


No. 


é Treatment repli- Rank 
Variable M cations order 
Experimental 
Money 0.0 64 ise 
Shock A375 64 the 
Control 
Shock 1.00 8 3* 
Money 1.50 8 4* 


* Differences between variable orders are significant at .001 
as determined by a Scheffé test. 


leader. Leadership responsibility evaluation is 
a function of more than merely election ver- 
sus appointment. The level of stress and the 
style of leadership must also be taken into 
account, i 
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» _ Attsinment Value and Type of Reinforcement: A Hypothesized 
Interaction Effect 


Richard L. Sorensen 
University of Illinois at Urbana-Champaign 


Tt was hypothesized that one’s attainment value for a particular activity and 
the type of reinforcement given in pursuing the activity interact to influence 
performance, Ninty~one Grade 6 students were given an instrument designed 

measure their attainment values for different skill areas. Of these students, 
X 28 males and 23 females who had particularly high or low values for either 
e sports or language arts were assigned a two-choice discrimination learning 
task. The stimulus materials for the two tasks were different enough to enable 
them to be cloaked as measuring sports or language ability. All subjects were 
randomly assigned to one of three treatment groups and given either tangible 
reinforcements, verbal praise, or mere knowledge-of-results feedback. The pre- 
dicted Reinforcement Type X Attainment Value interaction was found to reach 
significance on the performance measure. Neither sex nor ability estimates for 
the skill areas were found to signiñcantly influence the hypothesized inter- 
action. Although overall, those with a high value for the task performed sig- 
nificantly better than those with a low value, such an effect was apparently 


modified by the type of reinforcement given. 


ae 
\ rather large body of research has been ings as to the actual effects various reinforcers 
i concerned with the effects of reinforcement on have on task performance. Some studies have 
achievement behaviors of children. Con- found verbal praise to become increasingly 
eration has been given to the effects of-vari- effective with age, whereas others have found 
ous reinforcers on size discrimination (Meyer the opposite effect. Some have found reproof 
& Seidman, 1960; Terrell & Kennedy, 1957), to be most effective overall, and others have 
probability learning (Lewis, Wall, & Aronfreed, suggested that tangible rewards best facilitate 
1963; McCullers & Stevenson, 1960), discrim- performance. Still others have reported no 
x ination learning (Spence, 1970; Witryol, Low- differential performance effects due to rein- 
~ den, Fagan, & Bergen, 1968), and test per- forcement type. Although several studies have 
formance (Hurlock, 1924; Rosenfield, 1972). found older children to prefer verbal praise in- 
t- Both developmental (Stevenson, 1965; Witryol, centives, at least one investigation (Runyon & 
1971; Witryol & Ormsby, Note 1) and social Williams, 1972) has demonstrated that incen- 
; (McGrade, 1966; Zigler & Child, 1969; tive preferences do not affect performance. It is 
Zi jer & Kanzer, 1962) hypotheses have been the thesis of the present study that some of this 
advanced to explain subjects’ differential re- disparity may be accounted for on the basis of 
sponsiveness to reinforcers. an achievement variable that has been largely 
4 From this extensive body of literature, how- ignored: the child’s attainment value for the 
© ever, there have emerged rather disparate find- activity being reinforced. 
Value is an achievement construct that has 
pera i te been given at least some consideration in most 
This study is based on a dissertation submitted by the aaar theoretical statements on achievement. 
author in partial fulfillment of the requirements of the Lewin (1935), Rotter (1954), and Atkinson 
5 ec at the University er Be Sree (1957) are among those who have given value 
d e a À ank A 1 ran ol 5 p a, 
Dweck aa e ae ESAE a prominent theoretical Beaten but for the 
and Robert Linn and Stewart Jones for their helpful most part they have failed to explicate it 
sikgan as members of the dissertation Se thoroughly. Recently, Crandall (Note 2) has 
equests for re] ichard L. : : : i 
Sore LE Deuce Chae Health given more detailed treatment to attainment 
Center, P. O. Box 31, Chester, Illinois 62233. value, a construct she feels has for too long been 
Ps, a 
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considered global in nature. People hold differ- 
ent values for different goals, and they prob- 
ably hold very few values that are the same 
across all skill areas. 

Two studies reported by Battle (1965, 1966) 
are among the few that have given an impor- 
tant emphasis to attainment value. Junior high 
students’ academic performance in two subject 
areas (English and mathematics) were pre- 
dicted on the basis of the student’s minimal 
standard, his/her certainty of reaching the 
standard, his/her expected grade, and the value 
the student placed on doing well in each sub- 
ject area. Both relative attainment value (the 
importance placed on doing well in one area 
compared to other school subjects) and abso- 
lute attainment value (the value of competence 
in one area regardless of other areas) were 
found to be highly predictive of grade per- 
formance. 

Both type of reinforcement and task attain- 
ment value have been demonstrated to influ- 
ence children’s achievement behavior. Further, 
it seems quite possible that one’s attainment 
value for a particular activity could be an im- 
portant determinant of the effectiveness of a 
particular type of reinforcer. The present study 
then, was designed to test the hypothesis that 
attainment value and type of reinforcement 
function interactively to influence one’s level 
of performance. Specifically, it is predicted 
that in performing a task of low value, one’s 

performance will be enhanced more by tangible 
reinforcements or verbal praise than mere 
knowledge of results. Conversely, the perfor- 
mance of those assigned a high-value task 
should be enhanced more by knowledge of 
results than by tangible reinforcement or verbal 
praise, 

METHOD 

Subjects 

The entire sixth grade of one elementary school (91 

stuc- ois) was administered an attainment value ques- 


tionns:re, Based on their responses to the instrument, a 
sample of 51 children (2 iles, 23 females) was selected 
for participation in t} nental phase of the study. 
Student ages ranged í years 8 months to 14 years 


2 months. Overall, ti 
socioeconomic group, 


nts represented a middle 


Pretesting 


vised to assess the ab- 
iety of skills, each of 


A pencil-and-paper test 
solute attainment values « 
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which could be classified into one of three sl. | areas; 
English/language arts, mathematics, and spis. For 
each skill area, the question was asked, “How i»: oortant _ 
is it to you to be good at 2” To minimize 1e pos- 
sibility that ratings would be based on perceive bility, 
the subject was explicitly instructed not to ʻe the 
items on the basis of how good he/she is but si. ly ac- 
cording to how important it is for him/her to « ell in 
a particular area, A 5-point Likert-type scale | lowed 
each item on which the subjects were to rate i r per- 
formance from “very important” (1) to “not v im- 
portant” (5). After completing the attainmen value 
instrument on a separate sheet, the subjects we: asked 
to rate their ability in each of the three skill are» using 
a 5-point scale from “very good” (1) to “not ver, __ od” 
(5). 

Because the subjects were more ous with 


sports and English/ 
5, only these two test 
jects to,either a 


regard to their attainment values i 
language arts than in mathemati 
items were actually used to assi,, 


s n r, Sen 
“high” or “low” attainment-value condition. Su ‘ets 
who showed either uniformly high or uniformly | at- 
tainment values across all skill areas were not se! ted 
for participation in the subsequent experiment. 
Experimental Task 

Because discrimination learning tasks are commo ly 


used in the study of reinforcement effects (eyr., Meyer 
& Seidman, 1960; Spence, 1970; Terrell & ! -nned ; 
1957), such a task was used in the present experime' t, 
The task is most similar to the one used by Spence © id 
Segner (1967). 

The task for each subject consisted of six pairs o! ne 
drawings of human figures that were presented by mc ns 
of a slide projector. For both sports and language ta :s, 
the drawings were of sports officials’ signals, chosen 
the basis of their likely unfamiliarity to the subje 
(from lacrosse, basketball, and football). Those assign d 
to the sports task were told that their ability to lev :n 


new umpire signals was a measure of their sports ability. 


Those working on the language task were told that their 
ability to learn a new sign language (that the signa's 
were claimed to represent) was a measure of their lo \- 
guage arts ability, The only objective difference betwe on 
the two sets of stimulus items was that for the spo ts 
task, the man’s shirt was striped (like an umpire s), 
whereas for the language task it was solid. Although he 


task (discrimination learning) was the same for both ‘he — 


sports and language groups, the slight difference in 
the stimuli enabled the experimenter to describe eac: as 
being a measure of different abilities, 

Each subject performed only one set of stimulus 


items, either sports or language arts. For each pair, the 7 


subjects were instructed that one item was the “right” 
choice and the other “wrong.” It was explained that 
their task was to learn to choose the right one through 
repeated exposure and feedback from the experimenter. 


Treatment Groups 


Based on the results of the attainment value instru- 
ment, each subject was randomly assigned to work on 


À 


9 
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ATTAINMENT VALUE AND TYPE OF REINFORCEMENT 


the set (sports or language) associated with either his/ 
her high-attainment-value area or his/her low-attain- 
ment-value area. Within each high-value or low-value 
group, subjects ndomly assigned to one of three 
reinforcement cond: In the tangible reinforcement 
group, subjects wer arded tokens for each correct 
response that were redeemable for prizes. In the 
verbal praise group subjects were praised by the 
experimenter for each correct response (e.g, “You're 
good,” “You're finc”), In the knowledge-of-results 
group, the subjects were given no verbal feedback from 
the experimenter. Rather than having the reinforcing 
events constitute the subjects’ only source of informa- 
tion about response correctness or incorrectness, it was 
decided to add a lizh: signal for each correct response 
in each of the three treatment groups (see Spence & 
Segner, 1967). in this way, the information value of in- 
correct responses would be unambiguous and constant 
across the treatment groups. Thus the treatment condi- 
tions form a 2 X 3 factorial design: two value groups 
(high and low) and ‘| ce reinforcement groups (tangi- 
ble reinforcement, | praise, knowledge of results). 


Procedure 


ed individually in a room sepa- 
rate from his/be issroom. The subject was seated 
facing the wall which the stimulus materials ap- 
peared. The siide projector was positioned on a table 


Each subject v 


directly behind the subject behind which the experi- 
ment: of the subject’s view. After 
being priate instructions, each subject 
was sho e pairs presented by means of 
4 X 6 inci 2 cm) index cards. These pairs 


ns presented on the subsequent 
task. The subje: instructed to indicate his/her 
choice of the corre.’ item by saying “right” or “left.” 
Tt was insured that cach subject had an accurate under- 
standing of which was left and which was right. In 
rder to expose the subject to the appropriate reinforc- 
ing event, the first and third practice pairs were con- 
sidered correct and the second and fourth were incorrect. 
The light signal and reinforcers were used as they had 
been subsequently on the nonpractice trials to accustom 
the sub to (hem. Further, the variation of the serial 
positior nd the left-right spatial position- 
ing of th: ems were explained and demon- 
strated. Also, vere informed that a blank frame 
with a horizon «indicated the end of the series of 
items, 
The experiment: 


were similar 


ist of six pairs was then adminis- 
tered by means of a 35-mm rotary-tray slide projector. 
Each pair was presented for 5 sec, after which the pro- 
jector advanced to the next pair, with no blank frames 
appearing between. Thus, the subject was required to 
choose between the items in each pair during the time 
that the pair was exposed and not afterward. The cor- 
rect member of each pair was randomly predetermined 
and was the same for all subjects exposed to the sports 
or language task. For each condition, both light signal 
and reward/feedback were given immediately after the 
subject’s response. 

After the exposure of the last (eighth) pair in the list, 
the same list of items was again presented but in a differ- 
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ent serial order and with the spatial positioning of each 
pair reversed. This same list of six pairs (with two order 
variations) was thereafter presented for a total of 13 
trials or until a criterion of 2 errorless trials had been 
reached. After the subject had completed 13 trials or 
reached criterion, he/she was awarded the same tangi- 
ble reward (a large balloon) whether promised or not 
during the experiment. The subject was then escorted 
back to his/her classroom, 


Results 


The performance index used was the number 
of trials to reach a criterion of two consecutive 
trials on which five of the six responses were 
correct. 

Tt seems logical to assume that since the two 
tasks differed only on whether the stimulus 
figures wore a solid or striped shirt, the tasks 
were of equal difficulty. However, several 
analyses were performed to insure that this was 
the case. First, it was determined that overall 
the performance of those working on the sports 
task was not significantly different from those 
working on the language task (¢ = .08, ns). 
Second, to determine if the task factor was 
interacting with any of the other factors, sev- 
veral two-way analyses of variance were per- 
formed, with “task” as the first factor and each 
of the following as the “other” factor: sex 
(male, female), value (high, low), type of rein- 
forcement (tangible, verbal praise, knowledge 
of- results), and ability estimate (high, low). 
In all of the analyses considered, none of the 
factors were found to interact significantly with 
the task factor. Thus it can be concluded that 
the tasks are equivalent. Even though the tasks 
were presented as measures of different abili- 
ties, they did not elicit different performance 
levels overall. 

To test the hypothesis that attainment value 
has an influence on the effectiveness of various 
reinforcers, a 3 (reinforcement type) X 2 (at- 
tainment value) analysis of variance was per- 
formed. The interaction was significant, /*(2, 
45) = 3.82, p < .05. Statistical contrasts re- 
vealed significant differences between the two 
groups given knowledge of results and between 
the verbal praise/high-value and knowledge of 
results/low-value groups (Scheffé tests, $ 
< .05). Table 1 reveals that those working on 
the high-attainment-value task performed opti- 
mally given knowledge of results alone. There 
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TABLE 1 
Mean NUMBER OF TRIALS TO CRITERION 


Attainment value 


Treatment condition Low High 
ibli ard 

ene le rew: as aa 
SD 1,69 4,74 
‘bal praise 

ver Eo 7.80 5.10 
SD 3.42 1.45 

Knowledge of results 
1. p 9.00 5.00 
SD 2.50 1.73 


were no systematic differences in the verbal 
praise or tangible reward conditions. 

A second result was the finding of a signi- 
ficant main effect for attainment value (p 
< .01). Overall, those who performed the task 
that had a high value for them did significantly 
better than those working on their low-value 
task. This main effect was, of course, modified 
according to the type of reinforcement given. 

Among other significant results were that 
those subjects assigned to the high-value con- 
dition were found to have higher ability esti- 
mates for both language (¢ = 3.34, p < .005) 
and sports (¢ = 3.76, p < .001) than those as- 
signed to the low-value condition, That high 
value often means a high estimate of one’s 
ability is not surprising. Although one can only 
speculate as to the causal direction of this rela- 
tionship, it would follow that value indirectly 
influences the estimate of one’s ability (based 

on success/failure experiences). 

Another interesting result with the attain- 
ment value instrument is concerned with sex 
differences. Battle (1966) found boys’ attain- 
ment value for mathematics to be higher than 
girls’ and girls’ attainment value for English to 
be higher than boys’. The same result was 
found in the present investigation based on 
responses of 51 boys and 36 girls originally 
given the instrument. Boys had a significantly 
higher attainment value for mathematics than 
girls (1.72 vs, 1.12; ¢ = 12.29, $ < .001), and 
girls had a significantly higher value for Eng- 
lish/language arts than boys (2.17 vs. 1.92; 
t= 4.08, p < .001). Additionally, boys were 
found to have a higher attainment value for 
Sports than girls (2,39 vs. 2.22; ¿= 2.27, p 
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< .05). That boys had a higher value í- sports 
and mathematics and girls had a high -> valued 
for language arts should come as no rprise, | 
Because most girls tend to have a h` ory of 
social reinforcement for language cor «tence 
and boys are taught to value sports anc nathe- 
matics, one could expect this result. 
Discussion 

One curious and rather unexpecte result 
that deserves comment is the overall | insig- 
nificant main effect found for the type. rein- 


forcement given. That it apparently mac little 
difference in performance whether erbal, 
praise, knowledge-of-results feedback, ov oken 
reinforcements are given is itself notew “thy. 
Earlier studies cited in the literature ften 
suggest that the type of reinforcement iven 


does affect performance. However, the ay s of 
the subjects, the types of tasks given, anc the 
exact reinforcement conditions used wei 
varied as to make any overall conclusi ns 
equivocal, 
No specific hypotheses were advanced c a- 
cerning an overall main effect for attainn 
value. Although this main effect did reach 
nificance, the type of reinforcement given 
parently had a modifying influence. Indepe d- 
ent of reinforcement effects, one might exp ct 
value to have an overall effect. Battle (1906) 
consistently found high correlations betwoon 
students’ values and their grade performance. 
Even more important than a nonsignificant | 
overall effect of reinforcement or a main effect 
of attainment value was the fact that when con- 
sidered in interaction with attainment vali e, 
reinforcement apparently had an effect. At 
least in the case of subjects given knowledge of 
results alone, the present results Suggest that 
attainment value may influence reinforcement 
effectiveness. Subjects with a low value for the 
task performed significantly more poorly than | 
those with a high value in the knowledge. of- 
results group. Knowledge of results is undoubt- 
edly the most commonly used t ype of (non- 
aversive) feedback used in the classroom or any 
learning situation. Because of its prominent 
position in the educational setting, any con- 
clusions that could be drawn regarding its 
optimal use could have some educational im- 
plications. The above finding could yield such 


ig- 
\p- 
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ATTAINMENT VALUE AND TYPE OF REINFORCEMENT 


a conclusion: The value the subject holds for 
the task is a variable that demands consider- 
ation when correctness feedback is given. If it 
can be determined that the subject has a par- 
ticularly low value for the task at hand, one 
might more effectively use another type of 
reinforcement/feedback. On the other hand, if 
the subject holds a high value for the activity, 
knowledge of results may well be the most effec- 


tive reinforcement type. 
That dierent reward systems are used freely 
in classroom/clinical settings cannot be dis- 


puted. Although in the clinical setting the par- 
ticular reinforcement used is (ideally) the one 


found most effective in producing and main- 
taining | desired behavior, in the classroom, 
tradit onvenience has dictated the rein- 
forces Í choice (usually correctness feed- 
back). here can be little question that the 


attainment value held by those being rewarded 
has gone largely ignored. Before instituting an 
operant conditioning program in a classroom or 
establishing a clinical token economy system 
in a group setting, it may be wise to consider 
the attainment values held by the participants 
in «uch programs. Similarly, before continuing 
to give the traditional knowledge-of-results 
feedback to students in any learning situation, 
an examination of the students’ value for the 
task at hand could prescribe a more effective 
type of reinforcement. 

O’Leary and Drabman (1971) have sug- 
gested a number of factors that may critically 
influence the success of a token reinforcement 
program including the teacher, the child, the 
parent, and the system of reinforcement. The 
results of the present study may suggest that 
this list be lengthened to include the attainment 
value for the task held by the child. 

Overall then, it appears that the system used 
in the present study for assessing attainment 
value is a legitimate one. Through a procedure 
of simply soliciting self-ratings, one can ap- 
parently derive an estimate of a behavioral 
construct that is related to performance and 
ability estimates. In any event, attainment 
value as it is presently assessed deserves fur- 
ther investigation and validation in future 
studies of achievement behavior. 
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Egotism in Males and Females 


Walter G. Stephan, David Rosenfield, and Cookie Stephan 
University of Texas at Austin 


This study investigated the attributions of males and females for success and 
failure at a competitive game. It was found that males competing against males 
or females and females competing against females took more credit for success 
than they gave successful opponents and blamed themselves less for failure 
than they blamed their opponents for failing. Females competing against 
males, however, did not make these egotistical attributions. When the subjects 
accounted for their opponents’ outcomes, male actors were given more credit 
for success and were. blamed less for failure than female actors. It is suggested 
that females playing against males were not egotistical because they did not 
believe they were as competent at a masculine task as were their male opponents. 


In situations where ego involvement is high 
and opportunities for social comparison are 
present, people tend to take credit for their 
successes and deny blame for their failures. 
This attribution pattern, labeled egotism 
(Snyder, Stephan, & Rosenfield, 1976), has 
been obtained in a number of studies (i.e., 
Streufert & Streufert, 1969; Wolosin, Sher- 
man, & Till, 1973; also see the critical review 


by Miller & Ross, 1975). Three recent studies 
usin s and females have obtained 
results ig that females may be less 
egotistical males. In a study by Nicholls 
(1975), sixth-zrade boys and girls succeeded 


or failed on an angle-matching task, It was 
found that girls took less credit for their suc- 
cess and more blame for their failure than did 
boys. Levine and Reis (in press) and Feather 
and Simon (1973) have obtained a similar 
pattern of attributions for college men and 
women when they worked on anagrams tasks. 


Anoth found that although females 
took le for success, they assumed less 
blame foi re than did males (Feather, 
1969). 


Tt is possible that these sex differences were 
not due to differences in egotism but rather 
were due to differences in the way the tasks 
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were viewed by males and females. If the 
females had lower expectancies for success 
than the males, as many studies have indi- 
cated (Crandall, 1969; Deaux & Emswiller, 
1974; Feather, 1969; Feather & Simon, 
1973; House, 1974; Montanelli & Hill, 1969), 
sex differences in attribution would be likely 
to occur, but the differences would not be a 
consequence of sex differences in egotism, 
Differences in egotism could be established by 
demonstrating that members of one sex take 
more credit for success than observers give 
them and assume less blame for failure than 
observers give them. Although there are 
studies in which the attributions of actors and 
observers for success and failure have been 
compared, none have used subjects of both 
sexes. The studies that have compared the at- 
tributions of actors and observers have indi- 
cated that males tend to make egotistical at- 
tributions (Snyder et al., 1976; Streufert & 
Streufert, 1969; Wolosin et al., 1973), The 
results for females are less consistent. Beck- 
man (1970) and Ross, Bierbrauer, and Polly 
(1974) found evidence for egotism, whereas 
Beckman (1973) and Stephan (1975) ob- 
tained mixed results. 

Although this last set of studies also sug- 
gests that females may be less egotistical than 
males, two differences in the methodologies of 
these studies may account for the discrepan- 
cies in their results. First, all three of the 
studies comparing male subjects used com- 
petitive tasks, whereas none of the above 
studies comparing female actors and ob- 
servers involved competition. It is possible 
that the ego involvement and the implicit so- 
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cial comparison induced by competition would 
lead females as well as males to make egotisti- 
cal attributions. Second, the male observers in 
these studies participated in the tasks, but the 
female observers did not perform the tasks. 
It is likely that more egotism would be ap- 
parent when the attributions of actors are 
compared with those of observers who are 
working on the same task (involved observ- 
ers) than when the actors’ attributions are 
compared with those of uninvolved observers. 
This is because involved observers are more 
likely than uninvolved observers to compare 
themselves to the actor and, hence, should be 
more likely to make ego-enhancing attribu- 
tions. 

The primary purpose of the present study 
was to examine the ego-enhancing tendencies 
of males and females in identical situations to 
determine whether there are indeed sex dif- 
ferences in egotism. A secondary purpose of 
this study was to compare the attributions of 
female actors for their own behavior to those 
that male actors make for their behavior. 
Based on the findings of Nicholls (1975), 
Levine and Reis (in press), and Feather and 
Simon (1973), it was expected that female 
actors would take less credit for success and 
more blame for failure than males. 

Finally, within this design, it is also pos- 

sible to compare the attributions that ob- 
servers make for the outcomes of females to 
those that they make for the outcomes of 
males, Deaux and Emswiller (1974) found 
that observers attributed the success of males 
on a masculine task to skill while attributing 
the success of females on that task to luck. 
Deaux and Taynor (1973) obtained a similar 
pattern of results when observers were asked 
to account for a good performance by a male 
or a female, However, when the observers 
were asked to account for a poor perform- 
ance, males were viewed as being less com- 
petent than females, In both of the experi- 
ments, the observers were not directly af- 
fected by the outcome of the actor’s behavior 
and were, therefore, relatively uninvolved. 
The present experiment provides an op- 
portunity to determine if there are differences 
in the attributions that involved observers 
make for the outcomes of male and female 


actors. 
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METHOD 

Overview 

Using the procedure developed by Snyder & al, 
(1976), male and female subjects played one ther 
at a competitive matrix game. The outcome the 
game was manipulated so that one half of th. sub- 
jects won and the other half lost. Subjects we: ‘hen 
asked to assess the role of skill, luck, task dif’ alty, 
and effort in determining both their own ov. mes 
and the outcomes of their opponents. 
Subjects 

One hundred fifty-five male and female unde: ad- 
uates at the University of Texas at Austin p tici- 
pated in this study in partial fulfillment of th: re- 
quirements for an introductory psychology © irse. 
The data from 11 of the subjects were disc: led. 
Three subjects were omitted from the analyse be- 
cause of errors in the false feedback, 7 because hey 
expressed suspicion of the false feedback duriny de- 
briefing and 1 because he did not understand th. de- 
pendent measure. After dropping these subjects | om 


the analyses, the number of subjects per cell vai od 
from 16 to 19, 


Procedure 


Subjects arrived in groups of four, Either a r 
or a female experimenter explained that various 3- 
pects of decision making were going to be examir d. 
The subjects were told that each of them would b 
playing against one of the other three subjects ii a 
number matrix game that simulated real-life decisi: n- 
making situations. They were also told that in or er 
to make the game more realistic and interesting t! y 
would receive a penny for every point they had at 
the end of the game. 

The experimenter then showed the subjects a san- 
ple matrix (see Figure 1) and explained the game 
Player A could choose any one of the three row, 
and Player B could select any of the three colum: s. 
The cell at the intersection of the chosen row and 
column determined both players’ outcomes on tat 
move. Player A received the number of points abcve 
the diagonal in that cell and Player B received ‘he 
number of points below the diagonal. The subjects 
were told there would be a total of nine move: in 
the game, three moves on each of three different 
matrices, Each player started the game wit 50 
points and was told that he/she would lose nothing 
if he/she finished the game with fewer than 0 points. 
The subjects were given the three matrices onc at a 
time. It was suggested that they attempt to outguess 
their opponent on each of their three moves for that 
matrix. The subjects believed that they were making 
their selections at the same time as were their op- 
ponents. In order to make the subjects’ outcomes 
salient, they were asked to record the number of 
points they and their opponent had received after 
completing each move, 


> 
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EGOTISM IN MALES AND FEMALES 


wbjects were assigned to one of four com- 


Fhe 
petition conditions: a male subject playing a male op- 
` ponen‘, a female subject playing a female opponent, 
a m2'™ subject playing a female opponent, or a fe- 


mal ¿ject playing a male opponent. The last two 
condit ns were differentiated so that the attributions 


of m. ¿s for mixed-sex games could be analyzed 
separ’ icly from the attributions of females for mixed- 
sex mes. The experimenter told the subjects whom 


th. would be playing against and then took them 
aie cubicles, ostensibly to prevent them from 
s of their opponent, The subject was 
that he/she would be Player A. A 
the number of moves the subject expected 
t as administered at that point. 
subject was given false feedback about the 
Tr of the alleged opponent. Those assigned to 
ie su cess condition were given feedback that indi- 
ted they won seven of the nine moves in the game 
(Co ing only Moves 2 and 6 to the opponent). Sub- 
jec in the failure condition were told that they had 
Jost even of the nine moves, winning only Moves 
FLO, 
ter the game had been played and the final scores 
computed, each subject was asked to complete a 
questionnaire evaluating the extent to which each of 
fi factors affected the outcome of the game. The 
factors were skill, luck, task difñculty, and effort 
(Heider, 1958; Weiner, Frieze, Kukla, Reed, Rest, & 
Rosenbaum, 1971). For each factor, the subjects first 
checked whether the cause helped, hindered, or had 
no effect on their outcome. They then marked the ex- 
tent of the effect, if any, on a 7-point scale that 
1 from very little effect (1) to very great effect 
(©). Che subjects completed this questionnaire twice, 
once to explain their own outcome and once to ex- 
plain the outcome of their opponent. The subjects’ 
explanations of their own outcomes served as the 
measure of the actors’ attributions for their out- 
comes. The subjects’ explanations of their opponents? 
outcomes served as the measure of the observers’ at- 
ibutions for the actors’ outcomes. 

The order in which the two questionnaires were 
given was counterbalanced within each cell, After 
completing the attribution questionnaire, the subjects 
completed a final questionnaire assessing their mood, 
This questionnaire contained three items concerning 
how the subjects reacted to the game. The items were 
11-point scales marked at each endpoint with the 
following bipolar adjectives: happy-sad, excited- 
very interesting game-very uninteresting 
game. Finally, each subject was debriefed, and his or 
her suspicion was assessed by asking a series of in- 
creasingly direct questions concerning the subject’s 
responses to the outcome feedback on the games. The 
subjects were also asked if they knew their opponent 
personally ; none did. 


RESULTS 


Four-way unweighted-means analyses of 
variance were performed on the final question- 
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naire data, the four factors being sex of ex- 
perimeter, sex of subject, sex of opponent, 
and outcome (success or failure), Since the 
order in which the questionnaires were ad- 
ministered was counterbalanced, order was 
not included in this analysis. There were no 
significant differences attributable to the sex 
of the experimenter. 

The analyses of the final questionnaire re- 
vealed that subjects assigned to the success 
conditions relative to subjects in the failure 
conditions reported that the game made them 
more excited, F(1, 128) = 22.15, p < .001, 
happier, F(1,128) = 54.77, p < .001, and 
was more interesting, F(1, 128) = 45.52, p < 
.001. The results from these mood items pro- 
vide strong evidence for the impact of the 
outcome manipulation. Subjects in the success 
condition clearly enjoyed the game more than 
subjects in the failure condition. Before the 
matrix games were played, males expected to 
win significantly more games than did fe- 
males, F(1, 128) = 5.71, p < .05. It was also 
found that the subjects had higher expectan- 
cies of success when the experiment was con- 
ducted by a male experimenter than when it 
was conducted by a female experimenter, 
F(1, 128) = 11.05, p < 01. 


Attribution Data 
Five-way unweighted-means analyses of 
variance were performed on the attribution 
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TABLE 1 


ATTRIBUTIONS TO SKILL, EFFORT, 
Task DIFFICULTY, AND Luck 


Dependent variable 


Task 
difti- 
Attributions Skill Effort culty Luck 
Male subjects 
Own outcome 
Male opponent 
Own success 4.5 4.3 1.3 28 
Own failure 1.2 £ 6 39 
Opponent’s outcome 
Male opponent 
Opponent’s success 2.7 2.0 KESA 
Opponent’s failure 3.0 A Bis 39 
Own outcome 
Female opponent 
Own success 41 3.2 A 3.6 
Own failure 2.0 1.1 3 42 
fprponsatis outcome 
emale opponent 
Opponent’s success 3.0 3.2 9 49 
Opponent’s failure 3.2 24 L2 33-2 
Female subjects 
Own outcome 
Male opponent 
Own success 3.3 3.3 73.6 
Own failure 34 5 Toii 
Opponent’s outcome 
Male opponent 
Opponent’s success 3.8 3.2 1:0" 3:2 
Opponent’s failure 1.4 24 8 26 
Own outcome 
Female opponent 
Own success 2.9 3.3 hy DA @ 
Own failure 1.0 5 A 5.0 
Opponent’s outcome 
Female opponent 
Opponent’s success 2.5 9 BSS 
Opponent’s failure 24 1.1 6 30 


Note. The more positive the number, the more the outcome 
was attributed to that fact ubjects who succeeded made 
attributions for their own success and their opponent's failure, 
whereas subjects who failed made attributions for their own 
failure and their opponent's success, 


data, the fifth factor being role (actor ob- 
server). The fifth factor is a within-subjects 
factor, since the subjects contributed two 
pieces of data to the analyses, their attribu- 
tions for their own success or failure and their 
attributions for the failure or success of their 
opponent. 

The subjects’ attributions to skill, effort, 
task difficulty, and luck were transformed to 
indicate whether they felt the factor facili- 
tated or inhibited their ultimate outcome 
(Snyder et al., 1976). If subjects felt that a 
factor contributed to their outcome, its effect 
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was scored positively. If they felt tha’ the 
factor inhibited their ultimate outcome, ` was 
scored negatively. Hence, factors that -ere 


seen as helping successful subjects or hi: ‘er- 
ing unsuccessful subjects were scored ; >si- 


tively to indicate that they were respon: ‘ble 
for the subjects’ outcomes. Factors that vere 
seen as hindering successful subjects or ` 1p- 


ing unsuccessful ones were scored negati ¢ 
because they were perceived as wo! 
against the subjects’ outcomes, The mean. ‘or 
skill, effort, task difficulty, and luck are 
sented in Table 1. 

In order to obtain an overall measur: of 
egotism, the individual measures were c 
bined to form a composite index reflecting he 
emphasis placed on internal as opposed to 
ternal factors. When actors emphasize 
ternal as opposed to external factors in 
counting for their success (and they do so o 
a greater degree than do observers), th 
are making ego-enhancing attributions. Wh: 1 
actors emphasize external factors over iu 
ternal factors in accounting for their failure 
(and they do so to a greater degree than d+: 
observers), they are making ego-defensive : 
tributions. The composite index was co - 
structed by adding the subjects’ attributio: s 
to the internal factors (skill and effort) an | 
subtracting from that sum their attribution: 
to the two external factors (task difficult ’ 
and luck), Positive scores on this index indi 
cate a greater emphasis on internal factors 
over external ones, whereas negative numbers 
indicate a greater emphasis on external fac- 
tors over internal ones. 


TABLE 2 


COMPOSITE INDEX OF 
INTERNAL-EXTERNAL ATTRIBUTIONS 


Attributions for Attributions fo: 
own outcome opponent's outco!e 
Sex of Op- Öp | 
subject- Own Own ponent’s ponent's 
opponent success failure success faili 
Male-male 4.8 —2.4 =2 
Male-female 3.2 —14 2 
Female-male 1.3 A 2.7 
Female-female 1.3 —3.8 —2.9 


Note. Positive numbers indicate predominantly inter 
tributions, and negative numbers indicate predominantly ¢X- 
ternal attributions. Subjects who succeeded made attributions 
for their own success and their opponents’ failure, while sub- 
jects who failed made attributions for their own failure and their 
opponents’ success. 


a Se ee 


Results on ihe Index of Internal-External 
Attributions 


Since egotism exists to the degree that ac- 
tors attribute their success more internally 
and their failure more externally than do ob- 
servers, it is reflected in the interaction be- 
tween outcome and role. In this study two 
additional factors, sex of subject and sex of 
opponent, were included to determine their 
effects on egotisin. The interaction of sex of 
subject, sex of opponent, outcome, and role is 
the appropriate test to examine the effects of 
the sex of the actor and the opponent on 
egotism) The interaction among these vari- 
ables was significant, F(1, 128) = 4.24, p< 


.05. An analy: of the individual measures of 
skill, luck, t: ifficulty, and effort revealed 
that this efi as due primarily to the sig- 
nificant int: n for skill (p < .05), As can 
be seen in > 2, males playing against 
males, male: ying against females, and fe- 


males playing against females all attributed 
their success more internally than the success 
of their opponents, and they attributed their 


fa ernally than the failure of 
thei: F(1, 128) = 6.47, p< .05, 
F(i, i , P< .05, and F(1, 128) = 
9.03, p - pectively. However, females 
playing again. males did not display this 
egotistical pai..:n (F < 1.0)2 


There were three additional significant ef- 

fects from the five-way analysis. First, success 

_/ was attributed more internally than failure, 
" F(1,128)= 27.68, p < 001. This effect was 
significant for the subjects’ attributions to the 
asures of skill (p < .01), ef- 


indi: 

fort | ind luck (p < .01). Second, 
there w ificant Sex of Subject X Sex 
of Oppon: :teraction, F(1,128) = 5.16, 
$ < 05. Ma nade more internal attribu- 


tions when pl:»ing females than when they 

= played males, while females made more in- 
À ternal attributions when playing males than 
when they played females. This effect was 
significant for effort (p < .05). Third, a sig- 
nificant Role X Outcome interaction indicated 

that overall the subjects made egotistical at- 

| tributions, F(1, 128)= 12.06, p < .001. The 
| subjects made more internal attributions for 
I their own success than they did for their op- 
ponents’ success, but they made more ex- 
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ternal attributions for their own failure than 
they did for their opponents’ failure, The 
effect was significant for skill (p < .05), 
effort (p < .01), and luck (p < .05). 


Actors 


In order to compare the attributions that 
the males made for their own outcomes to 
those that the females made for their out- 
comes, contrasts were drawn from the overall 
five-way analysis. These contrasts examined 
only the attributions the subjects made for 
their own outcomes. A significant Sex of Sub- 
ject X Sex of Opponent X Outcome interac- 
tion was found in this analysis, F(1, 128) = 
4.95, p<.05. An examination of the cell 
means (Table 2) shows that this interaction 
was due to the fact that males playing males, 
males playing females, and females playing 
females all attributed success more internally 
than failure, F(1, 128) = 25.53, p< 01, 
F(1, 128) = 9.20, p < .01, and F(1, 128) = 
11.30, p < .01, respectively. Females playing 
females did not attribute success more in- 
ternally than failure (F < 1.0). The only 
other significant effects from this analysis 
were an outcome main effect, F(1, 128) = 
33.98, p < .01, and a marginally significant 
Sex of Subject X Outcome interaction, F(1, 
128) = 3.58, p < .06, indicating that females 
tended to attribute success less internally and 
failure less externally than males. 


Observers 


The final question to be examined is 
whether there were differences in the attribu- 
tions that the involved observers made to 
account for the outcomes of male and fe- 
male actors. Contrasts of the attributions that 
the subjects made to account for their op- 
ponents? outcomes were used to test for these 
differences. There was a significant Sex of 
Opponent X Outcome interaction, F(1, 128) 
= 3.94, p<.05. The success of male op- 
ponents was attributed more to internal fac- 


1 An analysis of covariance, using expectancy as a 
covariate, did not affect the significance levels of any 
of the reported findings. This was due to the fact 
that expectancies were not significantly related to the 
attributions made by the subjects. 
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tors than the success of female opponents, 
whereas the failure of male opponents was 
attributed more to external factors than the 
failure of female opponents. 


Discussion 


The results of the analyses comparing the 
attributions the subjects made for their 
own outcome with those they made for their 
opponent’s outcome showed that males play- 
ing males, males playing females, and females 
playing females all made egotistical attribu- 
tions. That is, they all took more credit for 
their success and assumed less blame for their 
failure than they gave their opponents. Only 
females playing against males failed to make 
egotistical attributions. 

These results can be explained in terms of 
Heider’s (1958) “naive analysis of action.” 
He suggested that causal attributions must 
satisfy two criteria: “(1) The reason has to 
fit the wishes of the person, and (2) the 
datum has to be plausibly derived from rea- 
son” (p. 172). In the present situation, it is 
likely that the subjects wished to protect and 
enhance their egos. They could achieve this 
goal by taking more credit for success and less 
blame for failure than they gave their op- 
ponents for their success or failure. 

The subjects’ perceptions of their relative 
competence at this type of task may have 
determined whether egotistical attributions 
were reasonable, Because the task was a com- 
petitive mathematical game, it is likely that 
it was perceived as being a masculine task 
(Broverman, Vogel, Broverman, Clarkson, 

& Rosenkrantz, 1972). This suggestion is sup- 
ported by the fact that males expected to do 
better than did females. The males appar- 
ently believed that they had the ability to do 
well on the task. Thus, it would have been 
reasonable for them to take more credit for 
success and less blame for failure than they 
gave their opponents. Females, on the other 
hand, apparently felt that they were less 
competent at this task than males. If so, it 
would have been unreasonable for females to 
take more credit for success and less blame 
for failure than they gave their male op- 
ponents. Hence, they should not have made 
egotistical attributions when their opponents 
were males. The females playing females, how- 
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would have been justified in thinking 
that they were as competent as their op- 
ponents. Thus, they should have been free to 
make egotistical attributions for their suc- 
cess and failure. Differences in perceived com- 
petence can also explain the parallel set of 
findings for the actors, which indicated that 
males playing males, males playing females, 
and females playing females attributed their 
success more internally than their failure, 
whereas females playing males did not, 

Two additional factors may have ‘con- 
tributed to the lack of egotism displayed by 
the females playing males. First, the females 
may have been less ego involved in the out- 
come of the game than the males. If the task 
was perceived as a masculine one, then the 
females were probably less concerned with 
doing well on it than the males were. Sec-’ 
ond, the females in this condition lacked an 
appropriate comparison other. A study by 3 
Zanna, Goethals, and Hill (1975) indicates 5 
that females tend to compare themselves to ~ 
other females but not to males. It^is likely 
that social comparison of abilities facilitates ~ 7 
the egotistical attribution process. Thus, the 
absence of an appropriate other to whom the 
females playing males could compare them- 
selves may have reduced the tendency to 
make egotistical attributions. 

The results for the observers indicated that 4 
the outcomes of male and female opponents |“ 
were viewed very differently. Both male and £ 
female observers gave male opponents more = * 
credit for success than they gave female op- 


ever, 


Te 


d 
| 
) 


ponents, and they tended to blame males less 
for failure than females. The results for suc- 
cess replicate the findings of Deaux and 
Taynor (1973) and Deaux and Emswiller 
(1974), who used uninvolved observers. The 
results for success and failure in the present 
study and the results for success from the d 
Deaux and Emswiller study can be inter- © 
preted in terms of the perceived competence 
of the male and female actors. Since the 
tasks used in these two studies were prob- 
ably perceived as being masculine tasks, it is ¥ 
likely that the male actors were thought to 4 
have more ability than the female actors. | 
Therefore, success by males would be ex- 
pected to be attributed more to internal fac- 
tors than success by females, and failure by | 


$ 


males would be expected to be attributed less 
to internal factors than failure by females. 
The results for failure in the present study 
differ from those obtained by Deaux and 
Taynor (1973). They found that males were 
perceived as being less competent than fe- 
males when the stimulus persons performed 
poorly in an interview. There are a variety of 
differences between their study and the pres- 
ent one, including differences in the indepen- 
dent and dependent variables and the fact 
that their observers were uninvolved. It is 
difficult to determine the degree to which each 
of these differences might have contributed 


to the liscrepancies between the results. 

In cœ lusion, the results from the present 
study ost that females may not be less 
egotist than males. Although females did 
got make egotistical attributions when they 
-competed against males, they did make ego- 
tistical attributions when they competed 


against females. These findings suggest that 
females have egotistical tendencies, but these 

J tendencies may not be apparent when they 
»make attributions for their outcomes on a 
ma-culine task. Females may feel less com- 
petent than males when their outcomes are 
relatively unimportant to them, particularly 
if they cannot compare their own performance 
with the performance of other females. 


REFERENCES 


Beckman, L. Effects of students’ performance on 
teachers’ and observers’ attributions of causality. 
Journal of Educational Psychology, 1970, 61, 76-82. 

Beckman, L. Teachers’ and observers’ perceptions of 
causality for a child’s performance, Journal of 
Educational Psychology, 1973, 65, 198-204. 

Broverman, I. K., Vogel, S. R., Broverman, D., 
Clarkson, F. E., & Rosenkrantz, P. S. Sex-role 
stereotypes: Current appraisal. Journal of Social 
Issues, 1972, 28, 59-78. 

Crandall, V. C. Sex differences in expectancy of in- 


tellectual and academic reinforcement. In C. P. 

Smith ‘4, Achievement-related motives in chil- 

dren York: Russell Sage Foundation, 1969. 
Deaux, K., & Emswiller, T. Explanations of success- 


ful performance on sex-linked tasks: What’s skill 
for the male is luck for the female. Journal of Per- 
sonality and Social Psychology, 1974, 29, 80-85. 

Deaux, K., & Taynor, J. Evaluation of male and fe- 
Ihale ty: Bias works both ways. Psychological 
Reports, 1973, 32, 261-262. 


EGOTISM IN MALES AND FEMALES 


1167 


Feather, N. T. Attribution of responsibility and val- 
ence of success and failure in relation to initial con- 
fidence and task performance. Journal of Person- 
ality and Social Psychology, 1969, 13, 129-144, 

Feather, N. T., & Simon, J. G. Fear of success and 
causal attributions for outcome. Journal of Per- 
sonality, 1973, 41, 525-542. 

Heider, F. The psychology of interpersonal relations, 
New York: Wiley, 1958, 

House, W. C. Actual and perceived differences in 
male and female expectancies and minimal goal 
levels as a function of competition. Journal of Per- 
sonality, 1974, 42, 493-509. 

Levine, R., & Reis, H. Fear of failure in males: A 
more salient factor than fear of success in females, 
Sex Roles, in press, 

Miller, D. T., & Ross, M. Self-serving biases in the 
attribution of causality: Fact or fiction? Psy- 
chological Bulletin, 1975, 82, 213-225. 

Montanelli, D. S., & Hill, K. T. Children’s expecta- 
tions and performance as a function of two con- 
servative reinforcements. Journal of Personality 
and Social Psychology, 1969, 13, 115-128. 

Nicholls, J. Causal attributions and other achieve- 
ment related cognitions: Effects of task, outcome, 
attainment value, and sex. Journal of Personality 
and Social Psychology, 1975, 31, 379-389, 

Ross, L., Bierbrauer, G., & Polly, S. Attribution of 
educational outcomes by professional and nonpro- 
fessional instructors. Journal of Personality and 
Social Psychology, 1974, 29, 609-618, 

Snyder, M., Stephan, W. G., & Rosenfield, D. Ego- 
tism and attribution. Journal of Personality and 
Social Psychology, 1976, 33, 435-441. 

Stephan, W. G. Actor vs. observer: Attributions to 
behavior with positive and negative outcomes and 
empathy for the other role. Journal of Experi- 
mental Social Psychology, 1975, 11, 205-214. 

Streufert, S., & Streufert, S. C. Effects of conceptual 
structure, failure, and success on attribution of 
causality and interpersonal attitudes. Journal of 
Personality and Social Psychology, 1969, 11, 138- 
147. 

Weiner, B., Frieze, I, Kukla, A., Reed, L., Rest, S., 
& Rosenbaum, R. Perceiving the causes of success 
and failure. New “Ne General Learning Press, 
1971. 

Wolosin, R. J., Sherman, s. J., & Till, A. Effects of 
cooperation and competition on responsibility at- 
tribution after success and failure. Journal of Ex- 
perimental Social Psychology, 1973, 9, 220-235, 

Zanna, M. P., Goethals, G. R., & Hill, J. F. Evaluat- 
ing a sex-related ability: Social comparison with 
similar others and standard setters. Journal of 
Experimental Social Psychology, 1975, 11, 86-93. 


(Received March 5, 1976) 


Journal of Personality and Social Psychology 
1976, Vol. 34, No. 6, 1168: 1176 


A Social-Learning Analysis of Similarity 
Effects in Imitation Research 


Kay Bussey and David G. Perry 
University of Queensland, Brisbane, Australia 


This research tested the hypothesis that the degree to which an observer and 
model are subject to similar reinforcement contingencies is a potent determinant 
of imitation. Children first participated in a premodeling phase in which the 
correspondence between the consequences received by them and an adult for 
performing similar and diverse responses was manipulated. The children then 
watched the model perform new responses that incurred reward, punishment, 
or no reinforcing consequence. On a test for imitation of these responses, chil- 
dren who had experienced reinforcing contingencies similar to the model's im- 
itated more of the model’s responses that had incurred either reward or no 
consequence, but fewer of the responses that had incurred punishment, than 
children whose contingencies had been uncorrelated with, or opposite to, the 
model’s, A second experiment indicated that the tendency of the former group 
to imitate responses incurring no consequence persisted when subjects could not 
infer the lack of consequence to imply reward, It is suggested that in natural- 
istic settings people associate similarity to a model with sharing reinforcement 
contingencies with the model, and this process might mediate effects of similar- 


Sharing Reinforcement Contingencies with a Model: 


ity on imitation. 


Both children and adults tend to imitate 
similar rather than dissimilar models (Hartup 
& Coates, 1967; Maccoby, 1959; Rosekrans, 
1967; Stotland & Patchen, 1961). Bandura 
(1969) recently offered a theoretical account 
of the similarity effect in terms of social-learn- 
ing principles. In his view, during socializa- 
tion, children encounter some people with 
whom they share analogous reinforcing con- 
tingencies. This condition occurs when a child 
and another person receive the same conse- 
quences from the social environment for per- 
forming a given response. Because socializing 
agents arrange common outcomes for people 
who share certain attributes (sex, age, etc.), 
children learn to associate similarity cues with 
the condition of analogous reinforcing con- 

tingencies. 

If similarity cues achieve this signal func- 
tion, then it is easy to appreciate a child’s 
imitation of a similar model when he sees the 


The authors thank the Department of Education, 
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model incur positive consequences for hi be- 
havior, This is because the child can afer 
that he, too, will be rewarded for perfo: cing 
the behavior. But why should a child in ate 
a similar model in the absence of any res, nse 
consequences to the model? This is in act 
the finding of several studies: Similarity ro- 
duces a generalized tendency to imitate hat 
persists even in the absence of explicit vi ari- 
ous reward. This can be explained if it is 
assumed that when a child observes a model 
performing in the absence of vicarious con- 
sequences, the child infers that the mod:! is 
performing responses that he has foun. on 
the basis of prior reinforcement experi ices 
in similar situations to be appropriate to the 
occasion. It follows that to the extent © per- 
son sees himself as sharing similar reinforcing 
contingencies with a model, he should infer 
that the model’s behavior is also likely to be 
appropriate for himself. Consequently, in situ- 
ations in which direct and vicarious conse- 
quences are absent and cannot be used to 
infer the appropriateness of one’s behavior, 
an individual turns for behavioral example to 
models with whose behavior he believes the 
appropriateness of his own to be correlated. 
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4 7 A SOCIAL-LEARNING ANALYSIS OF SIMILARITY EFFECTS 


a 

{ Dus principal purpose of this research was 
7 to manipulate experimentally the degree to 
Se 

f 


* which subject and model share analogous rein- 
fore’ug contingencies and to assess the impact 
o variable on children’s imitation. The 
lvpothesis was that children who per- 
heir behavior to be under the same 
ing contingencies as a model’s would 
shey à imitation, even in the absence of 
eyplicit vicarious reward. 
tion of the preceding hypothesis 
w indicate that the perception of analog- 
-inforcement contingencies increases 
ition, This would be an important find- 
t would not necessarily tell us that 
itally established similarity cues 
mitation because subjects make 
f: assumptions about analogous re- 
cnt contingencies. One way to test 
atter possibility is by assessing the ef- 
yo £ cts of vicarious consequences on children’s 
f imitation of similar (e.g., same-sex) and dis- 
similar (e.g., cross-sex) models. If children 
believe that a consequence (reward, punish- 
ment) accruing to a similar model is espec- 
ialy likely to be accorded to themselves 
should they perform the response, then their 
spontaneous imitation of similar models 
be particularly influenced by manip- 
iations of vicarious consequences.* 

This hypothesis was tested by examining 
the effects of vicarious consequences on 
children’s imitation of same- and opposite-sex 

models. In several recent studies, it has been 

und that children attend more closely to 

(Siaby & Frey, 1975) as well as learn more 

about the behavior of models that are con- 

| sistent with their own sex-role (Grusec & 

Brinker, 1972; Perry & Perry, 1975).? One 

© interpretation of such a finding is that imita- 

j tive learning is facilitated by perceptions of 
| 
| 
| 


main 
ceived 
reinf 


Confirm. 


ing, b 


inf 


t 


similarity (Kohlberg, 1966; Slaby & Frey, 
1975). The present experiment systematically 
varied sex of subject and sex of model as 
well as response consequences to the model. 
Thus it provided a test of the hypothesis 
that children’s imitation is more strongly in- 
fluenced by vicarious consequences admin- 
istered to a same-sex model than by vicarious 
consequences given to an opposite-sex model. 


a 
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A final aim of the research was to test 
the hypothesis that the very strong influence 
that manipulations of vicarious consequences 
exert on imitation is attributable in large part 
to the extent to which a subject believes that 
a consequence accorded to a model is also 
likely to accrue to himself if he should per- 
form the behavior, Although several theorists 
have stressed the importance of vicarious con- 
sequences for imparting information regarding 
the relative permissibility of modeled re- 
sponses for the subject himself (Bandura, 
1965, 1969; Liebert & Fernandez, 1970), no 
experiment to date has directly tested the 
validity of this assumption. 

The first experiment explored these issues. 
Children first participated in a premodeling 
phase in which the degree of correspondence 
between the outcomes they and an adult re- 
ceived for performing given responses was 
manipulated. Subsequently during a modeling 
phase, children watched the adult make new 
responses, some of which received a vicarious 
consequence and some of which did not. Fi- 
nally, children were tested for spontaneous 
imitation (and recall) of the responses the 
adult made during the modeling phase. It was 
hypothesizezd that the more the subject and 
the model were subject to analogous reinforc- 
ing contingencies, the more the subject would 
imitate the model even in the absence of a 
vicarious consequence. Furthermore, the ef- 
fects of vicarious consequences were hypothe- 
sized to be greatest for subjects who share 
analogous reinforcing contingencies with the 
model. Finally, the effects of vicarious con- 


1A study by Rosekrans (1967) failed to find 
stronger effects of vicarious consequences when sub- 
jects were exposed to a similar model, but, as Rosc- 
krans pointed out, it is likely that the vicarious con- 
sequences manipulation was ineffective. 

2 These effects appear to be stronger for boys and 
children of either sex with masculine sex-role 
preferences. 

Maccoby and Jacklin (1974) dispute the claim 
that children are more prone to imitate same-sex 
models, citing a number of imitation studies in which 
no Sex of Subject X Sex of Model interaction was 
found. Although only a hypothesis, it may be the 
case that such an interaction is found only in 
studies in which subjects are exposed simultaneously 
to displays by same- and cross-sex models. The 
three studies cited here were of this sort. 
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sequences were hypothesized to be greatest 
for subjects exposed to a like-sex model. 

It was suggested above that a major rea- 
son underlying the effects of vicarious conse- 
quences on imitation is the assumption by 
subjects that they and the model are subject 
to the same reinforcing contingencies in the 
experimental situation. If this is so, then a 
control group of subjects who forgo the pre- 
modeling phase but are placed directly into 
the modeling phase should behave similarly 
to a group of subjects whose reinforcing out- 
comes are perfectly correlated with the 
model’s in a premodeling phase. This means 
that adequate tests of the hypotheses outlined 
previously cannot be made by simple com- 
parisons between a no-premodeling condition 
and a premodeling condition in which a high 
degree of correspondence between subject and 
model reinforcement contingencies is made ex- 
plicit. Rather, the hypotheses must be tested 
via the inclusion of experimental treatments 
designed to undermine subjects’ confidence 
in the assumption that there necessarily exists 
a perfect correlation between these two 
events. Nevertheless, a no-premodeling con- 
dition was included in the design to provide 
a direct test of the hypothesis that unless in- 
formed explicitly to the contrary subjects as- 
sume in experimental settings that the con- 
dition of analogous reinforcing contingencies 
obtains. 


EXPERIMENT 1 
Method 


Participants. Subjects were 128 boys and girls from 
the third grade of a school in an upper-middle-class 
suburb of Brisbane, Australia. An adult female and 
an adult male served as models, and an adult female 
served as experimenter. 

Procedure overview. The modeling part of the ex- 
periment was in two phases. During an initial pre- 
modeling phase, the subject and model indicated 
choices on a preference task and received conse- 
quences (reward, punishment) from the experimenter 
for their choices. In an analogous contingencies 
condition, subject and model received the same con- 

sequence for similar choices but different conse- 
quences for different choices. In an uncorrelated 
condition, there was no relationship between subject 
and model contingencies. In an opposite contingen- 
cies condition, subject and model received different 
consequences for similar choices and the same con- 
sequences for different choices. A fourth group did 
not undergo this phase. Subsequently, subjects un- 
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s 
derwent the main modeling phase. Here :=bLjects ob. Ù 
served the model make a series of choice:, one third 
of which incurred reward, another third © inishment, 
and another third no consequences. Fin 'y, imita- 
tion and recall of responses display iring the 
main modeling phase were measured. H the chil- 
dren in each group were run with the ic’ ile model, 
and the other half with the male model. 

Stimuli. Stimuli consisted of 42 color ‘des, each 
of which depicted two objects of a simil- sort (e.g, 
horse and cow, apple and banana). Tw slide sets 
were used, 24 during the premodeling p} and 18 
during the main modeling phase. With’ cach set 
the order of slides had been randomly « ‘crmined, 
and the same order was used for all subje 4 

Procedure. In the premodeling phase, experi- 
menter explained that she was interested learning 
people's preferences for things and that would 
ask the child and the adult to indicate { each of 
a series of slides which of two items was eferred. 
Both model and subject were to indicate th » choices 
by depressing one of two buttons on esponse 
box, the left button for the left-hand iter and the 
right one for the other one. When both m. del and 
subject had decided on their choices (wh. n were 
displayed to the experimenter via lights on : panel ; 
before her), each was requested to indicate h choice ` 
by pointing to it with a pointer. The pu ose of 
the button pressing was to permit control «ver the 
model's responses. On each trial, the subject choice 
was displayed to the model via a light on hi. panel. 
On half the trials, in an unpredictable but p. deter- 
mined order, the model stated a preference >r the 
same item the subject chose, whereas for th- other 
half he indicated a preference for the ot) one. 
This equated premodeling groups on simil: ‘y of 
choice behavior for model and subject. 

For each slide, the model was asked to indicate 
his preference first, He was either rewarded © pun- 


ished, and then the subject gave his preference aud 
was either rewarded or punished. For both model 
and subject, reward consisted of the experimenter 
saying, “Yes, you’ve made a good choice,” whereas 


punishment consisted of, “No, you've made 1 bad 
choice.” The consequences administered to model 
and subject varied according to treatment. T} con- 
sequences accruing to subjects are spelled ov as a 


function of experimental group, choice behav 
model consequence in Table 1. In the analogc 
tingencies condition, when model and subje 
cated the same choice, they received the sar 
sequence (half the time, reward; half the time, 
punishment). When they made different responses, 
one was rewarded and the other one punished (half 
the time, one way; the other half, the reverse). 

In the uncorrelated condition, the model’s choice 
and consequence did not reliably indicate the ap- 
propriateness of the response for the subject. Now, 
when mode] and subject made either the same of 
different responses, they received the same conse- 
quence half the time and different consequences the 
other half. In each case, both model and subject re-) 
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TABLE 1 
SCHEDULE OF CONSEQUENCES ADMINISTERED TO SUBJECTS IN THE PREMODELING PHASE 
PR cai sabja Relationship of model and subject contingencies ri'en 
choice beha Analogous Uncorrelated Opposite 
Same choice 
Model rewarded Reward Half reward; half punishment Punishment 
Model punished Punishment Half reward; half punishment Reward 
Different choic 
Model rewarded Punishment Half reward; half punishment Reward 
Model punished Reward Half reward; half punishment Punishment 


ceived reward half the time and punishment the 
other half. 

Finally, in the opposite contingencies condition, 
when model and subject made the same response, 
one received reward and the other received punish- 
ment (with an equ umber of rewards and punish- 
ments accruing t 1). When they made different 
responses, they b received the same consequence 
(half the time, r the other half, punishment). 

The procedures o» ined above controlled for two 
variables that wer tential contaminators of treat- 
ments. First, the uber of trials on which model 
and subject receiy : the same consequence (regard- 
less of choice behavior) was equated across the three 
experimental groups. This controlled for the extent 
to which model and subject experienced similar af- 


fectiv temporal contiguity, which has 
been su t an important determinant of 
empathic vicarious consequences (Aron- 


the frequency with which the 
| (or punishment) for making 
or the model was equated across 
treatments. Previow ‘esearch has demonstrated that 
reinforcement to a subject for behavioral similarity 
to a model is ano(her important determinant of 
imitative behavior (Baer & Sherman, 1964; Grusec 
& Brinker, 1972). 
When the f 
perimenter proc 


freed, 1968 
subject receiv 
the same re: 


slide set was exhausted, the ex- 
ded to the main modeling phase. 


Tt was explained that from then on the model would 
be aske: es for all the remaining slides 
first anc ubject would be asked. The 


hand item half the time and 
ìe other in a single predeter- 
‘able order. One third of his 
choices were rews: 2, another third were punished, 
and another third eived no consequence. In the 
last cases, the experimenter remarked, “We'll go on 
to the next one.” Randomization schedules ensured 
that across subjects each modeled response incurred 
each type of vicarious consequence equally often. 
The decision to use a within-subject manipulation 
of vicarious consequences in the main modeling phase 
was motivated by a concern to avoid a problem 
that hampers between-subject manipulations of this 
variable. Specifically, when the frequency with which 
an agent dispenses a given consequence varies across 
treatments, potential confounds that involve sub- 
jects’ perceptions of the information value of the 


model chos 
the right-han 
mined but unp 


agent’s reinforcers (Cairns, 1967), their perceptions 
of dispositional characteristics of the agent (Babad, 
1972), or their perceptions of potentially threatening 
elements in the experimental situation (Morris, Mar- 
shall, & Miller, 1973) arise. Thus a within-subjects 
treatment of this variable was used. Following this 
phase, the model took a seat toward the back of 
the room and became absorbed in another task. 

For the imitation test, subjects were shown the 
second slide set with the instructions, “For each 
slide I want you to tell me which picture you like 
better.” No consequences were given after the chil- 
dren’s choices. 

The recall test followed. Subjects were shown the 
second slide set with instructions to tell for each 
slide which item the model had liked better, and 
subjects were promised tokens (exchangeable for 
small prizes) for correct answers. In an attempt to 
assess recall of vicarious consequences administered 
in the main modeling phase, half the subjects in 
each experimental group were also asked to report 
what the experimenter had said to the model follow- 
ing each of his choices. Only half the subjects did 
this in case the task interfered with recall of modeled 
responses. 

After responding to questions designed to check 
the effectiveness of the experimental manipulations, 
subjects were told they had performed very well 
and had earned enough tokens to choose a small 
prize. These were distributed to children when test- 
ing was completed, 


Results 


Imitation measure. A 4 X 2 X 2 X3 anal- 
ysis of variance involving three between- 
subjects factors (treatment condition, sex of 
subject, and sex of model) and one within- 
subjects factor (vicarious consequence) was 
performed on subject’s imitation scores, This 
analysis yielded highly significant main ef- 
fects of treatment, F(3, 112) = 4.68, p< 
.005, and vicarious consequences, F(2, 224) 
= 53.13, p < .001. However, the interaction 
of these variables was also reliable, F (6, 224) 
= 11.38, p < .001. The means for this inter- 
action are given in Table 2. 
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The pattern of this interaction supports the 
hypotheses of the study in every respect. 
First, note that control subjects who did not 
undergo the premodeling phase displayed imi- 
tative behavior very similar to that of sub- 
jects for whom reinforcing contingencies in 
the premodeling phase were positively corre- 
lated with the model's. This supports the 
contention that in experimental contexts chil- 
dren generally assume that they are subject 
to the same reinforcement contingencies as 
the model. Note also that the effects of vi- 
carious consequences are by far the strongest 
for these two groups of subjects. In fact, the 
efficacy of vicarious consequences is a direct 
function of the degree of overlap between 
model and subject contingencies, with dif- 
ferences among vicarious consequences means 
steadily decreasing as one moves from these 
two groups to the uncorrelated group to the 
opposite group. The hypothesis that children 
subject to reinforcing contingencies similar to 
the model’s would imitate the model even in 
the absence of a vicarious consequence was 
also supported. This hypothesis was statis- 
tically tested by first combining the data of 
the no-premodeling and analogous contingen- 
cies groups and then testing predicted con- 
trasts among treatment means when no vi- 
carious consequences were administered,.* The 
contrast of the combined groups versus 2 X 
the uncorrelated group proved significant, F 
(1, 112) = 5.74, p < .025, as did the contrast 
between the combined groups and 2 X the 
opposite group, F(1, 112) = 11.34, p < .001. 
However, the contrast between the uncorre- 
lated group and the opposite group was not 
reliable (F < '! 
The only oti 
ance to attain 


-ffect in the analysis of vari- 
lificance was the three-way 


\BLE 2 
IMITATION MEANS FOR INTERACTION OF TREATMENT 
AND VICARIOUS CONSEQUENCE 
Vicarious consequence 
Relationship of model No con- Punish- 
and subject contingencies Reward sequence ment 
No premodeling control 4.09 3.50 1.81 
Analogous 4.31 3.68 1.18 
Uncorrelated 3.28 2.90 1.93 
Opposite 2.21 2.62 2.62 
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TABLE 3 À 
IMITATION MEANS FOR INTERACTION OF SEX q 
or SUBJECT, SEX oF MODEL, ^9 
VICARIOUS CONSEQUENC? 
Vicarious conse) 
Group Reward No consequence. \nishment 
Boys 
Male model 3.72 3.41 1.81 
Female model 3.22 2.56 2.09 
Girls 
Male model 3.37 3.06 2.06 He 
Female model 3.59 3.69 1.59 
interaction of sex of subject, sex of © del, and 
vicarious consequence, F(2,224) = 95, p< 
.005. Means for this interaction arc n Table 
3. The strength of this interactio: derived 
from the fact that vicarious consequ «ces af- 
fected the imitation of same-sex moc ls more 
than that of opposite-sex models. Di: erences _ 
between children’s imitation of vic .iously | 


ee S 
rewarded responses and vicariously ; inished 
responses are larger, for both boys ard girls, 


when the model is of the same sex as the 
child, 

Recall measures. Correctly recalled «hoices 
of the model were subjected to a simila: four- 


way analysis of variance. No effect a’ ‘ained 


significance. Half the subjects were alse asked 
to recall the consequence administered ù> the 
model for each trial. Correctly recalle. con- 


sequences were also subject to a four-way 
analysis. The only effect attaining significance 
was the vicarious consequences main effect, 
F(2, 96) = 38.03, p < .001. Children were 
much better able to recall correctly the « onse- 
quence administered to the model if the :nodel 
had been either rewarded (M = 4.30) ot 
punished (M = 4.28) than if he had failed 
to incur a consequence (M = 2.16) for his 
behavior. 

Because a major hypothesis of the study in 
volved treatment differences in imitation whet 
no vicarious consequences were administered, 


®The error term for these contrasts was derived 
from pooling the between-subjects and within-sub: 
jects error terms from the analysis of variance, aní 
the significance of the contrasts was tested with 
more conservative number of degrees of freedon 
associated with the between-subjects error term. 4, 
a 


; 


A SOCIAL-LEARNING ANALYSIS OF SIMILARITY EFFECTS 


it seemed worthwhile to examine the nature 
of the children’s mistakes when they reported 
that a consequence had followed a modeled 
response when in fact one had not. This was 
accomplished by calculating, for each of the 
four treatment groups, the percentage of er- 
rors on no-consequences trails for which sub- 
jects mistakenly reported that reward had 
followed the modeled response. These figures 
were similar across groups and ranged from 
65% to 75%, indicating that on occasions 
when subjects erroneously reported a vicari- 
ous consequence the odds were about 2:1 in 
favor of the subject reporting the consequence 
as reward rather than as punishment, 
Manipulation checks. Subjects responded to 
four 7-point sca, with 7 indicating a greater 
degree of th: vant attitude in each in- 
stance. These questions were: (a) “How much 
do you like (/* model)?” (b) “How much 
are you like e model)?” (c) “Do you 
think the san sorts of things happen to 
(the model) to you?” and (d) “Is (the 
model) good at making choices?” Three-way 


ana! ce involving treatment, sex 
of sul: x of model were applied to 
the data cts involving the treatment 
variable ar bed here. 

Analysis of ponses to Question a in- 


dicated an inte: ction of treatment and sex 
of subject, F(3, 112) = 2.87, p < .05, that 
derived from the fact that whereas boys in 
the analogous contingencies condition liked 
the model better than boys in the other con- 
ditions, girls in this condition actually liked 
the model least, We offer no explanation for 


this rc oint out that the pattern 
rules ou rpretation of the imitation 
data in te: uttraction to the model. A 


highly signifi main effect of treatment 
was found for . estion b, F(3, 112) = 4.15, 
p <.01, Children in the no-premodeling and 
analogous contingencies groups reported them- 
selves as more similar to the model (combined 
M = 4.7) than those in the uncorrelated 
(M = 3.5) or opposite (M = 3.9) conditions. 
Since these data were collected after the imi- 
tation test, it is possible that subjects’ re- 
sponses were influenced by their imitative 
performance. A highly significant treatment 
effect for Question c also emerged, F(3, 112) 
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= 17.57, p < .001. Children in the no-pre- 
modeling and analogous contingencies groups 
more strongly believed that what happened 
to the model also happened to them (com- 
bined M = 5.48) than did those in the un- 
correlated (M = 3.47) or opposite (M = 
2.94) groups. This indicates that the experi- 
mental manipulations were effective. No sig- 
nificant effect was obtained in the analysis 
of the final question, indicating that similarity 
of reinforcing contingencies was not associ- 
ated with increases in perceived competence 
of the model. 


Discussion 


The results strongly support the hypothesis 
that the degree to which an observer believes 
he is subject to the same reinforcement con- 
tingencies as a model is a potent determinant 
of his imitation of that model. Not only does 
this variable appear to be responsible for the 
effects of vicarious reinforcement on imita- 
tion, but it also exerts powerful effects on 
imitation when no vicarious consequences are 
dispensed. The effectiveness of this variable 
appears to be independent of the observer's 
attraction to the model, the model’s compe- 
tence, prior reinforcement to the subject for 
behavioral similarity to the model, and prior 
contiguous pairing of affective states in model 
and observer, 

It is worth considering the possibility that 
the effects of our major independent variable 
were due to differences in subjects’ strivings 
for cognitive consistency. Stotland and Pat- 
chen (1961) suggested that when individuals 
conceive of themselves as similar to a model, 
they strive to incorporate other attributes of 
the model in order to maintain cognitive con- 
sistency. Because our experimental groups 
differed in the extent to which subjects per- 
ceived themselves as similar to the model, 
this possibility must be entertained. The 
major difficulty with this interpretation, how- 
ever, is the fact that the “similarity” variable 
interacted strongly with the vicarious conse- 
quences manipulation. This interaction is pre- 
dicted on the basis of social learning theory 
but not on the basis of cognitive consistency 
theory, in which case subjects should endeavor 
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to imitate a similar model regardless of the 
consequence accruing to him, 

Perhaps even more direct support for 
Bandura’s (1969) theory derives from the 
result that vicarious consequences affected 
the imitation of similar (same-sex) models 
more than that of dissimilar (cross-sex) 
models, This supports the view that pre- 
experimentally established  model-observer 
similarity affects imitation in the manner that 
Bandura suggests. 

Tt is of interest that our experimental ma- 
nipulation designed to make subjects believe 
that they were subject to opposite reinforce- 
ment contingencies to those of the model ac- 
tually produced something of a reversal of 
the typical vicarious consequences effect, Imi- 
tation in this condition was generally low, but 
subjects here imitated fewer of the model’s 
choices that had been rewarded, and more 
that had been punished, than children in any 
other treatment group. It is very possible that 
in real-life contexts, children do encounter 
models who receive consequences opposite to 
those typically accruing to themselves for 
performance of certain behaviors. This possi- 
bility was raised in a study by Thelen and 
Soltz (1969) in which vicarious reward to a 
white model seemed to cue young black chil- 
dren that they would be punished for per- 
forming the same behavior, though this effect 
might be limited to particular classes of be- 
havior, such as aggression (Liebert, Sobol, & 
Copeman, 1972), One goal of future research 
should be to delineate variables that affect 
children’s expectations that consequences ac- 
corded models of different classes will or will 
not occur to themselves for performing the 
same behavior. 

We hypothesized that subjects in an analo- 
gous reinforcement contingency condition 
would imitate the model even in the absence 
of explicit vicarious reward because they have 
learned that similarity cues and analogous 
contingencies indicate that the model is proba- 
bly behaving in a fashion that is appropriate 
for themselves as well. Strictly speaking, this 
hypothesis was confirmed, but it is possible 
that the effect was dependent, to a degree, 
on the fact that subjects frequently miscon- 
strued the absence of a vicarious consequence 
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nsistent 


to indicate reward. This result is « 

with findings reported by Spence (1966, 
1970), who suggested that in the al- «nce of 
evidence to indicate otherwise, subj often 
assume nonreaction to signal implici »eward, 
Subjects in the analogous contingen +s con- 
dition may have imitated these onses, 
expecting that they, too, would incur <e tacit 
approval from the experimenter for ; rform- 
ing them. Although this interpretati 1 does 
not lessen the validity or significanc’ of the - 
results, it does suggest a possible li tation 
on the generality of the tendency fo people 
to imitate similar models. Specifically imita- 
tion of similar models in the absenc of vi- 
carious consequences might be limited > occa- 
sions when a nonreacting agent of re force- 
ment is present during the modeling splay, 
since only when an agent is present ¢ 2 sub- 
jects interpret no consequences as ii lying 


reward, 
Our initial hypothesis was not restric ed to 
this circumstance, however, Therefore. © sec- 
ond experiment was conducted to det mine 
whether subjects would continue to sh v en- 
hanced imitation of an individual with vhom 


they have shared reinforcement conting cies 
when the agent of reinforcement is not vres- 
ent during the modeling phase. This « ndi- 
tion eliminates the possibility of subjects <on-, 


struing the absence of vicarious consequ:nces 

as tacit approval (Crandall, Good, & © ran- 

dall, 1964; Walters, Parke, & Cane, 1965). 
EXPERIMENT 2 


Method 1 
Participants. Subjects were 48 boys and gi- en-1 
rolled in the third grade of a school locate: in a) 


lower-middle-class suburb of Brisbane, Ausivalia. 
The same adult male as in Experiment 1 but « dif- 
ferent adult female* served as models. The same’) 
adult female served as experimenter. M 

Procedure. The procedure again consisted of a pred 
modeling and a main modeling phase. Only two of) 
the four treatment conditions were chosen for com- 
parison in this study. These were the analogous and) 
uncorrelated contingencies conditions. Half the chil-) 
dren of each sex were randomly assigned to ont 
condition and the other half to the other, and for 
half the subjects in each case the model was male 


| 


*The authors thank Briony M. Kemp, who volun] 
teered her time to fulfill this role. 
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and for the other half, female. For both treatments, 
the premecling phase was conducted in exactly the 
same mava»: as in Experiment 1. Stimuli were iden- 
tical to t of Experiment 1. 

At the ¿tset of the main modeling phase, the ex- 
perimente jicated, as in Experiment 1, that from 
this poin! he model would be asked his preferences 
for the ng slides first and then the subject 
would ' | his. However, the experimenter in- 
dicate had to make a telephone call and 
wou! from the room while the model 
pe nstructed the model to proceed 
+} tray, one by one, and to indicate 


eferred by pressing the appropriate 
panel and by pointing with the 
left the model and subject alone 
and the model proceeded as directed, 


pointe 
in th 


makir ime choices as in Experiment 1. 

Upon ©: upletion of this phase, the experimenter 
returned, nd tests for imitation and recall of 
modeled c! ices were conducted as in Experiment 1. 
Children :en returned to their classrooms, 

Res 


imitation scores were analyzed 
2 X 2 analysis of variance, with 
tr «iment condition, sex of subject, and sex 
of sodel as factors. The only effect attaining 
sigi ificance was a highly significant main ef- 
fect of treatment, F(1, 40) = 129.96, p< 
001. Children imitated a model who had been 
su! o the same reinforcement contingen- 
cle ; themselves much more often (M = 
12.46) than they imitated a model whose 
contingencies hod been uncorrelated with their 
ow uf ). There was, in fact, very 

in the two distributions of 
s his provides clear support for the 
hy; cbesis being tested. Children’s recall data 
were subjected to a similar analysis, but no 
effect reached significance. 


CONCLUSIONS 


These experiments furnish strong evidence 

‘© împortance of sharing a model’s re- 
spoiise-ou‘come contingencies for facilitating 
the effects of vicarious consequences on imita- 
tion and for facilitating imitation in the ab- 
sence of response consequences to the model. 
The latter effect would not seem to depend 
on subjects’ interpreting nonreaction from an 
ont of reinforcement as imparting approval. 
Presumably, in the course of socialization 
children associate analogous reinforcing con- 
tingencies with discriminable attributes of 
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certain social models. Because these attributes 
are often shared by the subject himself, simi- 
larity cues come to signal that the condition 
of analogous contingencies holds. It should 
be noted, however, that this is not the only 
process through which similarity comes to 
function as a cue for imitation. A complemen- 
tary process would involve a history of rein- 
forcement for imitation of models who possess 
similar attributes to oneself. Studies have 
demonstrated that social learning through ob- 
servation of a model is enhanced through a 
previous reinforcement history for imitation 
of the model (Grusec & Brinker, 1972; Perry 
& Perry, 1975). Taken together, the findings 
provide a more complete understanding of 
the way in which similarity cues come to 
serve as discriminative stimuli for imitative 
behavior. 

In review, the present research reports an 
additional variable that is a powerful deter- 
minant of imitative behavior, namely, the de- 
gree to which the subject believes that he 
shares the same reinforcement contingencies 
as the model. This variable may well mediate 
the effect that perceived similarity to the 
model exerts on imitation, although similarity 
cues probably require discriminative stimulus 
properties in other ways as well. The research 
furnishes evidence against an interpretation 
of similarity effects in terms of cognitive con- 
sistency strivings. Finally, the research pro- 
vides direct evidence in favor of the theo- 
retical notion that a major cause underlying 
the effects of vicarious consequences on imita- 
tion is the subject's belief that he and the 
model are subject to the same reinforcement 
contingencies. 
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Changes in Group Members’ Decision Preferences During 
Discussion: An Illustration with Mock Juries 


James H. Davis, Garold Stasser, Craig E. Spitzer, and Robert W. Holt 
University of Illinois at Urbana-Champaign 
The c ng personal decisions of mock jurors were assessed by periodic polling 
durin eration. Mock juries informed that they would subsequently be publicly 
accou le (queried by a panel of experts) differed considerably in their pattern 
of « hange over time from mock juries whose deliberations would remain 
À priv: r one thing, public juries moved much faster away from a guilty con- 


ser ur the end of deliberation, while private juries exhibited little change in 


their ncy to move with approximately equal speed toward guilty or not-guilty 
ver l hese and other dynamic changes in personal verdict preferences of jurors 
wert ed by the construction of state matrices and associated statistics, 
pern relevant comparisons over time and between conditions. Interpretations 
of vere also aided by subject reactions to postsession questionnaires. 


ions of members during 
sk-oriented groups have 
typically pro. icult for observers to assess 
and record aù rately., Observers may fail to 
ascertain thc precise position of group mem- 
bers on an issue, rate correctly the contribu- 
tions of the indis ıl members, etc. These 
pr plicated by the presence 
of intr: ng group members who 
fail to « voice vote” as often as 
their peers vevertheless may exert a 
decisive influ: he group’s outcome. 

Many natu: curring groups find it 
useful to have rev of the changing opinions 
of group member- (e.g., board meetings or 
juries) and frequently create this record 
through the periodic polling of group members. 
Such a polling device may serve, additionally, 
to focus debate, clarify lines of disagreement, 
and generally expedite task performance. Thus, 
whether scheduled at the discretion of a leader 
or prescribed at fixed intervals, the repeated- 
response format offers a means of character- 
izing fluctuating opinion within a group whose 
task is consistent with such behavior. 


The chan; 
free discussi 
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One aim of the research reported here was 
to study unfolding opinion or changing decision 
preferences in small groups by developing a 
conceptual and analytical system appropriate 
to a dynamic task. Theory and empirical re- 
search on group performance (see summaries 
by Davis, 1969; Davis, Laughlin, & Komorita, 
1976; Hackman & Morris, 1975; Hoffman, 
1965) have largely taken a static form hereto- 
fore. Predictions of group decision outcomes 
have typically made assumptions about the 
way social interaction processes acted on the 
initial distribution of opinion to produce 
particular decisions or solutions (e.g., social 
decision schemes, Davis, 1973). There are, of 
course, many kinds of interpersonal exchanges 
during discussion, but we are here concerned 
only with the kind of interpersonal process 
played through or related to the group decision: 
task. Juries are among those groups known to 
display the behavior with which we are con- 
cerned, for example, the periodic polling 
preceding a verdict. 


Mock Juries 


From the available evidence, it seems that 
the initial majority within a jury determines 
with high probability its verdict (Davis, 
Bray & Holt, in press). Moreover, precise 
assumptions about the order of majority 
(a 3 majority, a % majority, etc.) and the 
behavior of nonmajorities have been used to 
predict the verdict distribution of mock juries 
(eg, Kerr, Atkin, Stasser, Meck, Holt, & 
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Davis, 1976). The precise form of the 
accurately predicting model may differ with 
local conditions, such as a mock trial where 
jurors initially favor a not-guilty (Davis, Kerr, 
Atkin, Holt, & Meek, 1975) or a guilty verdict 
(Davis, Kerr, Stasser, Meek, & Holt, in press). 
(See Davis, Bray, & Holt, in press, for a survey 
of empirical studies of juries.) At a more basic 
level, and irrespective of any theories and their 
predictions, we would simply like to ascertain 
how majorities (or whatever) unfold during 
the course of group discussion. The investiga- 
tion to be reported studied personal opinion 
fluctuations during the deliberation of six- 
person mock juries. This study also afforded 
an opportunity to investigate the effects of 
another variable: public accountability. The 
attention that the press has given to prominent 
criminal trials has sometimes extended to the 
interrogation of ex-jurors. Not only have some 
interviewees revealed their own voting record 
and what induced them to change (or not 
change) their initial opinion, but they have 
occasionally commented upon the behavior of 
others during those deliberations. Some ob- 
servers of this public display may be repelled 
by the lack of privacy and the attendant possi- 
bilities for coercion and revenge. Others might 
argue that some accounting (not necessarily 
at the hands of the press) for juror voting 
behavior is desirable and can serve as an in- 
ducement to alert participation and responsible 
action. 
Whatever its merits prove to be, we sought 
to determine the effects of an expectation for 
postdeliberation interrogation within the con- 
text of a mock trial. Some subjects anticipated 
accounting for their voting record before a 
“panel of experts studying jury decision 
making,” while others did not have that 
anticipation. 

In summary, the general goal of this experi- 
ment was to study the behavior of juror deci- 
sion preferences during deliberations after a 
video-recorded rape trial (Davis, Kerr et al., in 
press). We were particularly interested in time- 
dependent changes in the distribution of votes 
themselves. The ordered pair (re, rxe) will 
hereafter represent the number of jurors who 
favor a guilty and not-guilty verdict respec- 
tively. For example, consider such issues as 
the symmetry with which an evenly split jury 
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of six persons (ro, rnc) = (3,3) ma subse- 


quently approach a guilty as opposed |: a not- 
guilty decision, and the inclination io -emain 
deadlocked. How quickly does (4, 2) 5S 
or (1, 5) — (0, 6), etc.? Does a (6, 0 (0, 6) 
distribution lead quickly to a verdic sport, 
or is there some vacillation, perhaps d: laying 
some pattern? How does the anticipa: of a 
public accounting influence moveme: from 
one opinion distribution (or “stat `) to 


another? For example, anticipated p 


may serve to harden majority/minorit_ ‘ines, 
whereas privacy could foster less defens: ness 
and contribute to a relative flexibility i juror 
positions. 

These and a number of other relate: ues- 
tions can be attacked by studying the | ling 
record. However, to do this systematic: v we 


require a somewhat more explicit and } cise 


means for characterizing juror ting 
dynamics. 
Social Decision Processes 

Imagine that decision-making groups and 
individuals alike face a task that defines set 
of mutually exclusive and exhaustive re on- 


sive alternatives, Aj, As, ...,A,. For a g oup 
of r persons, the members’ response prefero ces 


(E ROETE may be distributed in 
Nn p— IN 3 
m= ( gs ) different ways if we tr at 


alternatives but not members as distinguish- 
able. That is to say, there are m possible si :t- 
tings of persons into response alternatives. 
These are the m solutions to rı + ry +++ rn 
= r. For the special case of this study, r = 6 
eye? igh 
tinguishable distributions of guilty/not-guilty 
opinions. Those applications where z and r are 
large can yield a very large number, m, of 
logical possibilities. 

We seek now a descriptive device that per- 
mits a more detailed picture of the state of 
member response preferences from moment to 
moment.' To this end, let the m-distinguishable 


and = 2, and thus ( 


1 Godwin and Restle (1974) also made use of intra- 
group response distributions defined as states in their 
analysis of group concept attainment. However, theit 
application treated both persons and response prefer- 
ences as nondistinguishable whereas we regard pre- 


ee 


GROUP MEMBER OPINION CHANGE 


distributions be states, $;, into and out of which 
a group ry move. Define s;; to be the proba- 
bility the» group is in state S; on trial ¢ and 
state $; ial t+ 1, where 4 = 1,2)... T, 
Pers; (11) Ta meer Tg LORS SONES 
The si; e conceived as the entries of the 
m Xv stic matrix, S, given by 
8: L + Sm 
SATSES Sim 
Say, 0.08. 2 SA Sami (1) 
Smi Sm? *** Smm, 
where 
m 
E sy = 1.00, 
j=l 
The :ubscript # indicates a pair of trials 


1) and thus 7#= 1, 2, ..., T—1. 
There were m = 7 states for our mock juries: 
Sı = (6,0), S:= (5,1), ..., 57 = (0, 6). 

The mat:'x Sp describes transitions between 
states, but we do not at this point make any 
assumptions that the social decision process 
we seek to describe is Markovian. (In the 
typical experimental setup with small groups, 
the use of Sy is also unlike typical learning 
theory applications in that the occupancy of 
any slate, Sa is an observable event, and the 
si; may in principle be directly estimated by 
counting.) Observe in particular we do not 


now a= me that Sv is constant for all #’. In 
fact o constancy of Sv is an important 
empiical issue, to be determined as a useful 


feature of the research context. If Sv # Si, 
we must be especially attentive to any orderli- 
ness in the way Sy changes. Later, we intro- 
duce statistics for assessing various features 
of state matrices. 


METHOD 
Subjects 


Fro iotal sample of 955 subjects (584 male and 
371 female), 870 were randomly assigned to six-person 
juries, while the remaining 85 subjects participated 
individually. Subjects participated either as part of an 
extraclass exercise for an introductory social psychology 


ferences within the group as distinguishable and persons 
as nondistinguishable, (For a discussion of distinguish- 
able and nondistinguishable categorizations of events 
see Feller, 1957, or a standard tex: on combinatorics, 
€g., Hall, 1967), 
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course or to fulfill a requirement for an introductory 
general psychology course. 


Design 


Juries deliberated in either the private condition, in 
which deliberation was private, or the publicly account- 
able condition, in which jurors were told that in addi- 
tion to a permanent record (videotape) of their discus- 
sion, they might be questioned at a future date about 
their arguments, opinions, and the jury’s verdict. 
Individual “control” subjects never participated in 
group discussion, Some of these individual controls 
anonymously completed summaries of testimony 
(private), and others (publicly accountable) also com- 
pleted summaries but identified themselves so that 
they could later be contacted and questioned concerning 
their opinions and personal decision. 


Procedure 


Each 2-hour session consisted of two parts. During 
the first part, the subjects assembled as a large group 
of 30-60 persons in a classroom equipped with television 
monitors. The sequence of events for this part of the 
session was as follows. (a) Upon arrival, each subject 
was given a voir dire questionnaire (eliciting preliminary 
information on opinions concerning rape, ¢tc.), a 
personal opinion questionnaire, and a mark-sense 
answer sheet along with instructions, and requested to 
answer the voir dire questionnaire. (b) The mock trial 
(prerecorded on audio- and videotapes) was based on 
a slightly edited transcript of an actual rape case 
(Norris, 1965). (c) Subjects completed the personal 
opinion questionnaire immediately after watching the 
mock trial. They responded to several items, whose 
response scales were of different lengths and anchored 
in different ways, in addition to the customary guilty/ 
not-guilty dichotomy, (d) Subjects were assigned to 
smaller experimental rooms for the second part of the 
session. Any subjects left after randomly composing 
the largest possible number of six-person mock juries 
participated as individual controls. 

Experimental rooms were equipped with one-way 
mirrors and an intercom system. All instructions to 
juries during the second part of the session were read 
over the intercom. In the publicly accountable condi- 
tion, curtains at the one-way windows were open and 
light levels were adjusted so that subjects could see a 
video camera in the control room beyond. In the private 
condition, the curtain was tightly drawn over the one- 
way window. 

Juries were required to select a foreman, using what- 
ever means they wished. Deliberation instructions then 
explained the polling procedures, the decision rule, and 
the conditions that constituted the experimental manip- 
ulations. There were two types of polls—public group 
polls and private individual polls. ‘The foreman was, as 
customary, to call for a public group poll as often as he 
felt necessary, recording the time and each juror’s 
response (guilty, not guilty, or no response) on a vote 
record sheet. The private individual polls were taken 
at 1-minute intervals during deliberation in response 
to a signal over the intercom, Jurors responded pri- 
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TABLE 1 
STATE MATRICES, S;, FOR Five BLOCKS OF TRIALS, IN THE PUBLIC 
AND PRIVATE EXPERIMENTAL CONDITIONS 
i i Public conditi 
Trial “Wa juries v= 72 juries) 3 
Block a 1 C S$: S& 8 & $: $i 8: & & Ss S: Sr 
S 93: <0 1.00 
AEE i 06 91 03 
Ss OL 95>" COS S08 04 .94 02 
1 Sa BS: 2 13'5 02 08 87 04 02 
Ss 05.90.05 01 89 .08 02 
So 03 97 91 9 
Sr 1.00 90 
Sı 1.00 1.00 
$: 06 94 %3 97 
$: 02 0790- .06 P O01 
2 Sy 01 03 84 10 .02 06 81 13 y 
Ss Of 209. F.19- 02 02° 91 .07 
Ss 04 .89 07 02 98 
Sr 03 97 00 
$ 1.00 99 OL 
Sy 10.90 08 387 _ .05 
$ 03. .95:_..02 Ol 94 05 
3 Dy OL 96 03 A2 84 04 
$s 04 90 04 02 02 -92 .06 
So 03 .90 07 02 88 10 
Sr O22 198 02 8 
Sy 1,00 1,00 
Sa 410 90 1.00 
Sy 92 04 02 9% 04 
4 Se 03. 93: 03: Air} a22 , 
Si 92: ~ 08 89 09 2 
So 02 98 03 84 } 
Sr 1,00 1 
Sy 1,00 1.00 
$: 04 92 03 1.00 
Sa 02 95 02 98 02 
5 Se 98.02 08 83 08 
$s .94 06 95 05 
Se .97 .03 90 19 
Sı 1.00 01 09 
Nole. $1 = (6,0); §2 = (5.1); $2 = (4.2); $e = (3,3); S; 4); $s = (1,5); $1 = (0,6). a 


Entries, [4j]. are averages of relative frequencies with which mock juries moved from $« on trial £ to $j on £ + 1 for six 


blocks. (Zero entries have been left blank.) 


vately, guilty or innocent, on their individual answer 
sheets. A jury’s verdict was established by unanimous 
agreement on two consecutive public group polls 
separated by at least three private individual polls. 
Juries in the publicly accountable condition learned 
that the video recordings being made would be reviewed 
by a panel of psychologists and legal experts who were 
studying the accuracy of juries such as theirs. More- 
over, they might later be invited to discuss the argu- 
ments and opinions they had expressed. Juries in the 
private condition were told that their deliberations 
were private. Juries failing to reach a verdict within 
30 minutes were asked to declare themselves hung. 
Next, juries that had reached a verdict completed a 
final jury task, which consisted of responding (unani- 
mous agreement) on a 9-point “degree of guilt.” 


ial 


After deliberation, jurors completed privately a post- 
verdict questionnaire containing items from the earlier 
personal opinion questionnaire as well as items eliciting 
ratings of the believability of the witnesses; sociometrie 
ratings of fellow jurors’ participation and influence; 
ratings of satisfaction with the jury’s decision process 
and final decision; judgments of assigned, actual, and 
ideal decision rules; and a manipulation check. Par- 
ticipants were then thanked, debriefed, and excused. 

Procedures for the individual control subjects were 
comparable to those for juries. In place of deliberation, 
individuals summarized what they considered were the 
salient facts of the case on a summary sheet. Experi- 
mental manipulations comparable to those for juries 
were accomplished by asking individuals in the publicly 
accountable condition to provide their name, social 
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and phone number on the summary 

t they could be contacted later to discuss 
jes and decision with a panel of psycholo- 
ul experts interested in the accuracy of 
dings. Individuals in the private condi- 
| anonymous summaries, Finally, indi- 
neted a  postreflection questionnaire 


security nu 
sheet so tl 
their sum 
gists and 
judicial } 
tion con 
vidu: 


identice ie postverdict questionnaire for jurors 
except s pertaining to group interaction were 
omitt 
RESULTS 

Jury Mem + Preference States 

On ©. à trial, a jury was in one of seven 
possi!) states as defined by members’ per- 
son inion about the defendant’s guilt: 
Si= (rc, xa) = (6,0), (5,1), (4,2), (3,3), 
(2, (1,5), or (0,6). Changes in jurors’ 


verdict preferences observed during delibera- 
tiv were summarized for each pair of adjacent 
tials’ = t,t-+ 1 by the 7 X 7 “state matrix,” 
Sp, where the ijt’-th entry, [Sye], is the rela- 
tive frequency with which a group was in S; on 
trial Z and in Syon trial + 1. (The [Siv] were 
-veraged for the six trial blocks, and thus for 
iis data set we now let # = 1,2,...,5 be 
‘he trial block.) The state matrices, Sv, for the 
interstate transitions observed in the public 
and private experimental conditions of this 
study are given for six trial blocks in Table 1. 
We stress at this point, however, that the 
use of the term state and the use of square 
matrices to trace the traffic among them in no 
way implies (or denies) the assumption of an 
iderlying Markov process, either stationary 
nonstationary. The current use of matrices 
resembles that familiar from information 
theory, sociometrics, and applied statistics—to 
name only a few. The use of such an array 
retains and displays patterns of relations 
within a data set and enables a number of 
questions to be addressed by direct inspection. 
However, it is desirable to characterize a state 
rie by an index, like any sample of ob- 
servations, permitting the overall comparisons 
across time and between conditions. Any such 
general index depends in part upon what 
ieatures of the matrix are of interest. Since 
we are concerned about the temporal constancy 
{Su and the symmetry (i.e., around the main 
diag f Sv) of movement toward guilty or 
not guilty, we consider first an index, yv, to 
describe the rate of opinion change at the 
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Fiavre 1. The mean value of yz" is given for 6-trial 
blocks. If ye* is near .00, there is little movement toward 
either guilty or not guilty, and increasing y” indicates 
increasing movement. The asterisk indicates these are 
“pseudovalues” as recommended by Mosteller and 
Tukey (1968). 


t'-th block, where 


yw=LULsw, foralixyz. 2) 
sig 


If yr = .00, all juries have remained in the 
same state from trial ¿ to ¿+ 1, and all main 
diagonal entries, [suv], are unity. When yv is 
large, opinion change is rapid. Note that the 
index yv applies to a data set as a whole (i.e., 
in this study to all mock juries within an 
experimental condition). However, the sam- 
pling distribution of the observed value, the 
statistic w, is unknown, and it was thus 
necessary to use the jackknife (Mosteller & 
Tukey, 1968) to estimate: variation and test 
hypotheses. (See Mosteller & Tukey, 1968, for 
a discussion of these and related statistical 
matters, and Spitzer, 1976, for an economical 
computational routine.) The jackknife yields 
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6-TAIRL BLOCKS 

Ficure 2. The mean value of 6y* is given for 6-trial 
blocks. If 6,* is near 1.00, there is a tendency to equal 
movement toward guilty and not guilty, but a value 
greater than 1,00 indicates faster opinion change 
toward not guilty than guilty. The asterisk indicates 
these are “pseudovalues” as recommended by Mosteller 
and Tukey (1968), 


a quantity called a “pseudovalue,” ye*, which 
may be interpreted in precisely the same way 
as the index, yv, originally defined by Equa- 
tion 2. (More specifically, there were 66 juries 
im each condition for which complete data 
records existed. These in turn were divided 
into 11 subsamples of 6 each from which 11 
pseudovalues were calculated for each trial; 
the degrees of freedom, for the analyses of 
variance reported below, were based on these 
subsamples, and thus were [conservatively] 
small.) 

Mean observed values of yv*, averaged for 
six trial blocks, are given in Figure 1. Sub- 
stantial change over trials is evident by in- 
spection, but the two experimental conditions 
appear to yield rather different patterns of 
activity. Indeed, submitting the pseudovalues 
to an analysis of variance revealed a significant 
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effect for trial blocks, (4, 80) 24,15 
(p < .001); experimental condition, /’(1, 20) 
= 23.30 (p< .001); and their inte ction, 


F(4, 80) = 75.79 (p < 001). Interc:. dition 
comparisons of means (Scheffé tests) + each 
trial block showed a significant avera’ differ- 
ence at Blocks 2, 3, and 4 (F’ = 83.6. 29.98, 
and 228.41, respectively, and p < .00' ‘or all 
three) but not at the first and last locks. 
Thus, considering only these three iddle 
points, the private condition juries ap, ar to 

te of 


undergo, first, a rapid increase in the 
opinion change and a steady decline ther ifter, 
The public condition juries, on the othe; ‘and, 
followed a nearly opposite course; here 
appears to have been a large drop in p. ition 


change activity followed by a large and sady 
increase until the last trial block. It i only 
the convergence at the last trial bloc! that 
prevents nearly mirror image reversals «` the 
trends within the two experimental condi ons, 

vith 


A second question concerns the rate 
which mock juries move toward not , iilty 
relative to guilty. For example, coul the 


normative and legal safeguards against on- 
victing innocent defendants serve to crea : an 
asymmetry in opinion change (i.e., asym fry 
about the main diagonal of Sj-)? A se: ond 
index was thus constructed to describe he 
overall rate of relative movement in the wo 


directions: 

eH A+R YE s/A+ DS sy). (3) 

i<} i>i 

Again, the observed values, ôv, were jack- 
knifed, and for the reasons given above, the 
averages of the resulting pseudovalues, 6..*, 
have been plotted in Figure 2. A value of }.* 
near 1.00 indicates nearly equal movement 
toward guilty and not guilty.? The data po ats 
displayed in Figure 2 are generally near 1.00 
or larger, indicating faster opinion chenge 
toward not guilty than guilty, in both condi- 
tions. This result indeed does not contradict 
the idea that the normative and legal protec- 
tion extended in principle to defendants may 


2 The choice of index is to some extent arbitrary. 
There are a number of properties desirable in an index 
of this sort (expectation under interesting hypotheses, 
etc.). However, the use of 6, as defined by Equation 3, 
seemed desirable since it is the simplest of several 
pertinent statistics we studied. 
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guide juror behavior in (mock) practice. 


Figure 2 also suggests average changes in the 
relative rite of opinion movement to be a 
function ol trial block and experimental condi- 


tion, and this is confirmed by significant effects 
due to experimental condition, F(1, 20) 
(p < .001); trial block, F(4, 80) 
h 001); and their interaction, 


F(4, 80) = 48.00 (p < .001), The mild U- 
shaped i nd of the private condition actually 
represent; little change over time, and the 
initially : .d monotonic increase observed in 
the pu’ .c condition is not so strikingly 


different, except for the very large increase at 
Block 4. Scheffé tests confirm that the differ- 
ence between condition means at Block 4 is 
the only between-conditions comparison that 
is significant, F’ = 221.56, p < .001. 

In view of such differences between the 
public and private state matrices, it is not 
surprising that a significantly higher propor- 
tion of jurors in the public (.88) than private 
(.11) condition indicated on a postverdict 
questionnaire that they expected to be invited 
to an interrogation session later, X*(1)=488.21, 
p J01. Such manipulation checks for a 
proportion of public (.87) and private (.05) 
individuals also showed a significant expecta- 
tion difference, X#(1) = 51.33, p < .001. 

Additional relationships within Sy». Addi- 
tional information concerning interpersonal 
dynamics of verdict preferences may be ob- 
tained from the observed state matrices, Sv, 
given in Table 1. For example, most entries 
are zero, and the nonzero entries cluster 
around the main diagonal of the various 
matrices. This suggests that great “leaps” 
from divided opinion to the unanimous con- 
sensus (technically required for a verdict) were 
rare. The size of main diagonal entries, [Sie] 
shows that changes in opinion states were 
generally quite slow. When change did occur, 
it was most likely to be to the state “next 
door” ; this suggests that jurors rarely changed 
as subgroups—otherwise there would have 
been larger entries in cells two or more states 

vay within the same row. The state 
S4 = (3, 3) holds particular interest in that it 
represents an even division between guilty and 
not-guilty opinion. The corresponding entry 
Siw indicates the stability of this “confronta- 
tion” in that it estimates the probability of 
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juries remaining evenly divided from trial to 
trial. In both conditions $v is slightly smaller 
from block to block than the other values 
along the main diagonal of $., indicating the 
(3, 3) split was less stable than other states—a 
surprising result in view of conformity effects 
to be expected. In comparison, the private 
condition appears to yield somewhat more 
stability for the (3, 3) split, 


E Sw = 4.56, 


tml 


than the public condition, 


5 
X sw = 4.07. 
tæl 
We might next inquire about the direction 
of movement out of the crucial $4 = (3, 3) 
state and the relative dependence of this 
activity upon experimental condition. First of 
all, the movement toward not guilty was some- 
what greater than toward guilty; for example, 
the sums of the relevant cells were .59 and .34, 
respectively for the public condition. Similarly, 
the corresponding sums were .35 and .08 for 
the private condition, showing that the latter 
change in rate is slower.’ Such a result is 
consistent with the results using overall indices 
and potentially extends them in a detailed 
way. 


Other Comparisons 

Despite the different patterns of juror 
opinion change within the two conditions over 
trials, the overall observed distributions 
(Po, Pxa,Pu) of final verdicts were quite 
similar between the two conditions. Table 2 


4 More precisely, the movement toward not guilty 
was somewhat greater than toward guilty in that 


Bot 68 


EL Se = DE L Sy = 34 


pmi jmb va 
for the public condition, and 


6 7 b I 

D E sye = 35> E E tyr = .08 
jai Y e À paja 

for the private condition, showing that the latter change 
in rate is slower. 
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TABLE 2 
RELATIVE FREQUENCY OF FINAL OUTCOMES FOR SIX-PERSON MOCK Juries 
AND PARALLEL INDIVIDUAL SUBJECTS 
“Sea 3 Mock juries Individuals 
Not Not 
Condition Guilty guilty Hung N Guilty guilty 
Private Ol 25 Ad 72 64 36 42 
Public 25 29 A6 72 67 33 42 
Total 28 27 45 144 65 35 + 
summarizes these outcomes for both mock given separately in Table 3 for mals and 
juries and those individuals who acted alone females. The pattern of results is g+ erally 
throughout. Treating the entries for juries consistent, and with few exceptions an; ‘nter- 
(left side of Table 2) as a 2 X 3 contingency actions are due to “slope” differences “ather 


table, X*(2) = 6.46, ns (Postverdict personal 
opinions of ex-jurors also did not differ signifi- 
cantly between the public and private condi- 
tions.) Clearly, we would reach rather different 
conclusions about the effects of the private- 
public accountability issue if we had attended 
only to the final verdicts without accompany- 
ing data on the pattern of opinion change 
during deliberation. However, other conditions 
might exist where the two opinion change 
patterns could lead to different verdicts, for 
example, if deliberation were for some reason 
concluded earlier. 

The observed distribution of guilty and not- 
guilty (Pa, Pya) preferences of isolated public 
and private individuals (right side of Table 2) 
did not differ significantly. However, the pro- 
portion of guilty verdicts between correspond- 
ing individual and jury verdicts (i.e., Po vs. 
Pa) did differ significantly from each other 
(respectively, .64 vs. 31, s = 3.47, p < .001, 
for the public, and .67 vs. .25, z= 4.48, 
Ż < .001, for the private condition), confirming 
a result reported earlier by Davis et al. (1975) 
and Kerr et al. (1976). 


Personal Reactions of Jurors and Individuals 


A pre- and postdeliberation questionnaire 
solicited subjects’ ratings concerning various 
aspects of the trial. Table 3 contains post- 
deliberation mean ratings of the degree of 
certainty placed in the correctness of one’s 
personal verdict, likelihood of the defendant’s 
guilt, believability of victim’s testimony, and 
believability of defendant’s testimony. 

Sex of respondent sometimes interacted with 
other main effects, and thus mean values are 


than reversals in direction of differer es, A 
2X2X2 (Public vs. Private Co) dition 
X Sex X Pre- vs. Postgroup) analy s of 
variance was carried out separately for mock 
juries and individuals. Major results are ziven 
at the bottom of Table 3. 

We may summarize the findings repor‘ ed in 
Table 3 by observing that private-con: ition 
mock jurors were less certain of their pe: onal 
verdict, less inclined to regard the defer lant 


as likely to have committed the crim: less 
likely to rate him guilty, less believing o: the 
victim’s testimony, but more believing o the 


defendant’s testimony—trends intensifie in 
female subjects. Essentially the same patern 
was observed for both individuals and jurors, 
although the differences were sometimes rer 
small for one or the other. 


Discussion 


At the outset, we rationalized our interest 
in postgroup expectations of members in part 
by analogy with real trials, especially wcll- 
publicized ones, and the potential for influence 
and coercion through embarrassment, revenge, 
etc. It goes without saying that one-to-one 
generalizations to particular trials past or 
future is naive. Any study, whether observing 
role-playing undergraduates or actual per- 


* Females were also more likely than males to favor 
a guilty verdict. Davis, Kerr, Stasser, Meek, and Holt 
(in press) and Kerr et al. (1976) have observed the same 
result, using basically the same rape trial, and no 
further discussion here seems warranted in view of trial 
content and randomized subject assignment. However, 
sex was treated as a factor in analyses of variance when 
appropriate, 
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TABLE 3 
MEANS OF POSTDELIBERATION PERSONAL RATINGS ON SCALE OF 1-7 or FEELINGS ABOUT 
J ASPECTS OF THE MOCK TRIAL AND THE VERDICTS 
Private condition Public sandia 
Item Male Female Total Male Female Total 
I 
y of personal verdict 2.84 2.93 2.88 2.85 2.79 2.83 
ery certain) 
ood defendant did crime* 3.87 3.38 3.70 3.86 2.98 3.49 
tremely likely) 
HE 4.07 3.55 3.88 4.11 3.26 3.76 
ilty. 
Belle ability of victim> 3.06 2.93 3.02 3.13 2.66 2.94 
1 ery believable) 
vability of defendant® 3.61 3,98 3.74 3.73 4,20 3.92 
ry believable) 
ialis x 
\ty of personal verdict* 3.17 2.89 3.05 2.65 2,06 2,43 
ery certain) 
ihood defendant did crime 4,04 3.16 3.64 3.85 3.31 3.04 
extremely likely) 
ilt rating 4.05 3.00 3.90 4.08 3.31 3.79 
| = guilty) 
Lelievability of victim 2.87 2.63 2.76 2.83 2.73 2,79 
(1 = very believable) 
Relievability of defendant* 3.00 3,32 3.14 3.48 4.07 3.171 


(i very believable) 


. Sample size in each condition was 432. Significant F ratios involving experimental condition are given below, 
or Time X Condition, P (1, 862) = 11.31, p < .001. After deliberation, private jurors rated the defendant as more likely to 


« committed the crime than did public jurors, 


For Sex X Condition, F (1, 864) = 3.82, p œ .05. Females rated the victim as less believable in the private than public condition, 


For condition, F(1, 866) = 4.21, p <.04. 
‘ For condition, Fit! 79) = 5.88, p < .02. 
» For condition, F(1, 76) = 7.32. p < .03. 


f s is obviously limited by setting, 
uation, norms, etc. No single data set is 
sufficient to serve alone as a guide for social 
policy. (See discussions by Davis, Bray, 
& Holt, in press, and Gelfand & Solomon, 
1974, 1975, for further cautions on the perils 
cling devotion to the unembellished em- 
pical datum.) We see the main value of 
re-carches such as this to be in their implica- 
tions for general theory. i 
‘The state matrices, Sy, permitted the 
extension of social decision concepts to the 
description of opinion change during the 
deliberation period preceding final decision. 
The S, observed were thus queried in several 
ways, including the construction of the in- 
dicts, yr and ôr, to describe the overall 
behavior of the interaction/decision dynamics. 
Mean values of these two indices (Figures 1 
and 2) appear to behave rather differently over 
‘rials according to whether or not jurors were 
of a subsequent accounting. Manipu- 
lation checks confirmed that condition assign- 
ment indeed was significantly and highly 


associated with ex-jurors’ expectation of a 
later interrogation. í 

The results displayed in Figure 1 (which 
summarizes the overall rate of change in 
opinion whether toward guilty or not guilty) 
suggest that there is an initial (Block 1) 
opinion adjustment in both conditions, per- 
haps by those with vaguely defined preferences 
and/or those who yield quickly before counter- 
arguments. The private-condition mock jurors 
change even more rapidly at the second block 
but decline steadily thereafter. Private- 
condition jurors may adopt an early opinion 
and feel less inclined to change with time. 
They perhaps feel less need than public condi- 
tion jurors to debate, examine, and generally 
test testimony interpretations, recall, etc. 
since they do not expect subsequent challenge. 
The public-condition jurors display over the 
middle three trial blocks (the points which 
differ significantly between conditions) a 
roughly opposite trend. Public condition 
jurors perhaps reacted to the expected scrutiny 
by playing the role of a particularly conscien- 
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tious juror. They consequently attempted to 
avoid “premature” opinion change, and with 
repeated refutation, counterarguments, etc., 
felt increasingly free to alter their opinion. 
Thus, the two contrasting patterns of inter- 
state opinion change are consistent with at 
least one interpretation of the differential 
expectations of postexperimental examination. 

‘The second state-change index, ĉr, measures 
the relative rate of change toward guilty or 
not-guilty verdicts (Figure 2). Both conditions 

displayed on the average about the same mild 

asymmetry (i.e. in general, 6,* > 1.00, ap- 

proximately), and in the “norm-likely” direc- 
tion of faster change toward not guilty than 
guilty—with one massive exception. At Trial 

Block 4 there was a very rapid change by the 

public condition jurors. The doctrine that a 

“thousand guilty go free...” expresses a 

value so widely shared that we should perhaps 

not be surprised the late deciders, when ex- 
pecting subsequent interrogation, incline faster 
toward not guilty than guilty verdicts. 

The pattern of subject responses to selected 

questions about the trial activities (Table 3) 
suggests a generally more confident and 
decisive juror in the public condition. Relative 
to private condition subjects, ex-jurors and 
individuals alike from the public condition 
responded on the average that they were more 
certain of their verdicts, more confident that 
the defendant committed the crime or was 
guilty. These subjective reactions may be due 
to a greater “interaction investment” in the 
public condition, which displayed greater 
opinion change later in deliberation, or may 
have been due to differing expectations directly 
fostered by the manipulation. Unfortunately, 
several relevant mean differences were quite 
small, though “direction-consistent,” and thus 
only one of a corresponding pair of comparisons 
was significant. (See the note at the bottom 
of Table 3.) At present, we lack the supple- 
mentary details concerning the deliberation 
dynamics which would permit a more confident 
interpretation. 

Despite the differences in social dynamics 
described above, the public and private condi- 
tion juries tended to end with quite similar 
verdict distributions. To be sure, there were 
fewer guilty verdicts in the public condition, 
but the overall discrepancies were not large. 
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On the face of it, the public-private » nipula- 
tion, which apparently exerted a strong effect 
upon the form of intragroup opini hange 
processes, did not exert a similarly str effect 
upon outcome. Yet, the outcomes 1 t have 
been quite different should deliber is for | 
some reason have been concluded « sr (or 
later). For example, if a foreman wer o halt 
discussion and poll members (for win the 
social decision dynamics were as © -erved 
here) at the fourth trial block, wo could 
anticipate observing quite a differen’ ct of 
verdict outcomes. The point at which jury 
happens to be on the unfolding social « ision 
process, then, is perhaps an importani leter- 
minant of its verdict. Obviously, we | \nnot 
be sure about the generality of these sults 
without examining juries whose a {ted 


deliberation intervals vary, etc. 
Unfortunately, group researchers hav: typi- 
cally tended to concentrate upon perfor: ance 
(outcomes) with little attention to the pi ced- 
ing social interaction, or their focus upon cial 
dynamics and interpersonal exchanges h: : not 
been accompanied by a similar concern vith 


the outcomes of that interaction. (See 1 vis, 
1969, and Kelley & Thibaut, 1969, for ore 
comprehensive discussions of these iss «s.) 


One of the troublesome problems contribu. ing 
to the heavy emphasis on outcomes per sin 
group performance research has been he 
difficulty of obtaining a clear empirical pict ire 
of changing interpersonal relations, subgro ip 
patterns, etc., in such a way that they could 
be related to the group decision or solution. 
The state matrices, Sv, as constructed 
used here may permit at least a partial solu ion 
to this problem. Hopefully, a major deve op- 
ment of the future will be the extensior of 
essentially static theories of group performance 
(e.g., Davis, 1973) to a more dynamic form 
compatible with the kinds of data we have 
considered above. 
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Bystander Reactions to a Violent Theft: Crime in Jerusaiem 


Shalom H. Schwartz and Avi Gottlieb s 
Department of Sociology, University of Wisconsin—Madison 


Seventy-two male Israeli students were exposed to a violent crime in the course 
of a bogus discussion. Their awareness of other bystanders’ lack of reaction 
to the emergency (social influence) and others’ awareness of their actions 
(evaluation apprehension) were crossed in a 2 X 2 factorial design. An “alone” 


condition in which the subject was the only bystander controlled for the effects 
of others’ mere presence (diffusion of responsibility). Helping was reduced by r, 


diffusion of responsibility and slowed by negative social influence but was in- 
creased by evaluation apprehension, Differences traceable to social influence 
appeared prior to those from the other processes, Implications of these findings 
for the measurement of helping and the interpretation of bystander decision 


making are discussed. 


The widely decried unresponsiveness of 38 
witnesses during the violent murder of Kitty 
Genovese (Rosenthal, 1964) was the major 
stimulus to social-psychological research on 
bystander behavior in emergencies. Yet none 
of this research has confronted subjects with 
an emergency caused by a violent crime. The 
victims’ need has commonly been generated 
by impersonally caused accidents (e.g., fall- 
ing bookcases, Latané & Rodin, 1969), physi- 
cal illness (e.g., nervous seizures, Darley & 
Latané, 1968), or nonviolent crimes (e.g., 
theft, Moriarty, 1975). Bystanders have been 
exposed to some violence, but only in the form 
of fighting between children (Latané & Dar- 
ley, 1970) and between participants in a 
psychodrama (Borofsky, Stollak, & Messé, 
1971). 

Bystander responses to impersonal and non- 
violent emergencies are important, of course, 
but they may differ from responses to violent 
crime. Intervention in violent crime is likely 
to be viewed as more dangerous, especially 
when the attacker is still present, when ex- 
tended and unpredictable involvement with 
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the first author. 
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Wisconsin, Madison, Wisconsin 53706, 


authorities (e.g., police, courts) is possible, 
when fear of becoming the next victir may 
tress, 


induce particularly strong emotional 


f3 


and when violence-precipitated need m iy be & 


perceived as especially vivid or imme liate. 
The current study sought to detc:mine 
whether processes postulated to explain by- 
stander intervention in past studies apy ly in 
a case of violent crime. 

Latané and Darley (1970) have pro»osed 
three analytically distinct processes that may 
account for the oft-observed tendency o: by- 
standers to inhibit each others’ responsiv- ness 
in emergencies: social influence, audience in- 
hibition, and diffusion of responsibility, Char- 
acteristics of these processes are discussed 
below. We contend that many of the st idies 
interpreted as demonstrating the effects of 
one or another process have confounded two 
of the processes or even all three. A second 
aim of the present research is fo isolate the 
effects of all three intervening processes in a 
single study by systematically varying con- 
ditions conducive to each, 

The process of social influence requires that 
the focal bystander be aware of the reac- 
tions of other bystanders. The reactions of 
others provide informational cues about the 
seriousness and reality of the emergency and 
normative cues about appropriateness of re- 
sponses. Latané and Darley (1970) have em- 
phasized inhibition of response through visual 
observation of the inaction of others. Logi- 
cally, however, communication of others’ re- 
sponses through nonvisual channels should 
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also suffice as a source of social influence; 
and social influence may promote intervention 
as well as inbibit it, depending on the nature 
of the responses perceived (e.g, Bickman, 
1972). 

The process that Latané and Darley 
(1970) labeled audience inhibition requires 
that focal bystanders believe others are aware 
of their responses. It refers to the bystanders’ 
concern over others’ expectations and evalua- 


~ tions of their behavior and the desire to opti- 


mize these evaluations. This process might 
preferably be called evaluation apprehension 
to avoid any directional connotation. This 
apprehension may either foster or inhibit in- 
tervention depending on what expectations 
the bystander believes others hold. 

Diffusion of responsibility requires only 
that a bystander think others who might re- 
spond to the emergency are also present. Feel- 
ings of responsibility can then be diluted be- 
cause it may be less necessary for any one 
person to act; and any anticipated blame for 
nonresponse may also be reduced because it 
wou'd be shared by others. Increases in the 
numer of bystanders to an emergency, espe- 
cially competent bystanders, have been inter- 
preted as conducive to diffusion of responsi- 
bility (e.g., Darley & Latané, 1968; Schwartz 
& Clausen, 1970). If the additional bystand- 
ers are unable to respond (e.g., Bickman, 
1971), or if responsibility is explicitly as- 
signed to the focal bystander (Tilker, 1970), 
however, the diffusion process is unlikely. 

From the above discussion is becomes clear 
that a pure diffusion-of-responsibility effect 
can only be isolated when the bystander be- 
lieves that others are present and that neither 
she/he nor they are aware of each others’ 
responses. This implies that firm conclusions 
about diffusion of responsibility cannot be 
drawn from most of the laboratory and field 
studies in which the number of bystanders 
has been varied because bystanders have been 
aware of one another to some degree (e.g., 
Clark & Word, 1972; Latané & Darley, 
1968; Latané & Dabbs, 1975; Latané & 
Rodin, 1969; Piliavin, Rodin, & Piliavin, 
1969; Smith, Smythe, & Lien, 1972). 

Even the now classic study explicitly in- 
tended to manipulate only conditions condu- 
cive to diffusion of responsibility (Darley & 
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Latané, 1968) was not designed to control 
fully against the occurrence of social influ- 
ence and evaluation apprehension. Bystanders 
to the victim’s nervous seizure had an addi- 
tional 4 minutes to respond after an auto- 
matic circuit switcher began the next partici- 
pant’s turn. Since the order of turns was 
varied (Darley & Latané, 1968, p. 379, Foot- 
note 2), the subject may have expected either 
to hear another bystander (social influence) 
or to be heard herself by the others (evalua- 
tion apprehension) during this period, de- 
pending upon whose turn followed the vic- 
tim’s. In the replication of this study by 
Schwartz and Clausen (1970), the naive sub- 
ject’s turn always followed the victim’s, a situ- 
ation conducive to evaluation apprehension, 

In both of these studies, there was evidence 
for pure diffusion, however, since lone fe- 
male bystanders responded significantly more 
frequently than those who believed four others 
were present even before the circuit switched. 
In the replication with males by Horowitz 
(1971), other subjects were always scheduled 
to be heard next, a situation conducive to 
social influence; and no pure diffusion effect 
was isolated. 

The final emergency study that meets the 
awareness requirements for isolating diffu- 
sion of responsibility with an increase in the 
number of bystanders was done by Bickman 
(1971). In this study female bystanders re- 
sponded more slowly to the victim of a fallen 
bookcase when in the presence of another 
able bystander than when alone, although the 
proportions responding did not differ. In sum, 
unequivocal evidence for the diffusion of re- 
sponsibility process is sparse, and it pertains 
exclusively to female bystanders. 

Pure social influence and evaluation appre- 
hension effects also do not abound in the by- 
stander literature. In most of the studies pur- 
portedly demonstrating social influence, by- 
standers witnessed the emergency in each 
other’s presence. Hence, they may also have 
been affected by concern regarding others’ 
expectations and evaluations (e.g., Clark & 
Word, 1972; Latané & Dabbs, 1975; Latané 
& Darley, 1968; Latané & Rodin, 1969; Ross 
& Braband, 1973; Smith, Vanderbilt, & Cal- 
len, 1973). Another of Bickman’s studies 
(Bickman, 1972), again with female subjects, 
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may provide the only evidence for a pure 
social influence effect. Bystanders who heard 
a scream and crash responded significantly 
more quickly when they thought another by- 
stander from whom they were separated com- 
municated that she thought there was a defi- 
nite emergency. The study was structured so 
that the other bystander could not be aware 
of the subject’s responses. To the extent that 
this was correctly perceived, evaluation ap- 
prehension should have been minimized. 

Little work has been done on isolating 
evaluation apprehension effects. Beaman, 
Fraser, Diener, Kelem, and Westford (Note 
1) manipulated an appropriate variable— 
whether the focal bystander could be seen 
by another bystander. Unfortunately, how- 
ever, this manipulation was only partially 
crossed with whether the focal bystander 
could observe another bystander’s unrespon- 
siveness. The absence of a condition in which 
the focal bystander neither observed nor was 
observed by the other prevents any clear in- 
terpretation of their findings. 

Although audiences may inhibit helping 

because self-consciousness leads people to de- 
lay action until they are sure of themselves, 
as Latané and Darley (1976) suggested, the 
following reasoning leads us to expect that 
evaluation apprehension will promote helping 
in many circumstances: Data from Heberlein 
(Note 2) indicate that the vast majority of 
people feel some obligation to help others who 
are in trouble. This is supported by the very 
high rate of helping by lone bystanders in 
emergency studies. People tend to believe that 
other witnesses to emergencies share their 
sense of obligation, as corroborated by Bick- 
man’s (1971) finding that 14 of 15 subjects 
expected another able bystander to help. As- 
suming that bystanders project their own ex- 
pectations to others, they are likely to view 
intervention as the response socially expected 
of them, so that evaluation apprehension 
should promote helping. 

Three hypotheses about bystander inter- 
vention may be derived from the preceding 
discussion, each reflecting a different inter- 
vening process. First, mutually unaware by- 
standers to an emergency help less than lone 
bystanders (diffusion of responsibility). Sec- 
ond, bystanders help less when they are aware 
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that another bystander is unresponsive than 
when they are unaware of the oth: |- 
er’s reactions (social influence—negative), 
Third, bystanders help more when other by- 
standers present are aware of their reactions 
than when their reactions are unknown to 
other bystanders (evaluation apprehension: 
positive). 
The third hypothesis may be complicated 
by mutual awareness conditions in which both 
social influence and evaluation apprehension 4 
are likely because the others’ failure to re 
spond might suggest negative expectations, 
The one interpretable comparison in Beaman 
et al. (Note 1) indicates significantly less help 
with exposure to the combination o a- 
tion apprehension and negative social influ: 
ence than with exposure to the latter alone, 
Whether evaluation apprehension ¢ 
helping under mutual awareness, or 
it interacts with negative social influence 
further depress helping, may depend up 
which process takes hold first in a particula 
emergency. Because conditions conducive to 
all three processes are manipulate: 
study, we can examine possible differences in 
the timing of their impacts on behavior. 
In a short report that recently came to out 
attention, Latané and Darley (1976, pp. 18 
20) describe an experiment to which our ow 
was conceptually similar in that we bo 
tried to isolate the three bystander processé 
by manipulating channels of communication 
Their analysis did not treat the three rrot 
esses independently, however, th f 
significant inhibiting effects only f 
tions of two or more processes. Our reanalys 
indicates that all three independent effect 
were small and insignificant. Possible reason 
for the small size of effects in th 
may include the limited variability of respons 
(common for serious, unambiguous emergem 
cies; cf. Clark & Word, 1972, 1974), the us 
of a single additional bystander (also insuff 
cient to produce a reliable impact in Darlé 
& Latané, 1968), and the use of mostly mal 
subjects. 
Two other points require mention regarding 
the current study. Schwartz and Clause 
(1970) found that the number and ability 0 
other bystanders influenced helping, presuma 
bly through diffusion of responsibility, onl 


BYSTANDER REACTIONS TO A VIOLENT THEFT 


-cts who scored below the median 


among su 
on a sca~ that measures the tendency to 
deny res, nsibility.t This scale was included 
here in attempt to replicate this effect. 
Finally ccause this study was performed in 
Israel, | can provide some data on the cross- 
cultur lidity of inferences regarding by- 
stand esses, 
METHOD 
Design 

Mal rgraduates at the Hebrew University, 
Je were recruited to participate in a dis- 
cu “developing a high school sex education 
cui ubjects were paid the equivalent of 


s. The subject’s awareness of other 
hy ¿«ctions to attack (aware-unaware) 
ër tanders’ awareness of the subject’s re- 
m e-unaware) were crossed in a 2X2 
lesigr sbject’s awareness varied the likelihood of 
soci’ iafiuence, and others’ awareness varied the 
"\elihood of evaluation apprehension. An alone con- 
the subject was the only bystander 
as uscd to isolate the effects of diffusion of re- 
vsibility. After stratification at the median on 
r onsibility denial scores, 72 subjects were ran- 
dc aly assigned to one of the five experimental con- 
ditions. 


n in which 


rocedure 


Th veriment was conducted at night in an iso- 
impus building whose doors were locked to 
: ance plausibility. The subject was led down a 
corridor past rooms labeled to suggest there would 
be six participants (two in the alone condition), 
was told that the person in Room 1 had already 
rrived, and was placed in Room 2. He was given 
a responsibility denial questionnaire and a set of 
hv ‘h questions purportedly important for assessing 
his views on sex education. The latter was intended 
to screen out people with serious physical or emo- 
tional problems; however, none were identified. 
During questionnaire completion the research as- 
sistant interrupted the subject to show him a tape 
recorder found in the hall and ask if it belonged 
to him, When he picked up the completed question- 
naires, the assistant mentioned that the recorder be- 
longed to the person in Room 1 (the future victim). 
Thus subjects learned that the object of the later 
theft belonged to the victim. 
Subjects next donned earphones to receive a pre- 
corded explanation of the general procedures for 
the discussion and the rationale for physical separa- 
tion of the discussants (to reduce inhibitions and 
stimulate discussion of a potentially embarrassing 
topic), They then filled out questionnaires on their 
own sex education and pressed a signal button that 
rang a bell when they were done. This signaling 
gave practice in one potential mode of reporting. 
Before proceeding the assistant checked the intercom 
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and signal system and administered the manipulation 
of others’ awareness in the appropriate conditions, 
as explained below. 

The prerecorded instructions resumed with a de- 
tailed explanation of procedures that subjects also 
followed on a written outline. First, each partici- 
pant in turn was to discuss a topic chosen from a 
list provided for 2 minutes, starting with the person 
in Room 1 (the victim), then the subject, and so 
on. Following a break the discussion would continue 
in the same order, with each person talking for 24 
minutes on a new topic. In this second phase each 
talk would be followed by 24 minutes for everyone 
either to join in a free exchange of remarks about 
the immediately preceding speaker's views (subject 
aware of others) or to jot down remarks (subject 
unaware). A third phase of open discussion among 
participants would follow. Subjects were assured 
that no one else would be listening to their dis- 
cussion. To this end the intercom system would 
switch automatically, and the researcher would re- 
ceive the participants’ recommendations only in writ- 
ten form at the end of the discussion, 

Mutual awareness of reactions was manipulated 
by the descriptions of how the intercom system 
worked, which were given in the recorded instruc- 
tions and in the remarks made by the assistant. 
Awareness referred to the ability to hear any physi- 
cal movement (e.g., scraping of a chair), calling out, 
or ringing of a bell from rooms in which the micro- 
phone was on. 

Subject’s awareness. Subject’s awareness of other 
bystanders’ reactions determined whether or not so- 
cial influence might be operative. In the aware 
condition, all mikes were to be on throughout the 
three phases. Everyone was cautioned to remain 
quiet during other persons’ turns to avoid noise in- 
terference. Thus any sounds of helping actions by 
others would presumably be heard by the subject. 
Since the fictitious bystanders made no sounds 
throughout the emergency, subjects were exposed to 
social influence opposed to intervention. 

In the not aware condition, only the mike of the 
person whose turn it was to talk would be on during 
the first two phases. During the 24-minute period 
following each turn in the second phase, all mikes 
would shut off. The victim’s mike would remain on, 
however, since, as the assistant explained, Room 1 
had special equipment also being used for other re- 
search, Thus, during and after the attack, only the 
victim could be heard, and the subject could not 
know how others were reacting. He was therefore 
not exposed to social influence. 

Others’ awareness, The subject’s perception of 
whether other bystanders were aware of his reac- 
tions determined whether he was likely to experi- 
ence evaluation apprehension. 

In the aware condition, when all mikes were on, 


1 Previously called ascription of responsibility, this 
scale is now labeled responsibility denial (Schwartz, 


in press). 
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other bystanders could, of course, be aware of the 
subject’s reaction, When only the speaker’s mike 
was scheduled to be on, others were made aware as 
follows: During the intercom check the assistant dis- 
covered that the subject’s mike did not shut off 
automatically. He asked the subject to write a note 
describing this problem for a technician—thereby re- 
inforcing perception of the others’ awareness condi- 
tion, The assistant then concluded that the discus- 
sion could go on as long as the subject was not 
noisy. The subject therefore believed that he him- 
self could be heard by others throughout the emer- 
gency. 

In the not aware condition, when only the speak- 
er’s mike was on, other bystanders could, of course, 
not be aware of the subject’s reactions. In the “sub- 
ject aware” condition, however, when all mikes were 
scheduled to be on, others’ awareness was blocked 
as follows; During the intercom check the assistant 
discovered that no one could hear the subject ex- 
cept Participant 1, unless all the other mikes were 
off. The assistant therefore arranged the automatic 
system to shut off the other mikes during the sub- 
ject’s turn, enabling the subject to participate in 
all but the free exchanges. The assistant asked the 
subject to leave a note describing this problem for 
the technician. Thus, during and after the attack, 
the subject believed that he himself could not be 
heard by other bystanders, since all the other mikes 
were on. 

Alone. Only the subject and the future victim 
participated in this condition, and both mikes were 
on all the time. Diffusion of responsibility to other 
bystanders was therefore not possible, in contrast 
to the preceding conditions, 

After answering questions about procedures, the 
assistant announced that he was starting the auto- 
matic switching device and that he was going down 
the hall to the research director’s office where he 
could be contacted, if necessary, by pressing the 
signa] button, The assistant then started a tape with 
prerecorded voices for the various participants and 
a silent period during the subject’s turn. The dis- 
cussion proceeded through the first phase, the break, 
and 43 sec into the future victim’s talk beginning 
the second phase. At this point, the victim inter- 

rupted his talk to answer a knock at his door. Sub- 
jects overheard a conversation between the victim 
and a male stranger that quickly escalated into a 
fight. The intruder apparently grabbed the victim’s 
tape recorder, and the victim tried to stop him. 
Arguing, blows, and the crash of a person falling 
were heard, followed by the slamming of a door. 
The intruder had apparently escaped with the vic- 
tim’s recorder, leaving him moaning on the floor. 

The moaning, interspersed with barely audible re- 
marks, continued intermittently for another 24 min. 

The assistant recorded the subject’s response la- 
tency for ringing the bell, calling out, and/or leav- 
ing the room, waiting 1 additional minute before 
appearing, He then asked the subject what had hap- 
pened and what he was doing. Subjects who failed 
to respond within 5 minutes were contacted in 
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their own room, After ostensibly checkin that the 
victim was okay, the assistant grad: debriefed 
the subject. Finally, subjects complete: wo written 
questionnaires, one to check manipulat and per- 
ceptions and the other—not to be seer the as. 
sistant—anonymously soliciting the sub ` evalua- 
tion of the ethics of the study on Alf of a 
psychology department committee. 
RESULTS 

Sixty-five percent of the bysta: ‘ers re- | 
sponded to the emergency: 22% att-npted a i 
direct helping response by trying enter 
the victim’s room; 14% emerged fr~ their. 
room and sought to alert the rese chers; | 
29% rang their bell to alert the res: -rchers, 
None tried to approach another by anders? 
room, Among those who responded, ype of 
response was unrelated to experiment: ` treat- f 
ment. The quickest response was onl; 12 sec. 
after the intruder’s entry into the rov a, and 
the slowest was 195 sec. Those who i: led to, 
respond were assigned an arbitrary 55-sec | 


latency, equivalent to the time at wh h theg 
experimenter entered their room anc à full’ 
minute later than the slowest respons: 


Perceptions of the Situation and the 
Manipulations 


Perceptions of the communications «stem 
and of the experimental arrangement: were 
checked both in the verbal debriefing ani” 
in the written questionnaire, Four persons j 
were excluded from the sample prior to anal- | 
ysis because they thought that the tape re 
corder belonged to the intruder. This misun- 
derstanding implied that they, unlike other 
subjects, witnessed a provoked crime, All of 
the manipulations of awareness were corectly | 
perceived by 70% of the participants, and} 
an additional 19% made only partial, minor 
errors (e.g., could hear others’ verbal re. 
sponses but not their ringing of the bell). 
Misperceptions of awareness were not related 
to experimental treatment. One person incor- 
rectly thought that other bystanders were 
present, All misperceivers were retained in the 
analysis as a conservative measure. y 

As might be expected when persons are eX 
posed to a threatening emergency, many Tê- 
ported some suspicion. These reports W- 
questionably reflect both real doubts and 
defensive justifications of inaction or inde x 
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"cure 1, Cumulative proportion of subjects responding over time: Diffusion of responsibility, 


cision. Only five persons who immediately 
and spontaneously told the experimenter that 
they disbelieved the attack and who betrayed 
no hint of belief were excluded from the anal- 
‘is, one from each experimental condition, 
~cli-reported suspicion, measured on a 100- 
point scale, was related to helping for the 
total sample, but controlling for suspicion did 
not influence the effect of the manipulated 
variables, because suspicion was unrelated to 
experimental treatment. 


Effects of Independent Variables 


Dicholomized responsibility denial did not 
interact with any of the manipulated varia- 
bles in their effects on helping, nor did re- 
sponsibility denial scores correlate signifi- 
cantly with any index of helping. Responsi- 
bility denial is therefore not considered 
further in the analyses reported below. 

Diffusion of responsibility. To test for this 
theorized process, responses of lone bystanders 


were compared with those of bystanders who 
were in the presence of mutually unaware 
others. Diffusion of responsibility showed a 
substantial effect: Ninety-two percent of 
those who were alone intervened versus 45% 
of those in the mutually unaware condition 
(xê = 4.27, p < 05). Figure 1 shows the cu- 
mulative proportion helping in each condition 
as a function of time following the attack, 
These curves indicate that diffusion of re- 
sponsibility had virtually no impact on be- 
havior during the first 55 sec and only began 
to influence responses thereafter. 

To assess the timing of effects reflected in 
the curves, the following analysis was per- 
formed: The lone bystander sample and the 
mutually unaware sample were each dichoto- 
mized at their respective median response 
times. A Mann-Whitney test, used to compare 
the distribution of responses in the two pre- 
median subsamples, revealed no difference, 
indicating no effects of diffusion of respon- 
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Ficure 2. Cumulative proportion of subjects responding over time: Evaluation apprehension 


sibility among early responders, A comparison 
of the two postmedian distributions indicated 
a significant effect due to diffusion of respon- 
sibility (U = 2.5, p < .02); ? lone bystanders 
responded sooner, 

Responses in the four conditions created by 
crossing the subject’s awareness of other by- 
standers’ reactions with others’ awareness of 
the subject’s reactions were analyzed to test 
for the remaining theorized processes. Since 
evaluation apprehension and social influence 
did not interact in their impact on helping 
(x° = .28, ns, for interaction; cf. Goodman, 
1964), the two are treated independently. 

Evaluation apprehension, Bystanders were 
significantly more likely to respond to the 
emergency when others could be aware of 
their reactions (74%) than when others were 
unaware (39%: x° = 4.87, p < .05), regard- 
less of whether they themselves knew of the 
others’ inaction. Responses plotted over time 
(Figure 2) reveal a pattern similar to that 
found for diffusion of responsibility: Differ- 
entiation between the two distributions did 
not begin to appear until sometime after the 
assault (approximately 90 sec in this case). 
Statistical tests reinforced the inference sug- 
gested by the curves that evaluation appre- 


hension influenced late rather than ea v re- 
sponding. Dichotomizing each sample >t its 
median response time revealed no sigu icant 
difference between the two conditions i > dis- 
tributions of those who responded befo: > the 
median time, The distributions for the post- 


median groups were, however, marginally dif- 
ferent (U = 42, p < .10). 

Social influence, Although bystanders who 
were exposed to negative social influence re- 
acted less frequently (48%) than those who 
were not aware of the others’ reaction (68%), 
this difference was not reliable (y? = 1.31, 
ns). The cumulative curves of responses for 
this process (Figure 3) reveal a reversal of the 
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pattern for both diffusion of responsibility and © 


evaluation apprehension, Before 55 sec had 
elapsed, not one of those subjects exposed to 
negative social influence had helped, whereas 
32% of the subjects who could not know of 
others’ responses had already taken action. 
For the remainder of the time, the rates of re- 
sponse were approximately the same, Statisti- 
cal tests on the dichotomized distributions 
showed a difference due to social influence for 
$ 

2 All significance levels in this article are {wo-tailed 

unless otherwise indicated. 
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those who responded before the median time 
(U =", p < .02) but no difference for the 
postr in groups. A Mann-Whitney test on 
the sample indicated that those subjects 
expo». to negative social influence responded 
mo vly (z = 2.02, p < .02, one-tailed), 
Th cial influence appears to have slowed 
int ‘jon significantly even though it did 
no vely block it. 
rceptions 


the debriefing participants com- 
ritten questionnaire in which they 
to describe their thoughts and 
fe during the emergency, indicate 
r they felt that various characteristics 
ituation encouraged or inhibited their 
w, and check a list of thoughts and 
Four ideas were mentioned by at 
% of the participants in response to 
an open-ended question about their thoughts 
igs: doubts about authenticity 
), sense of obligation to help (37%), 

iy don’t others react?” (37%), - and 

i oughts about actions to take (27%). Sense 
ot obligation was mentioned significantly 
iore (p < 01) and doubts were mentioned 
significantly less (p< .05) by those who 
helped than by those who did not. It must be 
noted that these responses, as well as those 


and feel 
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discussed below, may partly reflect retrospec- 
tive justifications of behavior. 

Diffusion of responsibility. Three quarters 
of those subjects in the conditions used to 
uncover this process thought that the presence 
or absence of other bystanders had no in- 
fluence on their behavior. Those subjects in 
the alone condition were somewhat more 
likely to believe they had been influenced 
than those in the mutually unaware condition 
(38% vs. 9%). Of those who acknowledged 
influence, all but one subject believed that 
adding bystanders would inhibit helping. 
Spontaneous thoughts about their own sense 
of obligation were mentioned by 62% of 
those subjects who were alone versus 27% of 
those subjects who were mutually unaware. 
In response to the list of thoughts, 36% of 
the subjects in the mutually unaware condi- 
tion but none in the alone condition checked 
thoughts about reduced responsibility. 

Diffusion of responsibility was possible in 
all four conditions with multiple bystanders. 
Comparisons of subjective perceptions in the 
alone condition with the combined multiple 
bystander conditions revealed the same pat- 
tern of differences found with the mutually 
unaware condition; and the difference for 
thoughts about reduced responsibility was 
significant (p < .01). Thus, although the 
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TABLE 1 


ANALYSES OF VARIANCE IN HELPING USING RESPONSE 
LATENCY AND SPEED SCORES 


F values 
Response Speed 
Source df latency score 
Diffusion of responsibility 1,22 5.41* 1.60 
Evaluation apprehension 1,46 4.28" 1.92 
Social influence 1,46 340 4.66* 


*p <.05. 


majority of participants denied any effect for 
adding bystanders, those subjects who recog- 
nized an effect thought that it promoted dif- 
fusion of responsibility. Furthermore, self-re- 
ported thoughts and feelings reflected the ex- 
pected variations in sense of responsibility. 
Evaluation apprehension. Fully 88% of the 
participants denied that others’ awareness of 
their own reactions had any effect. The per- 
ceived effect of the mere presence of others 
was moderated, however, by whether or not 
these others could possibly be aware. Fifty- 
nine percent in the aware versus 22% in the 
unaware conditions (p < .01) said they were 
influenced by the presence of others, but this 
influence was perceived as inhibiting helping! 
Although the actual enhancement of their 
helping by others’ awareness apparently went 
undetected by participants, self-reports indi- 
cate that others’ awareness increased their 
thoughts about their own sense of obligation 
(41% vs, 17%) and about others expecting 
them to help (22% vs. 4%). Interestingly, 
those subjects whose reactions were unknown 
to others were significantly more likely to 
report thinking that the attack was ruining 
the discussion (39% vs. 11%, p < .05), sug- 
gesting that they felt more detached toward 
the emergency. 

Social influence. There were some signs 
that the impact of negative social influence on 
behavior was recognized by the participants. 
Of those subjects able to know that others 
were inactive, 54% said they were inhibited 
by this knowledge. Among those who did not 
know what others were doing, 32% stated 
that their lack of knowledge inhibited them 
and 12% stated that it promoted helping. 
Moreover, knowing versus not knowing about 
others’ inaction was significantly related to 
thinking spontaneously that others’ reactions 
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made helping inappropriate (64% vs 
p < .05). Spontaneous thoughts 


obligation tended to be more frequct among 
those who did not know what oi:ers were 
doing (44% vs. 16%, p < .10). 
Discussion 

Measurement of Helping 

Previous bystander studies have -ported 
helping responses in various ways, mt com- 
monly using proportion responding, -sponse 
latency in seconds, and speed sci con- 
structed by reciprocal transformatio:: of re- 
sponse latencies. Justifications for c\ ice of 
dependent variable have referred to ceting 
statistical assumptions and to minimi: 1g the 


impact of the arbitrary time assigned > non- 
responders, This choice also determine , how- 
ever, how the distribution of respons: : over 
time will influence the outcomes of para- 
metric analyses. The use of speed sci res is 
likely to yield significant effects if a liffer- 
entiation of responses occurs early, ¿ven if 
this differentiation remains constant or de- 
creases thereafter. This is because speed 
scores give more weight to early as o} d to 
late variation in responses, and the: = eem- 
phasize the difference between late re onse 
and nonresponse. Analyses of respon:> la- 


tencies, on the other hand, are more lik: `y to 
yield significant results when a different’ ition 
among those who respond to the emer; ency 
occurs gradually or when the difference is 
primarily due to the relative proportions re- 
sponding, since those who fail to respond are 
typically given an arbitrary, long response 
latency. 

The use of parametric tests with latency 
and speed-score data from the current study 
illustrates these patterns (Table 1), Both dif- 
fusion of responsibility and evaluation ap- 
prehension, whose effects emerged relatively 
late and led to differences in proportion help- 
ing, yielded significant effects only with la- 
tency as the index of helping. 

By contrast, the effect of social influence, 
which emerged early but did not lead to dif- 
ferences in proportion helping, was significant 
only for speed scores. Given that the metric 
chosen influences conclusions from parametric 
tests, and in the absence of a metric with cleat 


. 
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psychological meaning, nonparametric tests 
may be most reasonable both practically and 
intuitively at this time. Although proportions 
responding are clearly meaningful, finding dif- 
ferences in effects over time requires tests 
that consider order of response, such as the 
Mann-Whitney. It is also desirable to pro- 
vide response curves that can clarify the na- 
ture of these effects. 


Inferences and Interpretations 


This study has demonstrated that all three 
processes by which bystanders to emergencies 
presumably influence each others’ responses 
do indeed have independent, unconfounded 
effects on helping. The findings for diffusion 
of responsibility and social influence also ex- 
tend the generalizability of effects beyond 
past research because these processes were ob- 
served (a) among male bystanders; (b) in 
response to a violence-precipitated, personally 
threatening emergency; and (c) in a cross- 
cultural context. 

The presence of diffusion of responsibility 
for males in this study contrasts with its ab- 


sence in Schwartz and Clausen (1970). These 
authors noted that some of their additional 
bystanders were female, and they suggested 


“that the introduction of female bystanders 
may increase the incentive of males to inter- 
vene, counterbalancing the effects of diffusion 
of responsibility” (p. 308). Our findings con- 
firm their expectation that “the addition of 
male bystanders alone [would reduce help- 
ing] by males” (p. 308). 

Even though negative social influence sig- 
nificantly inhibited responses during the pe- 
riod shiorily following the emergency, it did 
not reliably reduce the final proportion help- 
ing. The absence of a social influence effect 
on the proportion of subjects who ultimately 
helped may reflect the fact that some of the 
subjects in the condition in which they could 
not hear others’ reactions were nonetheless 
exposed to a measure of negative social in- 
fluence, According to self-reports, 44% of the 
subjects in this condition conjectured that no 
one else had helped. Thus these subjects may 
have been influenced negatively by the pre- 
sumed behavior of others, Situations in which 
others’ reactions are unknown are necessarily 
ambiguous with respect to social influence. 
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Among those who were able to hear others’ 
reactions, 80% correctly reported that no 
one else helped. Yet a substantial proportion 
of those subjects in the negative social in- 
fluence condition ultimately helped (48%). 
The pressures contrary to intervention were 
apparently insufficient to deter these indi- 
viduals, but their decision was probably more 
complicated and therefore slower because they 
had to arrive at an interpretation of events 
at variance with the information available to 
them before responding. We should note that 
this study provides no evidence on the effects 
of positive social influence. 

The current study provides the only evi- 
dence for an independent impact of evalua- 
tion apprehension on helping. The differences 
in self-reported thoughts and feelings noted 
above draw attention to three possible mecha- 
nisms underlying this effect: (a) The absence 
of an audience produced a sense of detach- 
ment from events that lowered the emotional 
impact of the emergency and its power to 
activate norms. (b) The presence of an au- 
dience made the personal sense of obligation 
more salient, perhaps through heightening ob- 
jective self-awareness (Duval & Wicklund, 
1972). (c) The presence of an audience added 
the force of perceived social expectations to 
that of self-expectations. The latter might 
occur even when others are believed not to be 
responding if they are perceived by the by- 
stander as inactive because they are waiting 
upon his/her own response, 

As indicated by the absence of an interac- 
tion between subject’s awareness and others’ 
awareness, evaluation apprehension enhanced 
helping regardless of social influence.This find- 
ing contrasts with the results of two earlier 
studies (Latané & Darley, 1976; Beaman et 
al., Note 1) in which the presence of an audi- 
ence in combination with negative social in- 
fluence reduced helping. Considering the 
mechanisms proposed here, perhaps subjects 
in the no-audience conditions in the earlier 
studies felt less detached because they were 
one of only two bystanders. Moreover, in the 
negative social influence condition in Latané 
and Darley (1976), the other bystander ap- 
parently returned to his work after noting 
the emergency and could be seen to show no 
interest in the subject. The confederate in 
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Beaman et al. (Note 1) “retained a nonreac- 
tive demeanor.” This apparent disinterest 
may have blocked or weakened the second 
and third mechanisms. Disinterest on the part 
of an inactive other might undermine per- 
ceiving the other as waiting for and holding 
positive expectations for the bystander to re- 
act. Further research is required to identify 
the conditions under which various mecha- 
nisms produce evaluation apprehension ef- 
fects. 

Although evaluation apprehension was un- 
doubtedly low in the no-audience conditions 
of past studies, it was probably not entirely 
absent. In each case bystanders may have 
been concerned about the expectations of the 
victim and/or the experimenter who knew of 
their presence. Moreover, bystanders may 
have recognized that they could expose them- 
selves to the evaluations of others by inter- 
vening. These sources of evaluation apprehen- 
sion may be minimized by experimental de- 
sign, but another source—anticipation of 
evaluations by those to whom bystanders may 
have to report their actions in the future—is 
present even in alone conditions. 

Findings by Schwartz and Clausen (1970) 

that diffusion of responsibility effects were 
moderated by responsibility denial scores and 
that increasing the number of bystanders in- 
fluenced the type of response (direct vs. in- 
direct) were not replicated in the present 
study. Perhaps this is because both of these 
earlier effects were found with females but 
not males, Furthermore, the absence here of 
moderation by responsibility denial may be due 
to the use of a translated scale in a different 
cultural context; and the lack of effects on 
response type may be traceable to the sub- 
stantial differences in the emergencies and the 
types of help called for, The high rate of in- 
direct help may reflect the potentially high 
cost of intervention in this study (cf. Piliavin 
et al., 1969). 

The cumulative response curves as well as 
the statistical analyses for the various condi- 
tions indicate that the impact of the three 
processes on intervention emerged in a par- 
ticular sequential order, First came social in- 
fluence; diffusion of responsibility and evalu- 
ation apprehension followed approximately 
together. Because earlier studies did not vary 
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the conditions conducive to these »-rocesses 
orthogonally, additional data on co: parative 
timing are unavailable. Further arch is 
needed to validate or modify the p ‘ern we 
have uncovered. 

We have observed the timing of © -rt help- 
ing responses, but we do not know vnen the 
decisions leading to these respor were 
made. Assuming that the timing of — ‘erven- 
tion reflects the timing of decisions, wever, 
our findings support the generally epted 
view (cf. Latané & Darley, 1970; S :wartz, ` 
1970) that the first issue confrontin a by- 
stander is to decide upon the reality © need 
and appropriateness of help, a decisio: partly 
based upon the situational definition p vided 
by others (social influence). Follow: x this 


comes a decision regarding one’s ow- per- 
sonal obligation, which may be redu ed by 
the presence of others (diffusion of re ponsi- 
bility) and may be influenced by the > per- 
ceived expectations (evaluation apy ehen- 
sion). The latter, we note, may enhar e the 
sense of personal obligation. 
Ethics 

The ethical problems with deceit wer com- 
plicated in this experiment by the ri :s of 
emotional shock from witnessing a viol: ¢ at- 


tack and by self-deprecation for failu e to 
help. In order to detect undesirable in 
that might require halting the researc! , we 
supplemented the debriefing with an ar ony- 
mous “ethics questionnaire” administered 
after payment was made. The following re- 
sponses reflect the reactions of all 72 par- 
ticipants. 

Current feelings were measured on a set of 
90-mm self-descriptive scales. Mean responses 
for all seven negative feelings (upset, angry 
at experimenter and at self, depressed, em- 
barrassed, nervous, guilty) fell below the seg- 
ment marked “slightly” on the scale, both for 
helpers and nonhelpers. Positive feelings 
(calm, proud, pleased with myself) ranged 
substantially higher, There were no signifi- 
cant differences due either to response or t0 
experimental condition. Traces of any strong 
emotional disturbance were absent in these 
data. 

Asked point blank whether the experiment 
should be permitted to continue, one person 
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said no, ` ¿t unfortunately he gave no reason, 
None sa: he regretted participating, although 
one ex} resentment at being deceived. 
The o egative response to an open-ended 
inquiry hether “there was anything un- 
ethics’ is experiment” criticized depriv- 
ing | the interesting discussion for 
whic vad come; that is, the ethics of 
usin ful cover story were questioned, 
but of exposing people unwittingly 
to € ng emergency were not men- 
tioned, Fir iy, none believed this experience 
would the likelihood of their helping a 
real v the future, whereas seven be- 
lieved id increase their helping. 

Conc about the ethics of a study can- 


not be a 'ayed completely by participants’ re- 
sponses io anonymous ethics questionnaires. 
The © zoing feedback from the debriefings 
and om these questionnaires emboldened us, 
however, to continue the research to its con- 
clusi 
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Women’s somatic preferences in judging the male and female body were ob- 
tained and related to their own personality and background characteristics. 
One hundred and fifteen female undergraduates made paired comparison pref- 
erence ratings of a set of 15 male and a set of 15 female profile silhouettes 
which varied in chest/breast, buttocks, and leg size. The group of women as a 
whole selected a male silhouette of moderate size (thickness) with small but- 
tocks as the most attractive male physique. A moderate-sized male silhouette 
with a somewhat larger chest was also favored. This finding gave only partial 
support to the cultural belief that women prefer larger chests in men since 
the large-chested or “Atlas”-type physique received only slight endorsement. 
A moderate-sized female silhouette with small buttocks was chosen as the most 
preferred female figure. In addition, an evenly proportioned and moderately 
built female silhouette was also highly valued. Larger buttocked male and fe- 
male silhouettes were clearly disliked but less definite preference patterns were 


obtained with regard to leg size. 


The role of physical appearance in person 
perception has received considerable emphasis 
in recent research. While many studies have 
focused attention on the importance of facial 
qualities in determining physical appeal, few 
have investigated the importance of body 
characteristics. Nevertheless, the amount of 
money which has been spent on body-building 
studios, diet plans, and reducing salons sug- 
gests that attractive somatic appearance is 
important to people of both sexes. At present, 
however, little is known about which body 
characteristics serve as desirable somatic stim- 
uli for adult males and females. 

Within the psychological literature, discus- 
sions of somatic preferences have tended to 
appear either as part of case history reports 
or to help establish credibility for some theo- 
retical position. Tridon (1922/1949), for ex- 
ample, reported that breast-fed males, 
thought to have experienced oral gratification, 


We wish to express our gratitude to Debra 
Cowart, Amy Isenhower, and Anthony Cherry for 
their valuable assistance throughout this study. 

Requests for reprints should be sent to Sally Bell 
Beck, Department of Psychology, Butler University, 
Indianapolis, Indiana 46208. 
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showed a later preference for womo: with 
well-developed breasts. On the othe hand, 
Gorer (1948) has maintained that sc: eduled 
feeding experiences and oral frustratic: led to 
later preferences for large-breasted worn. As 
Scodel pointed out in 1957, almost no atten- 
tion had been given to the systematic study | 
of female bodies which are preferred by adult | 
males or to the psychological variables which 
are associated with males’ preferences. Sco- 
del’s (1957) observation could also have been 
made regarding the somatic preferences of | 
female observers. 

In 1968, Wiggins, Wiggins, and Conger, in | 
a controlled laboratory setting, investigated | 
heterosexual somatic preferences of adult? 
males. They reported that preferences for 
particular female body types were associated 
with certain personality and background 
characteristics of the male perceivers. In ad- 
dition, there was partial support for the ten- 
dency by society, and the men themselves, t0 
classify males as Breast Men, Buttocks Men, 
or Leg Men. 

Since there has been evidence that males 
differ in their attraction to particular female 
body parts, it is likely that differences in fe- 
male preferences for certain male and female 
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body parts also exist. While the stereotyped 
strong muscle man (“Atlas physique”) com- 
posed of >road chest and small buttocks has 
often by presented as the ideal male body 
by our ;Jiure, women’s preference for that 
or oth Je physiques has not been clearly 
establ . Jn an informal interview setting, 
Smit! asked 100 females to name the 
male characteristics which they found 
to | xually arousing. While 39% of 
th mer selected the male buttocks as 
erotically active, and 15% favored male 
slimnes ‘ly 1% reported sexual arousal 
when th ewed muscular chests and shoul- 
ders, an ne reported sexual arousal when 


viewing muscular arms. These preferences 
were widely discrepant with the cultural be- 
lief and wth the results of 100 interviews by 
Smith (1975) in which males were asked to 
select male body parts which they thought 
were most sexually arousing to women. In a 
recent study by Lavrakas (Note 1), the cul- 
stereotyped muscle man (“Atlas 
p jue”) also held little attraction for 
woven asked to judge front-view male silhou- 
eties. However, preference for a moderate- 
sized “tapering-V” male physique was re- 
ported by women who presented themselves 
as more traditionally feminine. 

© greater emphasis placed on shoulder 
aud chest width by females of the Lavrakas 
(Note 1) study than that reported for females 
intervi Smith (1975) may have been 
influenced by Layrakas’ use of front-view 
silhouette visual stimuli. While somatic pref- 
erences related to body width can be readily 
determined from viewing a front-view draw- 
ing of the body, preferences which may be 
based on protrusions to the ventral side (e.g., 


wl } 
ed 


chest) or to the dorsal side (e.g., buttocks) 


of the body are difficult. This study, like that 
of Wiggins et al. (1968) which explored the 

malic preferences of males, employed profile 
silhouette figures as stimuli. Profile figures 
were chosen not only to make direct compari- 
sons possible between the somatic preferences 
of females found in this study with those 
reported for males (Wiggins et al., 1968), but 
siso to maximize the subjects’ ability to 
clearly discern variations in body part size 
which involve protrusions to the front and to 
the rear of the body. 
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The goals of this study were threefold: 
first, to discover which sizes of the body 
characteristics of chest/breast, buttocks, and 
legs are desirable somatic stimuli to females; 


second, to investigate the personali 
background characteristics ere Ras 
which are related to their preferences: and 
third, to compare the somatic preferences of 
women with those which have been reported 
for men (Wiggins et al., 1968). 

In order to assess the relative attractive- 
ness of various male and female physiques to 
women, all subjects were asked to make 
paired-comparison preference ratings of 15 
profile male silhouettes arranged into all pos- 
sible pairs, and 15 profile female silhouettes“ 
arranged in all possible pairs. In addition, 
personality and background information was 
gathered for all females who gave somatic 
preference ratings. 


METHOD 
Subjects 


The subjects were 115 female undergraduate vol- 


unteer students from an introdu 
ctory psycholo; 
course at Butler University. TE e 


Stimuli 


The stimuli consisted of 15 male profile silhouettes 
and 15 female profile silhouettes. The 15 female sil- 
houette figures have been fully described in Wiggins 
et al, (1968). The construction of all 15 male pro- 
file silhouettes with regard to figure height, profile 
width, and body dimension values was designed to 
match those for females.1 In constructing the male 
silhouettes, each of three body dimensions (chest, 
buttocks, and leg size) varied independently of other 
body parts and with reference to a “standard” male 
physique (DeCoursey, 1961, p. 207). These three 
body dimensions (chest, buttocks, and leg size) 
could assume one of five area size values: large 
(+2), moderately large (+1), standard (0), mod- 
erately small (—1), or small (—2). The four values 
other than zero represented equal stepwise incre- 
ments or decrements in area size from the standard 
figure. Twelve of the 15 figures varied only on one 
body dimension. The other three figures were a 
proportionately “large” overall figure, a proportion- 
ately “moderate” overall (standard) figure, and a 
proportionately “small” overall figure. In order to 
ensure that the male silhouette stimuli had been 


1 We wish to thank Nancy Wiggins, who provided 
illustrative samples of female silhouettes and infor- 
mation regarding construction procedures to the first 
author. 
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STRONGLY MODERATELY MILDLY MILDLY MODERATELY STRONGLY 


NO 
PREFERENCE 


FIGURE 1. Silhouettes of the large chest (+2) figure and the standard male figure. (Rating cale 
appears at bottom.) 


drawn in proportion to the body dimensions varied, 
all figures were checked by planimeter readings.2 

The 15 male silhouettes were arranged into all 
possible pairs of males, and the 15 female silhouettes 
were arranged into all possible pairs of females, 
yielding a set of 105 stimulus pairs for each sex. 
Within each set of pairs, each silhouette was located 
half of the time on the left and half of the time on 
the right. The pairs were then randomly arranged 
and photographed to be presented as 35-mm slides,? 
In Figure 1, one of the male stimulus pairs is pre- 
sented together with the rating scale; a female stim- 
ulus pair has been presented in Wiggins et al, (1968). 

In Figure 1 the standard male figure (DeCoursey, 
1961) appears on the right, and the comparison figure 
with a large * chest is on the left. 


Procedure 


All subjects participated in two experimental ses- 
In the first session they rated the silhouettes and 
ed a background questionnaire. In the second 

* responded to personality and attitude 
ribed later. In rating the silhouettes, sub- 
required to indicate for each slide the 
1 they preferred and the degree of their 
the 7-place rating scale shown in Figure 

f the two experimental sessions care- 
those used in the Wiggins et al. 


‘round, and Attitudinal 


imin istered the Edwards Personal 
EPPS) (Edwards, 1954) and 


the MMPI, which was scored for the origin. | clinical 
scales and the following content scales; /\cademi¢ 
Achievement, Competition, Emotional Tr. naturity, 
Femininity, Graduate School Potential, 1 c:dership, 
Authority Problems, Self Sufficiency, Soci: trover 
sion, Need for Affection, and Denial of Maic Occupa- 


tions (Dahlstrom & Welsh, 1968), In adition a 
biographical inventory was given which requested in- 
formation on topics such as college major, ` obbies, 
reading habits, dating, drinking, smoking, socioeco- 
nomic and family background, birth order, and infant 
feeding experiences. Subjects were also asked to indi- 
cate their attitudes toward various body pacts by 
making semantic differential (Osgood, Suci, & Tannen- 
baum, 1957) ratings using three bipolar adjective 
scales (good-bad, weak-strong, active-passive). Sub- 
jects were asked to rate six concepts: “cl of a 
man,” “buttocks of a man,” “legs of a man,” “breast 
of a woman,” “buttocks of a woman,” and “legs ofa 


2 The help of the engineering firm of Warren T. 
Hobson and Associates, Inc., of Indianapolis, Indiana, 
in obtaining the planimeter readings was greatly ap- 
preciated. 

To control for any viewing order effect, half of 
the subjects saw the entire set of 105 male slides 
first; the other half saw the female slide set first. 

The words large and small, as used throughout 
this study, refer to dorsal to ventral body thickness 
or silhouette profile width (e.g., “large” legs are 
“thick” legs). 


WOMEN'S PREFERENCES IN MALES AND FEMALES 


\ woman.” (sur interspersed buffer items were also 
¢ rated: “lips of a woman,” “eyes of a woman,” “lips of 
aman,” an! “eyes of a man.” 
ESULTS AND DISCUSSION 
Group rage Scale Values 
Thi erage scale values for each of 
the 1 le and each of the 15 female sil- 
houet he group of 115 female subjects 
are by averaging across the sub- 
jec ə scale values (Wiggins et al., 
i968). 4 up average scale values and as- 
sociated ard deviations for each stimu- 
+ lus ied in Table 1. 
Accor to Table 1 the women of this 
study are nost attracted to both male and 


female phy iques which have small buttocks 
(—2). Sil’ ouettes of both sexes with larger 
buttock: (+2, +1) received negative ratings. 
The ke of large buttocks by women is 
-consistent with the dislike of large female but- 
‘tocks reported by males (Wiggins et al., 
19¢ 
h regard to chest size, the women as a 
while respond favorably to a moderately 
lary er chest (+1) in the male silhouette. Only 
slight preference, however, is shown for the 
large chest (+2). Women’s attraction to 
moderately larger chests (+1) in males is in 
vent with mens’ preference for larger 
veasts in females (Wiggins et al., 1968), but 
is in sharp contrast to women’s dislike of 
breasts (+2, +1) in silhouettes of their 
wr ‘. Moderately small (—1) is the only 
fevale breast size, in addition to those in- 
cor. vated as part of the evenly proportioned 
figuie (standard), to receive positive ratings 
from the women subjects. 

Larger (+2, +1) and smaller (—2, —1) 
“legged silhouettes of both sexes are acceptable 
to women just as the various female leg sizes 
_ had been to males (Wiggins et al., 1968). The 

only exception noted is a slight dislike by 
women for the small-legged (—2) male sil- 
houette. Women prefer moderate overall body 
size (standard) to either overall largeness or 
‘overall smallness in both males and females. 
This is consistent with the heterosexual pref- 
erences of males viewing female silhouettes in 
the Wiggins et al. (1968) study. 
Considering the large standard deviations 
which are associated with the scale values of 


je 
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TABLE 1 


GROUP AVERAGE THURSTONE SCALE VALUES FOR 15 
MALE AND 15 FEMALE SILHOUETTE FIGURES 


Male figures 


Stimuli M SD 
Large —.63 87 -130 59 
Standard 39 43 55 45 
Small —94 8i —.34 99 
+2 Chest/Breast 13374 — 26  '90 
+1 Chest/Breast 03 Al =—12 68 
—1 Chest/Breast —88 .52 30 44 
—2 Chest/Breast —.94 70° —34 65 
+2 Buttocks —j12 60 —1,03  .55 
+1 Buttocks —.01 58 —29 43 
—1 Buttocks 61 60 03 | AZ 
—2 Buttocks 90>, 00 1 84 
+2 Leg AT 53 31.54 
+1 Leg 21 46 43.55 
—1 Leg 07 39 44 46 
—2 Leg —=16 46 Soe 52 
Nolte, N = 115. 


Table 1, especially those for the overall large 
and overall small silhouettes, it is likely that 
individual differences in preference do exist 
but are being masked by the use of group 
averages. Furthermore, the nonadditivity of 
scale values (i.e., the scale value for the over- 
all small silhouette cannot be predicted by 
adding the scale values for the silhouettes with 
various small body parts) indicates that pref- 
erences for the size of particular body parts 
depends on the total body configurations. 


Factors Underlying Stimulus Preferences 


In order to determine whether the three 
body dimensions of chest/breast, buttocks, 
and legs are the actual dimensions used by 
female subjects in making preference judg- 
ments, two principal component analyses of 
stimulus scale value intercorrelations (one for 
the 15 male stimuli and one for the 15 female 
stimuli) were performed. A complete discus- 
sion of this procedure is found in Wiggins et 
al, (1968). Varimax rotation to simple struc- 
ture was applied to the first four components 
of each of the two analyses. Loadings of the 
15 male stimuli and of the 15 female stimuli 
on the two sets of related principal com- 
ponents are given in Table 2. As in the Wig- 
gins et al. (1968) study, only those loadings 
of .33 or greater are presented. 

The data of Table 2 suggest that essentially 
the same components do underlie females’ 
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TABLE 2 


PRINCIPAL COMPONENTS UNDERLYING PREFERENCE RATINGS OF 15 MALE AND 15 Fr 


SILHOUETTES 


Rotated principal components 


Male figures 


Femai 


Stimuli u HI 


Iv il 


Lar; 

Standard 
Small 

+2 Chest 
+1 Chest 
—1 Chest 
—2 Chest 
+2 Buttocks 
+1 Buttocks 
—1 Buttocks 
—2 Buttocks 
+2 Leg 

+1 Leg 

—1 Leg 

—2 Leg 


% Common Variance 


— 56 


50 
43 


31 17 


—.47 


42 : 
43 
—.48 


—42 
— 47 


66 
S4 


14 


Note. N = 115, 


somatic preference ratings of both male and 
female stimuli. They are based, like those for 
males, on chest/breast, buttocks, and leg 
dimensions. Womens’ perceptions of both male 
and female stimuli are based on a chest/breast 
component (male component 4, female com- 
ponent 3), a buttocks component (male com- 
ponent 3, female component 2), and two inde- 
pendent leg components (male components 1 
and 2, female components 1 and 4). The 
emergence of two independent leg components 
rather than a single bipolar component was 
also noted for males (Wiggins et al., 1968), 
Wiggins et al. (1968) argued that two com- 
ponents appeared since the leg does not lend 
itself to variations in size as readily as the 
chest/breast and buttocks. In addition, they 
recognized that simultaneous variation of the 
entire leg could mask possible perceptual dif- 
ferences in preference which might be based 
on independent variations of thighs, calves, 
and ankles, An additional possibility is that 
variation in leg size, especially for thighs, may 
be difficult to judge due to its contiguous 
spatial relationship to the buttocks. 


Correlates of Somatic Preference 


In order to determine which personality and 
background variables are related to women’s 
somatic preferences, Thurstone scale values 
for each of the 15 male stimulus figures and 


each of the 15 female stimulus fig ires were 
correlated with data obtained fron: the per- 
sonality and background inventories Jescribed 
earlier. The personality and background cor 
relates of somatic preference are pr: ented in 
Tables 3 and 4, 

The following descriptive composites for 
women are based on the data from ables 3 
and 4. Data for males were obtained »y Wig 
gins et al. (1968) and have been inscrted in 
brackets throughout the composites to permit 
direct cross-sex comparisons. 

Overall figure size. Women who prefer over 
all largeness in males are physically active and 
enjoy competition in sports. They present 
themselves as less traditionally feminine (low 
M-F Female scores) and have little interest 
in the performing arts which involve dance, 
music, etc. [Wiggins et al., 1968, found thal 
males who prefer largeness in females express 
a need for achievement.] No significant rela" 
tionships were found between somatic prefer- 
ences and personality variables for thos 
women who select largeness in females. 

Selection of moderate overall body size 
(standard) in the opposite sex, for both 
women and men, seems to be related to 4 
heterosexual orientation (desire to appeal to 
and please the opposite sex). Women wh? 
prefer moderate-sized males present them- 
selves as more traditionally feminine an 


=. 
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I choose carcers in home economics which are 
related to the home and family. They are 
conscienti is and dedicated individuals who 
try to fir the tasks they begin (Endurance) 
and vali: neatness and orderliness (Order) in 
both t personal and work habits. Their 
total y n of response suggests that they 
adher« closely to the more traditional 
cultu type of women as “good wives 
and | Their male counterparts are 
al uly oriented, but, unlike the 
w i; men are not orderly and or- 
ganized ¿iv personal and work habits.] 


In ca nen who select moderate over- 
all bods in the female may be less tradi- 
tionally fem sine since they do not reject oc- 
cupations t aditionally chosen only by men 


(no Deni»! of Male Occupations). 
Wo who prefer overall smallness in 
mal e reserved in their social interactions 


(not Exhibitionistic) and have been reared 

10 have at least a college educa- 
i lales who prefer overall body small- 
ne: in females come from upper-class back- 
gro nds and are comfortable with those in 
autority.] To some extent these findings sup- 
port the conclusions of Moore, Stuckard, and 
Srole (1962), that body thinness is a highly 
` -hysical characteristic among members 
ol upper class. Preference for overall body 
“smallness in the female figure is shown by 
women who are pursuing careers in the per- 
forming arts (performing arts major) such as 

e, music, theater, etc. Their preference 

fo. somatic thinness may be partly influenced 
by < desire to emulate the extremely thin 
female body of the ballerina or high fashion 
model, 

Chest/breast size. Women selecting larger * 
chested males are socially outgoing (not so- 
cially introverted), and express the need to 
achieve academically. In pursuing their goals, 
however, they may show a lack of self-reliance 
and depend on others to make decisions or 
select some course of action (low Autonomy). 

„Under stress conditions these women tend to 
convert their anxiety into physical symptoms 
(Conversion Hysteria), One might speculate 
that their attraction to the larger chested male 
rellecis a sensed need to be protected, espe- 
cially under stress, by the culturally stereo- 
typed “Atlas” or large-chested man. [Men at- 


a 
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tracted to larger breasted women are socially 
independent and nonnurturant. They have a 
strong heterosexual orientation and date fre- 
quently.] Like these men, women who select 
larger female breasts are heterosexually ori- 
ented, These women are outward directed (not 
Intraceptive), and value versatility and fre- 
quent change (Change) in their lives. 

A general pattern of passivity, indecision, 
and interest in religion is found in female and 
male subjects who select smaller chests/ 
breasts in the opposite sex. Women who like 
smaller-chested males seem to dislike physical 
activity and participation in sports. When 
faced with choices, they defer (Deference) to 
the wishes of others. These women report that 
they are religious people and have been reared 
by college-educated mothers. [Men preferring 
smaller breasted women are mildly depressed, 
lack achievement motivation, and are indefinite 
about their career plans, They hold funda- 
mentalist religious beliefs and come from 
working-class families.] Women who select 
smaller breasts in females are not emotionally 
excitable or overactive women (not Hypo- 
manic). They are also less heterosexually ori- 
ented and present themselves as less tradition- 
ally feminine. These women accept careers 
often thought to be more traditionally ap- 
propriate for males (no Denial of Male Oc- 
cupations), and show the potential to succeed 
in graduate school. It may be noted that 
women preferring smaller chests/breasts come 
from families with social and educational 
advantages, while men preferring smaller 
breasted women come from working-class 
families. 

Buttocks size. The selection of larger but- 
tocks by both women and men may be re- 
lated to a concern about orderliness. Women 
who prefer males with larger buttocks have 
little concern for order and neatness (low 
scores on Order) in their personal and work 
habits. While these women present themselves 
as more traditionally feminine, they do enjoy 


5 Throughout the composites of correlates of so- 
matic preferences, the word larger refers to those 
responses made to both the large (+2) and moder- 
ately large (+1) stimulus figures for the body part 
being discussed. The word smaller includes responses 
made to the small (—2) and moderately small (—1) 
stimulus figures. 
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some active participation in sports. They are 
not emotionally excitable or overactive women 
(not Hypomanic) and are introverted in their 
social relations with others (Social Intro- 
version). [In contrast to their female counter- 
parts, men who choose women with larger 
buttocks show a high need for orderliness, 
neatness, and organization, These men are de- 
pendent in social situations and tend to be 
self-abasing.] Women who prefer larger but- 
tocks in females tend to be disorderly in their 
personal and work habits (low scores on 
Order), They do not desire academic achieve- 
ment and report that they experience diffi- 
culty when dealing with authority (Authority 

Problems). 

Selection of smaller buttocks in the op- 

posite sex by both women and men appears to 
be associated with perseverence and accom- 
plishment. Women attracted to the smaller 
male buttocks are motivated to achieve in 
many areas, show a potential for graduate 
training (Graduate School Potential), are 
emotionally mature (no Emotional Immatur- 
ity), and are highly competitive. These women 
have a need to be leaders (Leadership), and 
indicate that they have little interest in re- 
ligion, [Their male counterparts value them- 
selves and do not feel the need to be the 
center of attention in a group. These men per- 
severe until their tasks are completed.] Pref- 
erence for smaller buttocks in the female sil- 
houette is found in women who desire to 
achieve academically. They were reared by 
mothers who also valued academic achieve- 
ment since they had college educations. These 
women report that they date infrequently, 
often choose careers which require training in 
science (pharmacy majors), and have little in- 
terest in reading magazines related to home- 
making. 

Leg size, Women and men who choose 
larger legs in the opposite sex seem to be in- 
wardly directed people. Women who prefer 
larger male legs tend to focus on interpreta- 
tion of their own thoughts and feelings (In- 
traceptive). They are less traditionally femi- 
nine and less heterosexually oriented. [Men 
who like larger female legs also focus on their 
own thoughts and feelings and are restrained 
in social s ns. They are nonaggressive 
males who tend to be self-abasing.] Larger 
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i 
legs in females are selected by women why \ 


respond in traditionally feminine ways. Whi? 
larger female legs are acceptabie to wome 


with liberal arts and science (‘.AS) majors, 
they are unacceptable to thos: in the per 
forming arts—especially to dance majors, 
The patterns of personality :orrelates as 
sociated with preference for sn ‘ler legs ip 
the opposite sex are quite differen’ for women 
than they are for men. The wo: en are not 
dominant, not self-sufficient, nee” to belong, 
(Affiliative), and are socially — <troverted, 
[The men, in contrast, are exhibi. onistic, af 
filiative, and wish to participat socially.” 
Women preferring smaller legs in emale sil 


houettes are characterized by the’ need for 
affection and dislike of change in heir live 
(Change). In situations of stress th y tend to 
convert their anxiety into physica! . ymptoms 
(Conversion Hysteria) and often re: ‘er them: 
selves incapable of action. 
Attitudes Toward the Body A 
With regard to attitudes toward he malt 
body as measured by the semantic di“ -rential! 
women who prefer larger male b ly parti 
generally rate those body parts vorablj 
(good, strong, active); those wi. prefer 


smaller male body-part size gener: ‘ly pet 
ceive the male body in a more > egative 
way (bad, weak, passive), There is, however 
a notable exception to the above generaliza- 
tion. The preference for smallness in the malè 
buttocks is not related to either a liking for 0! 
a dislike of male body parts, but rather to per- 
ceived strength of the female body. This rela 
tionship is consistent with the overall pictutt 
of self-confidence presented by women whi 
choose the smaller buttocks in males. No one 
pattern of relationships between preferencë 
for a particular size of female body parts an 


€ The correlations between somatic preferences and 
Semantic differential ratings have been submitted 10 
the National Auxiliary Publication Service. Se 
NAPS Document No. 02860 for pages of supple- 
mentary material. Order from ASIS/NAPS c/o Mi-i 
crofiche Publications, 440 Park Avenue South, NeW 
York, New York 10016. Remit in advance for each 
NAPS accession number. Make checks payable t0 
“Microfiche Publications.” Photocopies are $5.00. 
Microfiche are $3.00. Outside of the United States 
and Canada, postage is $2.00 for a photocopy oF 
$1.00 for a fiche, 
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attitudes toward female body parts is found 
for women. 
CONCLUSION 

The data presented in the preceding com- 
posite that women who prefer overall 
large: the male body are less tradi- 
tiona minine people who enjoy physical 
activi | participate in sports. Moderate 


ove siz: in the male (standard) is selected 
t ho present themselves in a more 
feminine way with interests that 


O 


tradition 


center d the home and family. And fi- 
nally, male smallness is chosen by 
women , are socially reserved and come 
from upper-class backgrounds, where thinness 


is a hig dy valued physical characteristic 
(Moore i al., 1962). 

The orrelates related to preference for par- 
tici nale body parts suggest that those 
women who select larger male chests, like 

‘lect smaller male buttocks, desire 

e especially when they pursue aca- 

uc goals, Women who select smaller male 

crests, like those who select smaller male legs, 
ae somewhat passive and indecisive people. 

Tariables related to preference for particu- 
lar female body parts suggest that women who 
prefe: larger female breasts and legs have 
personality and interest patterns thought to 
be more traditionally feminine. Women who 
select smaller female breasts and buttocks are, 

less traditionally feminine indi- 

; desire to achieve in many ways 

\ pursue academic goals. The above pat- 

as suggest that the women of this study 

may associate the more ample female body 

with a culturally stereotyped picture of the 

woman as “wife and mother,” while the 

smaller (thinner) female body is associated 
with greater personal and career freedom. 

While according to Wiggins et al. (1968) 
prefer a female figure which has 

cly large breasts, our women most 

prefer both male and female figures which 
have smaller buttocks. Findings for the group 
as a whole give only partial support to the 
cultural belief that women prefer larger chests 
in men since the large-chested or “Atlas” 
ale figure received only slight endorsement. 

Comparisons of our data with the findings 
reported for females by Lavrakas (Note 1) 


me 
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(that front-view tapering-V male silhouettes 
were preferred by women who were more 
traditionally feminine) are somewhat difficult 
to assess. Three types of silhouettes used in 
this study would have a tapering-V configura- 
tion if viewed from the front: a standard 
physique with a smaller buttocks, a standard 
physique with a larger chest, and an overall 
standard form. There is considerable lack of 
agreement in the personality and background 
characteristics which describe the groups of 
women preferring any one of these types. The 
first type of tapering-V silhouette (composed 
of a standard form with smaller buttocks) 
was the most preferred male silhouette by our 
women. Women who selected that form do not 
conform to the more traditional expectations 
for women by our culture. These women are 
emotionally mature, competitive, dedicated to 
academic and other types of achievement, and 
exhibit leadership in many areas. The second 
type of tapering-V silhouette (composed of a 
standard form with a larger chest) was se- 
lected by women who also desire academic 
achievement and are socially extroverted, but 
are less self-reliant. The third type of taper- 
ing V (composed of the evenly proportioned 
male body) was selected by women who do 
present a more traditional pattern of femi- 
nine interests. Since there are several types of 
tapering-V male physiques, and there are per- 
sonality pattern differences for women at- 
tracted to each of the types, generalizations 
based on broad somatic category labels such 
as “tapering V” or “pear shaped” are confus- 
ing and may mask important personality and 
background differences associated with prefer- 
ences for each of the body types. 

The nonadditivity of scale values from Table 
1 (i.e., the scale value for overall smallness 
cannot be predicted by adding the scale values 
for small chest, small buttocks, and small 
legs) suggests that preference for the size of 
particular body parts depends on the total 
body configurations, Similarly the pattern of 
personality and background characteristics of 
women who select a male silhouette composed 
of a small chest, large buttocks, and small legs . 
are likely to be different from the character- 
istics of women who prefer a small male chest 
when paired with small buttocks and large 
legs, etc. 
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In addition to the needed replications of 
single body-part variation involving women of 
different ages, cultural and educational back- 
grounds, etc., future research should also in- 
vestigate the correlates associated with sil- 
houettes which present body configurations in 
which more than one body part is varied at 
the same time. A comparison of the somatic 
preferences obtained when the same subjects 
view both front-view and profile silhouette 
stimuli should also be explored. While men’s 
somatic preferences of the male body have 
not yet been reported, they are currently 
being analyzed in our laboratory. 
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Effects of Being Observed on Expressive, Subjective, and 


Physiological Responses to Painful Stimuli 


Robert E. Kleck, Robert C. Vaughan, Jeffrey Cartwright-Smith, 
Katherine Burns Vaughan, Carl Z. Colby, and John T. Lanzetta 
Dartmouth College 


Two experiments explored the effects of observation by another on responses 
to painful stimuli, It was anticipated that the intensity of pain-related non- 
verbal expressivity decreases under observation, while indices of arousal (skin 
conductance and self-report) increase. In Experiment 1, subjects’ expressive 


k 
responses to shock were attenuated when subjects were observed as compared 
to when they were alone, but the anticipated augmentation of arousal did not 
w occur. Rather, the attenuation of expressive behavior was accompanied by a 
general decrease in subjective and autonomic responses to the painful stimuli. 
A second experiment replicated the results of the first study and, in addition, 
found no evidence for a differential impact of sex of observer on the three 
measures of arousal. An interpretation is discussed for the effect of observation 
on expressive behavior and for the relationships observed among expressive, 
autonomic, and subjective indices of pain. 
Å Nonverbal di-otays of pain are both a re- Others have stressed that expressive re- 
sponse to aversive stimuli and a primary sponses may be mediated by general strat- 
y means of eliciting caretaking responses from egies which operate across a wide range of 
others (e.g., Sternbach, 1968; Szasz, 1957). contexts and arousal states. One such perva- 
Important as such behaviors may be to insur- sive strategy that may be acquired during 
ing the well-being of the organism, relatively socialization is the tendency to inhibit or at- 
little is Known concerning either the variables tenuate expressive behavior in the presence 
which affect the expressive encoding of pain of others (Izard, 1971). While parents may 
or the relationship of expression to the sub- permit infants intense facial and vocal expres- 
jective and phy<iclogical components of pain sions of arousal, such displays appear to be 
+» arousal, The pr t research attempts to ex- increasingly punished with age, particularly 
plore both of the ues. if the child is male (Block, 1973; Holter, 
| Variability in the facial and bodily expres- 1970). In the studies reported here, indi- 
sions of pain has frequently been observed viduals were exposed to painful stimuli either 
across persons and situations (Beecher, 1959). in social isolation or while under observation 
The factors responsible for this variability, by another. Consistent with Izard’s socializa- 
however, have been more difficult to pinpoint. tion argument, it was anticipated that indi- 
Ekman n (1969) propose display viduals inhibit their expressive displays when 
a rules as | diators, that is, principles under observation. 
or norms a during socialization which Assuming that expressive inhibition does 
prescribe the vior appropriate to a par- occur in the presence of others, an important 
© ticular social co::oxt in response to a given question concerns the relationship between 
arousal state, these expressive changes and subjective and 
autonomic components of pain arousal. There 
is considerable theoretical disagreement on 
i This research was supported in part by National this issue as it relates to emotion generally, 
[> Science Foundation Grant GS 29-122 and in part by and the empirical literature is both relatively 
Lo Filene Endowment to Dartmouth Col- small and ambiguous in its implications 
Jeffrey Cartwright-Smnith is now at Vassar College. (Leventhal, 1974). A number of theorists 
Requests for reprints should be sent to Robert E, (@-8- Darwin, 1872; Izard, 1971; James, 
Kleck, Department of Psychology, Dartmouth Col- 1890) have argued that the general rule is for 
lege, Hanover, New Hampshire 03755. consistency among the various components of 
X, 1211 
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arousal and, further, that modulations in ex- 
pressive behavior can serve to instigate paral- 
lel changes in the associated subjective and 
autonomic components. This view is sup- 
ported by several recent studies which have 
found positive relationships between modula- 
tions in facial expressive activity and both 
the quality of arousal states (Laird, 1974) 
and their subjective and physiological inten- 
sity (eg., Lanzetta, Cartwright-Smith, & 
Kleck, 1976). 

While agreeing that the general rule may 
be for consistency among the various com- 
ponents of arousal, other investigators (eg. 
Gellhorn, 1970; Jacobson, 1967) note that 
occasionally the suppression or dampening of 
expressive behavior may be associated with 
increases in both subjective and autonomic 
indicators of emotion. However, the condi- 
tions that produce a suppression-induced aug- 
mentation of arousal have not been identified. 
One plausible hypothesis is suggested by 
Tzard’s notion that individuals are socialized 
to inhibit emotionally expressive behavior, Tf 
this inhibition is achieved primarily through 
the punishment of expressive activity by so- 
cializing agents, then an individual exposed 
to emotionally provocative stimuli in the 
presence of others might be expected to ex- 
perience anxiety conditioned to these social 
cues through their previous association with 
punishment. Thus, the presence of an ob- 
server should both attenuate expressive be- 
havior and augment arousal, 

The studies reported here test this hypoth- 
esis by examining the effects of the presence 
of an observer on expressive responses to 
painful stimuli and on subjective and au- 
tonomic reactions to a stressor, In addition, 
the studies explored the question of individual 
differences in nonverbal expressions of pain 

as a function of social context. 

Snyder (1974) has developed a self-report 
measure which differentiates persons with re- 
spect to their tendency to observe and con- 
trol their own expressive behavior. According 
to Snyder, the high self-monitor is concerned 
with the social appropriateness of his behav- 
ior and actively manages it in response to 
social cues, whereas the low self-monitor is 
less responsive to such cues and more respon- 
sive to internal states. Persons high on the 
self-monitoring dimension should thus sup- 


press their expressive pain ponse to a 
greater extent when observed ‘° in low self. 
monitors, 

EXPERIMENT 1 
Method 

Design overview. Individuals presele to be high 
or low on the self-monitoring dimer \ were ex. 
posed to three levels of shock intensi nder each 
of two conditions, one in which they pr: ied them. 
selves to be alone, the other, under obs: tion by a 
female peer. The subjects experienced © repeti- 
tions of each shock level in the alone the ob- 
served periods, with the order of the pe: counter= 
balanced across subjects, The resulting d isa3X 
3X2X2X2 analysis composed of thr vels of 
shock intensity (zero, medium, and hiv), three 
repetitions of each level, within each of ty experi- 
mental conditions (alone and observed), w) ich were 
Presented in two orders (alone first or bserved 
first). Equal numbers of high and low sel’ onitors 
were assigned to each order of presentation. 

Subjects. The subjects were 20 male un gradu- 
ates who were recruited for an experiment cerned 
with physiological responses to stress.1 Ha of the 
subjects had scored in the uppermost qua ‘e and 
half in the lowest quartile of a distributio: of 130 
Scores on the Self-Monitoring Scale admini red in 
an introductory psychology course, Subjc were 
either paid or given course credit for their p»ticlpa- 
tion. 

Procedure. The subject sat 2.4 m from a : r-pro- 
jection screen, and, as skin resistance and she. ': elec- 


trodes 2 were attached, it was explained that ine ex- 
periment would consist of three series of 10 shock 
trials. The start of each 10-sec shock stimulus would 
be signaled by a red slide, and each 20-see intertrial 
interval by a blue slide projected on the screen, On 
each trial, shock intensity would be increased. over 
the 10-sec period from 0% to 100% of a preset 
level, “with considerable variability in the maximum 


1 This number does not include four subjects who 
were discarded from the sample; one because of 
equipment failure, one because he preferred not to be 
observed, and two because of their Suspicions con- 
cerning some aspect of the experimental manipula- — 
tions. 

2The electrode for a Tursky Electrostimulator 
(General Control Co., 1220 Soldiers Field Road, Bos- 
ton, Mass.) was placed 1.8 cm above the subject's 
left outward-facing ankle. Zinc cup electrodes (3.14 
cm*; filled with a zinc sulphate, bactoagar, and water 
contact medium) for a Fels Dermohmmeter (Model 
22A, which uses a 70-nA constant current) were posi- 
tioned approximately 2 cm apart on the palmar sur- 
face of the subject’s nondominant hand, and an 
Esterline Angus graphic ammeter (Model A) was 
employed to keep a continuous record of skin re- 
sistance, 


reached on any one trial.” The subject was told that 
he could terminate shocks he felt were too painful 
by using a switch placed near his dominant hand, 

The subject was instructed to rate the painfulness 
of each shock on a scale from O (no sensation) to 
100 (extremely painful), by speaking into a micro- 
phone provided. Folding partitions were placed 
around the subject, ostensibly to prevent distraction, 
but in reality to enhance his sense of visual isola- 
tion from the experimenter. A camera concealed be- 
hind the projection screen permitted continuous 
monitoring of the subject’s face without his aware- 
ness. 

The experimenter then entered the control room 
to initiate the first series of trials. Unknown to the 
subject, the purpose of this series was to establish 
a shock intensity which would reliably elicit a facial 
display of pain. Commencing with a stimulus of 2 
mA, shock intensity was increased by 2 mA per trial 
until a facial response to pain was consistently 
elicited approximately 7 sec into the 10-sec shock 
trial, This was the maximum level of shock used 
for the duration of the experiment. For the re- 
mainder of the first series, stimulus intensity was 
randomly varied except that the final trial was al- 
ways a maximum-intensity shock. 

Following the intensity determination series, the 
experimenter returned to the subject’s room and in- 
dicated that there would be a slight modification in 
the procedure. The subject was instructed to label 
the last shock he had received, which in all cases 
was a maximum-intensity shock, as an 80, and to 
rate all subsequent shocks in relation to this ad- 
justed scale, * was done to insure that all sub- 
jects would be using the scale in approximately the 
same manner. 

At this point, half the subjects were exposed to 
the observer manipulation, in which a female con- 
federate arrived at the laboratory. As the experi- 
menter explained, she was “a fellow graduate student 
who had assisted in the pretesting of the shock ap- 
paratus with the condition that she be able to re- 
turn at a later date to watch the experiment being 
conducted.” Permission was then asked of the sub- 
ject to allow the experimenter’s friend to observe 
from behind a one-way window as the subject con- 
tinued to receive shocks. After the experimenter 
escorted th federate, whom subjects never saw, 
to an observation room, he removed the screens sur- 
rounding the subject and opened the curtains in 
front of the one-way window. 

The experimenter then initiated the first experi- 
mental block of 10 trials involving three levels of 
shock: zero, medium, and high. A rationale for zero- 
level shocks was provided by suggesting to the sub- 
ject that some trials would be characterized by a 
level of shock so low as to be nearly imperceptible. 
High shock was the maximum-intensity shock se- 
lected in the previous series. Medium shock, which 
was set at 50% of the maximum intensity, was in- 
cluded primarily to confirm the subject’s expectation 
that many rather than a few levels of shock were to 
be used; it was expected to evoke little or no facial 
behavior. With the exception of the first trial, which 
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was always a medium-level shock, stimulus intensity 
for each trial was randomly determined for each 
subject with the constraint that each shock level be 
given three times. 

At the conclusion of this trial block, observed sub- 
jects heard the confederate depart, after which the 
experimenter reentered the room, closed the one-way 
window curtains, and replaced the screens, Subjects 
who had not yet been observed overheard the con- 
federate’s arrival and were exposed to the manipula- 
tion. The experimenter then initiated the final block 
of trials, which consisted of a new random ordering 
of three repetitions of the three shock levels. After 
this last series, the experimenter conducted a brief 
interview which probed for possible suspicion regard- 
ing the experimental arrangements. 

Judging procedure. Videotapes of each subject’s 
facial behavior during the two experimental trial 
blocks were edited such that all intertrial intervals 
were replaced by blank tape. The initial trial in each 
block was eliminated (leaving three repetitions of 
each shock level), and trial blocks from all subjects 
were randomly distributed with the constraint that 
no two blocks from the same subject be contiguous. 

One male and three female graduate students were 
paid to make judgments of “individuals videotaped 
in a laboratory situation.” Their task was to make 
ratings, using a 101-point scale, of the level of dis- 
tress shown by each subject on each of the shock 
trials. There were 30 practice trials conducted em- 
ploying three stimulus persons selected from a pilot 
study to reflect the range of expressive behavior 
shown by the subjects in the present study. 

Each judge saw all 20 stimulus persons. None was 
informed as to the experimental hypotheses or the 
observer manipulation, and all were told that the 
subjects were unaware of being videotaped. Like 
subjects, judges were led to believe that many rather 
than three levels of shock were used. 


Results and Discussion 


Judge reliability. Interjudge reliability was 
determined by ranking the 20 subjects on the 
basis of the ratings made by each judge and 
by computing Kendall’s coefficient of concor- 
dance, W (Siegel, 1956, pp. 229-238). When 
viewing subjects in either condition, judges 
were in substantial agreement: W (alone) = 
714; W (observed) = .744. Both scores are 
significant beyond the .001 level. 

Expressive behavior. It was predicted that 
subjects exposed to painful stimuli are less 
nonverbally expressive, as measured by 
judges’ ratings of their distress, when under 
observation than when alone. It was further 
expected that the magnitude of the expres- 
sive attenuation is positively related to stim- 
ulus intensity. Zero-level shocks were expected 
to elicit little or no pain-related facial be- 
havior in the unobserved condition which re- 


TABLE 1 


Expressive, AUTONOMIC, AND SUBJECTIVE RESPONSES 
TO SHOCK UNDER OBSERVATION AND NoNOBSERVATION 
IN EXPERIMENT 1 


Shock level Observation Nonobservation 
Judged distress* 
Zero 
M 18.48 19.50 
SD 7.57 8.53 
Medium 
M 20.77 20.33 
SD 11.49 12,35 
High 
M 32.92 40.23 
SD 16.75 22.80 
Skin conductance change? 
Zero 
M —5.65 (—.23) —7.21 (—.25) 
SD 12.48 (.61) 9.21 (.50) 
Medium 
M 10.21 (.33) 12.84 (.54) 
ae 17.33 (.60) 19.99 (1.19) 
High 
M 41.98 (1.52) 50.87 (1.92) 
SD 26.93 (1.74) 25.42 (2.50) 
Self-reports of paine 
Zero 
M 2.68 2.50 
SD 7,80, 7.73 
Medium 
M 42.15 45.95 
SD 17.13 16.35 
High 
M 80.30 84.00 
SD 15.23 15.73 


e rmed (after Lykken & 
cores in parentheses are raw conductance 


© The higher the number, the higher the mean self-reported 
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aged to yield 
trial and enter 
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in response to shock 


ession under observation, High 
the other hand, were set on 
ability to elicit pain expres- 
"nce of an observer was ex- 
greatest impact on these 
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selves to be under observation t!.» when they 
presumed themselves alone, a di* rence which 
appears to increase with stim: intensity, 
However, the anticipated Shock evel x Ex. 
perimental Condition interactio: (239S 
2.39, p < .11, only approaches nificance, 
Separate analyses for each shoc) level pro- 
vide tentative evidence that sui cts sup- 
pressed their expressive responses ` nder ob- 
servation more in response to high ck, E(1, 
16) = 3.87, p = .06, than to medix. : or zero 
shock, each with an F less than 1. | 
Autonomic and subjective meas: ` A ra- 
tionale was developed in the introd. ‘ion for 
the prediction that attenuation of ressive 


behavior in response to the presenc of an 
observer is associated with intensific ‘ion of 
autonomic and subjective components of pain 


arousal. As with expressive behavior, : Con- 
dition X Shock Level interaction — is ex- 
pected: observation would result in -reater 
arousal and distress for high-level shoc- than 
would nonobservation, but this dii rence 
would be minimal for zero-level shock 

Three readings were taken per trial of the 
subject’s palmar skin conductance: at shock 
onset (P1) and at points 12 and 16 « c fol- 
lowing shock onset (P2 and P3, respect: ely). 
For each trial, a phasic change scor was 


computed as the difference, measured ii mi- 
cromhos, between P1 and the average o! P2 
and P3. Scores were then standardize: by 
applying the Lykken and Venables (1971) 
Tange correction formula, expressing the score 
for each trial as a percentage of the largest 
phasic change in skin conductance observed 
for that subject. Data transformed in this way 
were entered into a repeated measures analy- 
sis of variance, The significant main effect 
of experimental conditions indicates that, con- 
trary to Prediction, subjects were less aroused 
by shock when observed (M = 15.51) than 
when alone (M = 18.84), F(1, 16) =-4.34, 
$ < .05. Moreover, as the means for the sig- 
nificant Condition x Shock Level interaction 
indicate, the magnitude of this response de- 
crement increased with shock intensity, F (2, 
32) = 5.32, p < 01, Means and standard de- 
viations for both the raw conductance and the 
range-corrected scores are shown in Table 1. 

In order to examine the possibility that 
tonic or baseline levels of arousal were dif- 
ferent across conditions, conductance scores at 


i 


G 
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shock onset (P1) were averaged across the 
nine tris « for the observed and alone condi- 
tions se) ¿rately for each subject. Because of 
the Jay of skin conductance responses, 
measu nts taken at the point of shock on- 
set di flect the effects of the stimulus 
and } or maximum adaptation of the 
base! he previous shock stimulus. A 
sim is of variance of these scores 


rev lificant difference across experi- 
ions in tonic arousal levels with 
ver than alone, F(1, 16) = 6.27, 
(alone) = 12.95 pQ*; M (ob- 
9 pO, This suggests that the 


phasic ; were not the indirect product 
of a gi calming effect of observation. 
Self-: ported ratings for each shock were 


entered nto a repeated measures analysis of 
varianc. Means and standard deviations are 
prese’ed in Table 1. The significant main 
effect »xperimental conditions reveals that 
the observer manipulation resulted in lower 
Se of pain averaged across shock 
les (4, 16) = 5.5, p < .02; however, the 
Experimental Condition X Shock Level inter- 
action again only approaches significance, 
F (2, 32) = 2.49, p < .10. 

&elf-monitoring data. Our expectation that 
high self-monitors are more responsive than 
low seli-monitors to the observation manipu- 
lation was not confirmed. The Shock Level x 
Condition X Self-Monitoring interaction does 
not appr significance for any of the de- 
pe ures. 

^l Dough Experiment 1 provides tentative 
evi once consistent with the expectation that 
observation leads subjects to inhibit the in- 
tensity of their expressive displays, the results 
are counter to the notion that such inhibition 
is paralleled by increased arousal. The results 
for skin conductance and self-report are thus 


oF 


inconsistent with predictions based on a con- 
ditioned anxiety notion. For both measures 
the of the observer was to diminish 


rather than augment arousal to shock stimuli. 


EXPERIMENT 2 


A second experiment was conducted to ex- 
amine the reliability of the findings and to 
test the generality of the results across sex 
of observer. In this replication the nature of 
the observer manipulation was modified in two 
Tespects: the observer was made an age peer 


1215 


of the subject and both male and female ob- 
servers were employed. The first of these 
modifications was expected to increase the 
likelihood that subjects would suppress ex- 
pressive responses when under observation 
compared to Experiment 1 in which they had 
been observed by a higher status individual. 
No specific hypotheses were formulated re- 
garding the impact of sex of observer on ex- 
pressive behavior. 

Izarď’s (1971) concept of a general social- 
ized inhibition of expressive activity suggests 
that sex of observer does not have a differ- 
ential impact on the tendency to suppress 
responses to pain, From the perspective of 
Ekman and Friesen’s (1969) analysis of dis- 
play rules, however, it could be argued that 
different display strategies are elicited by male 
and female observers. For example, it may be 
that subjects in Experiment 1 were inhibiting 
their expressive responses in accordance with 
a prevailing machismo display rule, but that 
this rule is made more salient in the presence 
of a female than a male. Thus, one might an- 
ticipate a diminished effect of observation 
when the observer is male as compared to 
female. 


Method 


Design overview. The design was a 6X2X2X2 
analysis of variance composed of six shock trials, 
in each of two experimental conditions (alone and 
observed), which were presented in two orders (alone 
first or observed first), with one of two types of 
observer (male or female). 

Subjects. The subjects were 40 male undergradu- 
ates who received course credit for an experiment 
concerned with the psychophysiology of electric 
shock, 

Procedure. Experiment 2 was similar to Experiment 
1 with the following modifications: 

1. Each trial consisted of a 5-sec warning signal 
and a 2-sec, constant-intensity electric shock fol- 
lowed by a 15-sec intertrial interval. 

2. In the intensity determination series, shocks 
were increased in 2-mA steps over trials until, in the 
experimenter’s judgment, the subject showed a facial 
display of pain. This intensity was the only one used 
in the remainder of the experiment. 

3. In the two experimental trial blocks, six shocks 
were delivered per series. 


8 Four subjects were dropped from this group: 
three because no shock level could be established 
which reliably elicited facial behavior and one be- 
cause he expressed suspicion regarding the presence of 
a camera, 
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TABLE 2 


EXPRESSIVE, AUTONOMIC, AND SUBJECTIVE RESPONSES 
TO SHOCK UNDER CONDITIONS OF OBSERVATION AND 
NoNoBSERVATION IN EXPERIMENT 2 


Response class Observation  Nonobservation 


istress* 

D WS 16.21 24.47 

SD 14.06 16.53 
Skin conductance’ 

Mo 17.35 (.612) 20.52 (.738) 

SD 12.98 (613) 11.96 (572) 
Self-reports of pain® 

M 50.86 

SD 13.11 


a The higher the number, the higher the mean judge rating 
of subjects’ distress on a 101-point scale, 

» Unparenthesized scores are transformed (after Lykken & 
Venables, 19/1). The scores in parentheses are raw conductance 


change scores in micromhos. x 
«The higher the number, the higher the mean self-reported 


pain on a 101-point scale, 


4. Using index cards, subjects rated shock pain- 
fulness only during the two experimental blocks. 
Their judgments were anchored with respect to the 
first shock of the first experimental block, which 
they were instructed to rate as 50 on a 101-point 
scale. 

5. The observer was either a male or female friend 
of the experimenter, 

6, Subjects filled out a postexperimental question- 
naire designed to assess their reactions to the pro- 
cedure and to probe for suspicion regarding the ex- 
perimental manipulations. 

Judging procedure. Three males and two females 
of college age were paid to serve as judges. They 
were told that the stimulus persons they were to 
See were alone at the time the tapes were made, 
unaware of the camera, given a visual signal 5 sec 
prior to shock onset, and exposed to several stimulus 
intensities. The order of presentation of each stimulus 

person’s trial blocks was the same as the order in 
which they had been administered. Judges made 
practice ratings of = subject run in a pretest of the 
experimental proce ure, 


Results 


Judge reliab: 
computed in tk 
ment 1. Whe: 


. Interjudge reliability was 
nanner described in Experi- 
viewing subjects in either 
condition, judg: were in substantial agree- 
ment: W (alc ~)= .847; W (observed) = 
875. Both sco we significantly different 
from zero, p < : 

Expressive change 


d sex of observer. 
The ratings of the five judges were averaged 
to yield a single expressive sore per trial and 
entered into a repeated measures analysis of 
variance. Results are shown in Table 2, The 
experimental conditions effect is highly sig- 
nificant, F(1, 32)= 19.5, p < .001, reflecting 


the fact that subject showed lss evidence 
of discomfort when observed (%/ = 16.21) 
than when alone (M = 24.47). Sex of ob- 
server does not interact with rimental 


condition, however, indicating th! subjects 
inhibited their expressive respon: s to pain 
equally before male and female observers, 
F< 100. 

Autonomic and self-report meg s. Skin 
conductance records were scored in romhos 
at six points within each trial: prior 
to shock onset (P1); 2 sec prior io shock © 


onset (P2); shock onset (P3); anc 6, and 
8 sec after shock onset (P4, P5, ond Pó, 
respectively). Because the latency >f skin. 


conductance responses is approximai: y 2 to 
3 sec, PI was taken as the baseline ‘or the 


phasic response to the shock warnin,; light 
and subtracted from the average of | 2 and 
P3. This analysis revealed that the response 
to the signal was not different across \ xperi- 
mental conditions: M (alone)= 20; ïi (ob- 


served) = .20; F < 1.0. Given this resv't, the 
average of P1, P2, and P3 was treated 1s the 
baseline for the phasic response to sho: « and 
subtracted from the average of P4, P , and 
Pó. This change score was then standa: dized 
using the Lykken and Venables (1971) vange 
correction formula described in Expe! ment 
1, and the corrected scores were entere«\ into 
a repeated measures analysis of vari nce, 
The means and standard deviations of both 
the standardized and uncorrected scores are 
summarized in Table 2. As in Experiment 1, 
there was a significant main effect of ex- 
perimental conditions, indicating that sub- 
jects were less responsive to the shocks when 
under observation than when alone, F(1, 
32)= 7.23, p < .02. Sex of observer had no 
effect on phasic skin responses (F < 1,0). 
An analysis of the average baseline arousal 
(P1, P2, and P3) indicates that, contrary to 
Experiment 1, the presence of an observer 
did not significantly affect tonic levels of 
arousal: M (alone)= 20.82; M (observed) 
= 21.12; F < 1.0. 

Self-reports of shock painfulness were 
entered into a repeated measures analysis of 
variance and were found to be significantly 
lower when subjects were observed than 
when they were alone, F(1,32) = 35.6, p < 
.001. These means and standard deviations 
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appear in Table 2. As was the case for the 
~ other dependent variables, sex of observer had 
no effect on self-report of pain. 


GENERAL Discussion 


The results of Experiments 1 and 2 are 
consistent with the notion that adults con- 
trol their nonverbal displays of emotion in 
the presence of others. On the basis of their 
facial behavior, individuals were judged to be 

. in less distress when under observation than 
when alone, These expressive changes may 
reflect an acquired inhibitory response linked 

' to the presence of others or a specific display 
rule regarding the expression of pain when 
under observation, Whatever the source of 
the attenuation, it is clear that subjective 
and physiological components of arousal co- 
varied across experimental conditions with 
the expressive changes. Individuals not only 

_ inhibit expressive displays of pain when ob- 

served but also report less pain and show 
reduced phasic skin conductance responses. 

It appears plausible to argue that the 
subjective and autonomic measures parallel 


—*¢ 


the ci induced in expressive behavior 
because general calming influence which 
observation had on the individuals. This 


possibility is best addressed by an examina- 
tion of the subjects’ tonic or baseline levels 
of arousal across experimental conditions. 
As noted in the Results sections, tonic levels 
of autonomic responding were somewhat 
higher under conditions of observation in the 
first study and not significantly different 
across conditions in the second study; thus, 


the data provided no support for this inter- 
pretation. 
) The. resu lso appear to offer no sup- 


port for the theoretical proposition, elabo- 
rated by Allport (1924) and others, that 
the inhibition of expressive behavior in emo- 
tionally provocative situations is accom- 
panied by intensified internal arousal. They 
are consistent, however, with those theo- 
jretical views that assign a regulatory role 
to facial behavior in emotional experience 
(eg., Darwin, 1872; Izard, 1971; James, 
1890; Tomkins, 1962) as well as with some 
recent empirical evidence relevant to such 
views (e.g., Laird, 1974; Lanzetta et al., 
1976). In the Lanzetta et al, studies, in- 
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dividuals instructed to intensify or attenuate 
facial responses to painful stimuli showed 
parallel changes in autonomic and subjec- 
tive indicators of distress. The present studies 
replicate the attenuation effects found by 
Lanzetta et al., but do so under conditions in 
which the subjects were not instructed to 
alter their expressive behavior. Indeed, post- 
experimental interviews indicated that sub- 
jects were generally unaware of the inhibi- 
tory effect of the observer. It appears, then, 
that expressive behavioral changes, whether 
produced by voluntary compliance with in- 
structions or induced by modifications in 
social cues, can evoke congruent changes 
in autonomic arousal and subjective reactions 
to pain stimuli. 

Although the specific mechanisms respon- 
sible for these effects are still a matter of 
conjecture, the facial feedback assumption 
provides a plausible explanation for the re- 
sults of our earlier studies, as well as the 
data of the present experiments. Further, it 
suggests a possible explanation for other 
findings in the literature, for example, the 
influence of coactors on responses to stress 
and subcultural differences in pain tolerance. 
In regard to the former, several studies of 
the effects of coactors (e.g., Amoroso & 
Walters, 1969) have reported that the pres- 
ence of companions has a calming influence 
but only when the companions themselves are 
calm. These results are consistent with the 
facial feedback assumption if one postulates 
that the effect a companion has, whether 
calming or arousing, is mediated by the di- 
rect effect which he has on our expressive be- 
havior. That is, to the extent that the pres- 
ence of others attenuates or enhances our 
expressive responses to an emotional stim- 
ulus, our autonomic and subjective responses 
to that stimulus vary in parallel. 

The facial feedback interpretation is also 
consistent with data from studies of expres- 
sive and physiological responses to pain 
across a number of subcultural groups in 
the United States. Zborowski (1952, 1969) 
found that while Jewish and Italian in- 
dividuals were relatively uninhibited in their 
response to painful stimuli, Old Americans 
and the Irish preferred to hide their pain. 
Sternbach and Tursky (1965) corroborated 
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Zborowski’s observations and, further, found 
parallel differences across these various sub- 
cultures in pain tolerance and in psycho- 
physiological responses to pain (Tursky & 
Sternbach, 1967). Thus, it is possible that 
in training their children to be expressively 
stoic, Old American and Irish parents in- 
directly attenuate the arousal which their 
children experience to stressful stimuli. 
As noted earlier, and in a previous article 
(Lanzetta et al, 1976), a number of in- 
vestigators accord a prominent role to ex- 
pressive behavior in their theories of emo- 
tion. Some assume a direct neurally mediated 
link between expressive responses and emo- 
tional experience (e.g., Gellhorn, 1964), 
while others assume an intervening cognitive 
evaluative process which is triggered by un- 
explained physiological arousal, and which 
is responsive to expressive and contextual 
cues as a source of information to account 
for the physiological response (e.g., Laird, 
1974), A less well developed but potentially 
more fruitful approach assumes that expres- 
sive cues become conditioned to physiological 
arousal via temporal contiguity; that is to 
say, expressive responses as well as autonomic 
activity are unconditioned responses to emo- 
tionally arousing stimuli, but the former in- 
variably precede, and often overlap, the latter 
responses. Thus expressive cues may come 
to function as conditioned stimuli for auto- 
nomic responses or become part of a stimulus 
complex (which includes the unconditioned 
that is conditioned to the autonomic 
c Expressive responses would thus 
ire a measure of control over autonomic 
> with the magnitude of the control 
: the parameters of the previous 


conditioning ex >eriences. Current data do not 
permit choice among alternative 
theoretic: , and further research is now 
under way cing some of the implica- 


tions of th : oning view. 
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duplicated the experience of subjects in the 
other conditions. These experimental treat- 
ments were administered in a room apart from 
the classrooms. Two weeks later, the drawing 
materials were again placed in the children’s 
classrooms, and unobtrusive measures of post- 
experimental interest were recorded, As pre- 
dicted, subjects who had agreed to engage in 
the activity in order to obtain the award sub- 
sequently spent significantly less time playing 
with the materials than did subjects in either 
of the other two conditions. Relative to their 
uniform baseline levels of interest, subjects 
in the expected-award condition showed a sig- 
nificant decrease from baseline to postexperi- 
mental observations, while subjects in the 
other two conditions showed no significant 
change, 
This finding has been subject to consider- 
able further scrutiny. On the one hand, its 
generality has been confirmed across a sub- 
stantial number of procedural variations and 
subject populations. Deci and others (cf. Deci, 
1971, 1975), for example, have obtained 
comparable results by paying college subjects 
to solve puzzles. Similarly, Lepper and Greene 
(1975) found that the presence of adult sur- 
veillance as well as the expectation of reward 
could decrease children’s subsequent interest 
in playing with a target activity. On the other 
hand, it is clear that anticipated rewards will 
not undermine intrinsic motivation if they are 
not salient to the subject (Ross, 1975). Thus, 
in its current formulation (Lepper & Greene, 
1976), the overjustification hypothesis applies 
to (a) activities of at least some initial in- 
terest to a subject, (b) conditions which make 
Salient to a subject the instrumentality of 
engaging in a particular activity as a means to 
some extrinsic end, and (c) measures of sub- 
Sequent engagement in situations where sub- 
jects do not expect extrinsic rewards, 


Overjustification Studies and T. oken 
Economies 


One important question raised but not 
answered in the present overjustification lit- 
erature is its relevance to the systematic use 
of tangible reinforcement procedures in ap- 
plied settings (Kazdin & Bootzin, 1972; 
O'Leary & Drabman, 1971). At first glance, 
the finding that salient expected rewards may 
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undermine intrinsic interest seems liie a direct 
attack on the token economy ablishment 
(cf. Levine & Fasnacht, 1974). wever, a 
typical token reinforcement program and a 
typical overjustification study diffe: from each 
other in a number of potentiali; mportant 
ways. A consideration of these differences will 
help to elucidate the rationale for tie present 
study. 

The most obvious difference is be( sen sub- 
jects selected for low rates of appropiate be- 
havior versus subjects selected for hish rates 
of appropriate behavior. Since the ov-yjustifi. 
cation hypothesis presumes at least some in- 
trinsic interest in a target activity, its omain 
would seem to exclude token economie: whose 
subjects will not engage in approprivie be 
havior without extrinsic reinforceme: (e.g, 
Ayllon & Azrin, 1965). To the extent that. 


subjects in token economy programs £:e com- 
monly selected for relatively (vs. absı lutely) 
low rates of appropriate behavior, however, i 
the applicability of the overjustification hy-" 
pothesis remains an empirical issue. 
A second major difference betwee 
economies and overjustification studies is of 
potentially greater theoretical interest: the 
use of multiple-trial reinforcement procedures 
and the demonstration of a reinforcement ef- 
fect via such procedures, These essential fea- 
tures of token economy programs have been 
notably absent from Overjustification experi- 
ments. Indeed, two recent studies have sought 
to demonstrate that an Overjustification effect 
would not occur when multiple-trial rein- 
forcement procedures were employed. In one 
of these studies, Feingold and Mahoney 
(1975) found that five children increased 
(rather than decreased) their performance of 
a play activity after having been reinforced 
for playing with it. In the other study, Reiss 
and Sushinsky (1975, Experiment 2) found 
that “preferences” among songs established 
by discrimination training procedures trans- 
ferred to a recognizably similar posttest situa- 
tion, Both studies appear to demonstrate that 
overjustification effects should not be expected 


token 


1 An extensive discussion of some relevant concep- 
tual and definitional issues in the overjustification 
literature is available elsewhere (Lepper & Greene 
1976). 
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cannot answer the 
r reward procedures 


eased, decreased, or had 
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ingent reward. 
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the effects of rewards on subsequent 
ae are mediated by the information 
onvey concerning (a) a person’s per- 
motivation for engaging in the activ- 
the person’s competence at the activ- 
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sarily depend on the context and manner in 
which the rewards are presented (cf. Bem, 
1972; Feingold & Mahoney, 1975; Lepper & 
Greene, 1976). As one example, rewards con- 
tingent upon superior task performance, which 
provide an individual with significant infor- 
mation about his competence or ability at an 
activity, should be less likely to produce a 
decrement—or more likely to produce an 
increment—in subsequent intrinsic interest 
than rewards contingent upon task engage- 
ment per se (Deci, 1975; Lepper & Greene, 
1976; Ross, 1976). Thus, an appropriate test 
of the overjustification hypothesis requires a 
comparison across experimental conditions in 
which other relevant factors have been held 
constant. 

In addition, to provide data relevant to any 
hypothesis concerning intrinsic motivation, it 
is of central theoretical importance to dis- 
tinguish between two classes of experimental 
settings: those in which extrinsic rewards are 
potentially available for the target activity 
versus those in which extrinsic rewards are 
not perceived to be available (Lepper & 
Greene, 1976; Lepper et al., 1973). There 
are, of course, many reasons why a person 
might engage in a target activity; however, 
the focus of the oyerjustification hypothesis is 
on intrinsic as opposed to extrinsic reasons. 
This distinction has been operationalized by 
defining intrinsically motivated behavior as 
that which occurs in the perceived absence of 
extrinsic rewards (Deci, 1971; Lepper et al., 
1973; Ross, 1975). In the face of sufficiently 
powerful extrinsic rewards, individuals will 
often engage in activities in which they have 
no intrinsic interest. Therefore, assessments of 
subsequent intrinsic interest must be obtained 
in situations where subjects do not expect 
either tangible or social rewards to be con- 
tingent upon engagement in the activity. 

Furthermore, to preclude such expectations, 
some provision must be made to minimize the 
confounding influence of potential artifacts, 
such as demand characteristics, subjects’ reac- 
tivity to experimental procedures, and experi- 
menters’ expectancies (Johnson & Bolstad, 
1973; Levine & Fasnacht, 1976). In social 
psychological experiments, failure to keep 
personnel “blind” to subjects’ differential 
treatments may be sufficient grounds for out- 
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right rejection of a manuscript submitted for 
publication. In token economies, by contrast, 
personnel often have a justifiable “vested in- 
terest” in maintaining appropriate behaviors 
after tangible reinforcement has been with- 
drawn (O'Leary & Drabman, 1971). In per- 
haps the typical case, a teacher is trained in 
systematic observation techniques and con- 
tingent social approval behaviors as part of 
the token economy (Kazdin, 1973). In such a 
situation, it seems unreasonable to assume 
that the effects of this training will disappear 
when an attempt is made to reinstate pre- 
treatment baseline conditions, or that this 
situation provides an appropriate test of in- 
trinsic interest (Lepper & Greene, 1976). 

In short, a proper test of the relevance of 
the overjustification hypothesis to token econ- 
omies requires a between-groups design and an 
experimental setting in which time spent with 
a target activity can reasonably be attributed 
to intrinsic motivation. 


The Present Study 


This kind of setting was created in two ele- 
mentary classrooms by providing a time of 
day during which the entire class would par- 
take of a set of four activities with a common 
focus. Within the set, individual children were 
free to choose which activities they would 
engage in. As long as no differential contin- 
gencies were imposed among the activities, we 
presumed that an individual’s relative intrinsic 
interest in a particular activity was reflected 
in the time he or she spent playing with it, 
The dependent Measure, then, was the amount 
of time spent playing with particular activi- 
ties within the prescribed set. Given this situ- 
ation, it was possible to introduce and later 
withdraw a system of reinforcement contin- 
gencies, and to observe its effects on children’s 
immediate and subsequent interest in the vari- 
ous activities, 

: After baseline observations to determine 
initial relative preferences, children were ran- 
domly assigned to one of four groups, three 
experimental and one control. F or subjects in 
each of the three experimental groups, two of 
the four available activities were designated 
on an individual basis as “targets.” In the 
high-interest Stoup, the target activities were 
the two which each child had played with 
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most often during the baseline ='ase, In th 
low-interest group, they were tb- two whic 
each child had played with leas: = ‘ten durin 
baseline. In the choice group, tioy were th 
two chosen by each child indiy ally afte 
the baseline phase. During th. ireatmer 
phase of the study, then, each erimenta 
subject was differentially rewarde ‘ar speni 

ing time with his or her two targe ctivitie 
For subjects in the control grot ill fou 
activities were designated as targ Durin 
the treatment phase, these children cre con 
tingently rewarded for time spent v any o 
the four available activities, Thus, «ll four 
groups of children received conting. \t rein- 
forcement during the treatment pha. of the 
study. The difference between expe imental 
and control subjects was that expe mental 
subjects were differentially rewarded “or play- 
ing with their target activities, whe) as con- 
trol subjects were nondifferentially y warde 
for playing with any of the four : -ailable 


activities. 


Two features of this design are -ritical, 
First, since target activities were diff; ent for 
different subjects within the same experi- 
mental group, it was possible to keep ll per- 
sonnel in the study from knowing ar- child's 
experimental condition, Second, the re differ- 
entially rewarded group provided a asis of 
comparison for each of the three expe) mental 
groups, controlling for effects of tine and 
repeated measurement such as practice effects, 


satiation, and regression to the mean, Post- 
treatment data from each of the three experi- 
mental groups could therefore be compared 
with data from a group of subjects A 

i 


experiences had been otherwise identical. 

We expected to demonstrate a reinforce-| 
ment effect during the treatment phase in all 
three experimental groups. This effect would 
be indicated by a significant increase in time : 
spent with their target activities by each | 
group of experimental subjects, relative to the 
time spent with appropriately matched of 
yoked activities by the subjects in the control 
group. We could then discover whether the 
same procedures that had produced a rein- 
forcement effect would also produce an oyei- 
justification effect after the contingencies had 
been withdrawn. This effect would be indi- 
cated by a significant decrease in time spent 

a 
t 


experimental sub- 
spent with appro- 
d activities by con- 


‘with target activities 
hy jects, relative to the 
g| priately matched or 
ef trol group subjects. 


M 


' Subject Population 


an elementary school 
mathematics program. 
During “math time,” ild worked on one of 
"F100 levels into which iementary mathematics 
} curriculum had been A distinctive feature 
of the program was it cit reliance on an elab- 
orate system of extr vards, including a bi- 
weekly “Awards Asser when certificates and 
trophies were presente } who had earned them 
since the last assem! his system of extrinsic 
tewards provided a natural way to deliver 
contingent extrinsic r ment during the treat- 
ment phase of the st 

Two classrooms se the immediate experi- 
mental setting. Subj e 44 fourth and fifth 
graders, selected afte seline phase according to 
a procedure describe The school population 
was predominantly r socioeconomic levels, 
with 40% of the f ceiving welfare. Of the 
4 subjects, 19 had surnames, 17 were An- 
Blos, 7 were Black, re was Asian. The mean 
total arithmetic grad valents on the California 
Test of Basic Skill fourth graders and fifth 
graders, respectively, 3.72 and 4.11 in the Oc- 
tober preceding the sy, and 4.51 and 5.54 in the 
May following the st 


Experimental Settir 


.| The study was cond 
f with an ongoing indivi 


Experimental Materials 


a 


fie required the constant availability of 
iis sre ae activities, comparable in initial interest. 
“math eee was met within the context of a 
Rsucture o uke four activities shared a common 
manipulative format, each consisting of a set of 
(instructi ve materials, a sequence of “task cards” 
lected aite students how to use the materials) se- 
aining “ extensive pretesting, and a folder con- 
(a) geoboards (peg boards with rubber bands, to 
and lat r); (b) Dienes blocks (cubes, 
(c) Kb cs cubes in different numerical bases) ; 
and size, to materials (items varying in color, shape, 
ideas); ‘and Promote logical thinking and set theory 
metrical sha (d) tangrams (puzzle pieces in geo- 
„Each child pes, to be matched to various templates). 
Peach activit Ae provided with his own folder for 
Sheet with y. Each folder was covered with a “log” 
and “tim, columns headed “date,” “time started,” 
Place hee pastel Each activity sheet had a 
0 be Recorded: fate on which it was completed was 
Cards, and acti There were sufficient materials, task 
choose to en; ivity sheets to allow any child to 
time, gage in any activity at virtually any 
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Procedure 


Five school days before the first day of the base- 
line phase, the experimenter (the second author) in- 
troduced “some new math games” to each of two 
classrooms, in her capacity as administrator of the 
school’s ongoing math program. For a regular, 4- 
hour period each day, she explained, four activities 
would be available for children to play with on their 
own, to help her “find out which games to use in 
our math program.” One activity was introduced in 
some detail on each of 4 consecutive school days. 
After this training period, a 5th day was devoted to 
administering a simple questionnaire which asked 
about children’s previous exposure to and anticipated 
liking of the four different activities. 

Baseline. During the next 13 school days, children 
played with the four activities with no differential 
reinforcement contingencies, except with the con- 
straint that each child was to try each of the activi- 
ties at least once. At the end of this baseline phase, 
the amount of time children had spent with each ac- 
tivity was calculated. Children were then blocked 
into groups of four on the basis of the extent to 
which they had concentrated their time on their two 
most preferred activities and, within blocks, ran- 
domly assigned to conditions. A total of 51 children 
had been present in the two classes during the base- 
line period. Four children were dropped from the 
study, at the teachers’ request, because of various 
learning problems; an additional three children, 
showing the most extreme concentrations of time on 
their two preferred activities, were eliminated to 
produce a balanced sample of 44 subjects. The sam- 
ple included 11 subjects in each condition, 5 fourth 
graders and 6 fifth graders, or 5 boys and 6 girls. 

Treatment. On the first day of the treatment phase, 
the experimenter handed out a sheet of paper to each 
child and asked the class to read it silently. She 
then went over this sheet, point by point, with the 
class, as follows: 


[The experimenter] is very proud of the way you 
have all been helping her. Now, she is going to 


help you! 


1. If you work with certain games for 3 hours, 
[the experimenter] will give you credit on your 
math award sheet for having completed 1 math 
level. [The award sheet was the basis on which 
children were given certificates and trophies at 
their Awards Assemblies.] (If you work 6 hours, 
you will get credit for 2 levels. If you work for 
9 hours, you will get credit for 3 levels.) But, 
you'll only get credit if you write down correctly 
the time you start and stop, just as you have been 
doing. [Children had been recording data on their 
log sheets.] 

2. [The observer] will be in this room every day 
during math lab to check on how well you keep 
track of the time you play with the games. If she 
finds that you do not record the time you played 
with the game correctly, you will not get any 
credit for playing with that game. 
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3. [An assistant] will keep track of how long you 
play with the games on a chart. [The experimenter 
held up before the class a 28 X 22 inch (.71 X 
+56 m) bright orange posterboard chart with the 
title, “New Levels Completed for Math Awards 
Assembly.” Each child’s name was entered down 
one side, and headings across the top indicated 
how many hours and levels were completed.] When 
you come to class on Wednesday and Friday, you 
will see how long you have played and how much 
longer you have to play to get credit for a level, 


4. As soon as you reach the 3-hour point on the 
chart, your teacher will give you a paper to bring 
to your math teacher. This paper will tell your 
math teacher to enter Math Lab on your award 
sheet in your math profile, 


5. Next [the experimenter] is going to give you a 
sheet with your name on it and the list of games 
for which she will give you credit toward your 
math award(s). 


At this point, the experimenter handed out a sheet 
in the format of a personalized letter to each child 
in the class. After the date and greeting, it said 


You may play with any game you want to. 
Starting today, you will get 1 math level’s credit 
for every 3 hours you play with: [the subject’s 
target activities]. To help you remember, these 
folders have a big green X in the corner. [Target 
folders had been marked Prior to class except as 
explained below.] 


Subjects in the high-interest group were given the 
two activities with which they had spent the most 
time during baseline as their target activities. Low- 
interest group Subjects were given the two activities 
with which they had spent the least time during 
baseline. For control subjects, all four activities were 
listed; thus, for these subjects, there remained no 
differential contingencies among the four activities. 
Subjects who had been assigned to the choice group 
were given a sheet with two blank lines, When all 
the sheets had been handed out, subjects in the 
choice group were asked to step outside the class- 
room with [the first author]. Meanwhile, the rest of 
the class proceeded to go to the math lab area as 
usual, where they found that the upper right corner 
of the log sheet on the cover of the folders for their 
target activities had been marked with a large green 
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about their choices by themselves, 2+2 then to wi 
their choices on the sheet and bri) to [the fj 
author] at the back of the classroo \s each chi 
did so, the now-designated target folde-s were mark 
with a large green X and all four fo 5 were giy 
to the child to bring over to the math -b area. Fro 
this moment on, children in all fo experiment 
groups were treated alike. 

The treatment phase continued for more day 
For the first 3 of these days, the clas ym teache 
went over the experimental instruction the begi 
ning of the math lab period. The c providiy 
feedback to the children was updated y Tuesda 
and Thursday evening, so that upon arrivin 
in class Wednesday and Friday mor: childre 
would typically take note of their p. 

Withdrawal. On the first day of t! thdraws 
phase, the experimenter said that others 1e schoo 
were resenting the “unfair advantage” th this an 
the other class were enjoying, and thy’ she wa 
inclined to agree that it would be unfair ‘o continy 
such an advantage any longer. Therefor’, she wa 
taking down the chart as a sign that wor cing wil 
math lab activities would no longer earn credit fo 
math levels. In addition, the upper righ: corner of 
the log sheet on the cover of all the folder had beu 
clipped off, removing the discriminative ci to what 


had been target activities. On the other ‘and, she 
continued, there was no reason to remov the math 
lab materials from the classrooms; in fact, the 
teachers had been really happy with the math Jab 
Period. Children could continue to hei» her by 
working as before, or by making up their «wn activ: 
ity sheets, which she would use in other <- ssrooms, 

After the 13th day of the withdrawal phase, all the 
folders were removed from the Classrooms, The next 
day, the experimenter administered a questionnaire; 
telling children that their preferences wouid deter 
mine which activities stayed in their rooms. with tht 
other activities destined for other classrooms, Befort 
children left school on the day of this Jast question 
naire, the folders for the two activities which eadi 
class had “voted for” were returned to the mati 


lab areas, 


Data Collection 


The dependent variable of primary interest was 
the time spent by each subject with each of the at 
tivities. During the baseline phase, children wert 
instructed to record the date and time whenever they’ 
took one of their folders from or back to the mall! 
lab area, on the log sheet on the cover of the folder 
Their recording behavior was monitored carefully 
and social approval was delivered contingently when 
appropriate recording behavior was observed. Al! 
though this procedure produced acceptably reliable 
data (83% to 91% agreement with data recorded by 
a classroom observer), the time and effort necessary 
to achieve the result were inordinate. Accordingly; 
from the first day of the treatment phase to the end 
of the study, time data Were recorded by a classroom 


observer as well as by the students. In addition 10 
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time each child spent with 
was also possible to deter- 
he children completed the 
ad to estimate the accuracy 
ne, allowing an examination 
e experimental treatments on 


these data indicat 
each of the activi 
mine the rate at 
various activity s 
of the work they | 
of possible effects < 
performance, 


ESULTS 


Both within «1d across phases, the four 
activities varied ss to how difficult they were 
to complete accurately (percent correct) or 
‘teed (rate completed). To compensate for 
hese differences among activities, several em- 

Be. derived weighting systems were ap- 
Pana i performance data. Nevertheless, 
ae ‘iy analyses did not reveal any system- 
ees in accuracy or rate of per- 
e i p target activities, either between 
ae etween phases. Nor did subjects’ 
kral S the final questionnaire, scores on 
cee r tests, or progress in the school’s 
A os program show evidence of dif- 
Dike: etween groups. Although differences 

aes ea on any of these global mea- 
not surpris have been of some interest, it is 
techie. 6 that none were found, since the 
designed ; Contingencies in this study were 
with partic aet the amount of time spent 
was done cular activities, rather than what 
he with them. Significantly, there were 


no reliable differences between groups or be- 
tween phases in the mean time spent per day 
on all four activities combined; instead, the 
effects of our experimental manipulations were 
apparent only on the measure of greatest 
interest, the amount of time children spent 
with their target activities. 


Within-Group Comparisons 


A primary concern of the study was the 
demonstration of a reinforcement effect for 
each of the three experimental groups during 
the treatment phase. The relevant data are 
presented in Figures 1-3. These figures show 
the mean time per day spent with target ac- 
tivities during the three phases of the study 
by subjects in each of the three experimental 
groups, as well as comparable data for con- 
trol group subjects. For each of the three ex- 
perimental groups it can be observed that: 
(a) the mean time per day spent with target 
activities during the baseline phase, though 
variable from day to day, was essentially 
stable across days; (b) time on target during 
the treatment phase was greater than it had 
been during the baseline phase; and (c) re- 
moval of the contingency resulted in a de- 
crease in time on target from treatment to 
withdrawal. Both these shifts occurred for all 
but 2 of the 33 experimental subjects. In 
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Ficure 2, Mean time per day spent with target activities by high-interest group subjects d 
comparable data for control group subjects. 


addition, ¢ tests for correlated means estab- 
lished that all these within-group differences 
between phases were statistically significant 
at or beyond the p < .01 level.? Thus, the 
differential contingencies produced a clear 
reinforcement effect in each of the three ex- 
perimental groups.* 

It should be noted that the data for con- 
trol group subjects in Figures 1-3 represent 
three different sets of data from the same 
subjects. Each set of data was generated by 


2 All significance levels are based on two-tailed 
tests. 

The pattern of data for the low-interest group 
(Figure 1) is precisely that typically found in operant 
studies in applied settings. In fact, 1 of the 11 chil- 
dren in this group failed to respond to the treatment 
manipulation, a state of affairs quite familiar to 
researchers in this field (Kazdin & Bootzin, 1972; 
O'Leary, Becker, Evans & Saudargas, 1969). For the 
other 10 children in the group, the mean time per 
day with target activities during treatment was 22.4 
minutes with a standard deviation of 8.2 minutes. 
For the “deviant” child, the mean time per day 
with target activities was only 1.2 minutes. Conse- 
quently, this subject’s data were excluded from the 
low-interest group for purposes of subsequent be- 
tween-groups comparisons. In addition, 3 subjects left 
school before the end of the study. Since all of them 
were in the control group, every effort was made to 
use all of their data whenever possible. Thus, for 
high-interest and choice group comparisons with con- 
trol subjects during withdrawal, analyses are based on 


following the rule that had determine: which 
activities would be target activities v one 
of the experimental groups. For exam) e, the 
control group data in Figure 1 show t} mean 
time per day that these subjects spe — with 
the two activities with which they sp at the 
least time during the baseline phase, ti “‘low- 
interest” activities of the control jects. 
Similarly, data for control subjects in ‘igure 
2 show the time they spent with their ` tially 
“high-interest” activities. The data for -ontrol 


subjects in Figure 3 show the time they spent 
with two activities yoked to the choices of 
target activities made by choice group sub- 
jects, in terms of the ranks of the chosen ac- 
tivities within each choice group subjects’ 
baseline preferences, Since subjects were 
blocked before assignment to conditions, each 
experimental subject was matched to the con- 
trol subject in the same block. 

It should be clear that the data in Figures 
1 and 2 for control group subjects are not 
independent, since for each subject the activi- 
ties plotted in one figure are necessarily the 
two of the set of four not plotted in the other 
figure. What is represented twice in these two 
figures, then, is basically a single effect: Dif- 
ferences in the likelihood of playing with dif- 
ferent activities during baseline tended t0 
diminish by the end of the study for subjects 
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attained statistica nificance. 

4 the other hand, subjects in each of the 
ih el groups spent less time 
erat er target activities during the with- 
sawal phase than they had during the base- 
a For the low-interest group this 
Fiough y was not statistically significant, al- 
$ interpr baa of _Figure 1 suggests that 

fia s FTE of this lack of significance 
ice i allowance for the restricted range 

Gther two € Initial baseline and zero. For the 

ment dro Sean groups, this posttreat- 
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group subjects, and ¢(10) = 2.38, p < .05, for 
high-interest group subjects. 

Within the logic of a within-group design, 
of course, these data may be viewed as evi- 
dence consistent with the overjustification hy- 
pothesis. However, the nondifferentially rein- 
forced control subjects’ data suggest the possi- 
bility that posttreatment decrements in in- 
terest were, at least in part, the result of bore- 
dom, satiation, or some other process ensuing 
directly from the high amount of time spent 
with target activities during the treatment 
phase. These explanations suggest a negative 
relationship between previous time spent with 
particular activities and subsequent interest 
in those activities. Specifically, correlations 
of time spent with target activities should be 
negative between treatment and withdrawal 
phases and/or between baseline and treat- 
ment phases combined and the withdrawal 
phase. Both these correlations were computed 
for each experimental group, the three experi- 
mental groups combined, and for each of the 
three sets of target activities for control group 
subjects. All of these 14 correlations were 
positive or no more negative than r = —.05; 
thus, they do not support this class of expla- 


nations.* 


4 In principle, of course, these correlations are not 
inconsistent with the possibility that all children 
proportionately lost interest in their initially pre- 
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TABLE 1 
X MEAN MINUTES PER DAY SPENT WITH TARGET ACTIVITIES BY EXPERIMENTAL SUBJECTS RING 
EACH PHASE AND COMPARISONS WITH CONTROL GROUP SUBJECTS 
Phase F: | 
i Target Activities Baseline Treatment Withd 
High-probability during baseline . A 
Teqperinental (HI) 19.3 25.6 14 
Control (C) 17.8 15.0 13.0 
Chosen after baseline 
Experimental (CH) 14.5 219 68 
Control (C)* 13.8 14.8 16.1 
Low-probability during baseline = 
t Experimental (LO) 6.7 22.4 3.5 
Control (C) 61 10,6 117 
ł tests } 
HI vs, C (n = 11)! 1.57 2,83* Er 
CH vs, C (n = 11) <i 2.46" 3.55* 
LO vs, C (m = 10)» 1,15 3.54" 2,54* 
a Target activities were yoked to ranks of preferences of activities chosen by experimental subjects. 
» Comparisons during the withdrawal phase are based on two fewer subjects per cell. 
wh Son, 
Between-Groups Comparisons cell means on which they are based, re pre- 
; sented in Table 1,° 
Tt should be emphasized, however, that During the baseline phase, of co: se, ex- 
within-group coriparisons and between-groups perimental and control subjects did nt differ 
comparisons are asking different questions. In jn the time they spent with the -tivities 
the present study, our primary concern was which would subsequently be desig ited as 
whether any of the experimental treatments targets for experimental subjects. Du ing the 
that produced a reinforcement effect would treatment phase, experimental gro’ > sub- 
also produce an overjustification effect, as jects spent significantly more time wih their 
indicated by a significant decrease in time target activities than did the subjects in the 
spent by experimental group subjects with control group (p < .05 or less in each case): 


their target activities, after the withdrawal 
of differential contingencies, relative to the 
time spent by control group subjects with 
appropriately matched or yoked activities, 
Since each experimental subject was matched 
with a control subject by the blocking pro- 
cedure described earlier, ¢ tests for correlated 
means were used to compare each experi- 
mental group with the control group during 
each phase. These comparisons, as well as the 


ferred activities over time (Johnson, Note 1); this 
sort of explanation, however, does not account for 
the relative effects of the three experimental treat- 
ments. Moreover, studies designed explicitly to con- 
trol for satiation have consistently found overjustifi- 
cation effects which could not be attributed to ef- 
fects of increased task engagement (e.g., Calder & 
Staw, 1975; Lepper & Greene, 1975; Ross, 1975; 
Ross, Karniol, & Rothstein, 1976). 


During the withdrawal phase, subjects in two 
of the three experimental groups spent signifi- 
cantly less time with their target activities 
than did control group subjects maiched or 
yoked to them. Although high-interest group 
subjects’ time on target was significantly De- 
low their baseline level, ¢(10) = 2.38, P< 
.05, they did not differ from control subjects 

(£ < 1). Subjects in the low-interest group 

and in the choice group, however, spent sig- 
nificantly less time with their target activities 
than did control subjects matched or yoked 
to them, £(7) = 2.54, p < .05, and #(8) = 


Each of the nine comparisons in Table 1 Was 
also made separately for boys and girls and for 
fourth- and fifth-graders. In each of the four sub- 
groups of subjects, the pattern of means is strikingly 
similar to that of Table 1 (see Greene, 1974). 
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e-trial contingent re- 
are capable of pro- 
jecrements in engage- 
reinforced activities. 
over the course of a 

» the removal of de- 
token programs, are 
verjustification effects 
-trial, “noncontingent” 
Reiss and Sushinsky 
led. Instead, together 
studies of the subse- 
ple-trial reinforcement 
Colletti, Ersner-Hersh- 
ison, in press; Colvin, 
ad, & Lobitz, 1976; 
, & Ross, 1968), the 
strate that the use of 
ward procedures to pro- 
ement in target activities 
yncomitant decreases in 
situations where neither 
xtrinsic rewards are per- 


The present fir 
some conditions, 1 
inforcement proce 
ducing posttreatm 
ment with previc 
These results, obt 
13-day period foll 
monstrably effect 
clear evidence ti 
are not limited to 
reward proceduri 
(1975) have c: 
with data from 
quent effects o 
procedures (Bro 
field, Hershfield, 
1973; Johnson, 
Meichenbaum, I 
Present results d: 
powerful systema 
mote increased en 
may also produc 
task engagement, 
tangible nor soci: 
ceived to be availa 

These data are sufficient to indicate that 
the simple distinction between single-trial 
— multiple-t i! reinforcement procedures 
ae provide an adequate account of the 
te sie under which overjustification ef- 
oe be obtained. Unfortunately, how- 
A ane differences in results across the three 
Eba ental conditions do not allow an un- 
* differen conclusion concerning the effects 
iat | cae in subjects’ levels of initial in- 
7 n the present situation, On the basis 
. Previous laboratory data on overjustifica- 
Bien Calder & Staw, 1975), one would 
ikel verjustification effects to be more 

y aak subjects rewarded for engage- 
ae Ctivities of relatively higher (vs. 
vane lower) initial interest. Instead, dur- 

ithdrawal phase, high-interest sub- 
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jects showed no less interest than control 
subjects, while low-interest subjects showed 
significantly less subsequent interest than 
control subjects. In part, this reversal stems 
from the unanticipated general drift exhibited 
by control subjects over the course of the ex- 
periment, in which initial preferences tended 
to diminish over time. Therefore, interpreta- 
tion of these particular differences is more 
problematic than would have been the case 
had control subjects’ preferences remained 
constant throughout the study. On the one 
hand, this difference between the high- or low- 
interest groups may reflect a genuine effect 
of the interest manipulation; on the other 
hand, it may equally well reflect the relative 
insensitivity of the present design for assess- 
ing changes which parallel the temporal trends 
in the control condition, 

In particular, the results of a recent doc- 
toral dissertation (Colvin, 1973) indicate that 
some caution is appropriate in interpreting 
the present failure of the high-interest condi- 
tion to produce a decrement in subsequent 
interest relative to control subjects, since Col- 
vin’s study provides a conceptual replication 
of the high-interest and control conditions in 
the present study and obtains clear evidence 
of significant overjustification effects on both 
within-group and between-groups comparisons, 
Colvin’s subjects were elementary school chil- 
dren who had been selected for their demon- 
strated interest in art. They were asked to 
participate in the study on a voluntary basis 
and then randomly assigned to one of two 
experimental conditions. Token reinforcement 
systems were introduced after baseline ob- 
servations had determined the children’s rela- 
tive preferences between two sets of different 
art materials. In one group, children were con- 
tingently reinforced for time spent with their 
preferred materials; in the other group, chil- 


6 These differences between group means were 
paralleled by differences in the number of individual 
subjects in each group whose time with target ac- 
tivities decreased from baseline to withdrawal. This 
decrease occurred for 8 of 11 high-interest subjects 
versus 6 of 9 control subjects’ “high-interest” activi- 
ties, 7 of 10 Jow-interest subjects versus 3 of 8 con- 
trol subjects’ “Jow-interest” activities, and 9 of i 
choice group subjects versus 3 of 9 control subjects’ 
activities yoked to their choices. 
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dren were noncontingently reinforced after 
each session. During this treatment phase, 
contingent token rewards proved highly effec- 
tive in altering subjects’ choices, Then, after 
4 weeks, the token systems were abruptly 
terminated for both groups. Although the two 
groups did not differ in the percentage of time 
they spent with their preferred materials dur- 
ing the baseline phase, the group which had 
been contingently reinforced while the token 
systems were in operation spent significantly 
less time with these materials during an ex- 
plicit “extinction” phase than did the group 
which had been noncontingently reinforced. 
In the latter group, moreover, initial activity 
preferences were maintained throughout the 
experiment. These findings provide evidence 
that token reinforcement procedures can pro- 
duce decreases in posttreatment engagement 
with previously reinforced activities among 
subjects selected for an initial high interest 
in the activities, 

At the same time, it should be noted that 
Colvin’s study and the present one differed 
in several respects, For example, his procedure 
included two sets of art activities while ours 
included four sets of math activities, and his 
reinforcers were art and play materials while 
ours were rewards associated with the school’s 
ongoing math Program. In addition, Col- 
vin’s subjects were volunteers from a Univer- 
sity School while ours were a “captive audi- 
ence” from a public school Population of 
relatively lower socioeconomic levels, It re- 
mains for further investigation to determine 
the significance, if any, of these differences in 
populations or procedures between the two 
studies. 

Interestingly, compared to the data from 
the high-interest and low-interest conditions, 
subjects in the choice condition of the pres- 
ent study showed dramatic decreases in sub- 
Sequent interest, which were apparent in both 
within-group and between-groups comparisons, 
Indeed, during the 13-day withdrawal phase, 
choice subjects? Posttreatment engagement in 
the previously rewarded activities showed no 
Overlap with baseline data from this same 
group or posttreatment engagement levels 
pa by control subjects, In this light, 

portant to note that the experimental 
Procedure which distinguished this condition 
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involved more than the provisio» of an op. 
portunity for subjects to select target ac- 
tivities for which rewards were to be pro- 
vided, Specifically, in this cond’: on subjects 
were asked individually to thin- about the 
activities and the reward progra” . and were 
asked to make a public stateme | of their 
desire to receive rewards continge-: upon en: 
gagement in particular activities, h a pro- 
cedure, which elicits an overt mitment 
from the subject and explicitly ects the 
subject’s attention to the conseque es of his 
behavior (cf. Collins & Hoyt, 197. Kiesler, 
Nisbett, & Zanna, 1969; Zanna, per, & 
Abelson, 1973), is reminiscent. of p cedures 
employed in previous laboratory de. :onstra- 


tions of overjustification (e.g., Lepp: et al, 
1973), in which subjects are asked ‘o make 
an explicit acknowledgment of the instru- 
mentality of their engagement in a p ticular 
activity prior to undertaking the activity." 
Certainly, the possibility that vari» ions in 
the manner in which token reward p ograms 
are presented may affect subsequent `nterest 


following the withdrawal of a relative y long- 
term reward program deserves furt or em- 
pirical investigation (cf. Feingold & M ‘honey, 
1975). It would seem particularly inportant 
to note that there may be two oper: ‘onally 
distinguishable components or stages to a 
subject’s response to reward manip ations. 
Most research has focused on param: ers of 
reward (or tasks, or subjects), in the hope 


that a set of boundary conditions might be 


7 Note that the salience of instrumentality can and 
should be distinguished from the salience of particu- 
lar rewards or contingencies. The distinction is ana- 
logous to Kruglanski’s (1975). distinction between 
explanations of voluntary actions versus explana- 
tions of events or occurrences, In the former cast, 
naive attribution is Purposive or teleological, parti- 
tioning reasons into ends and means; specifically, 
distinguishing between endogenously (intrinsically) 
and exogenously (extrinsically) motivated actions. 
In the latter case, naive attribution is causal, dis- 
tinguishing between personal and environmental ex- 
Planations (cf. Heider, 1958; Jones & Davis, 1965). 
The overjustification hypothesis, of course, ad- 
dresses the means-end distinction rather than the 
Person-environment distinction (cf. Lepper, Greene, 
& Nisbett, 1973, p. 130). Therefore, the theoretically 
crucial issue is what makes instrumentality salient) 
rather than what makes particular stimuli salient 
(cf, Kruglanski, 1975, pp. 402-405), 
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of a variety of extrinsic constraints, includ- 
ing “enforced rehearsal” (Rosenhan, 1969), 
surveillance (Lepper & Greene, 1975), ex- 
ternally imposed deadlines (Amabile, De- 
Jong, & Lepper, 1976), and salient, expected 
rewards (e.g., Deci, 1971; Lepper et al., 
1973; Ross, 1975), as well as multiple-trial 
contingent reinforcement procedures (Colvin, 
1973; the present study). Similarly, the pre- 
dicted consequences of these manipulations 
have been assessed by diverse dependent mea- 
sures, including qualitative indices of task 
performance (e.g., Garbarino, 1975; Krug- 
lanski, Friedman, & Zeevi, 1971; Lepper et 
al., 1973) and questionnaire instruments (e.g., 
Calder & Staw, 1975; Kruglanski, Alon, & 
Lewis, 1972), as well as behavioral measures 
of subsequent intrinsic interest (e.g., Deci, 
1971; Lepper et al., 1973; Ross, 1975), It 
would appear, therefore, that overjustifica- 
tion effects and contrast effects should be 
characterized as constituting two conceptually 
distinct domains which partially overlap. 
Within the overjustification domain, behav- 
ioral effects which eventuate from multiple- 
trial contingent reinforcement procedures 
may be accurately described as instances of 
behavioral or incentive contrast. Conversely, 
within the contrast domain, the overjustifica- 
tion hypothesis affords one plausible theo- 
retical account for at least some of the effects 
with human subjects (cf. Johnson et al., 
1976). 

Further study of the possible role of cog- 
nitive mediation in contrast effects would 
seem to be warranted by the current literature 
on attempts to achieve generalization from 
token economy programs. A recent scholarly 
review of this literature (Kazdin, 1975) of- 
fered the following appraisal: 


Amid the enthusiasm over the progress already made 
in token economy research and the exciting trends, 
there remains a major 


tively few advances in 


void. There have been rela- 
developing a behavioral tech- 


nology which can be used effectively to maintain be- 
havior and to ensure transfer of training to settings 
where contingencies are not rigidly programmed. (p. 
263) 

ear implication of an at- 
e is to favor strategies 
ef- 


In this context, the cl 


tributional perspective is 
for achieving generalization of treatment 


fects that focus on subjects’ cognitions about 
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their reasons for engaging in target behaviors, 
rather than strategies that focus on program- 
ming the posttreatment environment (Greene, 
1974; Kopel & Arkowitz, 1975; cf. Kazdin, 
1975; Kazdin & Bootzin, 1972). From this 
perspective, generalization to nonprogrammed 
Settings is more likely to occur when training 
procedures induce subjects to make endoge- 
nous rather than exogenous attributions 
(Kruglanski, 1975) about their reasons for 
engaging in target behaviors. Exemplary of 
such procedures are: (a) the use of minimal 
and naturally available rather than overly 
powerful and arbitrary reinforcers (eg., 
O'Leary, Drabman, & Kass, 1973), (b) the 
use of various “fading” techniques in which 
extrinsic rewards are gradually phased out 
(e.g., Drabman, Spitalnik, & O'Leary, 1973), 
and (c) the use of self-control and self-rein- 
forcement techniques to replace or supplant 
externally imposed reinforcement programs 
(e.g., Brownell et al., in press; Drabman et 
al., 1973), 

In the present study, of course, theoretical 
objectives dictated some departures from in- 
formed token economy practice (cf, Kazdin, 
1975; O'Leary, in Press), which should be 
acknowledged in any attempt to evaluate the 
applied significance of the Present findings, 
For example, to provide a situation in which 
either increases or decreases in relative in- 
terest would be apparent, the present study 
employed “normal” subjects and a limited set 
of four mathematics activities of comparable 
initial interest, Similarly, to assess the effects 
of the reward system 
tentially confounding factors (e.g., feedback 
about one’s competence) held constant—the 
present reinforcement contingencies were 
based on time Spent with target activities 
rather than a performance-based criterion, 


egy for 
gains to unprogrammed settings, Had the 
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feelings of personal responsibility for th 
behavior via extended “fadin- Procedure 
for example, the results might wll have beg 
different. Considerable caution 10uld ther 
fore be exercised in extrapolai og from ti 
present study, and the present cults shoul 
not be taken as evidence that itively di 
signed token reinforcement systems will a 
ways, or eyen typically, prodr lecrement 
in subsequent intrinsic interest, 

At the same time, however, ‘he presen 
study also differed from most ing toker 
programs in its aim to distin betweer 
intrinsically and extrinsically ated be 
havior, Consequently, there was.» attempt 
made to maintain treatment gair: by sub: 
Stituting either contingent social a voval ot 
other naturally ayailable reinfo for the 
withdrawn token system; on the contrary, 
every effort was made in the presen’ study to 
eliminate such extrinsic incentives ¢/.ring the 
Posttreatment phase. Indeed, to « omplish 
this objective, classroom personnel were kept! 
“blind” to subjects’ treatment nditions 
throughout the study, and were ex; citly in- 
structed not to provide any differe:’al social 
reinforcement contingent on a childs choice 
of activities. From a theoretical poin: of view, 
these were the departures from typical token 
economy practice that merit the greatest at- 
tention. Given the applied objectives of token | 


reinforcement Programs, their implementation 
has rarely provided the theoretically appropri- 
ate conditions for testing the overjustification 
hypothesis ( Greene, 1974; Lepper & Greene, 
1976). But if the same reinforcement pro- 
cedures may have quite different effects on 
intrinsic as Opposed to extrinsic motivation, 


research presumed to evaluate token rein- 


forcement programs should include measures 
capable of assessing intrinsic as well as eX 
trinsic motivation, 


Conclusions 


A proper test of the relevance of the over- 
justification hypothesis to applied token re- 
inforcement programs requires a between- 
groups design and an experimental setting in 
which time spent with an activity can rea- 
sonably be attributed to intrinsic (vs. oS 
trinsic) motivation, The present study in- 
cluded these features in an applied reinforce- 
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Determinants of Intrinsically Motivated Behavior 


Zur Shapira 
University of Rochester 


iment was conducted to contrast and reconcile two cognitive theories of 
m and to investigate the expectancy determinants of intrinsically mo- 
havior. Subjects were presented with seven challenging tasks which 
heir level of difficulty. The subjects were asked to select their most pre- 
to work on and then to rank order the remaining tasks on their prefer- 

: working on them, Subjects were either told or not told that they would 
1,50 for successfully accomplishing the task. Subjects who could earn 
1 successful completion of the task chose relatively easy tasks (though 
isiest), while subjects who could get no money for performing the task 
her difficult tasks, The results indicate that when a salient monetary 
available, people behaved largely (though not solely) as predicted by 
y oriented expectancy-valence theory such as Vroom’s, whereas when 
ial reward was available, people’s choices and preferences could be most 
accounted for by an expectancy-valence theory utilizing an intrinsic 
unction which relates in a positively accelerated manner to task difficulty. 


Most cognitive heories of motivation (e.g., 
‘Atkinson, 1964 win, 1938; Peak, 1955; 
| Rotter, 1954; ‘i inan, 1959; Vroom, 1964) 
postulate two m. > determinants of motivated 
| behavior: expect: -y and valence. Expectancy 
simply means su! ive probability or a degree 
of belief that a ain action is followed by 
Specific outcomes. Valence (cf. Lewin, 1938) 
ìs the psychologic... value that is attached to 
an outcome. Thus + person’s motivation to do 
\! particular beh: vior is a function of the 
Valence of the ou comes which might follow 
the behavior, ard the person’s subjective 
probability that the outcomes actually follow 
the behavior. The expectancy-valence models 
of motivated behavior which have generated 
the greatest amount of empirical research are 
those of Atkinson (1964) and Vroom (1964). 


Wroom’s Model 


» z 
oons (1964) expectancy theory is a 
ae g model of work motivation 

ch utilizes the concepts of expectancy and 
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valence. There are two propositions in Vroom’s 
model. The first proposition allows one to 
assign valence to outcomes of a behavior (these 
are called first-order outcomes). Vroom 
hypothesized that the valence of a first-order 
outcome is determined by the value of other- 
outcomes (called second-order outcomes) to 
which the first-order outcomes can lead. For 
example, the valence of getting a college degree 
(a first-order outcome) is determined by its 
possible consequences such as being admitted 
to a graduate school, getting a better job and 
a higher salary, and so on. More specifically, 
Vroom asserted that the valence of a first- 
order outcome is a function of the valence of 
the second-order outcomes which it can lead 
to and the instrumentality of the first-order 
outcome for the achievement of each second- 
order outcome. 

TInstrumentality takes on values from +1 to 
—1, where +1 represents certainty that the 
second-order outcome follows from the first- 
order outcome, and —1 represents certainty 
that the second-order outcome cannot follow 
the first-order outcome. Mathematically, the 
valence of an outcome is said to be a mono- 
tonically increasing function of the sum of the 
product of valences of all second-order out- 
comes times the instrumentalities of the first- 
order outcomes for the attainment of these 
second-order outcomes. This is described in the 
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following equation: 


V= ME Vw] j= yen C) 


where Vy is the valence of outcome j, and Jj, 
is the instrumentality of the first-order out- 
come j for the attainment of the second-order 
outcome $. 

The second proposition in Vroom’s model 
allows one to determine the motivation to do a 
particular behavior. Motivation (which Vroom 
calls force toward action) is determined by the 
sum of the products of the valences of the 
different first-order outcomes times the ex- 
pectancies that the specific action being con- 


sidered results in that outcome. Mathe- 
matically, this is described by: 
P= fC (EsVi)] i=1,...,0, (2) 


ja 


where F; is the force toward action, and By 
is the expectancy (or subjective probability) 
that action 7 leads to outcome j, and V; is the 
valence of that first-order outcome which was 
calculated using the previous equation. 

The model further assumes that people 
choose the action which has the strongest 
positive (or weakest negative) force, which is 
similar to the notion, in decision theory, that 
people choose in a way which maximizes their 
subjective expected utility. 


Atkinson's Model 


Atkinson (1964) suggested a theory of 
achievement motivation which is also within 
the expectancy-valence framework. The basic 
multiplicative relationship in his model is 
P.X I, where P, stands for the subjective 
probability of success on the task, and Z, is the 
incentive value associated with success on the 
task, 

Atkinson asserted that there are two con- 
flicting tendencies operating in people who 
are faced with achievement-related tasks. The 
first is the tendency to approach success TA 
and the second is the tendency to avoid failure 
(Tas). The resultant tendency to approach or 
avoid achievement-oriented activity (T4) was 
postulated by Atkinson to be the sum of these 
conflicting tendencies, that is, T4 = T+ 


ZUR SHAPIRA 


(—T,,). According to Atkinson, «he tendeng 


to approach success in any sp: case (7) 
is determined by the product of he motive fo 
success (M,, a personality varie le) and th 
basic multiplicative relation P, > Similarh 
the tendency to avoid failure i uny speci 
case (Tas) is determined by th- product 
Mar (the motive to avoid fai ) times 
multiplicative relation which c ponds {g 


the basic multiplicative relation 
introduced above, that is, the p 


hich was 
ability 


failure (Ps) times the incentive val- of failu 
(—I;). Thus, 7, = M,(P, X I | Ty = 
Mar(Ps(—J,)]. Atkinson assun hat thé 
incentive value of success is neget related 
to the probability of success, tha: -, people 
desire these successes which they are | ist likely 
to attain. He suggested a specific li: ar fune 
tion to describe the relation betwer he two 
that is, Z= 1 — Pa a postulate hich h 
called a “special assumption” (1964 p. 243) 
Since Pp=1—P,, and y= Py, Ñ 
simple substitution yields the follow og ca 
tion describing the resultant motivat on: 7 

Ta = (Ms—Mar)[P.X (1—3). J 
This function gets its maximum ai P, = 5 
when Ms > Mar. Hence, Atkinson >redictol 
that people who are high-need achi- crs (fo 
whom Ms> Mur) choose moder: ly dif 
ficult tasks since in these situations thé 
motivation (i.e., their resultant tendency) i 


highest. 


Comparison of the Models 


The two models predict different results i 
certain situations. Assume that we have í 
series of similar tasks which vary in thei 
degree of difficulty, Assume also that there atl 
equal monetary rewards associated with suc 
cessful performance on the tasks. Suppose thal 
We present these tasks to people in an attemp! 
to find out on which tasks they would be most 
motivated to work. If all the monetary out 
comes which follow the completion of the tasks 
at each difficulty level are the same, then thë 
valences for the completion of each task a" 
equal. Hence, Vroom’s model predicts that 
people select the task with the greatest forcë 
toward action. The greatest force towa 
action is associated with the easiest task 


a 


valences are the same, then the 
rong the alternatives lie in their 
ind the higher the expectancy 
icr the task) the greater the 
sue the specific alternative. 
ther hand, postulated that 
lue is inversely related to the 
probability í cess. His model predicts that 
motivatior h is determined by the 
multiplicat ' the probability of success 
times the ve value is the highest for 
intermedia! Mcult tasks, for people who 
are high | achievement. In sum, in 
certain situ: Vroom’s model predicts that 
people choi sy tasks, and Atkinson’s 
model predi it they choose moderately 
difficult task 
The main « 
Atkinson’s m 
ceptions of va 
model does nx 
ship between { 
expectancy, I 
decision mod 
model assum: 
expectancy) a 
on the other 
value of succ 
function of it 
Deci (1975), ti 
formulations 


since if al 
differences 
expectanci 
(ie, the « 
motivation 
Atkinson, o) 
the incenti\ 


rence between Vroom’s and 
is lies in their different con- 
c or incentive value. Vroom’s 
pecify any particular relation- 
valence of an outcome and its 
line with the requirement of 
we can infer that Vroom’s 
at the two (i.e., valence and 
dependent. Atkinson (1964), 
|, argued that the incentive 
1a task is an inverse linear 
vectancy. As pointed out by 
lifferences between these two 
torm be explained in terms of 
intrinsic versus ¿xtrinsic motivation. 
_Intrinsically :notivated behavior is opera- 
tionally defined as choice behavior which is 
exhibited for no apparent external reward. The 
Psychological basis of this is in people’s need 
to be competent and self-determining (cf. de 
Charms, 1968; Deci, 1975; White, 1959). In 
other words, people behave in ways which they 
think allow them to feel competent and self- 
feternining in relation to their environment. 
“<trinsically motivated behavior, on the other 
hand, is behavior directed toward some ex- 
ternal Teward such as money, so people are 
&xtrinsically motivated if they engage in 
activities to get external rewards. 
T Vroom’s model is one of extrinsic motivation. 
Bice take on their valence in accord 
ive oe er outcomes (rewards) which they 
ktos rpa Cn for achieving. Several 
wa lave proposed adding an intrinsic 
or to Vroom’s model (cf. Campbell, Dun- 


EXPECTANCY DETERMINANTS 


1237 


nette, Lawler, & Weick, 1970; Galbraith & 
Cummings, 1967; Porter & Lawler, 1968), yet 
the basic formulation of the model is ex- 
trinsic in orientation. 

In contrast to Vroom, Atkinson’s (1964) 
model is primarily one of intrinsically moti- 
vated behavior. He defined incentive value 
purely as an intrinsic source of valence, that 
is, the incentive value is related to the relative 
degree of success on the task irrespective of 
other rewards which may follow. There is no 
extrinsic aspect in Atkinson’s (1957) original 
formulation, though Feather (1961) has pro- 
posed the addition of an extrinsic factor to 
the model. aah 

Hypothesis 1 is set to test (a) whether 
people’s choice behavior in a situation where 
there is a salient monetary reward could be 
better accounted for by Vroom’s model than 
Atkinson’s model and (b) whether, if ex- 
trinsic rewards are not available, Atkinson’s 
model predicts people’s choice behavior better 
than Vroom’s model. 


Hypothesis 1 


If a person is engaged in a challenging 
activity and if the person can choose the level 
of task difficulty that he/she wishes to pursue, 
then a person who can get no monetary re- 
wards for performing the activity should choose 
a more difficult task to work on than a person 
who can get monetary rewards for performing 
the activity. 


Task Difficulty and Intrinsic Motivation 


Atkinson (1964) postulated that the rela- 
tion between J, and Ps is a linear one (i-e., that 
T, =1-—P,), without providing any justi- 
fication for this assumption. Indeed, he called 
it a special assumption and claimed that “the 
ultimate value of this assumption is to be 
judged in terms of its testable implication” 
(1964, p. 242). The main testable implication 
is that the resultant tendency to approach 


achievement-oriented activity (for high-need 


achievers) is described by a symmetrical, 
inverted U function of the probability of 
success, which reaches its maximum at P= Ashe 
The evidence regarding this implication is not 
conclusive: some studies supported it (ef. 
‘Atkinson, 1974), while other studies did not 
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(cf. de Charms & Dave, 1965; Karabenick, 
1972; Kukla, 1975; Weinstein, 1969). The 
special assumption of the identity between J, 
and (1 —P,), however appealing and par- 
simonious, has not been confirmed empirically. 
A possible reevaluation is presented here. 
Suppose that you are engaged in working on 
a challenging task which has different degrees 
of difficulty. Let the degrees of difficulty be 
described in terms of the probability of success 
on the tasks. Assume also that there are no 
external rewards associated with this activity. 
Suppose that the probability of success drops 
from .5 to .4. The challenge in solving the 
difficult task (P, = 4) is greater than the 
challenge in solving the easier one (P, = .5). 
Imagine now that the probability of success 
on a similar task drops from .2 to .1, Again, the 
challenge and anticipated satisfaction in 
solving the more difficult task, denoted 
Ty(P, = .1), is greater than that of the easier 
task, denoted Jy(P, = .2). However, the addi- 
tional satisfaction that can be gained by suc- 
cessfully accomplishing the P, = .1 task versus 
the P,= .2 task, namely, Iy(P, = .1) — 
Ty(P; = .2), is greater than the additional 
satisfaction that can be gained by successfully 
accomplishing the P, = .4 task versus the 
P.=.5 task, that is, Zy(Ps= 1) —Iy(P,=.2)> 
Iy(P, = 4) — Ty(P, = .5). This inequality 
means that more challenge and satisfaction are 
associated with overcoming more difficult 
tasks, and that the challenge and anticipated 
satisfaction increase in a more accelerated way 
than the linear relation suggested by Atkin- 
son’s incentive value formula. Mathematically, 
intrinsic valence (denoted Jy) is a function of 
the probability of success and is described by 
the following power funetion: 
Iy=cl=P)'¢>0, k>1, (4) 
where ¢ is a constant which depends on the 
unit of measurement, and k is a parameter 
which may have different values for different 
people. Such an intrinsic valence function 
corresponds to the nature of intrinsic motiva- 
tion as it is conceptualized by the terms of the 
heed to feel competent and self-determining 
(ci. de Charms, 1968; Deci, 1975; White, 
1959), more than does achievement theory. 
As Deci (1975) has stated, “People seem to be 
engaged in the general Process of secking and 
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conquering challenges which are optimal fo 
them” (p. 62). Achievement t did no 
“let” people seek their own opt hallenge, 
rather, it stated that there is í one point 
of optimal challenge (ie, wh P,= 5) 
which is identical for every per who is a 
high-need achiever. The present mulation 
“allows” people to seek their ( level of 
optimal challenge and does not ; ceiling 
on the process of seeking an optim iallenge, 
The motivation model suggeste: cre is a 
decision-making model, similar Vroom’ 
and Atkinson’s models. Note, he r, that 
the present Jy function relates « to the 
intrinsic motivation aspects of task -havior 
The resultant intrinsic force (F, toward 
working on a particular task j, is d rmined 
by the multiplication of the prob: ‘lity of 
success on that task times the intrin alence 
of successfully accomplishing the tas! lathe- 
matically, this is represented by: 
Fj = P; X Iv; 6) 
substituting c(1 — P;)* for Iy; yields 
APSE A (0) 

where P; is the probability of succes on the 

specific task j. 

This formulation of F; determines family 
of functions which depends on the pr ii 
of success (P;) and on k. It attains ii» maxi- 


mum at P;=1/(k-+1), and as & incre*ses the 
median of the F; family of functions approaches 
the point P; = 0, although it never reaches 
that point.! 

The current formulation is a genera! repre- 
sentation of the relation of intrinsic valence 
to task difficulty. Atkinson’s model is a special 
case of this formulation, since when $= 1, 
Atkinson’s incentive value formula and the 
current intrinsic valence function are identical. 

Hypothesis 2 is set to test whether people’s 
choice behavior in situations where they can 
set their own goals and where they can get no | 


‘If the expected intrinsic value of failure was incor. 
Porated into the model, it would still yield the same 
prediction, It can be shown that the function 

F = (Pj(1— P)*— P(t P)] 


and F; described in Equation 6 reach their maximum 
at the same point. 


ards can be accounted for by 
odel. If not, the more general 
s useful in that it allows us 
he distribution of & across people, 
ter account of the intrinsic 
spects of people’s choice 


extrinsic r 
Atkinson’s 
formulatio: 
to determi: 
and hence, 
motivations 


behavior. 

Hypothesi: s also set to test Atkinson’s 
prediction. t cople’s preference functions 
are symm’! or example, assume a tašk 
which ha lternatives: Pa = .2 and 
ii 8 ing to Atkinson (1964), 
people are ir nt between the two alter- 
natives, sinc X (=P= 2X eS 
8X .2= P — Py). 


Hypothesis 2 


engaged in a challenging 
the person can choose the 
ity he/she wishes to pursue, 
can get no extrinsic rewards 
the activity, then he/she 
termediate difficulty and is 
tasks which are comple- 
ther with respect to their 


If a persor 
activity in wl 
level of task di 
and if the per 
for performir 
chooses tasks 
indifferent be 
mentary to o 
level of difficu 


METHOD 
Overview 


i Subjects were sented with seven puzzle tasks 
} of varying difcu:iies. They were asked to choose 
which one task the; were most interested in working on 
and then to rank order the remaining tasks on their 
preferences for woring on them, All the tasks involved 
working on a mec} | spatial-relations puzzle in an 
attempt to reproduce, with the puzzle pieces, a con- 
figuration which had been drawn on paper. The seven 
T conte rations differing with respect to 
ma ificuity, ranging from very easy to very 
ES te were cither paid or unpaid. Paid subjects 
a A that they would receive $2.50 if they Wear. 
a Aa produce in 15 min. the configuration which 
k ve ose, No money was mentioned to the subjects 
No-pay condition. 


Subjects 


——— a 


ee were 60 undergraduate students (30 males 
Partici eo) at the University of Rochester who 
Bilan: pr the study to gain credit toward the 
Tequire: pa of an introductory psychology research 
Wo nia They were randomly assigned to one of 

perimental conditions, 
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Apparatus 


The subjects were engaged in working on a puzzle 
called Soma, produced by Parker Brothers, which 
seems to be intrinsically motivating for college students. 
The puzzle is made up of seven pieces of plastic, cach 
of which is made to look like it is composed of three or 
four 1-in. cubes. There are a total of 27 1-in. cubes 
in the seven pieces. These pieces can be fitted together 
tọ form millions of configurations, including a cube 
3 in. on each side. 

During the experiment the subjects were asked to 
choose one configuration from seven which were dis- 
played to them. The configurations were of varying 
degrees of difficulty. The degree of difficulty was 
indicated by a statement which was written at the 
bottom of each configuration. It said “X% of the 
college students who have worked on this configuration 
have successfully reproduced it within 15 minutes.” 
The percentages (i.e., the X values) stated were 97% 
(very easy), 82%, 66%, 51%, 34%, 18%, and 3% 
(very difficult). The percentages stated were based on 
data collected in a pilot study. 

To minimize the influence of the shapes of the 
configurations on the subjects’ choices, two sets of 
configurations were used. Each set of configurations 
was presented in a random order to half of the subjects 
in each condition. 


Procedure 


Each subject was run individually in one experi- 
mental session which lasted about 30 min. When a 
subject reported for the experiment, he/she was greeted 
by the experimenter, taken to the experimental room, 
and seated at a table. The experimenter sat opposite 
the subject, who was told that the experiment was a 
problem-solving study and that he/she would be 
asked to reproduce one configuration, using the puzale 
pieces. The experimenter showed the subject how to 
assemble two very easy configurations. ‘Then, the sub- 
ject was asked to choose the configuration he/she 
wished to work on, out of the seven configurations which 
were displayed. The subject was also told that he/she 
would have 15 min. to work on the configuration. 
The subjects in the pay condition were told that if 
they succeeded in solving the configuration, they 
would receive $2.50. ‘ 

‘After the subject made his choice, the experimenter 


said: “In a few minutes you will actually start working 


on this configuration. Meanwhile, there is another 
do: Assume that this 


thing that I would like you to n 1 
configuration which you have just chosen is not avail- 
able and that only the remaining six configurations are 
available. Which of these six configurations are you 
most interested in working on?” Subjects then indi- 
cated their second choices and were asked to rank order 
the remaining five configurations, in the order that 
they would choose to work on them. Then the subjects 
were given the first configuration. which they had 
chosen, along with the puzzle pieces, and they were 
told that they would have 15 min. to work on the 


configuration. 


-50 
(n=18) 


40 


Pay Condition 


Percentage 
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No-Pay Condition ———_— i 


Probability of Success 


Ficure 1, Distribution of the subjects’ first choices, 


‘The time the subject took to complete the configura- 
tion was measured with a stopwatch. If he/she was 
unable to complete a configuration within 15 min., 
he/she was stopped and was shown how to do it. After 
that, the subject was asked to respond to a short 
questionnaire which was designed to measure the per- 
ceived difficulty of the task and the subject’s satis- 
faction from working on the task, The subjects were 
also asked to indicate whether their perceived prob- 
abilities of success were different than the probabilities 
presented to them, On completion of the questionnaire, 
subjects in the pay condition were paid their earninsg, 
and subjects in the no-pay condition received an 
unexpected reward of $1. The subjects were then 
debriefed about the goals of the study, 


Dependent Measures 


The subjects’ first choices constituted the main 
dependent yariable in the study. It was assumed that 
each subject chose the task on which he/she was most 
motivated to work. The entire rank order of the tasks 
was assumed to reflect the subject’s preferences and 
was used to estimate each subject's preference function. 

The subjects’ responses to the questionnaire were 
used to measure their satisfaction from working on the 
task each had done, the subjective difficulty of the 
tasks, and the differences between their perceived and 
the stated probabilities of success, 


REsutts 


Fach subject chose the task on which he/she 
was most motivated to work. The first hy- 


pothesis stated that subjects in the ; 
tion choose easy tasks, since by doin: so they 
would have the best chances of wi ing thd 


monetary reward, whereas subjects i the no 
pay condition choose tasks that v mort 
intrinsically rewarding. The perce: age ol 


times each task was chosen first, for cach olf 
the groups, is presented in Figure 1. 


Hypothesis 1 


To test the hypothesis that subjecis in the 
no-pay condition choose more difficult tasks 
than the subjects in the pay condition, the two 
groups were compared by means of the Mann 
Whitney U test. Significant differences werg 
found in the anticipated direction (U = 111.9, 
$ < 001, two-tailed test), thereby supporting 
Hypothesis 1. 


Intrinsic Valence and Probability of Success 


Hypothesis 2 predicted that while engaged 
in a challenging activity with no external res 
wards, a subject prefers a task of intermediate 
difficulty and is indifferent between tasks with 
complementary probabilities of success. To test 
the first part of Hypothesis 2, the distribution 
of the subjects’ first choices in the no-pdy 
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sted to determine whether it 
) a symmetrical distribution 
51%, This would be the case 
tasks of intermediate dif- 
xon Signed Rank test was 
und that the median of the 
ices distribution was signif- 
n 51% (T= 69, p< 001, 


condition was t 
could come 
with a medi 
if subjects < 
ficulty. The 
used and it v 
subjects’ first 
icantly lower 
two-tailed tes 
~ Therankor 
< subject could 
j his/her predic 
ment theory j 
- function for : 
| of Hypothes: 
between a su! 
achievement 
puted for eacl 


i the tasks provided by each 
tested whether it matched 
reference function. Achieve- 
ts a symmetrical preference 
ividuals. To test this part 
he rank order correlation 
; observed rank order and 
predicted order was com- 
ject in the no-pay condition, 


The 40 correla coefficients were then trans- 
formed to Z ies using Fisher’s r to Z 
transformation cc Hays, 1963, p. 530). The 
analysis show hat the average correlation 
did not differ ficantly from zero (3 = .42), 


and therefor 
tions, as refle 
did not supp: 
Next, the 
tank order as 
the intrinsic 
for each subje: 
k a predicted pr 
; 


subjects’ preference func- 
n their ranking of the tasks, 
ypothesis 2. 
clation between a subject’s 
s/her predicted order using 
nce function was calculated 
the no-pay condition. First, 
rence function was generated 
for each subject according to the value of the 
parameter & ch was determined by the 
first choice. Then, correlations between the 
Predicted and observed preference functions 
Were computed ond transformed to Z values. 
t he analysis showed that the average correla- 
tion was significantly greater than zero 
e= 2.05, p< .001, two-tailed test). The 
Subjects? preference functions were therefore 
Accurately predic (ed when the & values for the 
different subjects were used to construct an 
Individual preference function for each subject. 
MS supported ihe general intrinsic valence 
Power function as determining people’s prefer- 
“ce functions and choices. 


Ee subjects’ first-choice distribution could 
_. used to estimate the parameter k of the 
= preference function. Based on the 
Varig, tribution of the first choices, the 
4 Ac X= 1/(k+1) is likely to have a beta 

ution with the parameters @ and £, 
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since 0 < 1/(k + 1) < 1 for k > 0. The mo- 
ments method (cf. Johnson & Kotz, 1970) was 
used to estimate the parameters of the beta 
distribution that fitted the subjects first-choice 
distribution. The estimated parameters were 
a = .65 and 8 = 1.69. Thus, X = 1/(k + 1) ~ 
B(.65, 1.69) with a mean of .28 and a median 
of .23. The corresponding & values were a 
mean of 4.45 and a median of 4.37. Also, ac- 
cording to the beta distribution of 1/(& + 1), it 
was determined that 78% of the population 
have k values which are greater than 1, that is, 
a positively accelerated intrinsic valence 
function, 


Performance Analysis 


These data were gathered to validate the 
stimuli, that is, to examine whether tasks of 
greater difficulty were actually more difficult 
to accomplish. Table 1 presents the numbers 
and percentages of the subjects who succeeded 
in solving the puzzle which they chose and 
the average time of performance for those 
subjects who successfully accomplished their 
task. There is a monotonic trend (except for 
the 82% no-pay condition) between the degree 
of success on the task and the task difficulty. 
Subjects in both groups succeeded less on the 
more difficult tasks. These tasks also took 
longer to accomplish. 


Analysis of the Postsession Responses 


All the subjects reported that their perceived 
probabilities of success were identical to the 
probabilities that were presented to them. 
In addition, the subjects’ responses to the 
postsession questionnaire were used to examine 
the correspondence between the task’s ob- 
jective and subjective difficulty, as well as 
their interest in and satisfaction from per- 
forming the activity. The responses are also 
summarized in Table 1. With regard to the 
question “why did you choose the specific 
configuration?”, 36 out of the 40 subjects 
in the no-pay condition and 17 out of the 20 
subjects in the pay condition indicated that 
the degree of task difficulty was the major 
reason for choosing the specific configuration. 
Subjects who chose difficult tasks mentioned 
challenge as the reason for choosing a difficult 
task, while subjects who chose casy tasks 
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TABLE 1 
‘TASK PERFORMANCE AND ATTITUDES OF SUBJECTS IN Pay AND No-Pay CONDITI 
Objective difficulty 
Condition 3% 18% 34% Si% ` 66% 82% 01% 
No pay t 
Nibe of successful subjects 3 10 4 3 2 0 2 
Percentage of successful subjects 38 55 67 100 100 0 Es 
Average performance time (min.) 13.60 11.50 8.30 8.00 6.00 — 75 
Interest in task" 6.75 639 6.17 6.34 5.50 3.00 00 
Subjective difficulty of task* 6.38 6.06 5.34 5.00 4.50 6.00 00 
Effort exerted® 650 6.00 534 6.00 6.00 7.00 00 
Pay 
Number of successful subjects 0 — 1 1 2 JZ 
Percentage of successful subjects 0 = - 90 50 50 hom 
Average performance time (min.) — _ 8.00 7.00 7.50 6.40 0 
Interest in task® 700 — 650 6.50 6.75 6.11 0 
Subjective difficulty of task® 7.00 — 5.00). 35:50- = “4:75 4.55 10 
Effort exerted* 6.00 — 600 650 475 4.78 10 


s These were 7-point scales. The data reported here were transformed so that the score reported here is 8 minus t! 


All the scores presented here are mean scores, 


(in the pay condition) indicated that they 
chose them to maximize their chances of getting 
the monetary reward. The relation of the task 
chosen to money was indicated by one subject 
from the pay group who chose a relatively 
difficult task (i.e, where P, = .34), “If the 
$2.50 were not there,” the subject said, “I 
would have chosen the most difficult one,” 
Subjects generally rated the more difficult 
tasks as more difficult and more interesting 
and reported that they worked harder on them, 


Discussion 


The question of concern in this study was 
how the difficulty of tasks relates to subjects’ 
motivation for doing the tasks in the presence 
or absence of extrinsic rewards, Vroom’s 
theory led to the prediction that subjects 
choose the easiest task when equivalent 
extrinsic rewards are attached to the com- 
pletion of each task. The present experiment 
provided support for Vroom’s prediction in 
that subjects who were offered $2.50 for 
solving the puzzle of their choice selected 
relatively easy tasks. They did not however 
choose the easiest task. Thus, it seems that 
subjects tend to choose easy paths to extrinsic 
rewards, though they may desire some in- 
trinsic satisfaction from doing challenging 
tasks, so they select tasks which have a fairly 
high probability of success but provide 
slightly more than minimal challenge. How- 
ever, another extrinsic factor may have led 


AW score, 


the subjects not to choose the eas task. 
It is possible that the subjects were c cerned 
that the experimenter’s opinion of the: would 
be related to the relative difficulty of ^ne task 
which they chose, The present data © es not 
favor one explanation over the ot! r, and 
hence, the possibility that the deviat a from 
Vroom’s prediction stemmed from © o sub- 
jects’ concern of the experimenter’: opinion 
of them may be plausible and shold be 
investigated experimentally. 

The second hypothesis was concern | with 
the relations between task dificul, and 
motivation in the absence of extrinsic 1 wards. 
It was reasoned that only in the no-pay 
condition is it possible to have an uncon- 
founded investigation of the determinants 


of intrinsic motivation. Subjects sctually 
chose tasks of more than intermediate di fficulty 
(the median was 20.83%, probability of suc- 
cess). This finding lent support to the proposi- 
tion that intrinsic valence can best be described 
as a positively accelerated function of task 
difficulty rather than a linear function as 
Atkinson suggested. In other words, given } 
the choices (and thus the motivation) of the 
subjects, and assuming that choices are 4 
function of expectation times valence, the 
valence function which allows us to account 
most accurately for the choices is a positively 
accelerated function of task difficulty. Such E 
function implies that a person who is intrinsi- 
cally motivated chooses fairly difficult tasks 
F 


EXPECTANCY DETERMINANTS 


rather thin moderately difficult tasks as 


predicted >y Atkinson’s achievement theory. 

Although the current data do not support 
Atkinson’: ‘heory, previous data reported 
by Atkin id seem to support his theory. 


is probably due to differences 
ntal procedures. In the current 


This differ: 
in the exp 


experiment, nost all sources of external 
motivatior removed in the no-pay 
condition er, in the experiments which 
Atkinson d to support his position, 


al/competitive rewards in- 
cedure. For example, in the 
Ring Toss experiment (cf. Atkinson & 
Litwin, 196 jects took 10 shots with a 
ring at a wo veg target. The distance from 
which they « to take the shots was defined 
as a measu ask difficulty. The median 
distance of (2.99 m; with a possible 
range of 1 t t.,.30 to 4.57 m), was inter- 
preted by ti authors as a confirmation of 
achievement t ory. However, since there was 
no a priori  -iinition of what constituted 
< an intermedi difficult task, that is, where 
the probabi of success was 50% (cf. 
Atkinson & vin, 1960, p. 61), it is not 
obvious whi this actually constituted a 
confirmation ichievement theory. Further, 
since subject: anding together through- 
out the experi nt, interpersonal competition 
Was not contr od and may have contributed 
to the results, .s de Charms put it, “Under 
j these condition. [i.e., in the Ring Toss ex- 
periment], the .atching group may have had 
| 4 strong effect the behavior of the subjects 
"and the incentives for doing well may have 
been Strongly affected by the desire for 
group approval” (1968, p. 220). It is possible, 
therefore, that previous studies which seem 
to have Supporied achievement theory may 
Bye been confounded by interpersonal com- 
Petition. K 
"The present approach should be viewed 
‘8 a possible modification of achievement 
theory, Tn many studies within the achieve- 
a theory paradigm, subjects who were 
i g need achievers chose tasks with prob- 
| E of success between 25% and 40% 
* Heckhausen, 1968, p. 156). If the param- 
E. i ad present Ty formula varied be- 
oo and 3 for the subjects in these 
*Petiments, the present formulation predicts 


there we 
herent in i 
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that they choose tasks where 25% < P. < 40%, 
Also, the present formulation helps in clarify- 
ing the asymmetrical preferences of subjects, 
which were obtained in other studies (cf. 
Kukla, 1975). Thus, a simple modification 
of the incentive value formula may prove 
to be useful in explaining deviations from 
achievement theory predictions regarding sub- 
jects’ choices and preferences. 

The weaknesses of the present procedure 
should be indicated. First, the subjects’ 
possible concern for the experimenter’s opinion 
of them may have had an effect on their 
choices especially in the pay condition. Second, 
the possible discrepancies between the stated 
and perceived probabilities of success were 
checked after the subjects made their choices, 
Although no differences were found, checking 
the perceived probabilities of success before 
the choices were made is preferable and could 
have strengthened the interpretation of the 
results. 

Finally, while the present study helps 
clarify the nature of intrinsic motivation, it 
does not attempt to explain human choice 
behavior solely as a function of intrinsic 
motivation. Extrinsic rewards also play a 
crucial role in choice behavior, as we saw, 
and motivation to do a task is a function of 
both intrinsic and extrinsic motivation. How 
the two types of motivation are combined is 
still an unanswered question, although they 
were shown to be interactive rather than 
additive (cf. Calder & Staw, 1975; Deci, 1971). 
The quantitative description of intrinsically 
motivated behavior in the present study is a 
step toward quantifying the interactions and 
trade-offs between intrinsic motivation and 
extrinsic rewards. This issue should be in- 
vestigated in future research. 
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operating interracial 


up on the helper’s attraction for the recipient of hi: help were investigated 


2 X2 factorially designed experiment. The 
and race of the recipient served as additional 4 dependent variables. Sub- 
who were military servicemen from souti 
greater satisfaction with the group experience and greater at- 

competent group. %nember when they were induced to help 
help was provided by another member of the group. Subjects 
faction.Wwith the group experience and developed greater 
upmaates when their group was successful than did sub- 
n failing groups. Each of the effects held for both white 

»Of help. The implications of these findings for the manage- 
fation process are discussed. 


gnificantly 


A 

Rese ni g the effects of inter- 
group con Aude change, on the one 
hand, anı «$ of cooperation on inter- 
personal Á, on the other, has resulted 
in an en pon cooperative experiences 
as a poten’. ;/oterminant of cross-racial re- 
spect and | d racial attitude change. A 
Series of reci yublished studies have ad- 
dressed the ¢ n of whether this relation- 


nder some conditions of co- 
inder others. Blanchard, Adel- 
(1975) have demonstrated 
perative endeavors are success- 
| liking for groupmates, both 
, is greater than it is when 


ship is stron 
operation tha 
man, and /Ci 
that when/ c 
ful, resped 
black and, hit 
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eration and assistance of the staff of the 
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« of Behavioral Science for the University of 


ýs for reprints should be sent to Stuart W. 
ħrtment of Psychology, University of Colo- 
er, Colorado 80302, 


ccess or failure of the 


small towns and rural areas, 


cooperation is accompanied by group failure, 
In the absence of strong financial incentives 
for group success, Weigel and Cook (1975) 
found that members of cooperating interracial 
groups which were characterized by high levels 
of member participation in decision making 
liked and respected their black and white 
groupmates more than did members of groups 
in which the decision-making prerogatives 
were left to a group leader. 

It has been recognized that members of 
disadvantaged minority groups may enter 
newly desegregated situations with perform- 
ance handicaps. In educational settings it has 
been found that the academic achievement of 
minority students was lower than that of their 
white counterparts (Coleman, Campbell, Hob- 
son, Mood, Weinfield, & York, 1966). Initial 
achievement differences between black and 
white students have also been found in those 
studies which show improved academic per- 
formance by black students following desegre- 
gation (Samuels, 1971; Slone, 1968; Banks & 
DePasquale, Note 1; Laird & Weeks, Note 2; 
McCullough, Note 3; Morrison & Stivers, 
Note 4; Zdep & Joyce, Note 5). 

The reanalysis of the U.S. Office of Educa- 
tion Study (Coleman et al., 1966) conducted 
by the U.S. Commission on Civil Rights 
(1967) indicated that in desegregated class- 
rooms (more than 50% white), black students 
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with close white friends tended to have both 
higher achievement scores and higher college 
aspirations than did black students with no 
close white friends» One may alternatively in- 
terpret this finding to mean that having white 
friends leads to higher\ achievement among 
black students or that higher achievement 
(i.e., competence) leads to more white friends. 
The latter interpretation is consistent with 
the results of a third study in this series con- 
cerning the determinants of interpersonal at- 
traction in cooperating interracial “groups. 
These findings, which are described below, 
suggest that the initially lower achievement 
levels that may characterize some members of 
socially disadvantaged groups could hinder the 
development of interethnic respect and liking 
in newly desegregated settings. 

Blanchard, Weigel, and Cook (1975) in- 
vestigated the effect of a group member’s level 
of competence at performing his assigned tasks 
on the development of liking and respect for 
him in the context of a cooperating interracial 
group. The authors found that white subjects, 
drawn from an area where antiblack attitudes 
are strong, liked and respected a less com- 
petently performing black group member sig- 
nificantly less than a more competently per- 
forming black group member, However, no 
parallel effect was observed when white group 
members rated white comembers. These latter 
findings contrast with others in which lessened 
attraction for less competent members of one’s 
own ethnic group have been found (Aronson, 
Willerman, & Floyd, 1966; Berkowitz, Levy, 
& Harvey, 1975; Gilchrist, 1952; Helmreich, 
Aronson, & LeFan, 1970; Iverson, 1964; Shaw 
& Gilchrist, 1955; Stotland & Hilmer, 1962). 
However, in these studies, the competence 
level of the stimulus person had not been 
presented during a period of extensive, face- 
to-face interaction in a cooperating means- 
interdependent group as was the case in the 
Blanchard, Weigel, and Cook research. Other 
research has demonstrated that cooperative 
interaction often leads to respect and liking 
among group members (Cook, 1969; Crom- 
bag, 1966; Deutsch, 1949; Raven & Eachus, 
1963; Sherif, Harvey, White, Hood, & Sherif, 
1961; Wilson & Miller, 1961; Cook, Note 6). 
Considered together, these studies suggest that 
the cooperative experience attenuates the 
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negative impact of a member’s incom ‘ence, 
at least with regard to peers from onc’; own 
ethnic group. . 

The problem remains that the positive fea- 
tures of face-to-face cooperative int ction 
fail to overcome the effect of lesser -ompe- 
tence on interpersonal attraction for 2 © ember 
of a disliked ethnic group. Blanchard, \ ‘cigel, 
and Cook (1975) interpreted this fin og to 
suggest that less competent behavio vhen 
performed by a member of an ethnic ;roup 
against which one is prejudiced tends i con- 
firm the stereotypical expectations h: | by 

“such prejudiced persons. Once the ste: type 
is copfrmed, it may be difficult for the reju- 
diced péxson to revise his initial judg: at in 
the light of gther, more positive inform tion, 

This line of esearch is of special int st to 
the social management of racial desegi tion, 
Newly desegregat situations in the » cas of 
employment and @specially educati are 
often structured in a \mManner which :osters 
competition, Research fig? ca of 
interracial contact and attitude chang. would 
indicate that the combinatX0" of (` initial 
differences in task competențe and com- 
petitive interaction should ryduce ne po- 
tential for harmonious interra cial ae 
and the development of mutua,\ ™ tan 
liking. Organizing persons in wo duca- | 
tional settings into cooperating ethnic 
groups may provide one means o igating | 
these negative outcomes. 

It is reasonable to expect that if} 02e mem- 
ber of a cooperatively structured ; eh ae 
forms his duties incompetently, oi aden 


members may attempt to help him in 
further the goals of the group. That 
tively structured means-interdepender 
foster intermember helping behavior 1 
found both among ethnically homo; 
groups (Buckholdt & Wodarski, Note 


among ethnically heterogeneous group a 

son, Blaney, Sikes, Stephan, & Snap P», T t 

DeVries & Edwards, 1974; Weigel, * rte 
Ww 


Cook, 1975). However, it is unclear 

helping behavior of this sort, especial} 
the recipient is less competent, will po 
or negatively affect the development 


n 
traction by the helper for the recipie k M 
a prejudiced person helps a less co at 


performing representative of a gre 


which he is prejudiced, one might entertain 
the possibi that such helping behavior 
would eithe tend to further confirm the 
helper’s nege “ve stereotypes and result in less 
liking and 1-pect or (b) result in greater 
liking and re t. 

Although onsiderable literature is ac- 
cumulating o. ihe effect of helping upon the 
recipient’s lil for the helper, there is little 
evidence ad ng the above question. There 
has been on: ly of the effects of helping on 
the develop of attraction by the helper 
for the recip which suggests that the per- 
formance of ping behavior may increase 
liking for i ecipient (Jecker & Landy, 
1969). The stigators found that persons 
who were in to perform a personal favor 

| subsequent, loped greater attraction for 
the recipient the favor than did persons 
who were nc luced to perform a personal 
favor, While i- se results pertain only to the 
effects of intrs ‘hnic helping, they do suggest 
tthe possibili hat the induction of inter- 


ethnic helpi: 
overcoming 
terpersonal 
competent pe 
racial groups 
to investigate | 
In the cont 
group success 
Sequences for 
traction, As st: 
strated that \ 
groups which 
both white an: 


vy provide one strategy for 
egative consequences on in- 
ction attributable to less 
mance in cooperating inter- 
present study was designed 
question. 
t of the cooperating group, 
lure might have special con- 
e outcome of helping on at- 
d earlier, it has been demon- 
‘ite members of cooperating 
e successful like and respect 
black groupmates more than 
do white members of failing groups (Blanch- 
ard, Adelman, & Cook, 1975; Blanchard, 
Weigel, & Cook, 1975). It has also been found 
that members of successful groups develop 
greater feelings of personal satisfaction with 
the experience than do members of failing 
| groups (Blanchard, Weigel, & Cook, 1975). 
These findings suggest that the feelings of 
Satisfaction produced by group success are as- 
sociated with the other group members, re- 
ulting in attraction for them. Such an inter- 
pretation is consistent with the views of other 
Investigators (Byrne, 1971; Byrne & Wong, 
1962; Clore & Byrne, 1974; Lott & Lott, 
965). If helping a less competent groupmate 
elps the group succeed at its task, there may 
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be an increase in personal satisfaction which 
generalizes to the helped person. Conversely, 
if one’s group fails in spite of the help pro- 
vided to a less competent groupmate, it might 
increase the frustration and focus the at- 
tribution of cause for the group’s failure on 
the helped, less competent groupmate, 

When a prejudiced white group member 
helps a less competently performing group- 
mate, the recipient’s race may further qualify 
the effects of helping. This qualification may 
take any one of several forms. The first is that 
the race of the recipient will have no bearing 
on the effects of helping on the development of 
attraction for the recipient. Such an outcome 
would be expected if one assumes that the 
cooperative group experience “individualizes” 
groupmates to such an extent that the im- 
portance of race as a determinant of liking 
is diminished. This possibility has been sug- 
gested by the results of previous studies 
(Blanchard, Adelman, & Cook, 1975; Blanch- 
ard, Weigel, & Cook, 1975; Weigel & Cook, 
1975). 

The second possibility is that the potentially 
positive effects of helping on attraction may 
be blocked when the recipient is black and the 
helper is white and prejudiced. Such an effect 
for the race of the helped person might be 
anticipated if one assumes that negative at- 
titudes toward a racial group would be called 
into play by the exposure of a prejudiced 
white to a black groupmate, It is unclear how 
likely this possibility is, While this effect of 
prejudice is commonly assumed, experimenters 
have repeatedly demonstrated that there is no 
one-to-one correspondence between attitude 
toward a disliked group and reactions to in- 
dividual representatives of that group (e.g., 
Wicker, 1969). Instead, the consistency with 
which behavior reflects attitude is determined 
by many situational and contextual variables. 

The final possibility is that in the context of 
negative racial attitudes the potentially posi- 
tive effects of helping might be enhanced. 
There are two theoretical orientations in terms 
of which such a result might be anticipated. 
From the viewpoint of dissonance theory, a 
prejudiced white group member who helps a 
black groupmate is performing counteratti- 
tudinal behavior. So long as the white helper 
feels that he has freely provided help to a 
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member of a social group against which he is 
prejudiced, one might expect greater liking for 
the black recipient of help than would other- 
= wise take place, The second theoretical ori- 
entation on the basis of which increased liking 
for black recipients of help might be an- 
ticipated is ambivalence-amplification theory 
(Katz, 1970). Katz has suggested that preju- 
diced persons retain a guilt-evoking recogni- 
tion that the targets of their prejudice are 
often unfairly treated. This ambivalence pro- 
duces an amplification of the reactions of 
prejudiced persons toward representatives of 
the group against which they are prejudiced. 
Friendly reactions become even more friendly 
and unfriendly reactions more unfriendly. 

Given these divergent theoretical expecta- 
tions, it would appear that under conditions 
of cooperative interdependence no clear pre- 
diction can be made regarding the effect of 
race of recipient on the level of attraction de- 
veloped for him by the helper. 

The above considerations led to the for- 
mulation of three hypotheses: 

1, A member of a cooperating interracial 
group who helps a less competently perform- 
ing groupmate will develop greater attraction 
for him than will a member of such a group 
in which help for a less competently perform- 
ing groupmate is provided by a third group 
member. 

2, A member of a cooperating interracial 
group which is successful will develop greater 
attraction for his groupmates than will a 
member of a group that fails. 

3. The magnitude of the effect of helping 
a less competently performing group member 
on the development of attraction for him is 
greater when the group succeeds than when 


it fails. 
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METHOD 


Subjects 


Research subjects were 64 white airmen who had 
recently finished basic training and were stationed at 
Lowry Air Force Base, Denver, Colorado. Airmen 
volunteered for a “special project” knowing only that 
they would be paid $5 for participating, that they 
would be relieved from other duties, and that others 
had found the project interesting and enjoyable. Our 
interest in studying the determinants of interpersonal 
attraction in interracial groups assumes some negative 
orientation of the white subjects toward blacks. How- 
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ever, the fear of sensitizing the participants to the ex. 
perimenter’s interest in the race variable precluded 
the use of a racial attitude inventory as a basis for 
subject selection, Instead, geographical origin served 
as an indicator of antiblack attitudes. White airmen” 
from small towns in the southern part of the United 
States were selected from the volunteer pool since 
public opinion surveys have indicated that such at. 


titudes are more prevalent in the rural h (Brink 
& Harris, 1963; Greeley & Sheatsley, 1971; Mayko- 
vich, 1975). 
Experimental Task 

The cooperative task which provided th context in 3 
which the experiment was conducted wes a manage- 


jame. This 
a railroa 


ment training activity called the Railroa 


1.5-hour activity consisted of operatir 

business which received orders to s variety o. 

products between 10 cities in 500 ca: lifferent : 
vail- 


types. Effective management depende: 
ability of appropriate cars in the pro i fill 
each order, since the different products yequired 


different types of railroad cars, The three pants | 
filled shipping orders; kept records on p tosses, 
maintenance charges, fines, and the lo f oll 
railroad cars; and informed the rai] 

of each financial transaction ove: i 
experimenter served as “station the 
participants learned (a) that some ciii i only 
certain products and therefore requiree n 
types of railroad cars, (b) that suflici wnberssof 
cars had to be maintained wherc e likely to 
be needed, and (c) that moncy be saved by 
shipping products in cars that be needed at 
the station of destination, the veness of the 
business increased markedly, The t $ was cond 
ducted during a series of five rinute sessions calied 

task and interv 


workdays. Observation during |! 
conducted after the task’s corn) 
dicated that the participants fo 
enjoyable and challenging. 
There were three positions in the railroad business. 
The communications officer received and heudled 
work orders, kept records of railroad car maintenance 
charges, and reported each financial transaction to the 
station master. The equipment officer kept records of 
the location of each of the 500 railroad cars used in 
the activity, suggested the type of car to be used based 
on the type of product to be shipped, and reported 
the number of appropriate cars available at the city 
of origin. The shipping officer inspected a map of the 
railroad routes in order to report to the group the 
number of days in transit for each shipment, and kept: 
records of all profits and losses the team oto 


tion consistently in- 
id the activity to be 


The research subject occupied the position of com: 
x 
1 This subject selection procedure is the same as 
that used twice previously in this line of research! 
(Blanchard, Adelman, & Cook, 1975; Blanchar 
Weigel, & Cook, 1975). 
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nications officer. The two other Positions were 

led by one white and one black experimental con- 
federate. 

The stu'y was presented to the participants as 


of the research and development for a new pro- 


m desig: to train persons to work effectively in 
plated arc 
) Independen: Variables 
‘The thr: pendent variables in the experiment 
Were (2) per : helping versus observed helping, 
) group failure, and (c) the race of the 
i$ compete) cup member. Subjects were assigned 


h of the eight cells in the 2 X2X2 


helping conditions: Either the sub- 

© help a less competently perform- 
\personal helping) or the subject 
group member help the less com- 
the 3-person group (observed 

nt of the helping manipulation was 
t to help the less competent con- 

1s called for but, at the same time, 
feel that his help was voluntary. 
on involved (a) experimenter en- 

z in general, introduced during the 
ictions for n sement task; (b) general re- 


tests for help | © less competent confederate; and 
0) subtle sugge. £ by the second confederate that 
ping would bi sropriate.2 

‘Since one coni te always performed his duties 


mitting numerous errors, work- 
osses in profits, and incurring 
1p for these errors, the helping 
nstant across conditions regard- 
clp. The helping behaviors in- 
ed checking less competent confederate’s 
metic, solving blems related to the manner in 
the less com. tent confederate’s task should be 
over some of the less com- 
ities. The performance of the 
ate improved slightly over the 
ient activity, Detailed scripts 
learned specified not only how 
hould be made by the con- 
of the less competent group 
vhen helping was to be in- 


competently, 

g slowly, causi 
les against the 
portunities wer: 


of the mana 
lich the confederai 

When each erro 
wf te enacting the 


ing of the activity subjects 
l receive a $5 bonus if their 
he average profits of past 
received the bonus while 
the end of each of the 5 
reported the team’s prog- 
of the first 3 workdays 
cess and failure condi- 
; Day 2, $800 behind; and 
imente d. Under the Success condition the 
7 T reported that the group was now “just 
* e end of the fourth workday and 
BF) e the end ob ane Be and final 


tik aL u 
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? Slight modifications were made in the duties of 
the subject and the competent confederate in order 
to facilitate the helping manipulation, The duties of 
the competent confederate Were structured such that 
he would easily appear too busy with his own tasks 
to provide assistance to the less competent confederate 
under the treatment condition where the subject was 
to provide the help. In addition, when the subject was 
to be the helper, a simple record-keeping task was 
added to his role as communications officer. This task 
provided the subject with extra information which 
could be utilized to help the less competent confed- 
erate, 

® There were several instances of helping during 
each of the five 15-minute workdays. During the first 
workday the less competent confederate requests that 
the helper (either the subject or second confederate) 
slow down the order-filling process because he is 
visibly falling behind. The helper takes over one of 
the less competent confederate’s tasks for the duration 
of the activity—counting the number of days in 
transit on a map of the railroad routes. On two oc- 
casions the Jess competent confederate is unable to 
discern how some of his computations are to be per- 
formed and requires help in finding the answer in the 
written instructions, At the end of the first workday 
the competent confederate Suggests that “We might 
help you check your figures.” During the second 
workday either he or the Subject, depending upon the 
experimental condition, checks the less competent 
confederate’s computations after every work order 
is filled. On two occasions this checking results in the 
discovery of mistakes which the helper corrects, The 
less competent confederate commits two errors during 
the record-keeping period which occurs at the end 
of each workday, The helper assists in the correction 
of these errors which had resulted in penalties against 
the work group, During the third workday the check- 
ing after each work order continues, At the end of 
the third workday the less competent confederate re- 
quests help in computing losses which the group had 
incurred from failing to fill some work orders; in the 

process the helper discovers some missing computa- 
tions and assists with their correction. During the 
fourth workday the helper assists with certain types 
of computations with which the less competent con- 
federate has difficulty. The helper also checks over all 
the final computations. During the final workday no 
further errors are committed by the less competent 
confederate and he requires no further assistance, At 
the end of the session the less competent confederate 
says to his helper; “You really helped me out with 
my job. I had a lot of problems with it at first.” 

4 After subjects had completed the Postexperimental 
questionnaires, those in the failure condition were in- 
formed that the experimenter had made an error in 
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The third independent variable was race. For half 
the subjects, the less competent groupmate was the 
black confederate. For the other half, it was the white 
confederate, Because of the great cost and time in- 
volved in training confederates, only one pair was 
used, It is therefore possible that the race variable 
was confounded with the appearance and person- 
alities of the confederates. Any effects attributable to 
this confounding were minimized by the extensive 
role training which the confederates received. The 
role prescriptions specified the appropriate display of 
affect as well as the content of the confederates’ ex- 
changes with the research subject. The possibility 
that differences in the appearance of the confederates 
might influence the results was minimized by the Air 
Force uniform which the confederates wore,® 


Dependent Variables 


After the management task had been completed, 
participants were separated and a postexperimental 
questionnaire was administered under the guise of 
collecting information “to help us evaluate the per- 
formance of the crew.” Subjects were told that they 
would not see their groupmates again. Administration 
of the postexperimental questionnaire was prefaced 
with both written and oral guarantees of confi- 
dentiality and anonymity. Stimulus persons were 
identified by name, not by race. 

The attraction measure consisted of 20 items and, 
with minor exceptions, was identical to that used 
previously by Blanchard, Adelman, and Cook (1975) 
and Blanchard, Weigel, and Cook (1975). Nine of 
these items were adjective ratings from the evaluative 
dimension of the semantic differential. These ratings 
included such dimensions as friendly—unfriendly, 
honest-dishonest, and valuable-worthless, Five of the 
items asked for ratings of respect and liking. For ex- 
ample, subjects were asked whether they would like 
to be friends with the stimulus person. The final six 
items were designed to specify real-life choices for the 
subject. For example, subjects were asked how much 
they would like to work with the stimulus person 
again on a military project and how much they would 
like to spend time with the stimulus person off duty. 
Eleyen-point response continua were used for each of 
the 20 items, The total attraction scale exhibited high 
reliability (Cronbach's, 1951, a =.95), 


Subjective Experience Variables 


Short measures designed to assess the subjects’ per- 
ceptions of various characteristics of the research ex- 
perience were included in the posttest battery. A 
three-item measure of perceived group success (Cron- 
bach’s «= .87) and a one-item measure of perceived 
helping provided one means of assessing the effective- 
ness of the manipulation of the independent variables, 


computing the group’s profits. Each subject in this 
condition was then awarded the $5 bonus for win- 
ning. 
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Measures of several other subjecti verience 
variables were also included in the he t they 
might provide insight into the process terlying 
any effects on attraction due to the help: sipula- 
tion, There were two classes of subjective perience ~ 
variables which proved important for the ‘terpreta- 
tion of the findings regarding the relatio ship be- 
tween the manipulated variables and inter; ynal at- 
traction: (a) measures designed to asse: e sub- 
ject’s feelings about his personal partici; n, and 
(b) measures designed to assess the subject vercep- 
tions of the less competent groupmate. An those 
measures concerning the subject's feelings ut his 
participation in the management task we a) a 
one-item measure of the subject’s perceptio of” his 
own task competence, (b) a two-item measu: of the 
subject’s perception of his personal responsi! iy for 
his group’s performance (@=.84), and (c) s six- 
item measure of the subject’s general feci s of 
satisfaction-dissatisfaction with his experienc: in the 
management task (æ = .87). Among those n sures 
concerning the subject’s perceptions of the le. com- 
petent groupmate were (a) a three-item mi e of 
the subject’s perception of the stimulus persi na- 
tive ability (æ = .95), (b) a four-item measur. f the 
subject’s perception of the stimulus person ffort 
(a= .94), (c) a three-item measure of the = ject’s 
perception of the competence of the stimulus erson 
(a= .86), and (d) a two-item measure of tì sub- 
ject’s attribution of responsibility for the grou; - per- 
formance to the stimulus person (a = .92). Thi items 
included in these measures were each presen ed in 
11-point Likert-type response continua. 

Debriefing 

Experimental participants were debriefed ir roups 
of four to six persons after the research project was 
completed, During these sessions the experimenter 
engaged the participants in open discussion al out the 


meaning of the research to which they hod con- 
tributed. Drawing on the subjects’ personal experi- 
ences in task-oriented groups, the experimeni«r called 
their attention to the similarity between the vari- 
ables which had been studied and those things which 
the subjects had described as important features of 
groups in which they had participated while in the 
Air Force. The necessity for the use of deception and 


5 The possibility of effects due to the confounding 
of the personalities of the confederates with their race 
might have been reduced substantially by employing 
several different confederates of each race. However, 
the training of confederates for this research required 
6 weeks of intensive Preparation including con- © 
siderable practice with naive subjects. Because of the 
high costs associated with training, the use of mul- 
tiple confederates was financially impractical. It 
seemed preferable to employ a methodologically im- 
Perfect procedure than to abandon the study of a 
type of variable which could be manipulated only 
through the use of highly trained confederates. 


EFFECTS OF HELPING 


tes were discussed along with 
rimental design. The results 
implications discussed with 
1ese discussions, which aver- 
participants were asked to 

s anonymously, All but one 
thought the research was 

e been conducted. Only one 
use of deception was inap- 

s personally bothered by it. 


the role of the conf 
basic concepts of 
vere described and 
she participants. A 
aged 1 hour in leng 
answer several qui 
person indicated t 
important and sho’ 
indicated that he 

propriate and that 


SULTS 


lent variables provided a 
lesign defined by (a) 
npetent stimulus person 
bject (personal helping) 
retent confederate (ob- 
whether the group suc- 
d (c) whether the re- 
white or black. Subjects 
t per cell to each of the 
ign. Scale scores derived 
ental questionnaire were 
of three-way analyses of 


The three ind 
X2x 2 fact 
whether the les 
was helped by 
or by the mor 
served helping ) 
ceeded or faile 

cipient of help 
‘were distribute: 
eight cells of th 
from the postex; 
submitted to a 
variance, 


Adequacy of th nipulations 


vrocedures were employed 

y of the manipulation of 
iables. First, a trained ob- 
tapes of each experimental 

) whether the confederates 
vropriate incompetent be- 
ther the confederates per- 
ibed helping behavior, and 
(c) whether the subject performed the pre- 
we helping behavior. On the basis of 
ihe, ratings six subjects were deleted and 
ee replaced. It is interesting to note 
F e majority of these subjects were re- 
Ee because of inappropriate confederate 
E rather than the failure to induce the 
Th ae helping on the part of the subject. 
es enon to this assessment of the video- 
E the subjects’ exposure to the intended 
A es, a second criterion for effectiveness 
Presentation was employed. The measures 
fag tied success and perceived helping 
Bio indicators of the experimental par- 
ere a Subjective experience. The subjects 
a hh a to respond to the question, “How 
= z p did you give this person during the 
ion Be Providing him with useful informa- 
» Showing him how to do his job better, 


Two independ 
0 assess the adi 
‘the independent 
Server viewed vi 
Session and rate« 
performed the 
haviors, (b) w 
formed the pres: 
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TABLE 1 
MEAN ATTRACTION SCORES 
Race of recipient pauls AS 
of help Personal Observed 

Black 

Group success 174.1 168.6 

Group failure 153.5 142,2 
White 

Group success 185.1 168.6 

Group failure 156.9 143.0 


Note. Maximum score = 220. 


catching errors, etc.?” on an 11-point response 
continuum ranging from “I helped him a lot” 
to “I didn’t help him at all.” If any subject 
responded to the helping item more than 1 
point away from the midpoint of the response 
scale in the opposite direction from the intent 
of the manipulation, he was deleted from 
further analysis. This criterion resulted in the 
deletion of nine subjects. Of these nine cases, 
only one was deleted because a subject in a 
helping condition perceived that he did not 
help. The other eight were deleted because 
subjects in the conditions where the second 
confederate was supposed to provide help re- 
ported that they too had provided help, Anal- 
ysis of variance based on the 55 subjects who 
were retained indicated a significant main ef- 
fect of the helping variation on perceived help- 
ing such that subjects in helping conditions 
perceived that they had helped more than did 
other subjects under the condition in which 
the second confederate had provided the help, 
F(1,47) = 31.77, p< .001.° Subjects who ex- 
perienced group success exhibited significantly 
greater perceived success than did subjects 
who had experienced group failure, F(1, 47) 
= 108.83, p < .001. , 

The results regarding the effects of the in- 
dependent variables on attraction for less 
competent stimulus persons are reviewed be- 
low in relation to the hypotheses. 


poe tees 

6 Subjects were deleted from the following cells: 
one from the personal help, group success, white 
recipient cell; two from the observed help, group 
success, white recipient cell; two from the oberg 
help, group success, black recipient cell; two from the 
observed help, group failure, black recipient cell; and 
two from the observed help, group failure, white 


recipient cell. 


1252 


Interpersonal Attraction | 


Hypothesis 1. A member of a cooperating 
interracial group who helps a less competently 
performing groupmate will develop greater at- 
traction for him than will a member of such 
a group in which help for a less competently 
performing groupmate is provided by a third 
group member. 3 

The data indicated a main effect for helping 
on the development of attraction for the stim- 
ulus person who was helped, F(1,47) = 3.44, 
p < .07. As Table 1 indicates, the pattern of 
results in each of the four possible compari- 
sons was for subjects to develop greater at- 
traction for a less competent stimulus person 

. whom “they personally helped than did sub- 
‘jects who merely observed a similarly less 
competent groupmate being helped by a third 
group member. It is notable that this eleva- 


tion in attraction appears to hold regardless of , 


the stimulus person’s race. Hypothesis 1 re- 
ceives support, although at somewhat greater 
than the .05 probability level. 

Hypothesis 2. A member of a cooperating 
interracial group which is successful will de- 
velop greater attraction for his groupmates 
than will a member of such a group that fails. 

The data showed a significant main effect 
for group success—failure on the development 
of interpersonal attraction, F(1,47) = 15.63, 
p < 001. Table 1 portrays: the direction of 
this effect: Members of successful groups did 
develop greatef attraction for their group- 
mates than did members of a failing group. 
Hypothesis 2 is confirmed, 

Hypothesis 3. The magnitude of the effect 
of helping a less competently performing 
group member on the development of attrac- 


TABLE 2 
MEAN SATISFACTION SCORES 


Source of help 
Race of recipient 
of help 


Personal Observed 
Black 
Group success 59.3 54.3 
Group failure 459 40,0 
White . 
Group success 59.1 56.7 
Group failure 50.5 414 


Note, Maximum score = 66. 
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did not approach significance, F (1, 47) = 01, 


tion for him is greater when the group suc- 
ceeds than when it fails. 

A Helping X Success—Failure interaction ~ 
would have been necessary to confirm this 
hypothesis. However, the relevant interaction 


p > 10. Hypothesis 3 is not supported. a 
Effects of race of person helped. No sig- 
nificant main effect for the race of the re- 
cipient of help on the development of attrac- 
tion for the recipient was observed, There 
were no significant interactions involving the 

race variable. 


Subjective Experience Variables 


Main effects for the helping variable, 
F(1,47) = 9.07, p < .01, and group success- 
failure, F(1,47) = 48.28, p < .001, on sub- 
jects’ feelings of satisfaction with the group 
experience were observed. As indicated in 
Table 2, the personal provision of help pro- 
duced significantly greater feelings of satis- 
faction than did the observation of help 
provided by another group member, Group 
success also produced significantly greater feel- 
ings of satisfaction than did group failure. No 
significant main effects for helping on any of 
the other subjective experience variables were 
observed. However, in every case, group suc- 


cess-failure produced significant differences on  § 
these remaining subjective experience vari- i 


ables. Members of successful groups perceived 
themselves to be more competent personally, 
F(1,47) 27.48, p < 001, and more per- 
sonally responsible for the performance of 
their group, F (1,47) = 16.32, p < .001, than 
did members of failing groups. Members of f 
successful groups perceived the less competent f 
groupmate (regardless of the source of his i 
help) to be more competent, F (1, 47) = 16.70, | 
p <.001; to possess greater native ability, 1 
F(1,47) = 13.77, p < .001; to have demon- 
strated greater effort, F(1,47) = 17.44, p < 
.001; and to be more responsible for the 
team’s performance, F(1,47) = 13.77, p< 
.001, than did members% failing groups. { 


i! 


DiScussion 


As an unintended consequence of ir 
ing task-interdependent groups a‘- 
proach to enhancing the quality of 
interaction in desegregated settine 


r 


/ s0nious interracial behavior. This risk arises 
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administrators, teachers, and others entertain 
> the risk-of inhibiting the development of har- 


secause of the unfortunate coincidence of sev- 

ixcumstances: (a) Disadvantaged ethnic 

ies may enter newly desegregated set- 

` with initial performance handicaps, (b) 
there is evidence that more competently per- 
forming persons are liked and respected more 
than less competent persons, and (c) more 
competent members of interdependent groups 
are likely to help their less competent group- 
mates. Hence, in establishing interdependent, 
interracial, task-oriented groups whose higher 
achieving white members help initially lesser 
achieving black members, we may be creating 
the conditions for increasing negative cross- 
racial and cross-ethnic behavior by providing 
situations in which whites’ expectations for 
less competent behavior by blacks may be 
confirmed. 

The evidence from this study indicates 
that such fears are unwarranted. The results 
provide support for the inference that per- 
sonally helping a less competent comember of 
a cooperating group leads to increased liking 
and respect for him. Moreover, inspection of 
the cell means suggests that this effect holds 
when the recipients of help are black as well 
as when they are white. When white subjects 
were induced to help a less competently per- 
forming black group member, they developed 
greater respect and liking for him than when 
they simply observed a similarly less compe- 
tent black group member being helped by 
another member of the group. These findings 
suggest that the risk to which we alluded 
above is an unlikely possibility and that 
cooperating, interracial groups can be used 
effectively in desegregated settings as a means 
of enhancing cross-racial harmony. 

Subjects who were induced to help the less 
competent group members felt significantly 
greater satisfaction with the group experience 

Hi \an did subjects who merely observed a third 
Negrétpmate provide such helping. That positive 
est and good feelings might result aoa the 
token com™ Of help had previously been only a 
to commenif speculation (Rosenhan, 1972). On 
ing a full Amc £ the present findings, it would ap- 
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pear that at least some forms of helping pro- 
duce feelings of satisfaction in the helper. 

This discovery suggests a possible interpre- 
tation of the helping-attraction relationship: 
that feelings of satisfaction may be the medi- 
ating variable. The possibility that a parallel 
relationship, namely that between success and 
attraction, is mediated by positive affect has 
been suggested by other writers (Blanchard, 
Adelman, & Cook, 1975; Byrne, 1971; Byrne 
& Wong, 1962; Clore & Byrne, 1974; Lott & 
Lott, 1965). A previous study in the present 
research program found increased satisfaction 
with the group experience to be associated 
with group success and suggested that this 
finding supported the explanation that posi- 
tive affect is generalized to fellow group mem- 
bers resulting in attraction for them (Blan- 
chard, Weigel, & Cook, 1975). The main ef- 
fect found in the present study for group 
success-failure on satisfaction, in company 
with the main effect for success—failure on 
attraction for groupmates, lends further sup- 
port to the affect-generalization interpreta- 
tion. If we assume that personal helping and 
group success are interchangeable sources of 
satisfaction, it seems reasonable to extend the 
affect-generalization interpretation still further 
to account for the helping—attraction relation- 
ship found in the present experiment. 

As noted in the introduction, Jecker and 
Landy (1969) interpreted the: helping—attrac- 
tion relationship as an instance of dissonance 
reduction. This explanation does not seem 
applicable to the parallel finding in this study. 
While participants in the Jecker and Landy 
experiment incurred personal costs when they 
were induced to help (i.e., to perform the dis- 
sonant act of giving up money they had pre- 
viously earned), in the present study no such 
costs were involved. Instead, helping in the 
context of the means-interdependent group is 
consistent with the helper’s self-interest: To 
the extent that he helps his groupmates, he 
furthers the achievement of the group goals. 
Given the helper’s objectives, helping is a con- 
sonant rather than a dissonant act. 

There was no indication in the present study 
that the race of the stimulus person exerted 
any effects on the expression of attraction 
toward him. Neither were any interactions 
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involving the race variable observed. Such 
results lend further support to the argument 
that cooperating, means-interdependent groups 
function to “individualize” their members in 
such a manner as to facilitate the unimpeded 
development of intermember attraction ( Blan- 
chard, Adelman, & Cook, 1975; Blanchard, 
Weigel, & Cook, 1975). 

While an interaction between the effect of 
helping and the effect of group success—failure 
on the development of attraction was pre- 
dicted, none was observed. It had been sug- 
gested that group failure might promote a 
search for causes for the group’s poor perform- 
ance and that helping a less competent group 
member under such conditions might have the 
effect of emphasizing his incompetence. In 
this context the less competent group member 
might then become a target for blame for the 
group’s failure. If such a process were operat- 
ing, one would expect to find that under group 
failure, subjects would attribute more of the 
responsibility for the group’s performance to 
the less competent groupmate, relative to 
their judgment of their own responsibility, 
than they would under conditions of group 
success. Instead, it was found that subjects 
tended to attribute significantly less responsi- 
bility for their group’s performance to them- 
selves and to the less competent group mem- 
ber under conditions of group failure than 
under group success. Consistent with a recent 
review of this research literature, no self- 
serving or ego-protective biases in the attribu- 
tion of causality for failure were observed 
(Miller & Ross, 1975), 

A second component of the process under- 
lying this predicted, but unconfirmed, inter- 
action involves the effect of helping a less 
competent group member under conditions of 
group failure on the helper’s judgment of the 
recipient’s level of competence. We thought it 
possible that personally helping someone un- 
der these conditions might serve to highlight 
the recipient’s incompetence. However, no 
effects for helping on the helper’s perceptions 
of the recipient’s level of competence, effort, 
or native ability were found. 

The results of this study provide useful 
guidance for the management of racial deseg- 
regation. While previous research has indi- 
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cated that less competent behavior exhibited 


a 
the development of respect and fojic- 3 | 
them, the present findings suggest that when. ~\ 


members of a more advantaged ethnic niger > b 
ity help such persons as a natural c nsequendh.,, 


of their joint membership in interdep: dent, 
cooperating groups, this barrier to harmonious 
interethnic relations may be overcome. 
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The relation of the Kohlberg moral stage to prejudiced and tolerant ethnic 
attitudes was investigated in seven majority lower-middle-class samples and 
one upper-middle-class sample totaling 173 city and suburban children, ages 
7-13. One group of 22 suburban 10-year-olds was followed for 3 years. Chil- 
dren played a board game requiring spontaneous comments about minorities 
and were interviewed twice later. A significant inverse relationship between 
percentage of negative statements (translated as prejudice) and moral stage 


(translated as ability to respect) was found for all groups (rs from —.52 to 
—.73 and p=<.001; for longitudinal study, r=—.63 with IQ partialed out 
and —.6 with age partialed out and p= <.001). Thus, prejudice is related to 
moral development and social understanding. 


Human rationality implies that attitudes are 
to some extent derived from underlying values 
and structures of thought. Thus, the tendency 
to devalue a racial or ethnic group could very 
well be connected to, a cognitive inability to 
respect people in general. The idea of respect 
being owed to someone is linked to the idea of 
justice being owed. The child’s cry of No fair! 
shows strong (if superficial) interest in issues of 
justice. Increasing facility in dealing with these 
issues typically results from growth in logic, 
empathy, and ability to take the roles or view- 
points of others (Aronfreed, 1968). If these de- 
velopments provide bases for respect, do they 
not also provide for decrease of prejudice? 

The purpose of this study is to explore the 
connection, first, between attitudes of ethnic 
or racial prejudice and, second, between issues 
of respect as indexed by the particular patterns 
of fairness in moral judgment described by 
Kohlberg (1975). It is hypothesized that Kohi- 
berg’s hierarchical stages of moral judgment 
are inversely correlated with quantitative de- 
gree of ethnic prejudice at ages 7-13, 

But are children really prejudiced? Because 
the work of cognitive psychologists has shown 
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the structures of thinking to be differen «t dif 
ferent ages, one must conclude that the heng 
typic appearance of prejudices is not b: cd ol 
the same thought structures in childho« 


adulthood. The tentativeness with whic! 
children in this study explored their o n d 
veloping ideas also casts doubt on i.e ol 
assumption that prejudice is a trait a 
at home as certainly and trustingly as n 
milk, and almost as early. 


SOCIAL LEARNING AND PSYCHODYNAN iC 
RESEARCH 


Previous psychological research on pre judicey 
has been mainly of two types, both of which 
downplay conscious rationality (reviewed by 
Proshansky, 1966). The first relies on the social 
learning hypothesis that ethnic stereotypes 
and corresponding attitudes of prejudice are 
learned by imitation of socializing agents. This 
hypothesis partially explains why city children 
reveal more aggressive and less) patronizing 
prejudices than suburbanites. But other im- 
portant facts are not explained, such as agé 
changes of individuals and variation within 
families. 

Is it true that children merely adopt the f 
titudes of their parents? Although it is © 
that environmental effects are pervasiy 
strong, parents’ and children’s attitud 
late poorly, averaging less than .40 in 
est studies (Bird, Monachesi, & Bur” 


ABILITY TO RESPECT 


Frenkel-Brunswik, 1951; Radke-Yarrow, Tra- 
ger, & Miller, 1952). Many young children are 
more prejudiced than their parents, a finding 
hard to explain if their own experiences and 
thought processes are not contributing. Typi- 
cal of statements collected in the present re- 
search is that of a 7-year-old city girl: “My 
mother thinks since years I don’t know the 
word nigger. She’s no help when they bother 
me so terrible. She would put me to bed, but 
there’s lots of them all over, so why not call 
them what they are? I hate them.” Other 
children describe their parents as prejudiced. 
“My parents are always talking about hippie 
rats. I’m not prejudiced, but if I was, it would 
be their fault,” says a black boy of 13. 

The second and more complex view, the 
psychodynamic, seeks to account for internal 
emotional processes in the child that result in 
prejudice. Strong attitudes of prejudice repre- 
sent hostile and aggressive tendencies that are 
repressed or inhibited from being expressed 
toward in-group members and are then dis- 
placed onto an out group. This theory seems to 
grasp an essential component of the more path- 
ological adult prejudices, but it is misleading in 
assuming a resemblance between adults and 
children and how ingrained the prejudices may 
be in each case. Prejudice is more than a per- 
sistent, unitary trait or simply the result of 
being reared in a harsh or authoritarian family, 

Authoritarianism was defined as an early 
appearing “central structure with a determin- 
ing role in personality” by Adorno, Frenkel- 
Brunswik, Levinson, and Sanford (1950). Al- 
though the research behind their study was 
much criticized during the next 10 years, the 
notion of prejudice as a semipermanent trait 

hat itself organizes thinking has somehow sur- 
vived. This is in the face of consistently low 
correlations (accounting for less than 15% of 
the variance) found between early and late 
childhood for personality, attitude, and other 
noncognitive traits (Kagan & Moss, 1961; 
Mischel, 1969; Peterson, 1968; Vernon, 1964). 


But no longitudinal study has yet traced atti- 


ades of ethnic and racial prejudice- 

Negrit is exceedingly difficult to gather good data 
choosi kaol children’s attitudes. The 1974nation- 
Fey turvey of the U. S. Department of Health, 
a commend, and Welfare’s Office of Education 


ing a full Amdrawn midway because its too-direct 
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questions aroused opposition from schools and 
parents. That attitude scales no longer yield 
trustworthy data in an age of self-conscious- 
ness about prejudice is attested to by the ex- 
tensive tests of Cook (1966). Surveys on the 
effects of desegregation often use unstandard- 
ized interviews (Coles, 1967; Dwyer, 1958), 
ranking of friends (Bradley, 1964), or socio- 
grams as behavioral measures of prejudice 
(St. John, 1975; McPartland, Note 1). The 
results are entirely equivocal. Singer’s (1966) 
findings on New York City were contradicted 
the following year by Dentler and Elkins 
(1967). According to St. John’s (1975) recent 
survey of all available evidence (covering 
achievement and self-image, as well as attitudes 
to others); contact is as likely to have a 
negative as a positive effect on accepting blacks, 
whereas for the blacks themselves, the results 
of integration are mixed or positive. 

The lack of longitudinal studies tracing 
racial attitude changes from cognitive de- 
velopment or the effects of schooling is de- 
plorable (Smith, 1969), The National Institute 
of Education has now given six contracts for 
wide-ranging anthropological studies of deseg- 
regation. Because prejudice is a negative and 
defensive sc attitude, these studies will no 
doubt stress the valid social learning and psy- 
chodynamic assumptions that children strive 
for status, imitate some people, and scapegoat 
others. But because the countervailing contri- 
bution of the child’s cognitive-social develop- 
ment has so far been neglected, it is the focus 
of the present report. 


CoGNItivE=DEVELOPMENTAL RESEARCH 


Research in cognitive development has pro- 
vided a significant challenge to both matura- 
tionist and behaviorist approaches to socializa- 
tion. The emphasis on universal patterns and 
the focus on form of thinking rather than in- 
tensity, content, or stimulation by specific 
event offers a useful framework in which to 
view individual differences. 

Piaget (1926) showed that children predict- 
ably organize their thinking, although not in 
the same way as adults, and that the rules and 
patterns of this organization depend upon stage 
of cognitive development. Kohlberg (1969) ex- 
tended this by demonstrating that stages of 
development occur also in social judgments per- 
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taining to moral issues. He asserted that there 
is a universal core sense of justice in persons of 
all cultures that develops in both cognitive and 
moral adequacy by stage transformations. He 
found empirically, in a 20-year longitudinal 
study, six invariantly ordered stages. The 
bases of decision making are, at Stage 1, quasi- 
physical predicted consequences; at Stage 2, 
instrumental hedonism and naive egalitarian- 
ism; at Stage 3, being helpful and gaining 
others’ approval; at Stage 4, maintaining social 
order; at Stage 5, a social contracts system 
recognizing an arbitrary starting point; and at 
Stage 6, individual conscience informed by uni- 
versal principles. 

The stage theory in cognitive-developmen- 
tal psychology requires an invariant sequence, 
internal organization of each stage, qualitative 
difference between stages, and universality of 
applicability (Kessen, 1962). The theory does 
not require that all children progress at a simi- 
Jar rate or reach all of the stages. This definition 
has been applied primarily to physical cogni- 
tion, but evidence has accumulated that social 
thought becomes organized in a parallel man- 
ner (Colby, 1972; Cowan, Langer, Heavenrich, 
& Nathanson, 1969; Feffer & Gorevitch, 1960; 
Swinson, 1966; Selman, Note 2). 

‘The present study is an extension of the work 
of Piaget and Kohlberg. Collecting comments 
on morality and on ethnic and racial groups 
from the same children allows not only a test of 
statistical correlation but also a close examina- 
tion for underlying assumptions about people 
in general and certain groups of people in par- 
ticular. This permits some extensions in the 
descriptions of the moral stages by Kohlberg, 
showing them to be also stages in each child’s 
growing ability to respect persons. (See Downie 
& Telfer, 1973, fora discussion of the basic con- 
tribution of respect to the development of 
morality.) 


Method 


In order to discover the possible relation between 
moral stage and prejudice in a variety of environments 
and developmental levels, a design was adopted involv- 
ing six city and suburban samples ages 7, 9, and 13 
years. Twenty-two 10-year-olds were also studied longi- 
tudinally during three school years. To test further for 
age, sex, or environmental effects on the correlations 
obtained, a sample of fifty-three 12-year-old boys from 
a wealthy suburb was added, bringing the total of 
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children to 176. As they played a speci vised game | 
and were interviewed on their attit they were | 
(truthfully) told that their recorded vents would 
contribute to the development of a n cial studies 
curriculum. The investigator was : chologist- 
counselor and social studies consultant suburban 
system, and a camp director in the city she was a 
familiar and friendly figure to all the ren (also 
showing movies before the studies) is tant, for 
“unconditional acceptance” is a factor uting to 
reliability when a sensitive topic is dis: In the 
wealthy suburb she was less familiar, but hildren 
were more sophisticated. 
Subjects 

Two lower-middle-class schools in a bedr: iburb 
supplied 53 children to compare with 48 chi from 
13 schools located in all but three neighbor! of a 
medium-size industrial city. Each child was int: ewed 
three or more times. The 26 girls and 27 boys i: » the 
suburb and the 26 girls and 22 boys from the at- 
tended three grades of school; second (N = 16 irb, 
11 city), fourth (W = 21 suburb, 21 city), or nth 


(N = 16 suburb, 16 city). One third of the city 
was carried out in two of its best (but also lower-n. dle- 
class) schools, and the rest on all members of grou; col- 
lected by age in a citywide day camp in which tl in- 
vestigator had been the director 

Of the city sample 25 were gentile whites, 5 » ert 
Jewish, 13 were black, and 5 were Oriental; 13 were í: om 
professional families, 7 from management, 14 fom 
skilled workers, and 14 from labor or welfare. Of ih 
suburban samples (for both cross-sectional and lon, i 
tudinal studies), 58 were white gentiles, 7 were Jewis 
and 10 were black; 20 were from professional familic: 
10 from management, 31 from skilled workers, and 1: 
from labor or welfare. 

The group studied longitudinally consisted of subur 
ban fifth graders only. They were classmates in the sami 
schools. They were more extensively tested and thei 
families were interviewed for the initial phase of ar 
ongoing study of 12 boys and 10 girls, 

The final group of 53 boys from a wealthy subur 
had been measured for moral stage in an investigatio 
by a Kohlberg colleague 2 months before. Through hi 
intercession, the investigator was given access to th 
school’s play areas to create rapport and was allowe | 
two 40-minute periods per child to test for prejudic 
Median family income in this suburb was almost dou): 
that of the city. Three of the boys were from Jewish 
professional or management families and the others were 
white gentiles; 20 were from professional families, 18 
from managerial, 9 from skilled workers, and 3 from 
labor or welfare, 

City and suburb spent the same relatively larg 
baie per pul on their schools, although the subu 

an median family income was $3,400 higher, s 
urban schools had 3% black children T ba 
from the city, whereas the city schools were 21%, } 
Lower-middle-class suburban schools were ci 7 
bring about some social class comparabili 
slightly over half of both groups were lowe 
middle class). However, being 


ing 


a policemar 
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The Game of Comments 


Ficure 1. The game of comments, 


involves a different family lifestyle in a suburb than in 
i city, Speaking to this point, Lauman (1970) sug- 
sested that not the effects of occupational equality but 
rather those of “general lifestyle” be compared because 
secular trends have weakened the historic bases of 
criptive status: religion, ethnicity, and social class. 
Grade school children are particularly “classless” un- 
less they are from such a deprived culture as that of 
housing projects, 


Instruments 


A game (see Figure 1) and two interview schedules 
(see Tables 1 and 2) for different age ranges served to 
measure prejudice, As a measure of moral stage, Kohl- 
berg’s clinical interview involves probing the statements 
‘bout morality spontaneously offered in solving story 
problems. The chief strength of all of these techniques 
lies in allowing the child to structure freely his or her 
own remarks, The interview schedules to ascertain pre- 
judice consist of peer-composed language only. 

The game. A game called “Comments,” played on a 
board with dice, was created to take one school period 
when played by four classmates. These groups were ran- 
domly chosen by the teacher. A token “man,” represent- 
ing an American ethnic group member pictured on the 


~ ) boird, moves according to throws of the dice toward 


l 


A “home” at the board’s center. Arriving there is defined 
“becoming a full American.” Children comment on 


all \he groups (lindian, Irish, Italian, Jewish, Mexican, 
Neg, Oriental, and Puerto Rican) while they are 


which to represent. Further comments about 
the pictured groups are required whenever a child’s 
token comes to rest on a color-coded space. A suggestion 
to comment on “problems they encountered in becom- 
ing a full American” was almost never heeded. Instead, 


personal views or bits of information were offered. No 
child objected to the recording of his or her views. 

Most of the spaces on the board are uncolored, About 
one sixth are red, calling for a comment on “any group” 
hefore play can proceed. Because during pretesting 
children avoided commenting on blacks and Jews, some 
spaces (one sixth at random) were painted blue or yel- 
low to require comments on these groups. (No child in- 
quired, Why those?) The previous friendships among 
the players, the “small group dynamics,” the somewhat 
forbidden subject matter of many comments, and the 
desire to win the game all fostered a growing excitement 
that reduced inhibitions. Although the first comments 
were typically neutral (“Negroes have kinky hair”), 
the first negative comment triggered others, and no 
group clearly continued to censor its comments. Fur- 
thermore, by the end of the game, when tension was 
highest, many children had to search for a new remark. 
Previously unspoken thoughts emerged, such as “Blacks 
are dirty, or I think you can’t tell if they are clean or 
dirty.” “Water looks white on Sam’s hand.” “Jewish 
people don’t give you sandwiches. They’re selfish. Mrs, 
Kaplan says, ‘Give it to David.’ ” 

The interview. Some days or weeks after the game, 
during 40-minute one-to-one interviews, each child was 
asked to respond to the items on one or the other inter- 
view schedule, each consisting of 20 comments collected 
during the game from either 7—9-year-olds or 10-13- 
year-olds, Provocative remarks both positive and nega- 
tive were selected for the schedules, with the groups 
represented in proportion to the number of comments 
made about them during the game. Although the 20 
comments provide 20 interview sections to be separately 
scored, spontancity is fostered by allowing the time 
spent on cach to be only roughly equal. Each comment 
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TABLE 1 
INTERVIEW SCHEDULE OF SELECTIONS FROM 
Game Comments FOR LOWER GRADES 
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TABLE 2° 


INTERVIEW SCHEDULE OF SELECTIONS FROM 
Game COMMENTS ror UPPER GRADES 


. Negroes are not the same as whites. 

` Twouldn’t mind being a Negro; I think it would be 

nice, sometimes. 

. Jews are different. 

Indians wear clothes that look like rags or cloths. 

Negroes fight and have fat noses. 

. White people with moustaches I can’t stand. 

. Mexicans are lazy and wear big hats. 

Jews fight and chase other children. 

. White people should be nice to everybody, especi- 

ally because they might be rich. 

10. The best kind of family is Italian; they make good 
jokes. 

11. Mixing different kinds of white kids is good in 
school; Irish, Spanish, like that. 

12. Black kids say, “Let’s mess around.” 

13. Chinese are good and bad Chinks. 

14. My parents tell me all about black and white people 
and what to do. 

15. Puerto Ricans are poor. 

16. Indians only cared about Indians; they were selfish. 

17. White people were mean and prejudice made the 
slums, but I wouldn’t go there. 

18. I fight sometimes. (How much? Any fights in the 
last 3 months?) 

19. Irish kids are tough or strong. 

20. Putting a few black kids in a class doesn’t help 

anything. 


Carano Ne 


is introduced by, “During the game, somebody about 
your age said. . . . What do you think about that?” 
(Added questions ask opinions on own attitudes and 
frequency of fights with schoolmates.) 

The method of measuring prejudice in the upper 
grades was compared to the long-used Æ scale method 
(Adorno et al., 1950) by twice administering the age- 
appropriate schedule (% negative) and the Æ scale 
to the group studied longitudinally, Pearson correla- 
tions for fifth- and seventh-grade testings are given 
in Table 3. Although the fifth-grade game-plus- 
interview predicted prejudice in seventh grade (mea- 
sured either way) better than the Æ scale did, both ap- 
pear to be measuring the same attitudes. The Æ scale 
is clearly a test of prejudice. Its sharp statements 
aroused active opposition from seventh graders. “This 
can make me prejudiced.” Scores were lowered even 
though the investigator urged honest answers. 

The Kohlberg Measure of Moral Maturity. During a 
second private interview, children were presented with 
Kohlberg’s Dilemmas 3, 4, and 10, involving issues of 
rights, duties, punishment, the value of life, property, 
law, family, and so forth, In resolving the story dilem- 
mas, children tell how these issues are perceived, 
thereby revealing their structures of moral judgment. 
The answers are extensively probed to ensure that they 
represent the child’s best thinking and not situational 
factors that might favor, for example, verbally fluent 
children. Test-retest reliability for Kohlberg measures 
used in six published studies has ranged from .8 to .9 
(Lieberman & Selman, 1975; Lieberman, Note 3). The 
present study’s self-correlation of moral stage even 


. Jews close their stores on unexpected days. 

. Many Indians bave different beliefs from whites or 

Americans. 

Lots of Negroes don’t go to school, even when it is 

against the law. 

Many white people are nice outside, but not inside. 

. Jews swear alot. “i 

Tt takes the whites (or Americans) to stop unfair 

things in the world. 

. Negroes are dirty, and you can’t tell if they’re clean 
or dirty. They are strong and fight well. (Latter - 
statement only, if Negro child present.) 

. The whites didn’t start the world. 

. Indians are violent people. They like to attack and 
start trouble. 

10. The U.S. belonged to the Indians, really, not the 

whites. 

11. Mexicans drink too much and smuggle pot. 

12. Negroes have guns and rob people, and start riots, 

13. Indians wear wierd clothes, like a little thing that 
hangs down. $ 

14. Sometimes fighting is OK. (Do you get into fights 
at school? Have you had a fight in the last 3 
months? How many?) 

15. My parents influence my attitudes to other groups. 
(Do your friends? Do things taught in school? Does 
TV? What about just your own self, the things you 
experience?) 

16. I wouldn’t want to be born Indian or 
because it’s harder to get jobs. 

17. Hardly anybody teases Jews much, but their noses 
are big. 

18. All Negroes are the same. 

19. Mexicans sneak over to get jobs in this country, 

20. ‘The blacks should like the whites, no matter what 

has happened. 


x pupe w he 


© 00 


Negro, 


after 32 months (group studied longitudinally) is .79 
(p = .001). 

Validity is shown in the current study, as in others 
(Haan, Smith, & Block, 1968; Kohlberg, Turiel, & 
Rest, 1969; Turiel, 1966), when over half the issue 
scores in each test are at a single moral stage, with a 


TABLE 3 


CORRELATIONS BETWEEN INTERVIEW AND Æ SCALE 
Scores: LONGITUDINAL STUDY (n = 22) 


Pearson 


Grade and test correlation Probability 


5th grade E and 


% negative scores -80 001 
7th grade E and 

% negative scores 15 001 / 
%, negative Sth and 

% negative 7th OL 00, 
E 5th and E 7th „88 001 
% negative Sth and 

p -88 

E 5th and os 

% negative 7th 49 OL 


normal distribution of major and minor scores to ad- 
jacent stages only. Philosophical arguments and cross- 
cultural studies (Kohlberg, 1969; Kohlberg & Meyer, 
1973) support the test’s validity. Predictive validity 
should refer to moral behavior. Although obviously a 
connection exists between judgment and behavior, it is 
largely unexplored. Hartshorne and May (1928) showed 

moral action among children to be generally situation 
` specific, unstable over time, and nondevelopmental in 
relation to age, but judgment stage was not measured. 
Prejudice as a form of judgment seems closer to moral 
judgment than certain behaviors that have significantly 
-correlated with Kohlberg moral stage, such as cheating 
(Krebs & Kohlberg, 1973) and taking part in a protest 
“(Haan et al., 1968). 

Coding. Interview and game comments were in all 
cases so similar (the one-to-one nature of the interview 
hoving made no difference) that it was possible to score 
‘hem as one. All comments were numbered and typed 
on a blind sheet to be scored by two raters. The raters 
first noted whether each comment (a) was stereotypic 
involving only personal anecdotes or unsupported pro- 
nouncements), (b) showed directional thinking, or (c) 
zave only information. Interrater agreement on these 
designations was .74. 

Percentages of negative comments were also calcu- 
lated from the total number of statements per child 
(mean = 30 statements) and this percentage was taken 
as the index of prejudice. Differences in percentage of 
positive and neutral comments were judged, but these 
eemed to be too distorted by the intent to be “cool” 
and to suppress enthusiasm in pre- and early adoles- 
cence to be reported here. They were, however, counted 
in the interrater agreement -of .78 on percentage of 
negativity-positivity. 

Written criteria for scoring a response negative refer- 
red to (a) spontaneous negative remarks, (b) agreement 
with negative remarks offered by a peer, and (c) nega- 
tive comments made in response to probes of an initi- 
ally neutral comment. Any such answer was scored as 
negative unless some strongly positive context (adja- 
cent remarks) overwhelmed it in the judgment of the 
rater, The several overlapping examples provided were: 
(a) hostile—“Blacks are hippies. They wear rags."’— 
(b) intolerant—Indians should dress decently, not al- 
ways cause trouble.” — (c) patronizing—“Jews are all 
brains. Too bad nobody likes brains who don’t do any- 
thing.” —(d) negatively stereotyping—“Chinese have 
buck teeth.” — (e) humorous—‘Chinese are chinks.” 

The investigator and another person trained by Kohl- 
berg scored the Measure of Moral Maturity with agree- 
ment of .82, using the issue method described in the 
Scoring Manual (Note 4). In this method, either pure 
or a major-plus-minor moral stage score is assigned to 
each issue appearing in each dilemma, to each issue ap- 
| pearing across dilemmas, and then to each dilemma as a 
whole. This permits one estimate of a child’s intended 
meanings to be checked against another, so that his or 
her basic assumptions are better understood, The moral 
maturity score is the sum of the weighted percentages 
of usage of each stage, 
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TABLE 4 


PERCENTAGE OF NEGATIVE COMMENTS AT 
Eacu Morat STAGE 


Two Three F 


Grade Description One 
2 City 45 26 
2 Suburb - 30 19 
4 City 44 20 
4 Suburb 37 13 
7 City 47 25 12 
7 Suburb (18) 13 6 8 
5 Suburb, 
longitudinal study 35 18 (7) 
7 (same) 27 13 6 5 
6 Affluent suburb (ty) 18 15: 
Totals 


36% 19% 11% 6% 


Nolte, For the numbers in parentheses, N = <5. 


Results 


Many more negative remarks are offered by 
the low-stage children at all ages. Table 4 shows 
that 36% of the comments are negative toward 
a racial or ethnic group at Moral Stage 1, 19% 
at Stage 2, 11% at Stage 3, and 6% at Stage 4, 
as the average of all samples. The overall in- 
verse correlation between moral stage and this 
index of prejudice is .58. This correlation for all 
eight groups is significant at the .001 level, even 
with age partialed out. The fact that the stage 
effects in Table 4 are much greater than the age 
effects probably means that the investigator’s 
rapport with the children plus the excitement 
of the game inhibited the hiding of prejudice. 
Because hiding could increase with age, just as 
moral stage tends to increase with age, it 
seemed advisable to partial out age as a possi- 
ble background cause of the correlations ob- 
tained. 

Likewise, partialing out IQ (tested only on 
the children in the longitudinal study) did not 
affect the significance of the results. Moral 
stage apparently depends as much upon other 
factors, such as empathy (related to attention 
and ego strength) and role taking or social 
awareness (Selman, Note 2), as upon intelli- 
gence measured by the Stanford-Binet. 

Sex effects have been minor in studies of 
childhood moral stage. That they are absent or 
minor in the present data is indicated by the 
correspondence of the correlations for sixth- 
grade boys with the other correlations of mixed 
groups in Table 5. Girls do fight less and express 
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TABLE 5 
PEARSON COEFFICIENTS BETWEEN MORAL 
STAGE AND PREJUDICE 


Moral With age With IQ 
stage with partialed partialed 


Grade N prejudice out out 


Cross-sectional study 


2nd 27 —.69 —.59 
4th 42 —.63 —.61 
Tth 32 =.52 —.51 
City 
total — 60 
Suburb 
total —.61 
Longitudinal study 
Sth 22 =13 -15  —.64 
Tth 22 —.71 —.70 = 163 
(same) 
Boys in affluent suburb 
6th 55 —.68 
—.58 —.54 


Total 


prejudices less forcefully, but they express as 
many prejudices. Table 6 shows the correla- 
tions between moral stages, IQ, school achieve- 
ment, and directed thinking (as opposed to 
stereotyping) at two grade levels for the same 
22 students. 

The effects of society’s differential treatment 
of blacks appear in Table 7, in which the ob- 
jects of the children’s negative remarks are in- 
dicated. Most prejudice is expressed by and 
toward black people. The second most pre- 
judice is expressed toward Jews. A seventh 
grader in the city reported, “A kid will come in 
the classroom and yell out, ‘Any Jews here? 
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What are you?’ so he can tell jokes, you know. 
But only Jews never tell, because they know; 
what kids think.” The small group of Chinese 
receive little negativity and show even less. 
Another finding is that hostile comments and 
tendency to fight are highly correlated. 


City Versus Suburb 


Although three fourths of the suburban sec 
ond graders were at Moral Stage 2, all but tw: 
city children of the same age were at Stage 1 
The difference of almost one stage between th 
two is never made up during grade school 
nor does it widen. The average child of 10 in 
this industrial city resembles in moral reason- 
ing and inclination to share another’s viewpoint 
a 7-year-old from a suburban school that has a 
slight population majority of lower-middy- 
class families. The average difference in stage of 
moral judgment is one stage over the 5-year age 
span represented. Similarly, the average in- 
crease during the 32 months of the study of U : 
suburban longitudinal group is two thirds o! 
stage. 

Concerning style of reasoning, the tendenci s 
of city and suburban second graders, when to 1 
to offer any comment, are alike only in th t 
28%29% of the comments are information |. 
Those offered in the city include more ret r- 
ences to classmates who are present and show 
less curiosity and directional thinking. 3y- 
fourth grade, informational comments are half 
of the second-grade level for both groups. The 
difference in being analytical widens. White 
city children in both fourth and seventh grades 
keep about half of their remarks on a stereoty- 
pic “pronouncement” level (“Italians love 
spaghetti” —499% and 469%), whereas the 18% 


TABLE 6 
CORRELATIONS OF MORAL STAGE AND IQ WITH OTHER VARIABLES (LONGITUDINAL GROUP) 


Moral stage in Moral stage in Stanford Binet 
19 


Item Grade Sth grade 7th grade 
Moral stage Sth 1.00 ct bc .60** 
7th 19%" 1.00 p99 
Percentage of ethnic negatives 5th —.73** —.71** — .46* 
Tth —.64** —71** = Ai* 
School achievement Sth 61"* .36* 70** 
Percentage of directed thinking Sth 25 39 .50* 
7th Ad 45 .49* 
Note. N = 22. 
*p = <.01. 
* p= <.001, 
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TABLE 7 


PREJUDICED EXPRESSIONS OF CHILDREN (PERCENTAGE OF TOTAL COMMENTS) 
| z x Prejudices toward 
\ Blacks Jews White Americans _Orientalsand Indians 
| Grade Suburb City Suburb City Suburb City “Suburb City 
| Cross-sectional study 
| By 2nd J” 
Whites 23 56 12 33 0 6 21 23 
| Blacks 51 31 0 12 7 71 7 42 
By 4th 
| hites 21 61l 6 39 4 0 19 26 
| Slacks 38 28 3 18 33 74 31 39 
By 7th 
| Whites 18 31 12 28 9 23 9 21 
Blacks 30 14 10 42 33 57 15 35 
Longitudinal study 
Sth 
) S9 Whites 23 -- 7 — 11 — 12 — 
Blacks 42 — 10 — 31 = 46 
k By 7th 
Whites 14 — 8 — 9 — 8 
Blacks 33 — — 33 = — 


if this type of nonanalytic remark from subur- 

an fourth graders drops to 12% by seventh 

crade. Thus, whereas city children in seventh 

grade give half social reasoning comments and 

half pronouncements or anecdotes, suburban 

n seventh grade offer five times more 

soning comments than stereotypic 

vocements, These percentages are drawn 

stal of game and interview data. 

y, correlations of style (analytic versus 

»typic) with moral stage are not signifi- 

it, One explanation may be that the warmth 

vad chiding that result from city children’s in- 

olvement with each other fosters moral de- 

velopment as much as does curiosity, at the 

f grade school level. Nevertheless, the choice of 

. more analytic style must have strong effects 

on reaching higher stages later, since these de- 

pend on dispassion and the use of formal oper- 
ational thinking (Kohlberg, 1975). 

City second graders make nearly one-third 
negative comments, twice as many as suburban 
counterparts, But prejudice in the city drops 
off about 10% among whites and 9% among 
blacks by seventh grade. In the suburbs at this 
{junior high age, it is reduced about 14% from 
the second grade level. The small decline in 
egativity among all groups, evident in Table 
6, would be somewhat greater except for the 


A. 


attitudes toward “white Americans” gained 
from older siblings’ comments on the war in 
Vietnam and urban blight. 

When the primary source of ethnic and racial 
attitudes was asked, over one third in both 
places denied any influence other than theirown 
ideas and experiences. (Although these data 
are only as good as the insight of those offering 
them, too often children are not given the, op- 
portunity to describe their motives.) In all, 
24% of suburban children admitted primary 
parental influences, whereas only 5% said 
peers initiated their ideas; 16% in the city 
cited parents, and 14% peers. School lessons 
and TV received small notice. The opinions of 
older siblings seemed to receive more respect 
than those of parents. 


Discussion 


The hypothesis is confirmed. The presence of 
a link between prejudice and the degree of 
clarity or maturity of moral reasoning is dem- 
onstrated. Thus, the element of ignorance that 
accompanies negativity in prejudice is typi- 
cally more than a lack of general intelligence or 
a lack of knowledge of specific facts about hu- 
man psychology or ethnic history. Certain proc- 
ess factors in the drawing of mature moral 
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conclusions, often including emphathic role 
taking, are absent. “Didactic instruction about 
prejudice. . . to teach youngsters to be cogni- 
tively sophisticated about prejudice, in the 
absence of their becoming cognitively so- 
phisticated generally,” (p. 102) a solution 
referred to by Glock, Wuthnow, Piliavin, and 
Spencer (1975), will be insufficient unless moral 
development is sought through active wrest- 
ling with dilemmas involving unfairness. 
Glock et al., working in three high schools, 
found prejudice to be the result primarily of 
cognitive unsophistication. The Æ scale evi- 
dence of frustration and aggression did not 
correlate with anti-Semitism at this age level. 
Against the theory that childhood prejudice 
largely arises from either the imitation of par- 
ents’ prejudices or the authoritaritan traits 
implanted by poor parenting, a cognitive-de- 
yelopmental theory would stress that in child- 
hood, regardless of environment, there is a 
rudimentary sense of justice and a naive inter- 
pretation of crude and external differences be- 
tween groups. Although broad social influences 
are important, prejudice can begin with physi- 
cal differences, which imply to the young child 
psychological and moral differences. Two white 
7-year-olds illustrate a mixture of social and 
cognitive influences while during the game they 
say: 
Ela: I don’t want to be a Negro; they’re not good. 
Me neither. Some people don’t like them; they 
look different. 
Their hair goes boing, Here’s how Roy [black] 
walks [demonstrates duck walk]. Negroes are 
brown and black—who likes it? 


Don: 


Ela: 


Two black 7-year-olds say: 


Ina: Jews are different. Maybe they speak another 
language. Their eyes are shut up more than ours 
are, like this [squints]. Jews are kind of like 
other people but I don’t like them. They call 
you names. I don’t like Indians because most of 
them don’t wear pretty clothes. But Navajos 
have pony tails. . . . You should like all people. 

Jim: Chinese are Chinks. They are funny looking. 


But all the theories of prejudice have some 
application. Often they act in concert. When 
groups already categorized as funny looking or 
inferior engage in normal fighting or behave 
badly, the prejudice of young observers is aug- 
mented. This is an outcome of reasoning. As the 
elementary school years proceed, vivid impres- 
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sions from certain symbolic and iconic repr 
sentations also have a strong influence on cor 
ceptions of status and social role. This is a 
outcome of contagion of feeling and social lear 
ing. “The Blacks live in the slums, where the 
have a million bugs, and a giant leak in th: 
faucet. And rats with little shiny eyes con 
out; I saw it in a filmstrip,” says a white sv 
ban fourth grader. With the same impressior 
many black children stereotype themselve: « 
inferior, in spite of the normal ego need to i1 
self and own-group first. This is an outcom« i 
personality. “What I can’t be is a drunk, | ik 
them big black dudes on the fire escape by us 
confides a black 12-year-old boy. 

Concerning the origins of prejudice, ti 
boy’s statement illustrates not only frank ho 
tility but also the overlap in the explanat« 
power of contingent or externally imposed 
cial conditioning, a lack in the conscious abil’ 
to respect (shown by low moral reasoning), 

a personality factor of negativity in self-co 
cept. All can add to prejudice. Anger and figh 
also cemented it in the groups studied. 

Again it must be asked whether the derog 
tory remarks of children constitute prejud 
in the adult sense. The role of cognitive imn 
turity makes this unlikely. The tentativen 
of many of the judgments made, the tie ` 
tween them and moral development, and i 
changes in the longitudinally studied childs 
reveal the strong role of conscious thous! 
Also it is clear that although young child. 
(and others) are affected by the content of « 
tudes present in the environment, new idi 
that emerge with changes in the structure 
moral judgment provide a rationale of rigl 
and duties that can block or cast doubt | 
naive expressions. 

This conclusion disagrees with previ» 
studies. Goodman (1964) referred to a stud 
Kutner (1958) that led her to conclude +i 
“true prejudice begins as early as seven yeas 
age.” Kutner’s “clearly prejudiced” group 
7-year-olds were observed only in their : 
quiescence to ethnic stereotypes provided | 


the teacher, which other children rejected. Y 
he concluded after finding cognitive correla 
of prejudice, such as signs of rigidity, that p 


judice permeates all aspects of be 
is an integral part of personality. 
(1971) thorough study of preschoolers 
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ranged beyond its own data to repeat All- 
| port’s (1954) speculation that prejudiced child- 
| ren will go through a phase of “undifferentiated 
Ù negativity” in grade school. This the present 
) study disconfirms. An acknowledged source for 
| Porter was a study by Radke, Trager, and 
« Davis in 1949, which concluded that the pre- 
" judices of 7-year-olds are “partly crystallized.” 
The authors explained the term crystallized by 
| observing that children of 7 are able to offer 
|’ philosophical rationales. But the nature of these 
1 rationales and whether they evolve was not 
(investigated, 
’ An attempt is now made to show how Kohl- 

berg’s moral stages can be described as “stages 
i of respect for persons” based on the interviews 


i 
s of this study. They show that very young, or 
ylowest-stage, children define respect as obedi- 
» ence. The higher level of negative group atti- 
ytudes is based on confusion from concreteness 
cof thinking and fear of imagined power. Reified 


labels and a fixed occupational status, not 
awareness of motives, determine judgments 
and feelings in the following example from a 
10-year-old suburban boy: 


If a slave owner mistreated his slave, and the slave came 

ato me for help, I would make him my slave. But I 
șWouldn’ t feel sorry, because his owner had paid for him, 
ee he was a slave, 


il Another 10-year-old boy, also at Stage 1, 
ahowed the power and consequences orienta- 
ittion of the stage in his reply to the interviewer’s 
F tatement, “Many Indians have different be- 
tiliefs than ours,” “You ought to try to change 
hem, or they might try to scalp you, because 
hey woulda? t like you.” Three years later, on 
hicaring the same statement while at Stage 2, 
1 said, “Let the Indians do what they want. 
After all, they sold New York to us. If they 
fant to start trouble, let them. Maybe it’s a 
rotest.” 
4 Children at Stage 2 equate respect with 
egalitarian attidues that stress mechanical bal- 
ancing. Now self-interested reference to experi- 
nce with other groups dominates reasoning. 
‘Segregation is not right, I guess, but I know 
he whites better, and I would not worry about 
eople I did not know. So I go along with the 
whites,” says a 13-year-old white boy (IQ of 
/130). This is the stage of the majority of school 
children. The lesson that “all men are created 
equal” is understood as ‘everybody has differ- 
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ent plans, and everybody can argue for his,” 
which is a kind of respect. To use others or to be 
prejudiced against them is fair, if the others 
permit it. There is no rationale for solving 
shared problems beyond “a war and then a 
peace treaty” (for urban rioting; solution of a 
13-year-old suburban boy.) 

At the conventional Stages 3 and 4, children 
become attracted to an abstract respect for 
people in general. They experience mutuality of 
viewpoint as the basis of relationships. They 
infer some obligation to make a show of con- 
cern for others, but decisions about. prejudice 
that exists in the environment are usually 
based on in-group conformity. Distant persons 
and enemies are ignored. An enemy can still be 
“a gook” or a nonperson at Stage 3 because 
complex political, economic, institutional, and 
psychological causes of social problems are not 
understood. “Shocks” from unexpected human 
behavior produce denunciations and denial of 
personal involvement. 

A change in one suburban boy’s answer to 
inquiries about fighting in school from, “No- 
bees going to do it for you,” at age 10, Stage 

, to the following Stage 3 answer 3 years later 
ca the typical increase in optimism in as- 
sumptions about human nature at Stage 3. 


I don’t fight any more, I never did, much. You have 
to be respected by acting good, not in the way of being 
tough. But other kids are suckered in by that. There's 
a few like that in every school, a few who can’t make 
friends, and who want to try it the tough way. Hspeci- 
ally black people get friends by acting tough. It’s not 
friendship, and not necessary. 


At Stage 4 sentiment is less important and 
human character is respected where recognized. 
Gaps in understanding behavior are filled in by 
hypotheses, and social models are constructed 
to include institutions and their functioning. 
The effort to be objective makes for fewer pre- 
judices than at Stage 3, but the views of a local 
majority are still overweighted. There is a 
quest for a set of accepted rules to tie together 
the disparate social elements and sentimental 
aspirations of Stage 3. But any perception of a 
single hypothesis of relations may foster a 
rigidity of judgment that favors prejudice, as 
in the following statement of a 13-year-old 
suburban white girl: 


The Jaw must represent that there are different re- 
ligions and beliefs. But, a law against euthanasia would 
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be just to protect Catholics from doctors, and that seems 
unfair to the rest of us. I think Catholics should have 
their own countries and laws. 


The Stages 5 and 6 described by Kohlberg 
are beyond the understanding of children and 
most adolescents, but because they assume uni- 
versal ethical principles that apply to all, their 
eventual achievement should be an educational 
goal. The earlier stages must first be traversed. 
To facilitate the step-by-step process, a Socra- 
tic and questioning “moral education” could 
involve dilemmas derived from “ethnic 
studies.” It seems probable that more progress 
in the stages of respect during childhood would 
reduce prejudice in the population. 

The importance of these stages is that their 
internal organization of structures causes 
learning about other groups to be selective. The 
learning is not simply externally organized by 
association with events in the outside world or 
affective events in the unconscious mind. The 
internal organizers are the concepts of persons 
and their logical identity and value, and cer- 
tain patterns of thinking or relational schemata 
that are akin to those described by Piaget for 
physical thinking. The structural organizers of 
social thinking are more difficult to know and 
are only suggested here. But because they affect 
orientation to social justice, it is not surprising 
that there is a correlation between them and 
prejudice, 
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Social Comparison and the Inference of Ability Difference 


Joseph M. Schwartz and William P. Smith 
Vanderbilt University 


Two experiments are reported, evaluating an uncertainty reduction model for 
social comparison of ability. According to a parametric metaphor, uncertainty 
about ability difference between self and other should be greater where (a) 
other’s mean performance is similar rather than dissimilar to own performance 
and (b) variance in performances of other is large rather than small. The first 
experiment indicated greater persistence in comparison where the mean differ- 
ence in performance was smaller, and greater persistence where the variance in 
the other’s performance was larger. The second experiment provided evidence 
that the persistence effects were indeed mediated by uncertainty, as indicated 
by judgments of ability differences. Subjects who found they were inferior to 
the other in performance were somewhat more reluctant to reach the conclusion 
that there was an ability difference than were subjects with relatively superior 
performance, but performance rank did not affect persistence at comparison. 
Implications of these findings for the Festinger theory of social comparison 


are discussed. 


Festinger’s (1954) theory of social com- 
parison processes states that persons learn 
about their abilities via comparison with oth- 
ers of similar ability. As Jones and Regan 
(1974) have pointed out, the predicted pref- 
erence for comparison with others of similar 
ability serves the function of increasing the 
comparer’s certainty in predicting what his 
own ability will permit him to do. Of course, 
in order to select another of similar ability 
from an array of potential referent individuals, 
the comparer must have previously assessed 
his own ability relative to such others. Jones 
and Regan note that social comparison may 
also serve the function of determining for the 
comparer just what his ability level is relative 
to others. The present analysis is concerned 
with this latter function. 

If reduction of uncertainty regarding rela- 
tive position along an ability continuum is a 
function of performance comparison, we can 
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begin to specify those variables that ought to 
affect such comparison. If a person, P, is 
interested in assessing the discrepancy in abil- 
ity between himself and another, O, three 
variables are hypothesized to be critical to 
the degree of uncertainty experienced, and 
hence to the social comparison process: (a) 
the magnitude of the mean performance dif- 
ference between P and O, (b) the variability 
of each person’s performance, and (c) the 
number of opportunities for comparison. 
These variables suggest a parametric meta- 
phor for the performance comparison process 
based on the formula for a ¢ test for the dif- 
ference of two means. The value of the dif- 
ference ¢ is assumed to reflect subjective cer- 
tainty about a hypothesis of ability difference: 


(Šp — Xo) — (up — to) 
Sot 
Ng- No 


Under the null hypothesis of no ability dif- 
ference, p — o = 0, Xp = P’s performance 
mean, X, = 0’s performance mean, Sp? = vari- 
ance of P’s performance, S,? = variance of 0’s 
performance, Vp = number of observations of 
P’s performance by P, N, = the number of ob- 
servations of 0’s performance by P. It should 
be noted that it is not suggested that people 
compute ¢ tests when comparing their own 
abilities to those of others, but that the in- 


laitt = 
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formation contained in the above formula is 
combined in some rational manner and, for 
the moment, it seems worthwhile to look at 
the ability evaluation process from the per- 
spective of the relationships implied by the 
above formula, 

Three basic relationships can now be speci- 
fied: (a) P’s confidence in a given ability 
difference should vary directly with the mag- 
nitude of the difference between P’s perform- 
ance and 0’s performance. The larger the 
mean performance difference, the more confi- 
dent P will be that there is an ability differ- 
ence. (b) The more variable either person’s 
performance, the less confident P can be that 
any mean performance difference represents 
an underlying ability difference, given some 
minimal overlap of their performance distri- 
butions. (c) As the number of occasions for 
comparing his performance with that of an- 
other person increases, P’s confidence that any 
mean performance difference reflects an abil- 
ity difference should increase. 

From the present perspective it might be 
predicted that those who perform similarly to 
the comparer would create greater interest in 
social comparison than would those who per- 
form more discrepantly. A greater number of 
comparisons would be required to achieve 
subjective certainty if mean performance dif- 
ferences were small, Thus, when the objective 
of social comparison is the determination of 
relative ability, the source of this greater 
interest in similar others lies in the greater 
ambiguity of the information they provide. 
This greater interest should not be confused 
with the demonstrated precomparison pref- 
erence for similar others where outcome pre- 
diction is the objective of comparison (Jones 
& Regan, 1974). Presumably in this latter 
context similar others are selected as compari- 
son targets because subjective certainty may 
be most easily achieved via comparison with 
them, and not, as in the case addressed in 
the present analysis, because similar” others 
themselves are loci of uncertainty. 

Support for the Festinger (1954) predic- 
tion of preference for similar others has been 
claimed by several investigators in addition to 
Jones and Regan (1974). For example, Dreyer 
(1954) found some tendency for subjects to 
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persist longer at comparison with similarly 
performing others than with dissimilar others; 
however, the focus of the Dreyer investigation 
was on level of aspiration, that is, on perform- 
ance prediction. Gruder (1971), Wheeler 
(1966), and Wheeler, Shaver, Jones, Goethals, 
Cooper, Robinson, Gruder, and Butzine 
(1969) all found some tendency for subjects 
to prefer performance information about oth- 
ers of performance rank similar to their own. 
However, given that subjects in these latter 
investigations already knew the performance 
rank order in the group prior to comparison 
choice, the function served by their seeking 
further, more specific performance informa- 
tion is unclear. In any case, in none of these 
experiments was any evidence presented re- 
garding subjects’ certainty about ability eval- 
uations as a consequence of social compari- 
son, 


EXPERIMENT 1 


The purpose of Experiment 1 was to ex- 
plore the effects of direction of performance 
difference (valence: subject > other and sub- 
ject < other), mean performance difference 
(magnitude: low and high), and performance 
variability (variance: low and high) on the 
amount of comparative information sought by 
subjects in making a determination of relative 
ability. Although the present model does not 
explicitly include the valence factor, previous 
speculation about self-esteem maintenance in 
social comparison settings (Friend & Gilbert, 
1973; Hakmiller, 1966) suggested the possible 
importance of its inclusion, 

Briefly, subjects were given up to 20 trials 
of false feedback about their own and an- 
other’s performance on a reaction time task. 
Subjects were told to reach a decision as to 
who was more able at the task, the subject or 
his comparison other. The number of trials of 
feedback utilized by subjects prior to reaching 
this decision was the dependent variable of 
paramount interest. It was predicted that (a) 
subjects receiving low-magnitude feedback 
would seek more comparative information 
than those receiving high-magnitude feedback, 
and (b) subjects receiving low-variance feed- 
back would seek less comparative information 
than those receiving high-variance feedback, 
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No specific predictions were made for the 
valence factor. 


Method 
Subjects 


Sixty-four undergraduates participated in the study 
in groups of three or four to fulfill a course require- 
ment. 


Procedure 


After being seated in individual booths, subjects 
were told the research concerned how people ex- 
tract problem-solving information from complex 
stimuli, and that the experiment would consist of a 
performance phase (Phase 1) and a feedback phase 
(Phase 2). Phase 1 instructions informed subjects 
they would be presented with slides of arrays of 
single-digit numbers printed in various colors. Sub- 
jects were to add the numbers of the color named 
at the bottom of the array as quickly as possible 
without sacrificing accuracy, The purported focus of 
Phase 1 was on individual differences in how fast 
persons could solve these problems. A fully func- 
tional reaction time apparatus (telegraph keys and 
clock-timers) was provided, ostensibly to allow mea- 
surement of time to solution accurately. The true 
purpose of this apparatus was to verify that sub- 
jects were performing the task as instructed. 

The completion of 20 error-free trials constituted 
Phase 1, After a 10-minute break during which 
Phase 1 performances were purportedly scored, sub- 
jects were told that Phase 2 was concerned with 
assessing their ability to detect performance differ- 
ences among themselves. Each subject received 20 
feedback sheets face down, one for each Phase 1 
trial. On each sheet appeared two scores, supposedly 
representing the subject’s performance and the 
performance of another group member with whom 
the subject would compare across the 20 trials. Score 
feedback was in fact constructed in advance by the 
experimenter. Subjects were told that as a control 
for variations in problem difficulty, scores would 
not be given in seconds, but in “standard form.” 
These standard scores ranged from —30 (a slow time) 
to +30 (a fast time). Zero was given as the mean 
for Vanderbilt students. 

On signal from the experimenter, subjects exam- 
ined the first of the 20 feedback sheets. This sheet 
was then collected and the signal to examine the 
next sheet given, and so on. Subjects were instructed 
to continue examining feedback sheets until they 
felt they could reasonably judge whether they or the 
other whose scores they were seeing had performed 
better during Phase 1. This judgment (better or 
worse) was recorded on a “decision sheet” provided 
for that purpose. The primary dependent variable 
(hereafter persistence) was the number of feedback 
sheets examined prior to decision. As subjects were 
asked to look at a minimum of 5 feedback sheets 
before rendering a judgment, persistence scores could 
range from 5 to 20. In addition, subjects filled out a 
1S-point confidence scale on the reverse side of the 
decision sheet immediately after making their deci- 
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sion. The lower and upper poles of this continuum 
were labeled “not at all confident” and “very confi- 
dent,” respectively. 


Time Score Feedback 


Overview of experimental manipulations. Experi- 
ment 1 was a 2 X 2 X 2 completely crossed between- 
subjects design. The experimental manipulations were 
effected by systematically adjusting the relationship 
between the subject’s own scores and those of his 
comparison other. 

Construction of feedback. The experimental feed- 
back consisted of two separate 20-trial schedules, one 
for the subject’s performance and one for the com- 
parison other’s performance, A subject’s own scores 
were always constructed so that the mean across 
any block of 5 consecutive trials (and hence across 
the full 20 trials) was equal to zero, that is, equal 
to the population mean. The variance of subject’s 
own scores was such that no score varied more than 
+3 from the schedule mean. 

The three experimental manipulations were cre- 
ated by varying certain properties of the compari- 
son other’s feedback schedule. The magnitude manip- 
ulation was accomplished by indicating that the 
comparison other’s performance was on the average 
1 or 4 points different from the subject’s. The valence 
manipulation simply reflected the sign of the differ- 
ence between the subject and the comparison other 
(subject > other, subject < other), The variance 
manipulation involved varying the degree of disper- 
sion of scores about the comparison other’s schedule 
mean, The range of scores was greater in the high- 
than in the low-variance conditions, For any block 
of five consecutive trials in the high-variance condi- 
tions, two scores were within +3 points of the 
schedule mean, two were within +4-6 points, and 
one was within +7-10 points. In the low-variance 
conditions all scores were within +3 points of the 
schedule mean. 

Feedback for both the subject’s and the other’s 
performance was constructed in 5-trial units. Each 
combination of the three experimental manipulations 
had as its source a separate pool of such 5-trial units, 
each unit containing the experimental manipulations, 
Representative 5-trial blocks for each condition ap- 
pear in Figure 1. Four such units were randomly 
selected to create 20 trial schedules. Thus, the actual 
set of scores for the subject and the other varied 
across subjects within a given experimental condi- 
tion while still remaining faithful to the rules gen- 
erating the various manipulations. The only other 
constraint imposed on the construction of feedback 
schedules was that the first score of any feedback 
schedule equal the mean for the entire feedback 
schedule. Thus, the valence and magnitude manipu- 
lations were in force on Trial 1, while the variance 
manipulation was not fully effective until Trial 5. 


Results 
Persistence 


Analysis of variance performed on persist- 
ence data yielded the predicted main effects 
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F> of magnitude, F(1, 56) = 17.73, p< .001, 
“and variance, F(1, 56) = 7:01, p< 01; ans 
=. spection of the means indicates that while 
` } low-magnitude subjects sought more com- 
Yan parative information than high-magnitude 
| subjects (Ms = 11.59 and 7.66, respectively), 
the decision sheet and therefore are not in- 

ied in the analysis of these data. Confi- 


low-variance subjects sought less information 
C. 

lence ratings produced main effects of valence, 
F(1, 54) = 6.04, p< .02, and magnitude, 
F(1, 54) = 13.45, p< .001. Subjects who 
performed better than their comparison others 
were more confident of their decision than 
those who performed worse, while subjects in 
high-magnitude conditions were more confi- 
dent than those in low-magnitude conditions. 


Confidence ratings 


Two subjects inadvertently failed to fill out 
the confidence ratings on the reverse side of 


sv j 
than high-variance subjects (Ms = 8.34 and 
10.11, respectively). 
Neither the main effect for variance nor any 
i the higher order interactions were signifi- 
cant, 


Discussion 


The results of Experiment 1 clearly sup- 
port the analysis of social comparison pro- 
î cesses guiding this research. As the model pre- 
dicts, more performance information was 
sought when the comparison other was close 
in ability to, rather than divergent in ability 
from, the subject. Furthermore, increasing 
the variability in the performance of the com- 
parison other made any given ability differ- 
ence more difficult to “read.” This is reflected 
in the greater persistence of high- as opposed 
to low-variance subjects. These findings are 
consistent with our contention that persons 
interested in a determination of ability dif- 
ference persist with a given comparison only 
¢ to the extent that it does not provide a clear 
basis for drawing an inference of ability dif- 
ference. Hence one compares more with 
“similar” others because they are a source of 
uncertainty, not because they afford “subjec- 
tively precise” standards for ability evalua- 
tion. 

Analysis of the confidence data may help 
clarify the mechanisms underpinning the per- 
sistence effects, High-variance subjects seek 
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Ficure 1. Sample feedback patterns, negative valence 
conditions only. 


more information than low-variance subjects 
but are ultimately equally confident of their 
decision. On the other hand, low-magnitude 
subjects seek more information than high- 
magnitude subjects but still remain less confi- 
dent of their decision. Thus it appears that 
an additional few trials of feedback enabled 
high-variance subjects to reach a confident 
decision. Low-magnitude subjects (i.e., those 
comparing with a similar other) remain rela- 
tively unsure of their decision despite having 
engaged in lengthier comparison. This effect, 
then, is congruent with our implication of the 
similar performing other as source of am- 
biguity. Perhaps instructions stressing the 
value of reaching a rapid decision may have 
forced low-magnitude subjects to terminate 
their information search prematurely. Finally, 
given the absence of valence effects on persist- 
ence, the finding that negative valence sub- 
jects were less confident of their decision 
than positive valence subjects is viewed as 
self-presentational. If indeed negative valence 
subjects were less certain of their ability dif- 
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ference judgment, why then did they not seek 
more comparative information? 

In any case the confidence ratings from 
Experiment 1 are hard to interpret because 
subjects made the ratings on the basis of dif- 
ferent amounts of comparative information. 
The crucial question is how the variables of 
difference magnitude and performance vari- 
ance affect judgments of ability difference, 
given the same amount of performance feed- 
back. 


EXPERIMENT 2 


In Experiment 1 we assessed the effect of 
some performance difference parameters on 
the amount of comparative information sought 
by subjects in making determinations of rela- 
tive ability. Experiment 2 had as its primary 
concern the investigation of the social judg- 
ments presumably underpinning the persist- 
ence effects in Experiment 1. Specifically, sub- 
jects were asked to rate the likelihood of 
three ability inferences—whether they were of 
equal ability, more able, or less able than a 
comparison other, These ratings were made at 
five different points during the 20 feedback 
trials. This enabled us to examine subjects’ 
judgments of their ability relative to the other 
as a function of the direction and magnitude 
of the mean performance difference, of vari- 
ance in performance, and of trials. 

It was predicted that likelihood estimates 
of the three ability inferences would become 
stronger (either more likely or less likely) 
over the five repeated measures, the direction 
of this increase being determined by the direc- 
tion of the mean difference (valence). Posi- 
tive valence feedback (subject > other) 
should increase the likelihood of an inference 
of more ability and decrease the likelihood of 
an inference of same or less ability. Negative 
valence feedback (subject < other) should 
encourage an inference of less ability, and 
discourage inferences of more or same ability. 
High-magnitude feedback (mean difference of 
4) was expected to yield stronger inferences 
than low-magnitude feedback (mean differ- 


ence of 1). Low-variance feedback was ex- 
pected to yield stronger inferences than high- 
variance feedback. ere 
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Method 
Subjects 


Sixty-four undergraduates volunteered for the 
study in order to fulfill a course requirement. 


Procedure 


Both the Phase 1 procedures and the independent 
variables as manipulated by the construction of time- 
score feedback were virtually identical in both 
Experiments 1 and 2. Phase 2 procedures in Experi- 
ment 2 differed from those in Experiment 1 only in 
that (a) subjects were informed that the experi- 
menter was concerned with their judgments of dif- 
ferences among themselves in their ability, and (b) 
subjects in Experiment 2 examined all 20 feedback 
sheets and responded to questionnaires requiring these 
judgments after Feedback Sheets 1, 5, 10, 15, and 20. 

Dependent measures. Subjects responded to three 
questions probing their evaluation of their own and 
the comparison other’s ability after Trials 1, 5, 10, 15, 
and 20. Subjects were asked whether they had about 
the same level of ability, more ability, or less ability 
than the comparison other. All responses were marked 
along 15-cm scales labeled “highly unlikely” at one 
pole and “highly likely” at the other, Intermediate 
labels of “slightly likely,” “moderately likely,” and 
“very likely” were also included. 


Results 


Data were analyzed via analysis of variance 
procedures for a 2 X 2 X 2 X 5 factorial de- 
sign with three between-subjects factors 
(valence, magnitude, and variance) and one 
within-subjects factor (repeated measures). 


Same Ability Ratings 


Analysis of subjects’ inferences of same 
ability indicated that those in the high-mag- 
nitude conditions generally considered this 
hypothesis less likely than did those in the 
low-magnitude conditions: main effect for 
magnitude, F(1, 56) = 8.92, p < .004. A re- 
peated measures main effect also appeared, 
F(4, 224) = 2.52, p = .042, but is seriously 
qualified by the presence of several interac- 
tions. 

While subjects in the low-magnitude con- 
ditions maintained moderate likelihood ratings 
of same ability across all repeated measures, 
those in the high-magnitude conditions showed 
a decrease in likelihood estimates over trials: 
Magnitude x Repeated Measures interaction 
F(4,224) = 3.34, p = 001. 

A Variance X Repeated Measures interac- 
tion also appeared, F(4, 224) = 3.00, oS 
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.019; however, variance effects are best under- 
stood through examination of the three-way 
interaction of magnitude, variance, and re- 
peated measures, F(4, 224) = 2.53, p = .041, 
which is graphed in Figure 2. Additional anal- 
ysis of the data revealed that for Repeated 
Measures 2 through 5 (there was no variance 
manipulation at Repeated Measure 1) the 
main effect of variance approaches statistical 
significance, F(1, 56) = 3.47, p= 062, and 
the MagnitudeX Variance x Repeated Mea- 
sures interaction is stronger, F(3, 168) = 3.71, 
p= .013. 

Examination of Figure 2, and of the results 
of simple effects F tests, reveals that (a) high- 
magnitude subjects generally gave lower likeli- 
hood estimates than low-magnitude subjects 
at each repeated measure (p <.12 on the 
first measure, p < .08 on the second, and p 
< .02 for the remaining measures); (b) low- 
variance subjects gave lower likelihood esti- 
mates than high-variance subjects early in the 
session (p= .022 for second repeated mea- 
sure); but (c) magnitude and variance inter- 
act on the last repeated measure, F(1, 56) = 
3.60, = .06, such that for high-magnitude 
subjects the difference between low and high 
variance has disappeared entirely at the fifth 
repeated measure, whereas for low-magnitude 
subjects, the variance difference has grown 
stronger. 


Ability Difference Ratings 


The primary analysis of the directional 
ability judgments (more able and less able) 
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Ficure 2. Judgments of same ability; Magnitude X 
Variance X Repeated Measures interaction. 
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Ficure 3. Relative ability judgment confirmed 
by feedback; Valence X Variance X Repeated Mea- 
sures interaction. 


was accomplished by conjointly analyzing 
judgments of greater ability for positive val- 
ence subjects and judgments of lesser ability 
for negative valence subjects. Thus, for all 
subjects in the current analysis, time-score 


‘feedback confirms the relative ability judg- 


ment under consideration. Analysis of these 
data indicated that, in general, high-magnitude 
subjects gave higher likelihood estimates than 
did low-magnitude subjects (main effect for 
magnitude, F[1, 56] = 14.49, p < 001) and 
likelihood estimates increased over repeated 
measures (main effect for repeated measures, 
F[4, 224] = 11.67, p<.001). Likelihood 
estimates increased at a somewhat greater rate 
over repeated measures in the high-magnitude 
condition than in the low-magnitude condi- 
tion: Magnitude x Repeated Measures inter- 
action, F(4, 224) = 2.54, p = .04, 

Valence and variance effects appeared only 
in interaction: Valence X Repeated Measures, 
F (4,224) = 2.94, p = .021; Valence X Vari- 
ance X Repeated Measures, F (4, 224) = 4.23, 
p = .033. Subjects in the positive valence con- 
dition gave higher likelihood estimates than 
subjects in the negative valence condition 
early in the session (p < .01 at the first mea- 
sure, p = .055 at the second), but not later, 
Suggesting an initial reluctance of subjects to 
accept the implication of relatively inferior 
performance, Variance also appears to have 
operated in a unitary fashion only during the 
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initial portion of the session (p = .026 at the 
second measure), low-variance subjects mak- 
ing stronger likelihood estimates than high- 
variance subjects. However, on the fifth re- 
peated measure, both the valence and variance 
factors appear in interaction, F (1, 56) = 3.67, 
p = .057. As can be seen in Figure 3, this in- 
teraction seems to be due to the surprisingly 
conservative position taken by the positive 
valence-high-variance subjects, Overall this 
three-way interaction may be seen as due to 
a combination of uncertainty engendered by 
the variance manipulation and self-presenta- 
tion effects during the early trials by negative 
valence subjects. 


DISCUSSION 


The current research has been concerned 
with one social comparison question, the de- 
termination of the rank order of ability within 
a social group. When this question is of in- 
terest, we have argued that social comparison 
may be likened to a statistical process analo- 
gous to the computation of a £ test. A person’s 
underlying goal is the reduction of uncertainty 
regarding the relative ability of himself and 
others. This uncertainty is viewed as being a 
function of the mean performance difference 
between person and comparison other, the 
variance of the performance difference, and 
the number of performance comparisons con- 
ducted. It is represented internally by likeli- 
hood estimates attached to each of three rela- 
tive ability hypotheses—those of similar abil- 
ity, more ability, and less ability. 

This analysis received considerable support 
from the two experiments reported herein. We 
predicted that in Experiment 1 low-magnitude 
subjects would seek more comparative in- 
formation prior to an ability difference deci- 
sion than would high-magnitude subjects. In 
Experiment 2 we suggested that low-magni- 
tude subjects would consider the hypothesis of 
ability similarity more plausible than high- 
magnitude subjects and would consider a di- 
rectional ability hypothesis confirmed by the 
performance data less plausible than high- 
magnitude subjects. These predictions re- 
ceived strong support in both studies. It might 
t, in the absence of the data from 
2, the magnitude effect on per- 

periment 1 might be taken as 


be noted 
Experim 
sistence i 
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evidence of some inherent superiority of the 
similar (i.e. low-magnitude) other; after all, 
subjects chose to compare more with them 
than with dissimilar others. Experiment Ls 
however, clearly demonstrates that, at least 
for judgments of relative ability, others of 
different ability allow stronger inferences 
to be made; hence it is plausible to conclude 
that this is the source of the persistence effects 
in Experiment 1. 

Although the issue of performance vari- 
ability has not appeared in prior discussions 
of the social comparison of abilities, it is ac- 
corded a central position in the present analy- 
sis. If reduction of uncertainty about relative 
ability is a primary social comparison motive, 
factors that influence the ease with which this 
uncertainty can be reduced provide critical 
tests of the validity of this assertion. It was 
predicted and found in Experiment 1 that 
high-variance feedback as opposed to low- 
variance feedback would engender lengthier 
performance comparison. It was anticipated 
that in Experiment 2 the difference between 
high- and low-variance feedback would be 
reflected in higher likelihood estimates of simi- 
lar ability for the former than the latter sub- 
jects; the results of Experiment 2 were gen- 
erally supportive of this contention, On both 
judgments of ability similarity and judgments 
of ability difference congruent with valence of 
time-score feedback, the simple main effect of 
variance was significant at the second re- 
peated measure. High-variance subjects gave 
higher likelihood estimates than low-variance 
subjects on ratings of same ability, while the 
opposite held true for judgments of more 
ability and less ability for positive valence 
and negative valence subjects, respectively. 
That these variance effects attenuate over 
time on ratings of ability similarity among 
high-magnitude subjects but not among low- 
magnitude subjects is quite easily explained 
within the current framework, Quite simply, 
the larger the mean performance difference, 
the less likely it is that performance variabil- 
ity will mask a “significant” ability difference. 

It may be noted that these findings ap- 
pear to complement those of Trope and Brick- 
man (1975), who also used a probabilistic in- 
formation-processing model (though Bayesian 
in origin, as opposed to our classical model) 


at 
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to predict subjects’ choices of tasks when at- 
tempting to diagnose their ability level. They 
found that subjects chose tasks which maxi- 
mized the difference in performance between 
persons standing at different ability levels. 
Presumably this choice was guided by an at- 
tempt to minimize resultant uncertainty. The 
present studies addressed the issues of judg- 
mental uncertainty and persistence in com- 
parison once persons are actually faced with 
small versus large and stable versus unstable 
performance differences. Our findings suggest 
that small and unstable differences between 
persons in performance indeed create greater 
uncertainty about relative ability, and hence 
encourage more extensive information search 
than do large and stable differences. 

Tt should be noted, finally, that although we 
have implicated the “similar other” as a 
source of ambiguity in our experimental situa- 
tion, this in no way contradicts the assertion 
that under some circumstances the similar 
other is a superior source of comparative in- 
formation, e.g., as a standard for prediction 
(Jones & Regan, 1974). It appears that a 
crucial factor in social comparison is likely to 
be the nature of the ability-related question 
being addressed, It is through careful atten- 
tion to the nature of the ability question being 
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asked that the social comparison process can 
be clarified. 
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The gaze direction of both the subject and his interviewer were varied inde- 
pendently, such that either looked at the other’s eyes or at the other’s knees in 
a 2 % 2 design. A repeated measures design was used with nonverbal behaviors 
of Japanese males as dependent variables. When the interviewers gazed at the 
subjects’ eyes, the response latencies of the subjects were shortened and their 
torso movements were reduced, Results from other studies of Japanese non- 
verbal behavior suggest that this pattern of response indicates greater subject 
involvement in the interaction, The magnitude of these effects was unchanged Vad 
the interviewer's gaze and making eye 
knees. It was thus reasoned that effects 
produced by eye contact in other research may have resulted from subjects’ 
knowledge that they were being gazed at and not from the unique properties 
ed at the interviewer's eyes, the hand 
self-manipulations of the subjects increased, reflecting the upsetting effects of 
monitoring the interviewer's face during interaction. The consequences of gaz- 
ing at another’s eyes thus appear to be different from having one’s eyes*gazed 


whether the subject was reciprocating 
contact or looking at the interviewer's 


of eye contact. When the subjects gaz 


at by another. 


A growing amount of research has been 
concerned with what Ellsworth and Ludwig 
(1972) refer to as the “interactive influence 
of visual behavior.” The focus of these stud- 
ies has been upon the social stimulus prop- 
erties of visual behavior rather than upon its 
informational or regulatory function during 
interactions. From this perspective, research- 
ers looking at physiological dependent mea- 
sures have concluded either that being gazed 
at (Kleinke & Pohlen, 1971) or that eye con- 
tact (McBride, King, & James, 1965; Nichols 
& Champness, 1971) produces increases in 
arousal. Studies using other types of depen- 
dent measures show similar results, Thus, 
Ellsworth, Carlsmith, and Henson (1972) 
found that being stared at produced faster 
escape responses. In Ellsworth and Carlsmith 
(1973), consistent eye contact resulted in 
lesser amounts of shock being administered to 
confederates. In these experiments the arousal 
function either of being stared at or of eye 


‘This study was supported by a Social Science Re- 
search Council research fellowship to the first author. 
Requests for reprints should be sent to Michael 
H. Bond, who is now at Chung Chi College, Chinese 
University of Hong Kong, Shatin, N.T., Hong Kong. 
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and Eye Contact During Interviews: 
Japanese Nonverbal Behavior 


contact was used as an intervening variable 
to explain the findings. A 

However, in none of these studies is it pos- "| 
sible to separate the arousal properties of eye 1 
contact from those of being gazed at. In the 
act of noticing the gaze of another, one must 
make eye contact, if only briefly. In Ells- j 
worth and Carlsmith (1973) the confederate 
either gazed at or averted his gaze from a: 
gazing subject. The resultant decrease in 
shock delivered to the gazing confederate i 
could thus have come from the aversive prop- ( 
erties for the subject of being gazed at in this } 
situation rather than from the aversive prop- 
erties of eye contact. In order to separate the | 
effects of being gazed at from those of mutual , 
gazing, the gazing behavior of both inter- 
actants must be manipulated. The present 
study attempts such a separation. 

If being gazed at is just as arousing as eye 
contact, then much of the work on eye con- 
tact might profitably be related to research on 
social facilitation where the audience visually 
monitors the subject as he performs. In both ! 
types of research people are being observed 
by others with consequent changes in be- 
havior frequently related to a mechanism of 
heightened arousal, drive level, or involve- 
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ment (e.g., Ellsworth & Ludwig, 1972; Za- 
jonc, 1965). In audience situations this 
heightened arousal is believed by many to 
derive from a greater concern by the subject 
abeut the consequences of good or bad per- 
formance (e.g., Cottrell, 1972). This concern 
bears obvious similarities to the evaluation 
apprehension of experimental subjects studied 
by Rosenberg (1969), but extends outside the 
laboratory to all social situations involving 
evaluation. Increases in this concern about 
evaluation may also be the source of the 
arousal effects seen in much of the eye con- 
tact research, for people typically evaluate 
performance of others visually. Thus, in re- 
search on both audience and eye contact ef- 
fects, the functional stimulus may be the per- 
ception of being gazed at. 

If, on the other hand, eye contact produces 
effects over and above those produced by 
being gazed at, different conclusions would 
follow. Perhaps as Argyle and Dean (1965) 
speculate, eye contact is an intimacy-produc- 
ing event in its own right with motivating 
characteristics beyond or different from those 
of being looked at by another. Control of 
both subject and confederate gazing should 
allow an assessment of this alternative in com- 
parison to the previous hypothesis. 

This control should also show if the target 
of the subject’s gaze affects the subject’s non- 
verbal presentation. Kendon ( 1967) has 
pointed out that, when talking to someone, 
people generally avoid looking at others. Pre- 
sumably monitoring the other’s reaction is too 
demanding when one must also formulate 
speech as part of a conversation, If this rea- 
soning is correct, then the stressful effects of 
having to look at an interviewer during an 
interview should have effects on the subject’s 
nonverbal behavior, The design of the present 
study should also provide an answer to this 
question. 

A previous study of Japanese nonverbal be- 
havior (Bond & Shiraishi, 1974) suggested 
which nonverbal channels might be monitored 
for the effects of the gazing variables. In that 
study forward-leaning, as opposed to back- 
ward-leaning, interviewers elicited from sub- 
jects fewer torso movements, faster response 
latencies, more gestures and self-manipula- 
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tions of the hands, longer total speech, shorter 
average pauses, and more gestures of the head, 
In the absence of any other guidelines, these 
dependent variables were selected for analysis 
in the present study, 


METHOD 


Subjects 


Male juniors (N =16) in the Sociology Depart- 
ment at Kwansei Gakuin University volunteered to 
help us train interviewers in the counseling program, 


Confederates 


Two male graduate students 1 aged 25 and 30 years 
served as interviewers. In order to ensure that their 
behavior was constant in all four experimental condi- 
tions, they were first carefully instructed in all types 
of nonverbal behaviors. They then practiced assum- 
ing a sitting position which appeared natural for an 
interviewer, but which enabled them to remain rela- 
tively motionless during the 4-min. interview, for ex- 
ample, feet flat on the floor, back firmly against the 
back of the chair, and so on. The experimenters 
stressed the importance of constancy in nonverbal 
presentation under all gazing conditions to the in- 
terviewers. A number of practice sessions were held to 
help the interviewers adapt to the various gazing con- 
ditions. The experiment began only after both con- 
federates reported feeling comfortable in all condi- 
tions. 

The interviewers memorized the two series of in- 
terview questions so that they could maintain their 
gaze without referring to their interview sheets dur- 
ing the interview, 


Independent Variables 


Subjects were recruited to help the counseling pro- 
gram at the university in training graduate student 
counselors. Upon arriving for the supposed training 
session, they were given a typewritten instruction 
sheet explaining that we wanted to test our inter- 
viewers under various types of interviewee behavior, 
It was explained that the unusual forms of gazing 
asked of the subjects would help prepare the coun- 
selors for the unexpected behavior of clients in the 
field. 

They were then asked to have two interviews dur- 
ing which they were to gaze 100% of the time at the 
interviewer's eyes in one interview (S:), his knees in 
the other (So). A verbal check was made by the ex- 
perimenter (the second author) to ensure that sub- 
jects clearly understood their task in the first and in 
the second interviews, Subjects accepted the rationale 
for the request with all agreeing to cooperate, Be- 
tween interviews, the subjects were reminded to 
change their target of gazing for the second interview, 


+The authors wish to thank Daisuke Shiraishi 
and Yasuo Iwata for their conscientious service as 
confederates, 
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The confederate interviewers also varied the target 
of their gazing, However, this change took place dur- 
ing the interview rather than across the interviews. 
At the end of 2 min., the experimenter loudly closed 
a window outside the interview room as a signal to 
the interviewer to change the object of his gaze from 
the general area of the interviewee’s knees (Io) to 
his eyes (Lı) or vice versa. The interviewer changed 
the tilt of his head along with the target of his gaze. 
Thus, in the I, condition his forehead was tilted for- 
ward and down 30° from the horizontal; in the Iı 
condition, his head was held upright. 


Design 


Each subject had two 4-min. interviews during 
each of which he received the two types of gazing 
from his interviewer in one sequence the first time, 
in the reverse sequence the second. Thus each subject 
experienced 2 min. of eye contact (S:-l), of gazing 
at the interviewer (Sı-Io), of being gazed at by the 
interviewer (So-I,), and of no gazing by either party 
(So-I,). There were two interviewers and two differ- 
ent sets of questions. Therefore the use of 16 sub- 
jects permitted a design fully counterbalanced against 
time with repeated measures across the independent 
variables (i.e, two subject-gazing instructions, two 
interviewer-gazing sequences, two confederates, and 
two interview topics). 


Procedure 


Upon arrival, the subject was met by the experi- 
menter (the second author) who gave him written 
instructions followed by a verbal check of the sub- 
ject’s understanding. The experimenter then took the 
subject to an adjoining room where he introduced 
the first of the subject's two interviewers. The ex- 
perimenter seated the subject and asked him not to 
move his chair during the interview. He then left 
the room. 

The subject was interviewed by the interviewer 
who maintained the target of his gaze until he heard 
the experimenter’s signal. Interviewer and interviewee 
sat face to face, the front of their armless chairs 120 
cm apart. The tape recorder was placed to the right 
of the interviewer, so that nothing came between the 
interactants. The camera was hidden 10 ft. (3.05 m) 
from the subject at about 45° to the plane of his 
shoulders. 

The interviewer asked a fixed series of questions 
about one of two topics, student life or family ex- 
periences, The questions began with basic informa- 
tion and then became more open-ended, After each 
of the subject’s answers to the simple questions, the 
interviewer proceeded immediately to the next ques- 
tion. After the subject’s responses to the more open- 
ended questions, the interviewer began a 3 count. If 
he completed this count, he moved on to the next 
question. Subjects were given a 10-count period to 
begin their answers. If no response (pause filler not 
included as a response) occurred within this period, 

the interviewer asked the next question, Overall, this 
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procedure allowed sufficient experimental control 

without the interview appearing unnatural. by 
The experimenter terminated the interview after 4 

min. He escorted the subject to the adjoining room 

and gave him some confidential ratings to fill out, 

These ratings consisted of 10 bipolar scales measur- N 

ing the subject’s feelings during the interview and 16 

bipolar scales used for evaluating the personality of ; 

the interviewer (Bond & Hwa Young Ho, Note 1). x { 

This information was supposedly used to help us | 


evaluate the interviewers as counselors. 

After both interviews, the experimenter (the sec- 
ond author) questioned the subjects to see if any 
were aware that the interviewers’ gazing behavior 
was programmed. No subject showed any such sus- 
picion, with most subjects saying that they were con- 
centrating on following their own task instructions 
(ie, to gaze constantly either at the interviewer’s 
eyes or knees). No subjects indicated any awareness 
of the hidden videotape camera. 


Dependent Variables 


The nonverbal measures from the Bond and Shirai- 
shi (1974) study are described below. 

Torso movement, This measure was any perceptible 
movement of the torso, as in swaying or during a 
posture shift. Frequencies were taken from the vidco- 
tape, 

Average response latency. This measure was the 
average time taken by the subject to make a re- 
sponse to the interviewer’s questions. Pause fillers 
were not scored as responses. The data were recorded 
by a stopwatch from the audiotape. 

Gestures of the head. This measure noted any 
movement of the head, vertically or horizontally, It 
was decided not to include this measure here as the 
subject instructions would interfere with free head 
movement. 

Self-manipulations of the hand. This measure, the 
amount of time the hand(s) was (were) in moving 
contact with some other part of the body, was taken 
from the videotape by a stopwatch. 

Gesture of the hands, This measure formerly noted 
a gesture of the hands scored only if it did not take 
place as part of a self-manipulation. The definition 
was expanded here to include any large movement of 
the hand, arm, or forearm, including those which re- 
sulted in a self-manipulation, such as raising the hand 
from the lap to scratch the ear, This change resulted 
in the inclusion of those large movements of the 
hand which had previously been excluded when they 
terminated in a self-manipulation. 

Pause average. A pause was operationally defined 
as any discernible time during a subject’s answer 
when no sound was made, This measure was taken 
from the audiotape by a stopwatch. 

Total talking time, The amount of time spent by 
each subject in answering questions was taken from 
the audiotape by a stopwatch. 

In Argyle and Dean’s (1965) equilibrium theory 
of intimacy, a number of variables are predicted to 
change in a compensatory way if the amount of eye 
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contact between interactants is increased. Thus, smiles 
and physical proximity should decrease. These two 
variables were therefore also measured from the 
videotape. 

Physical proximity. As subjects were asked not to 
move their chairs, the only way to change the dis- 
tance between themselves and their interviewer was 
by leaning forward or backward, Mehrabian’s (1969) 
operational definition was used with the subject being 
scored twice in each condition at 30 sec and 90 sec 
on the videotape, The two scores were added. 

Smile. No operational definition beyond the usual 
linguistic coding was attempted for smiling. 


Instruments 


A Seiko stopwatch, calibrated to .1 sec, was used 
for all timing measures. The speech and video records 
were made by a 5-in. (12.70 cm) Sony tape recorder 
and .5 in. (1.27 em) Sony video recorder, respectively. 
Scoring 

Care was taken to prevent bias during scoring. 
Thus, whenever possible given the measures being 
taken, the subject's head (and hence his gaze con- 
dition) was covered on the television screen during 
playback, The interviewer’s head was never visible 
on the tape, so his condition could not be known to 
the scorers.* 


RESULTS 
Reliability of Nonverbal Behaviors 


Two raters scored 40 2-min. samples of each 
nonverbal measure used in this study. Inter- 
judge reliability coefficients were then cal- 
culated for the 2-min. intervals. As the range 
of scores for the variable of forward lean was 
small, the percentage of agreement was com- 
puted (see Table 1). 


Treatment of Data 


The nonverbal measures were analyzed us- 
ing a 2 X 2 (Interviewer’s Gaze X Subject’s 
Gaze) analysis of variance for repeated mea- 
sures, 


Confounding Variables 


One obvious problem in truly interactive 
experiments is control of the confederates’ be- 
havior. If the experimental conditions of gaz- 
ing affect the nonverbal behavior of the con- 
federates, then the independent variables are 
systematically confounded. Despite attempts 
to control such bias (see Method section), 
systematic differences are clearly possible. 

As only the subjects were videotaped, it 
was impossible to check for systematic differ- 
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TABLE 1 
RELIABILITIES OF NONVERBAL BEHAVIORS 
IN Two-Minute INTERVALS 


Variable 
Self-manipulation of hands 89 
Gesture of the hands -96 
Torso movement 17 
No. smiles .99 
Forward lean 54 
Speech total time 99 
Pause average 97 


Response latency average 99 


ences in interviewer body movement. How- 
ever, the audiotape provided information 
about the interviewer’s paralinguistic behay- 
ior, Speech disturbances (Kasl & Mahl, 1965), 
number of pauses, and length of pauses fol- 
lowing the subjects’ last utterance were all 
measured and found to be unrelated to ex- 
perimental conditions. This partial evidence 
suggests that the effects described below were 
a function of the independent variables and 
not of some other confounded behaviors on 
the interviewers’ part, 


Manipulation Check 


The videotape record clearly showed sub- 
jects fixating the region of the interviewers’ 
knees during the Sọ condition (see Table 2), 
It was impossible to check whether subjects 
were exactly fixating the interviewers’ eyes in 
the S; condition from the videotape. However, 
interviewers were asked to estimate the per- 
centage of time the subjects maintained eye 
contact in the S,-I, condition. The average 
percentage was 81. 


Interviewer Gazing 


When the subjects’ eyes were being fixated 
by the interviewers, the subjects made fewer 
torso movements, F(1,15) = 5.81, p < .05, 
and gave shorter average response latencies, 
F(1, 15) = 6.58, p < .025, than when the in- 
terviewers fixated the subjects’ knees (see 
Table 3). Despite the suggestion of an inter- 
action for response latency averages in Table 
2, the F value for the interaction is only .66. 


2 The authors extend their hearty thanks to Hwa 
Young Ho, Yasuo Iwata, Daisuke Shiraishi, and 
Toshiko Takeuchi who helped score the tapes, 
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TABLE 2 


AVERAGE RESPONSE LATENCY IN SECONDS AS A 
FUNCTION OF THE TARGET OF SUBJECT 
AND INTERVIEWER GAZE 


Interviewer target 


Subject a 
target Knees Eyes M 
Knees 2.45 2.57 2.51 
Eyes 2.61 1.88 2.25 
M 2.53 2.23 2.38 


As a double check for the possibility of an 
interaction, differences between Ip and J, con- 
ditions were calculated for each subject at 
both Sọ and Sı. An analysis of variance on 
these difference scores showed no greater im- 
pact for the interviewer-gazing variable at Sı 
compared to So. 


Subject Gazing 

Subjects show more self-manipulations of 
the hands when gazing at the interviewer's 
eyes than when gazing at the interviewer’s 
knees, F(1,15) = 7.72, P< 01 (see Table 
4). 


Interviewer Gazing Versus Eye Contact 
(Soli vs. Sl) 

There were no interactions between the sub- 
ject- and interviewer-gazing variables which 
might haye suggested a difference between Sol, 
and $,I; conditions, As a further check, ¢ 
tests were run for all eight dependent vari- 
ables comparing responses in these two condi- 
tions. No result was significant even when the 
more liberal alpha level of .10 was chosen to 
avoid making Type II errors. 


Order Effects 
This study was not designed to examine the 
effects of interviewer gaze sequence either on 


TABLE 3 


Torso MOVEMENTS IN A Two-Minute PERIOD 
AS A FUNCTION OF THE TARGET OF SUBJECT 
AND INTERVIEWER GAZE 


j Interviewer target 
Subject per taana 


target Knees Eyes M 
Knees 41 2.4 Ei 
Eyes 63 24 44 
M 5.2 24 3.8 
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TABLE 4 
SeL¥-MANIPULATIONS OF THE HAND IN SECONDS 
DURING A Two-MinuTE PERIOD AS A 
FUNCTION OF THE TARGET OF SUBJECT 
AND INTERVIEWER GAZE 


Interviewer target 
Subject ects 


target Knees Eyes M 

Knees 30.3 24,3 vK) 
Eyes 40.3 46.0 43.3 
M 35.4 35.2 35.3 


subjects’ ratings or on their nonverbal be- 
havior. It is probable that the requirements of 
controlling their gaze during the interviews 
left the subjects too preoccupied to assess the 
impact of changes in the target of interviewer 
gazing (i.e., Iy-l vs. I,-Io). 

Nonetheless, order effects would be of in- 
terest if they were present. Consequently, 
analyses were run on the rating measures and 
on the nonverbal measures to answer this 
question, The repeated measures design made 
it impossible to assess interactions between 
interviewer-gazing order and either the target 
of the subjects’ gaze or the target of the inter- 
viewers’ gaze, However, main effects of in- 
terviewer-gazing order could be tested. 

There were no order effects for the non- 
verbal measures and only two at the .05 level 
for the 26 rating measures. As these two ef- 


fects could easily have been chance findings, . 


they are not considered further. 


Interviewer Effects 


Interactions between the identity of the 
two interviewers and either the subject-gazing 
variable or the interviewer-gazing variable 
would suggest that the effects for these vari- 
ables were dependent on the characteristics of 
the specific interviewer and not on the inde- 
pendent variables. The repeated measures de- 
sign meant that this possibility could only be 
assessed in the first or in the second inter- 
views but not across both interviews. Con- 
sidering the first and the second interviews 
separately, there were no interactions between 
the identity of the interviewer and the inde- 
pendent variables for any of the nonyerbal 
measures showing effects. Thus the results re- 
ported above appear to be independent of 
interviewer identity. 


A 
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Discussion 


Three main findings emerged from this ex- 
periment. First, being gazed at is an effective 
variable in changing interviewees’ nonverbal 
behavior. Second, the impact of an inter- 
viewer's gazing at the subjects? eyes was just 
as great whether the subjects were simul- 
taneously giving eye contact or looking at the 
interviewer's knees. Finally, the subjects’ be- 
havior changed when they looked at the inter- 
viewer’s eyes, but this change is functionally 
distinct from that produced when the inter- 
viewers gazed at the subjects? eyes. Each of 
these conclusions is discussed below, 

Some mention should be made here of those 
variables showing no effects, that is, pause 
average, total talking time, forward lean, and 
number of smiles, In the case of the first two, 
it turned out that the interviewers failed to 
limit subject pauses to 3 sec as specified in 
the procedure and as had been done in Bond 
and Shiraishi (1974). This unintended change 
made the measures of pause average and total 
talking time noncomparable to those of the 
earlier study. This alteration of procedure 
may explain why no effects were seen for 
these two variables, In the case of smiles and 
forward Jean, there was no reason to expect 
effects for Japanese subjects across these mea- 
sures. The variable of smiles has proved un- 
responsive to a variety of manipulations in 
Japan, although it does appear sensitive in 
Western subjects (Rosenfeld, 1966; Schultz 
& Barefoot, 1974). The variable of forward 
lean was not reliably enough measured in this 
study to assess its responsiveness adequately, 


Interviewer Gazing 


Although not significant at the .05 level, 
there was a tendency for the interviewer-gaz- 
ing variable to reduce the subjects’ hand 
gestures, F(1, 15) = 3.25, p <.10, As men- 
tioned before, interviewer gazing also resulted 
in fewer torso movements. At a purely de- 
scriptive level, these two results indicate that 
there may be some concrete basis to the meta- 
phorical suggestion of being frozen by an- 
other’s gaze (Laing, 1965; Sartre, 1943/ 
1956). Where escape is impossible, as in this 
situation, one’s gross movements can be im- 
mobilized by a gazing other. 


Taken as a group, what do the reductions 
in torso movements, gestures of the hand, and 
average response latencies indicate about the 
meaning of being gazed at for Japanese sub- 
jects? Unfortunately the design of the present 
experiment prevented the use of the subject’s 
ratings to answer this question because the 
target of the interviewer gazing was varied 
within each interview, whereas the ratings 
were taken after each interview, However, 
other studies have included ratings of the in- 
terviewer along with most of the measures of 
subject nonverbal behavior used here. 

Fewer torso movements were made by Jap- 
anese males to forward-, as opposed to back- 
ward-, leaning interviewers (Bond & Shiraishi, 
1974). In the same study, shorter response 
latencies were given by males and females to 
these same forward-leaning interviewers. These 
forward-leaning interviewers were in turn 
rated as more talkative, more polite, more 
liberal, and more attractive by their inter- 
viewees. In a different study, males and fe- 
males showed shorter response latencies to a 
higher, as opposed to an equal-, status inter- 
viewer (Bond & Hwa Young Ho, Note 1). 
There the higher status interviewer was per- 
ceived as sturdier and more cautious, No 
studies have found any effects for the new 
measure of hand gestures, 

All the above ratings are positive charac- 
teristics for Japanese subjects, The nonverbal 
effects seen in this experiment have thus been 
found in other studies where the interviewers 
eliciting the nonverbal effects have been rated 
positively on an evaluative dimension. This 
indirect reasoning suggests that the non- 
verbal responses seen here to the gazing in- 
terviewer indicate a positive response to his 
gazing, This conclusion is consistent with an 
unpublished study using Japanese females 
where an interviewer gazing 90% of the time 
was regarded as more polite and as more 
humble than an interviewer gazing only 10% 
of the time (Bond & Kodama, Note 2). 

These results are consistent with findings 
from studies in Western countries. Increasi 
amounts of gaze have produced more positi 
evaluations (e.g., Cook & Smith, 1975), 
though the direction of this evaluation 
pends upon the nature of the interaction (El 
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worth & Carlsmith, 1968). In their review of 
the literature of visual interaction, Ellsworth 
and Ludwig (1972) conclude that the es- 
sential effect of being gazed at is to increase 
one’s involvement in the interaction. The 
functional relationships in Japan for shorter 
response latencies, fewer torso movements, 
and fewer hand gestures indicate greater at- 
traction and are consistent with a conclusion 
of greater involvement in the interaction with 
the gazing interviewer of the present study. 
Thus, the Japanese response to the gaze in an 
interview situation appears to be the same 
as the Western response, although the non- 
verbal channels related to this attraction prob- 
ably differ depending on cultural display rules 
(Ekman, 1972). 


Interviewer Gazing Versus Eye Contact 


- The response to an interviewer gazing at 
the subject’s eyes was the same whether the 
subject reciprocated by looking at the inter- 
viewer’s eyes, thus making eye contact, or 
looked instead at the interviewer’s knees. 
The fact that eye contact was maintained 
only about 80% of the time by the subjects 
in no way qualifies this finding, because in 
similar interview situations Japanese males 
gave eye contact only about 20% of the time 
to a steadily gazing male interviewer (Bond 
& Shiraishi, 1974; Bond & Hwa Young Ho, 
Note 1). Thus the fourfold increase in this 
base rate should have afforded an adequate 
assessment of the subjects’ nonverbal behavior 
during eye contact. 

Before drawing any conclusions about lack 
of differences between eye contact and being 
gazed at, some comment should be made 
about the nature of this experimental situa- 
tion. First of all, the subject’s gazing behav- 
ior was determined by the experimenter’s in- 
structions and was not free to vary. Second, 
the interview format used here produced a 
highly structured interaction between pre- 
viously unacquainted persons. And finally, it 
is probable that the definition of the situation 
as an interview made an evaluative dimension 
salient to the subjects. A change across any 
of these dimensions could produce differences 
in nonverbal behavior between conditions of 
eye contact and those of being gazed at. 


MICHAEL H. BOND AND HIROSHI KOMAT 


In some respects, however, the experimen- 
tal situation used here is similar to that used 
in other experiments because, in most studies, 
subjects are unacquainted and the interac- 
tions are tightly structured. It is arguable, 
therefore, that having one’s eyes gazed at was 
the functional stimulus for arousal in much 
previous research on eye contact. 

Argyle and Dean (1965) have hypothesized 
that eye contact increases the intimacy level 
of an interaction, If this increased intimacy 
has nonverbal consequences as they have 
maintained, these effects were not tapped by 
the variables measured here. In this situation, 
at least, a state of eye contact appears to 
result in no greater impact than one of being 
gazed at. 

As mentioned before, this finding suggests 
a link between some experiments on eye 
contact and those on audience effects where 
performance is visually monitored. Specif- 
ically, it is possible that the perception of 
being observed is the functional stimulus in 
both cases. When being observed, one’s per- 
formance becomes available for evaluation 
and arousal (or involvement) ensues. In op- 
position, Zajone (Note 3) maintains that the 
mere presence of a conspecific is required and 
that audience effects are seen even in an evalu- 
ation-free situation. It may, however, be dif- 
ficult to eliminate the evaluative component 
from the gaze, given that throughout our life 
much of our activity has been assessed vis- 
ually by others. It is interesting in this respect 
that Cottrell, Wack, Sekerak, and Rittle 
(1968) were able to eliminate audience ef- 
fects by blindfolding the audience. Although 
Zajonc does not regard this experiment as an 
adequate test of the mere presence hypothesis, 
Cottrell et al.’s study suggests the effective- 
ness of the gazing variable in audience situa- 
tions. If a gazing other produces a generalized 
evaluation apprehension, then it should be 
possible to produce audience effects merely 
by having an audience gaze at a performer 
even where the correctness of the performance 
can only be evaluated nonvisually. This ques- 
tion is currently being examined. 


TARGETS OF GAZING 


Subject Gazing 


The requirement to gaze at the inter- 
viewer’s eyes served to increase the subjects’ 
self-manipulations of the hand. This increase 
was no greater when the interviewer was also 
looking at his partner’s eyes than when the 
interviewer was looking at his partner’s knees. 
Again there was nothing functionally unique 
about the condition of eye contact in this par- 
ticular situation. 

In Bond and Shiraishi (1974), increases in 
hand self-manipulations along with shorter re- 
sponse latencies and fewer torso movements 
formed a part of the nonverbal pattern elic- 
ited by forward-leaning interviewers. How- 
ever, in the present study this pattern of 
response was cleanly divided between the sub- 
ject- and interviewer-gazing variables, sug- 
gesting that a multidimensional response was 
given the forward-leaning interviewer in that 
previous study, It is quite possible that in- 
creases in hand self-manipulations indicate a 
disruption produced by information overload, 
as Kendon (1967) and others might suggest. 
This conclusion would be consistent with re- 
sults from the Bond and Shiraishi study com- 
paring. forward-leaning to backward-leaning 
interviewers; forward-leaning interviewers 
have a greater visual and auditory prominence 
possibly leading to stimulus overload for their 
interviewees. Relevant here is the finding 
from Bond and Hwa Young Ho (Note 1) 
that more self-manipulations of the hand are 
associated for males with perceiving the in- 
terviewer as more active. This perception is 
consistent with the hypothesis of stimulus 
overload. 

Apparently, however, the effects of such in- 
formation overload can be quite distinct from 
those produced by experimental manipula- 
tions which seem to increase subject involve- 
ment. In Bond and Shiraishi (1974), the 
forward-leaning interviewers apparently pro- 
duced both information overload and involve- 
ment; in the present study the effects of 
information overload and of greater involve- 
ment are associated with different manipula- 
tions. Different nonverbal channels appear to 
be sensitive to different features of the social 
situation. 
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» “™ Effect of Incentives and Education on Resource Conservation 
Decisions in a Simulated Commons Dilemma 


Paul C. Stern 
Elmira College 


Uncontrolled growth in a finite world creates conflicts between individual and 
collective good and between short-term and long-term good. Two strategies 
are defined for escaping this double dilemma: (a) making group-oriented be- 
havior pay off for individuals through incentives or strengthened group ties 
and (b) inducing people to act from a long-term perspective by educating 
them about the probable consequences of their acts. Forty-eight subjects played 
versions of a four-person commons game, modeled on a carpool, in which 
forms of incentive and education were manipulated. Incentives in the form of 
price increases produced more conservation and extended the life of resources, 
while direct payoffs and a rationing system were ineffective. The effect of the 
price incentive increased with its magnitude, Education by spot messages was 
ineffective, but detailed information about long-term consequences significantly 
extended the life of resources. This effect appeared early, before incentives 
showed any influence. While both strategies were effective in the laboratory, 
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reasons are given to expect less success in the outside world. 


Time is running out for the world. Over- 
population, pollution, and the depletion of non- 
renewable resources are the result of uncon- 
trolled growth in a finite environment. These 
processes are the product of a pervasive conflict 
between the individual short-term good, which 
demands continued growth, and the collective 
long-term good, which requires restraint. 
Hardin (1968) has described the world situa- 
tion as a “tragedy of the commons.” He com- 
pares our situation to that of a group of 
shepherds grazing their flocks on a common 
range. Each is motivated to add an animal to 
his or her flock, since the net profit would be 
the full value of an animal less a small cost due 
to competition from the new animal for the 
available grass. Since each shepherd profits 
from adding animals, there is an inexorable 
trend toward overgrazing of the range, which 
ends only with the ruin of all the shepherds. 
Hardin argues that the logic of the commons 
operates whenever people have unlimited 
access to a cheap but finite resource. The world 
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supplies of food, water, clean air, and mineral 
resources are examples of commons which 
people are currently exploiting. The logic of 
the commons dictates that, through the ra- 
tional actions of individuals pursuing their own 
well-being, these resources will all be exploited 
until they can no longer support the popula- 
tion. Hardin calls this situation tragic because 
individual rationality leads with certainty to 
collective ruin, The search for an escape from 
this dilemma is a growing concern among 
social scientists. 

A wide variety of social dilemmas appear to 
have formal similarities to the commons situa- 
tion. A conflict between individual and group 
rationality exists in armies, where each in- 
dividual stands to gain by deserting in combat, 
in labor organization, where individuals can 
gain the benefits of a union without paying 
dues, on sinking ships, where individual ra- 
tionality dictates that people rush to save 
themselves while risking everyone’s lives, and, 
in general, in any situation in which many 
individuals must act together to secure a com- 
mon good (Olson, 1965). In all these situations, 
individuals can profit from the effort of others 
without making the corresponding effort them- 
selves, yet if everyone reasons this way, there 
is no profit for anyone. These collective action 
problems have been compared with the com- 
mons dilemma, and the similarity of both of 
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these to the classical prisoners’ dilemma has 
also been noted (Buckley, Burns, & Meeker, 
1974; Dawes, 1975; Hamburger, 1973). As will 
be pointed out, the commons dilemma also 
differs in crucial respects from both the general 
problem of collective action and from the 
prisoners’ dilemma. 
The prisoners’ dilemma has, however, pro- 
vided one possible theoretical model for the 
commons situation (Dawes, 1975; Dawes, 
McTavish, & Shaklee, 1975). In a prisoners’ 
dilemma, individuals are faced with a choice 
between two acts, cooperation and defection. 
In the prototypical situation, these represent 
keeping conspiratorially silent about a crime 
and tuming state’s evidence against one’s ac- 
complice(s), respectively. The payoffs to each 
individual are such that no matter what choice 
others make, more is earned by defection than 
by cooperation. However, defection by every- 
one pays each player less than would be earned 
if each cooperated. Thus, the prisoners’ di- 
Jemma reproduces the conflict between indi- 
vidual and group rationality which is a promi- 
nent characteristic of the collective action and 
commons situations. 

Analysis of the commons as a prisoners’ 
dilemma suggests that Hardin’s tragedy may 
be escaped if people can somehow be induced 
to base their actions on group rationality, One 
way to achieve this end is to provide posit ive or 
negative incentives to individuals so that 
group-oriented behavior becomes more in- 
dividually rational (e.g., Grzelak & Tyszka, 
1974; Kelley & Grzelak, 1972). Real world 
examples of such incentives include tax ad- 
vantages for families which control fertility, 
charges for mining minerals, remission of tolls 
to carpools, and mandated pollution controls 
on automobiles. Another approach emphasiz- 
ing group rationality involves strengthening 
individuals’ ties to their groups. For example, 
intragroup communication has been found to 
facilitate cooperation in n-person prisoners’ 
dilemmas (Dawes et al., 1975; Grzelak & 
Tyszka, 1974), and structural factors such as 

power, cohesion, and status may also influence 
individuals to give more consideration to group 
interests (Buckley et al., 1974). 

While the model of the commons as a 
prisoners’ dilemma accurately reflects conflict 
between individual and group rationality, and 
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while it suggests some possible solutions to the 
dilemma, it is nevertheless an incomplete 
model of the commons. Stern (Note 1) has 
identified four properties of the commons 
which are not represented in the prisoners’ 
dilemma. 

First, individuals in a commons can m 
choice which frees them from the immediate 
effects of their fellows’ actions. For example, 
someone who buys a gas-guzzling new car 
knows approximately what this decision cost, 
and the cost is unaffected by the neighbors’ 
car-buying decisions, Thus, in a commons, the 
alternative to cooperation is independence, 
rather than defection; a player choosing in- 
dependence is unaffected by what other players 
choose. The special value of independence to 
some people may lead them to behave differ 
ently in a commons from the way they mig! 
act in a prisoners’ dilemma. 

Second, research on the commons diles ama 
should allow the possibility of intragroup con 
trols, such as an agreement to punish nonco- 
operators. The prisoners’ dilemma game rules 
do not allow players to form agencies with 
governmental functions. 

Third, intergroup relationships are impor- 
tant in a commons. The behavior of a small, 
face-to-face group may not generalize to a 
situation in which outside individuals and 
groups, with whom communication is difficult, 
can materially influence a group’s outcome. 

Finally, and most important for this dis- 
cussion, every commons situation involves the 
depletion of a finite resource. People in a com- 
mons are thus in a temporal interaction with 
their environment: As people consume re- 
sources, they change the environment in ways 
which make further consumption more and 
more costly. These changes may take the form 
of higher prices (as with mined minerals, when 
lower and lower grade ore must be refined) or 
decreased benefits from exploiting the com- 
mons (as when untreated sewage begins to 
noticeably foul the polluter’s immediate en- 
vironment). In the language of gaming, the 
payoffs change as a function of the players’ 
past actions. This property of the commons 
situation is not necessarily present in collective 
action problems, and is never part of a pris- 
oners’ dilemma game. It is the changing en- 
vironment in a commons which creates the 
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RESOURCE CONSERVATION DECISIONS 


conflict between short-term and long-term ra- 
ta tionality. While this conflict is implicit in the 
work of theoretical writers on commons di- 
lemmas, it has largely been absent from the 
empirical research, which has relied on static 
models, such as the prisoners’ dilemma. 

From the viewpoint of a static model, it is 
clear that the increasing costs of using re- 
sources eventually make conservation indi- 
vidually rational. A crucial question, however, 
is whether conservation will increase before the 
life support systems are irreparably damaged. 
A dynamic model which simulates the chang- 
ing environment can be used to study man- 
environment interactions in the face of de- 
clining resources and to evaluate which of 
several proposed policies can significantly 
extend the lifetime of a finite pool of resources. 
The only relevant available research (Ruben- 
stein, Watzke, Doktor, & Dana, 1975) suggests 
that both incentives and scarcity can increase 
conservation, but provides no data on the 
effect of declining resources on conservation, 

A dynamic model also suggests an additional 
strategy for solving the commons dilemma: 
‘Tragedy may be escapable if people can be 
induced to base their actions on long-term 
rather than on short-term rationality. This 
strategy might involve an educational program 
in which people are informed about the prob- 
able long-term consequences of their actions. 

Kelley and Grzelak (1972) have presented 
we evidence that cooperative players in a pris- 
oners’ dilemma have a longer time perspective 
than noncooperative subjects. It is plausible 
that information which encourages a long-term 
viewpoint might lead to increased conservation 

in a commons situation. 
Two strategies have been suggested for 
‘ averting the tragedy of the commons: People 
may be induced to behave according to group 
rationality (by the use of incentives or strength- 
ened ties to a group) or they may be induced 
-to behave in accordance with a long-range 
perspective (through education about the 
probable long-term consequences of their acts). 
In the present study, both strategies are used 
to influence behavior in a laboratory commons 
game modeled on a carpool. The carpooling 
situation has many of the formal properties 
which have been identified in the commons 
dilemma (Stern, Note 1), and carpools can be 
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readily simulated in the lab with small num- 
bers of subjects. Tt is hypothesized that posi- 
tive incentives (reduced costs to drivers in car- 
pools) and negative incentives (increased prices 
or gasoline rationing) increase conservation and 
prolong the life of resources. This effect is 
predicted to occur sooner when freedom to 
consume is directly restricted (rationing) than 
when indirect control (pricing) is used. In- 
formation about long-term consequences (spot 
messages and/or detailed information about 
how consumption determines prices or rations) 
is expected to bring about an early and per- 
sistent increase in conservation and to extend 
the life of the resources. 


METHOD 
Subjects 


The subjects were 48 undergraduate students drawn 
from classes in Freshman Arts and Sciences and In- 
troductory Psychology. They were told they would be 
paid for their participation. 

One commons game was played in each cell of a 
2 X 3 X 2 design (the manipulated variables were ra- 
tioning vs. pricing, type of influence attempt, and 
amount of information given about the game). 


Structure of Games 


The basic commons game (pricing, no influence at- 
tempt, some information) involved four players who 
made choices between X (which represented driving 
alone) and Y (which represented joining a carpool), 
Players choosing Y were required to choose one among 
them to be YW (representing the driver), with the 
others becoming YZ (riders), Group members had the 
option to quit the group and choose X during the 
process of deciding who was to be YW. ‘These choices 
were made on each of a series of trials, 

At the start of each trial, each player was paid a 
salary of 11 tokens (1 token = three cents of real 
money) and the cost of X, YW, or YZ was paid out of 
this salary, according to the player’s choice for that 
trial. These costs were announced before each trial, after 
having been determined on the following basis: A 
= cost of driving a car = 5 tokens at the start of the 
game; B = cost of membership in a carpool = 5 
tokens; C = cost of driving in a carpool = 1 token, A 
represents tangible costs such as gasoline and deprecia- 
tion of a car, while B and C represent intangible costs 
such as inconvenience and loss of individual freedom. 
Players choosing X paid A, those choosing YZ paid B, 
and those choosing YW paid A + B + C. Thus, the 
costs of X, YZ, and YW on the first trial were 5, 5, and 
11, respectively. These costs imply that as the games 
began, the cost of riding in a carpool (YZ) was equal to 
that of driving alone (X). In short, the games started 
with players having nothing to gain by joining a carpool: 
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In pricing games, the price of gasoline (reflected in 
the cost of driving and paid by players choosing X or 
YW) changed as a function of the group’s consumption 
of gasoline. A tally was made for each player who chose 
other than YZ on a trial, and a cumulative total of 
these tallies was recorded. Tallies represented units of 
fuel consumed. The price of gasoline was increased by 
one token on the next trial after the total tallies passed 
these values: 6, 12, 17, 21, 25, 28, 31, 33, 35, 36, 37, 38. 
‘Thus, prices increased as a function of tallies, increasing 
more and more rapidly as gasoline resources became 
depleted. Subjects were not told in advance when the 
prices would change, nor were they told that the game 
would be terminated after the third trial on which 
cumulative tallies exceeded 26. 

Rationing games differed from the above in that 
prices never changed (X paid 5, YZ paid 5, and YW 
paid 11, throughout the game). However, in addition 
to the salaries received at the beginning of each trial, 
players also received tickets which represented rationing 
coupons. Each player began the game with two tickets 
which were fully transferable, except that no player was 
allowed to hold more than two tickets at once. Any 
player choosing X or YW (i.e. driving) on a given trial 
was required to spend a ticket, The total number of 
tickets in a game began at eight (two per player) and 
decreased by one on the trial that began after the cumu- 
lative tallies had passed these values: 6, 12, 17, 25, 31, 
36. Thus, used tickets were not always replaced. The 
effect of this system was to make rationing more severe 
as a function of resource use and finally to deprive some 
players of the freedom to drive. Any player having no 
tickets was required to get a ride (i.e., become YZ) or 
else forfeit the next trial’s salary. This simulates a 
likely consequence of losing a gasoline ration. 

Three levels of the influence attempts variable were 
used. The no-influence condition present in one-third 
of the games has already been described. In one-third 
of the games, a positive incentive was included in the 
form of an additional rule. Players were told that if 
three or four of them chose Y, the cost of YW would 
be decreased by two tokens, This is analogous to the 
practice of offering free parking or lower tolls to par- 
ticipants in carpools. In the remaining third of the 
gan ga Se were introduced before Trials 3, 

, and 14, 

messages constituted an attempt to influence 
reminding them about the long-term con- 

their behavior. The messages were as 

In pricing games, before Trial 3, “You will 

-ala y and this will be used to pay for the 
designated choice. The more times 
anything other than YZ, the more the 
crease.” Before Trials 7 and 10, “The 
ve tallies accu. wlated, the higher the payments go.” 
Re Trial 14, Your salary can stretch only so far. 
sed, the higher the payments. Save 
ı rationing games, players were told 
“Yc ı will receive a salary to pay for 


you 


tallies v 


uture, 
ial 3, 
-sated choice, You will also receive a ticket 
¿nable you to be X or YW. The more times 
e anything other than YZ, the fewer tickets 
ve,” Before Trials 7 and 10, “The more 
lated, the fewer tickets received.” Before 
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‘Trial 14, “Your allotment of tickets can only go so far. 
The fewer used, the longer they'll last. Save for the 


future.” The influence attempts involved no change in 


the basic games other than those described here. 

Information was also manipulated. Players in the 
some information games were given the information 
that the cost of X and YW (or availability of tickets) 
depended on tallies and that these costs would increase 
(or tickets decrease) “once the number of tallies passes 
a predetermined point.” Players in the full information 
games were given this information, as well as the exact 
schedule of when the prices (or rations) would change 
as a function of tallies. Thus, full information subjects 
had more complete information about the long-term 
consequences of their actions. The added information 
was especially 
jects were made aware that the prices (or rations) 
would change slowly at first and would then change 
more rapidly, The value of early conservation was tht 
made known to those who would think about it. 


Procedure 


The subjects were seated around a small 
told they were about to play a game, ‘The simi! 
a carpool was not mentioned, so that attitudes atx 
how one should behave in a carpool would not influence: 
choices. The game leader distributed copies of the rules 
for the variant of the game to be played, read them 
aloud, and encouraged the subjects to ask questions 
about the game. The leader and players also went over 
a series of scoring examples to aid in understanding thi 
meaning of tokens, tallies, and, when appropriate. 
tickets, The rules contained information about salaries 
costs of X, YZ, and YW; rations (if appropriate); the 
procedures for deciding to be X, YZ, or YW; and either 
some or full information about when the costs (or 
rations) would change. In games involving positive 
incentives, the incentive was included as one of the rules 
of the game. In all games, players were given the vague 
information that the game would end “after the tallies 
reach a certain number.” 

At the start of each trial, the leader announced the 
value of X, YZ, and YW for that trial. When all players 
had chosen either X or Y secretly, they announced their 
choices to the group. When tokens (and tickets, if ap- 
propriate) had been collected from the players who 
chose X, those who chose Y were allowed 3 min. to 
choose one among them to be YW. If only one player 
chose Y, the rules stipulated that the player automa 
tically became X. r 

At the end of the meeting, the group leader collecte 
the appropriate payments. The leader announced the 
number of YZs and tallies for the trial and the totul 
number of tallies accumulated thus far in the gamc. 
Trials continued until the end of the game was reached 
Free communication among players was allowed at all 
times, except that players who chose X were instructed 
not to communicate with the others while they were 
deciding who was to be YW. A postexperimental ques- 
tionnaire was administered when the game ended, and 
the subjects were debriefed. At this time the analogy 
to a carpool and the meaning of the tallies were ex- 
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plained. No player suspected the relationship to any 
commons situation before the debriefing. 

A measure of resource conservation was obtained by 
recording the percentage of YZ or YW choices (ie., 
choices to carpool) made in each of six stages of the 
games. Each trial was included in a stage of the game 
defined by the number of tallies recorded before that 
trial began (0-6, 7-12, 13-17, 18-21, 22-25, or over 25) 
Thus, trials which occurred under identical condition. 
of gasoline price or availability of rationing coupons 
were grouped together in the same stage. The measures 
of conservation thus represented the percentage of 
players joining a carpool during each stage of each game, 

A second dependent measure was an index of the rate 
of resource depletion and the lifetime of the gasoline 
resources. Since resources are represented by tallies and 
time by trials, this measure was the number of trials 
needed to use a given number of tallies. Specifically, a 
record was made of the trials on which the 10th, 20th, 
and 30th tallies were used, (Several games ended with 
only 29 tallies used, Since the 30th tally would neces- 
sarily have been used on the next trial, this was assumed 
to have occurred for the purpose of data analysis.) The 
number of the trial on which the 10th tally was made 
represented the lifetime of the first 10 tallies, and the 
lifetimes of the next blocks of tallies were determined by 
subtraction, 


RESULTS 


Both dependent measures were subjected to 
four-way analyses of variance, in which the 
three manipulated variables were combined 
with a time variable representing either the 
six stages of the game or the three blocks of 
tallies, depending on the dependent measure 
being analyzed.t 

Positive incentives for conservation had no 
effect either on the frequency of conservation 
or on the lifetime of resources. Negative in- 
centives in the form of higher prices or smaller 
gasoline rations produced effects with the pas- 
sage of time. The time variable was significantly 
related to increased conservation, F(5, 10) 
= 5.02, p < .05, and possibly to a decreased 
rate of resource depletion, F(2, 4) = 6.12, 
p < Ì0. The comparison between rationing 
and pricing showed more frequent conserva- 
tion, F(1, 2) = 28.81, p < .05, and longer 
resource life, F(1, 2)= 7441, p< .05, to 
occur in pricing games, which is contrary to 
prediction. In addition, the interaction of 
rationing-pricing with time approached sta- 
tistical significance in its effect on conserva- 
tion, (5, 10) = 2.73, p < .10. Closer examina- 
tion of these data suggested that rationing and 
pricing did not produce differing amounts of 
conservation until relatively late in the games, 
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Ficurt 1. Cumulative resource consumption 
in rationing and pricing games. 


A similar result was found in the data on the 
lifetime of resources: rationing-pricing inter- 
acted with time at the’.10 level, F(2, 4) 
= 4.92, with rationing and pricing having 
significantly different effects only in the last 
block of tallies (by Newman-Keuls test, 
q: = 5.92, p < .05). Thus, pricing was more 
effective than rationing, but the effect did not 
appear until later stages of the game (see 
Figure 1). 

Edueation in the form of spot messages did 
not significantly influence either conservation 
or the lifetime of resources. However, when 
full information was given about the long-term 
consequences of action, resources lasted longer, 
E(1, 2) = 20.25, p< .05. The effect of in- 
formation on the frequency of conservation was 
less clear, amounting only to a weak trend, 
F(1, 2) = 2.97, p < .25. The effect of informa- 
tion was relatively stable throughout the game, 
and did not interact with time (see Figure 2), 


*Two games (both involving pricing and full in- 
formation) were halted due to a shortage of time before 
the rules called for them to end. These games lasted so 
long because a stable pattern of carpooling had evolved 
early in the game, and few tallies were used, The games 
were ended after 16 consecutive trials in which all 
players carpooled, when 22 and 23 tallies, respectively, 
had been used. For data analysis, it was assumed that 
the players in these games would have continued to 
carpool on every trial until the scheduled end of the 
game, 
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Ficure 2. Cumulative resource consumption 
in some and full information games. 


Discussion 


Great care must be taken in using these 
findings to draw broad conclusions about the 
effect of rationing, the effect of education, and 
so on concerning resource conservation. The 
present results probably depend on the par- 
ticular versions of education and rationing used 
in this study, the relative values of incentives, 
and other operational details. For example, the 
failure of a positive incentive to influence con- 
servation may have been due to the small size 
of the incentive used: Subjects were never 
observed to even mention the positive incen- 
tives while discussing their choices during the 
game. Other findings may also depend as much 
on the way variables were operationalized as 
on the variables themselves, in ways which can 
best be determined by further research. 

The negative incentives used in this study 
(rationing and pricing) effectively increased 
conservation and extended the life of resources, 
as predicted. Examination of the data suggests 
that this effect was due entirely to the pricing 
condition. The rate of resource consumption, 
represented by the slopes of the curves in 
Figures 1 and 2, decreased in pricing games as 
resources were depleted, but remained almost 

unchanged in rationing games (see Figure 1). 
A Newman-Keuls test showed that the last 10 
tallies lasted longer than the first in pricing 
games (qs = 6.55, 7 < .05) but not in rationing 
games (g: = -42) Thus, the rationing system 
used here appexrs to have been completely 
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ineffective, in stark contrast to the prediction 
that rationing, being a more direct constraint 
than pricing, would be more effective. 

One possible explanation for these results is 
that the rationing system used here was not 
perceived as a constraint. For one thing, only 
2 of the 24 players in rationing games ever ran 
out of tickets, and these only on the last trial 
of the game. If freedom is the ability to drive 
alone, it was nearly unrestricted by this ration- 
ing system. Postexperimental questionnaires 
showed no difference between subjects in 
pricing and rationing games in the degree to 
which they felt restricted by the rules of the 
game. Thus, this rationing system may not 
have imposed any appreciable cost on indi- 
vidually oriented behavior. A rationing system 
that restricted options early in the game may 
have effectively increased conservation. 

Another possible explanation of the ineffec- 
tiveness of rationing is that in rationing games 
it cost as much to ride in a carpool as it did 
to drive alone, Thus, it paid in the short term 
to drive alone as long as this option was open. 
(If a price structure like this seems unrealistic, 
it can be noted that in pilot work, when pay- 
offs favored carpooling any more than this, the 
frequency of carpooling increased to the point 
where strategies to increase it further were 
superfluous.) Thus, the choice to drive alone 
may have been dictated by short-term. self- 
interest, with carpooling appearing rational 
only to players who realized that it would pay 
off in the long run by prolonging the game. The 
present results suggest that for rationing to 
work (a) people must either perceive rationing 
as an immediate constraint on action or (b) 
people must learn to take a long-range view. 

If it can be assumed that the rationing sys- 
tem used in this study was not perceived as a 
constraint during most of the game, the ration- 
ing games can serve as an approximate baseline 
against which to compare the effects of incen- 
tives such as price increases on conservation. 
Compared to rationing games, pricing games 
do not produce a significant decrease in cumula- 
tive resource consumption until the end of the 
ninth trial. That is, after Trial 9 and later 
trials, cumulative tallies in pricing games are 
significantly lower than in rationing games 
(p< .05 by Mann-Whitney U test). Ap- 
parently, incentives such as price increases are 
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effective in proportion to their magnitude at 


W#a the moment (see Figure 1). At Trial 9, when 


the advantage of pricing games first becomes 
apparent, the average cost of driving alone in 
a pricing game was eight units. Interestingly, 
this is the earliest point in the game at which 
the average payoff for players in a four-person 
carpool is greater than that for a player driving 
alone. That is, price increases seem to become 
significantly effective at about the point when 
they first make group-oriented behavior con- 
sistent with the short-term welfare of indi- 
viduals. Players in the pricing games seem to 
have acted in their best individual interest 
“while maintaining a short-term perspective. 
The data on the educational strategy suggest 
that information about the probable long-range 
consequences of behavior may significantly in- 
crease conservation in a commons situation. 
While spot messages about long-range con- 
sequences were ineffective and were probably 
not even noticed, detailed information about 
the long-term effects of consumption ap- 
parently increased conservation and extended 
the lifetime of available resources. It was sug- 
gested earlier in this article that the strategy 
of giving information works by inducing people 
to base their actions on long-term considera- 
tions. If this is so, information should increase 
conservation even when incentives are too weak 
to induce group-oriented behavior. An exami- 
nation of the data indicates that full informa- 
tion significantly decreased consumption of re- 
sources as early as the third trial (Mann- 
Whitney U = 3, p < .02). The significant 
difference between some information and full 
information games continued through the 
eighth trial, after which the difference, though 
still apparent graphically (see Figure 2), be- 
came statistically nonsignificant. The loss of 
statistical significance may be due to the in- 
creasing disparity between rationing and pric- 
ing games which increased the variability 
+ within each information condition. 
~ These results suggest that both incentives 
and an educational strategy can be used to in- 
crease resource conservation. Furthermore, 
these strategies for change seem to operate as 
one would expect from theory. That is, in- 
centives seem to be effective in proportion to 
their magnitude and only when they are strong 
enough to make group-oriented behavior in- 
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dividually rational on a short-term basis. The 
informational strategy, on the other hand, can 
increase conservation even when such action 
is not individually rational in the short term, 
This strategy does seem to induce people to 
take a longer term point of view. The educa- 
tional strategy thus seems particularly promis- 
ing for increasing conservation before the costs 
of consumption become prohibitive for most 
individuals. 

Incentives can effectively restrict consump- 
tion while people maintain an essentially in- 
dividualistic, short-term perspective. Educa- 
tion seems capable of changing people in the 
direction of taking a long-term point of view. 
If these conclusions hold, the educational 
Strategy has greater promise for inducing 
people to conserve resources when incentives 
are not used or cannot be used in practice. It 
would be heartening to conclude from these 
findings that a program of education emphasiz- 
ing the long-term consequences of resource con- 
sumption could have a significant influence on 
conservation on a world or national scale, There 
are, however, some crucial differences between 
the model and game which generated these 
results and the real world. First, education 
about the structure of a game with known rules 
is much more credible than education about the 
real world system, whose rules are unknown. 
This would be true even if all expert predictions 
about the future of our resources were in 
agreement. Second, there is an inescapable 
difference between game time and real time. 
The fact that players can be induced to look 
ahead to the end of a 1-hr. laboratory game 
gives little assurance that they can be induced 
to act on the basis of considerations that are 
delayed by years or decades. Third, while a 
small group can influence the world system 
simulated in this game, it can have no such 
effect on the real life analog. No amount of 
carpooling by four people can change the price 
of gasoline. Action by large groups, or large 
numbers of small groups, is necessary to change 
the real world system, but the potential for 
collective action in a large group is much less 
than in a small one (Olson, 1965). Thus, while 
the present results support the analysis of the 
commons dilemma presented here, and while 
they suggest strategies to avoid the dilemma, 
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escape from the tragedy of the real life com- 
mons is much more problematical. 


REFERENCE NOTE 


1, Stern, P. C. Psychological research and the steady state 
society. Unpublished manuscript, 1975. (Available 
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This study investigated the relationship between the structure of moral reason- 
ing and specific moral choice. Subjects whose structural stage was determined 
by Kohlberg’s Moral Judgment Interview were asked to resolve dilemmas in- 
volving the cases of Watergate and Lt. Calley. The consistent finding was that 
persons at each higher stage of moral structure more often made choices that 
were consistent with human rights and less often chose alternatives that were 
designed to maintain conventions or institutions. The gradual and consistent 
(monotonic) nature of this pattern suggests a model of cognitive development 
which emphasizes the role of individual concepts within the general stage 


structures. 


Throughout the history of structuralism 
the issue of structure and choice has almost 
always been discussed in the context of dis- 
tinguishing the two concepts. Piaget’s original 
work began when he noticed that there was a 
consistency in children’s reasoning about 
Binet IQ problems, though such consistency 
appeared to be absent from their solutions or 
choices. The methodological revolution occa- 
sioned by Piaget was to orient psychologists 
to the underlying form or structure of reason- 
ing rather than final behavior or choice. 

But why should we be interested in study- 
ing the structure of reasoning? It is because, 
ultimately, structure is related to choice. 
While any set of cognitive developmental 
stages is valid merely because it follows an 
invariant sequence, such stages would be of 
little interest if they did not lead to specific 
types of decisions. Piaget’s discovery of formal 
operations as a set of structures would be of 
less interest were it not for the fact that hav- 
ing achieved formal operations an individual 
is able to solve multivariate problems that 
were unsolvable earlier. Likewise, in the social 
realm, we would be less interested in the 
structure of moral reasoning were it not for 
the prospect that the development of such 
reasoning leads one to make decisions that are 
more just. 

The person who has most clearly articu- 
lated the structural nature of moral reasoning 
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is Kohlberg (1969, 1973). He has traced the 
development of moral structures through a 
series of six stages which, due to their logical 
interrelationships, form an invariant and cul- 
turally universal sequence, Moral structures 
are measured by having individuals reason 
about several moral dilemmas (cases in which 
two or more conventional values conflict). A 
typical dilemma involves a husband, Heinz, 
who must decide whether to steal a drug to 
save his dying wife after all reasonable at- 
tempts to obtain the drug have failed. 

At each stage the dilemma is resolved using 
the principle which defines that stage. For ex- 
ample, a Stage 3 individual may respond, 
“Heinz should steal the drug because that is 
what any good, loving husband would do.” 
Such reasoning (i.e., the subject’s verbatim 
response) is considered surface reasoning or 
content. By analyzing a series of surface re- 
sponses, it is possible to determine the basis 
on which an individual generally considers ac- 
tions to be morally right or wrong, This basis 
is the underlying principle or structure, In 
the case of Stage 3 persons, the moral right- 
ness of an act is determined by the degree to 
which it can commonly be expected to main- 
tain harmony between people, a principle 
known as interpersonal concordance. Notice, 
however, that the structure of moral reason- 
ing is defined independently of the resultant 
choice, Another individual, also at Stage 3, 
could use the same underlying structure to 
argue that Heinz should not steal the drug 
because he would cause disharmony between 
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himself and the moral standards of his com- 
munity. 

Since structure and choice are conceptually 
independent, it had been thought that all 
arguments could be defended at any stage. 
However, the logic of Kohlberg’s theory sug- 
gests that as the structure of moral reasoning 
develops, it leads to a single, most just con- 
clusion. We have seen, for example, that 
Stage 3 reasoning could be used to defend op- 
posing choices in the Heinz dilemma, depend- 
ing on the reference group with which one 
wished to maintain harmony. As reasoning 
develops, its scope becomes more universal. 
By Stage 5 moral dilemmas are resolved with 
a consideration for human rights, rights which 
are seen as existing prior to and forming the 
basis of any human society. Thus, in the 
Heinz situation the structure of Stage 5 rea- 
soning can lead to only one choice. Logically, 
it must advocate stealing the drug since the 
wife’s right to life takes precedence over con- 
ventions protecting property. In this case the 
logic of the moral structure actually deter- 
mines the nature of the moral choice. 

The notion that form can determine choice 
also receives support from moral philosophy. 
Kohlberg (1973) takes the view that at the 
highest stage of development, all rational per- 
sons would, given the same information, come 
to the same conclusion in any moral dilemma, 
If morality is the just resolution of everyone’s 
claim, there should be one solution to each 
situation that most equitably achieves this. 
A procedure suggested by Kohlberg (bor- 
rowed in part from Rawls, 1971) is to sup- 
port only those claims that could be accepted 
were the actor given an equal chance of be- 
coming any of the participants in the situa- 
tion. Decisions made in this way are maxi- 
mally protective of individual rights, since all 
rational persons would want to secure their 
own basic rights (life, liberty, etc.) before 
securing luxuries or conventions. 

While a relationship between structure and 
choice seems likely, there has been no re- 
search designed specifically to test it. How- 
ever, evidence collected in other studies in- 
volving the stages of moral reasoning is rele- 

vant here. These studies are of two types. 
Those that have tr: ıted moral judgment as a 
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continuous variable have found a linear cor- 
relation between higher stage reasoning and a 
cluster of attitudes that have been called lib- 
eral (Fishkin, Keniston, 
1973), libertarian (Rest, 1976), and nonex- 
ploitively permissive (Jurich & Jurich, 1974), 
The feature common to these clusters is that 
individual rights are preferred over the main- 
tenance of institutions, except where institu- 
tions are themselves furthering rights. 

Another type of study has measured moral 
judgment as a discrete variable. Most of 
these have been experiments in which persons 
at different stages responded to the same situ- 
ational conditions with different overt be- 
haviors. The consistent results of these studies 
have been that subjects at the highest stage 
included in the sample (usually Stage 5) per- 
formed actions which were more consistent 
with the rights and welfare of other indi- 
viduals, especially in situations where the en- 
vironment provided either no incentive or 
actually punished one for doing so, Haan, 
Smith, and Block (1968) showed that persons 
who defended the principle of free speech 
despite imminent arrest were found largely 
among Stage 5 and 6 subjects, though to a 
lesser extent among Stage 2 subjects as well. 
Jaquette (Note 1) found that Stage 5 sub- 
jects were more likely than lower stage sub- 
jects to aid a distress victim in the context of 
ambiguous cues. Krebs (1967) found that 
Stage 4 subjects more often refrained from 
cheating even in the absence of an authority 
than did subjects at Stage 2. F. inally, conven- 
tional persons (Stages 3 and 4) are less likely 
to engage in delinquent behavior than are 
those at the preconventional level (Stages 1 
and 2; Kohlberg & Freundlich, Note 2). 

The major conclusion that can be drawn 
from these studies is that an empirical rela- 
tionship between structure and both atti- 


tudinal and behavioral choice exists, How- 


ever, it is not clear whether the effect is 
abrupt or continuous. Do Stage 5, and in 
some cases Stage 4, persons support choices 
based on.rights, while lower stage persons do 


not, or do individuals at each higher stage 
endorse the choice consistent with rights in- 
creasingly more often? Findings in line with 
the abrupt position would support the view 
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of development in which right answers are 
known only by those who have apprenticed 
through the lower stages. On the other hand, 
findings that support the continuous model 
would suggest that the conclusions derived 
from highest stage reasoning are increasingly 
perceived as more adequate as one develops. 

In order to validate the existence of a rela- 
tionship between structure and choice in 
moral reasoning and to investigate its abrupt 
or continuous nature, subjects of known 
moral stage were asked to make choices in a 
number of structured dilemmas. The dilemmas 
were designed to pit a substantive or pro- 
cedural right against some other concern. 
They involved the issues of Watergate * and 
Lt. Calley, two well known instances of con- 
flicting moral concerns. It was hypothesized 
that alternatives consistent with rights are 
endorsed more often at Stage 5 than at lower 
stages. No prediction was made concerning 
the abrupt or continuous nature of the pat- 
tern of response across the entire set of stages. 
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, Subjects 


The sample consisted of 372 persons, predomi- 
nantly college students in a variety of locations. Al- 
though it was not possible to take either a truly 
random or stratified sample, an effort was made to 
cover the entire spectrum of institutions. Included 
were 40 freshmen, 50 seniors, and 40 education grad- 
uate students at Harvard University; 20 freshmen 
and 20 seniors at Boston State College; 25 freshmen 
and 30 sophomores at Newton Junior College; and 
52 juniors and seniors at St. Anselm’s College (Man- 
chester, New Hampshire). To add greater geographi- 
cal distribution, 75 seniors at two Catholic prep 
schools in rural areas of Illinois and Michigan and 20 
school supervisors from the rural environs of At- 
lanta, Georgia, and Richmond, Virginia, were also 
included. 

As a whole, the sample was distinguished by its 
age (most subjects were 17 to 25 years old), region 
(predominantly New England), and 1972 voting rec- 
ord (237 favored McGovern, 161 favored Nixon). 
Sex was almost evenly distributed. To control for 
the possible effect of different subcultural milieus, the 
data were analyzed separately by institutions as well 
as together, 


Measures 


Each subject completed a written moral judgment 
interview and a week later completed the Watergate- 
Calley questionnaire. The moral judgment interview 
is a semistructured measure which presents the sub- 
ject with three dilemmas in story form (Should 
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Heinz steal the drug? Should Heinz be punished? 
Should a boy disobey his father who has broken a 
promise to him?). Each story is followed by a num- 
ber of open-ended probe questions. Responses are 
classified according to one of five stages and an over- 
all stage score derived. (Since Stage 6 is so rare it is 
not included in standard scoring.) Protocols were 
scored by the author using Kohlberg’s standard 
manual (Kohlberg, Colby, Speicher-Dubin, & Lieber- 
man, Note 3). A sample of 40 protocols was also 
scored by a second experienced scorer. Reliability to 
within half a stage was 95%. The convergent validity 
of the interview can be found in Kohlberg (1969). 

The Watergate-Calley questionnaire consisted of 20 
forced-choice questions which provided two and 
sometimes three response categories.2 The questions 
presented situations in which a substantive or pro- 
cedural right was in conflict with another concern 
(such as loyalty, sincerity, authority), The situa- 
tions were chosen both for their public saliency and 
for the clearness with which they presented conflict- 
ing issues.* The questions relating to Lt. Calley were 
selected from items used by Kelman and Lawrence 
(1972). They were prefaced with an objective sum- 
mary of the My Lai events, 

To substantiate the claim that particular choices 
were most consistent with the concept of rights, six 
judges, none of whom were familiar with Kohlberg’s 
theory, were asked to complete the questionnaire 
using the instructions in the next section, Three of 
the judges described themselves as conservatives on a 
scale of political ideology and had voted for Nixon, 
while the other three placed themselves in liberal 
categories and had favored McGovern, Both the 
ideology scale and the measure of candidate prefer- 
ence are described below (see Other Measures sec- 
tion). All judges showed at least 50% Stage 5 rea- 
soning. The definition of rights embodied in the 
following instructions is based on philosophic discus- 
sions by Ross (1930), Frankena (1955), Hart (1955), 
Rawls (1971), and Nozick (1974). 


Instructions for Resolving Moral Situations 


Following are the instructions given to the judges: 
“Below are a number of situations with alternative 


1 For a discussion of this study as it explains the 
Watergate events, see Candee (1975). 

2 Copies of the questionnaire are available from the 
author. 

8The Watergate affair (1972-1974) involved a 
series of events stemming from the efforts of Presi- 
dent Nixon and members of his reelection committee 
to cover up the committee’s involvement in a number 
of illegal campaign practices including a break-in at 
the national headquarters of the Democratic Party, 
The Lt. Calley case occurred during the Vietnam 
War. It involved his being tried and convicted, in 
1971, for the murder of 22 civilians in a village which 
had been suspected of Vietcong activity, Calley’'s 
main defense was that he was acting on orders previ- 
ously given by his superiors, 


one 
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answers. Choose your answer in accordance with the 
instructions. This is not necessarily a measure of your 
personal opinion. It is a measure of the implications 
of the instructions. The situations are taken from 
actual aspects of the Watergate and Lt. Calley af- 
fairs. Use your knowledge of these events to resolve 
ambiguities you may find in the questions.” 

Judgment of right and wrong. In deciding whether 
an actor is right or wrong, consider the human 
rights that are explicitly involved or that may be 
reasonably implied in the situation. The basic human 
rights include (a) life (protection from unwanted 
deprivation of life), (b) liberty (freedom of action, 
speech, association, belief, and freedom from un- 
provoked harm or intrusion), (c) procedural rights 
(fair trial, habeas corpus, equal consideration for a 
job, etc.), (d) political rights (vote, hold office), and 
(e) property. Lesser rights (including those some- 
times known as special rights) involve rights derived 
from contracts, promises, family, and job relations. 

1. If a claim to rights exists on one side of an 
issue and only the upholding of a convention or self- 
interest exists on the other, decide in favor of the 
side which protects rights. 

2. If a claim to rights exists on both sides of an 
issue, decide in accordance with the order listed 
above. For example, life takes precedence over all 
other rights, while liberty prevails over all but life. 

3. Consider the motives of the actor. Try to assess 
whether the actor’s intent was to (a) protect basic 
rights or (b) protect an institution, convention, or 
self-interest. Use objective data, primarily the actor’s 
publicly stated intention, 

4. An action which violates lesser rights (see hier- 
archy above) is justified only if (a) it is performed 
for the publicly stated purpose of protecting more 
basic rights and (b) the action itself is reasonably 
related to those goals. To be reasonable an action 
must either (a) have a fair probability of being suc- 
cessful or (b) be virtually the only recourse available 
because either the more usual channels have failed 
or because basic rights would be jeopardized by the 
time such channels could be attempted, 

5. Summary: If, on balance, an action was de- 
signed to protect basic rights (pursuant to the above 
qualifications), decide that the actor was right, not 
guilty, and so on. If, however, an action involved 
the violation of any rights without the justified at- 
tempt to protect more basic rights, consider the actor 
wrong, guilty, and so on, 

Duty. Duties are correlative with rights. Therefore 
one’s first duty is to insure the protection of basic 
rights, when they are immediately threatened, The 
order of duties follows the order of rights (see Judg- 
ment of right and wrong section above). In addi- 
tion, there are certain other prima facie duties in- 
cluding the duty to treat all persons with human dig- 
nity, to be open and honest, to obey just institutions, 

and to act so as to increase the general welfare. 
These duties should honored except in cases where 
they would actually promise human rights. 

Principles of equality md universality. Rights that 
are basic to human bei: gs are to be applied univer- 
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sally (to everyone) and equally, This is particularly 
pertinent in the case of criminal and procedural 
rights and duties. All persons must be prosecuted or 


defended in a manner consistent with their alleged ,' 


ant 


crime without respect to such morally irre) 
criteria as power, status, race, sex, or creed. 

Responsibility. An individual is considered 
sponsible for his/her actions except in circum 
where he/she could not reasonably have ant 
the actual outcome, This requires that the in 
make a considerable and pointed effort to determine 
the nature and probable consequences of his/her act, 
It is the actor himself, armed with reasonably 
definite knowledge, who must ultimately decide the 
propriety of his own action, 

The choices I expected on the basis of these in- 
structions appear in Table 1, Reliability of the judges 
with this criterion was as follows: for 11 of the 20 
items (Questions 1, 2, 4, 6, 7, 8, 9, 15, 16, 17, and 
19), agreement was 100%. For 6 other items (Ques- 
tions 3, 5, 12, 13, 14, and 20), there was agreement 
by five of the six judges. Of the remaining 3 items, 
Question 10 showed equal 67% agreement among 
both liberal and conservative judges, Question 18 re- 
ceived 100% agreement among conservatives but only 
33% among liberals, and Question 11 (dealing with 
Daniel Ellsberg) received 100% agreement from lib- 
erals but 0% from conservatives. Thus, with only 
three exceptions, the foregoing instructions were suc- 
cessful in producing a reasonably nonpartisan con- 
sensus on the implications of the concept of rights. 


Other Measures 


In order to obtain a composite measure of each 
subject’s use of the rights alternatives, I constructed 
a Rights Index. This was the percentage of choices 
consistent with rights over the entire 20 questions. 
A general measure of political ideology was obtained 
by having subjects rate themselves, relative to others 
their own age, on a 9-point scale from highly con- 
servative to radical. Candidate preference was simply 
a self-report of the subject having favored Richard 
Nixon or George McGovern in the 1972 election. Sub- 
jects who gave no choice were eliminated from tallies 
using that variable. An approximation of the effect of 
different school milieus was obtained by ranking the 
schools according to academic entrance standards, 


RESULTS 


The hypothesis of this study was strongly 
confirmed. There was a highly significant 
linear main effect of moral stage on the 20- 
question Rights Index. Means for Stages 2—5 
were 48, .57, .70, and .86, respectively, 
F (3,356) = 59.37, p < .001, linear trend p 
< .001. 

We can now interpret the data for the in- 
dividual Watergate-Calley questions as pre- 
sented in Table 1. A significant relationship 
between moral structure and choice, as mea- 


A 
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sured by chi-square, was found for 17 of the 
20 questions. What is most important is that 
if each case the predominant pattern was 
monotonic. That is, with only several excep- 
tions found among the relatively few Stage 2 
subjects, the percentage of persons endorsing 
the alternative consistent with rights was 
greater at each higher stage. A conservative 
measure of this association is Kendall’s tau 
(rank correlation). In all cases for which chi- 
square was significant, tau was also. Of pri- 
mary importance is the fact that in no case 
was the correlation negative. Among the 17 
questions with significant chi-squares, there 
were no cases in which higher stage subjects 
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preferred the alternative less consistent with 
rights. 

To help establish the validity of the re- 
sults, the effect of moral structure on choice 
was compared with other variables which 
might also be expected to account for these 
findings. Since Watergate was a partisan po- 
litical issue, the most likely competing vari- 
able would be one’s candidate preference in 
the Watergate-tainted election of 1972. Ac- 
cordingly, in Table 2, the results are pre- 
sented separately for persons supporting 
Richard Nixon and those supporting his op- 
ponent, George McGovern. Although Nixon 
supporters tended to excuse both the Water- 


TABLE 1 


PERCENTAGE OF Subjects CHOOSING RIGHTS ALTERNATIVES ON WATERGATE-CALLEY QUESTIONNAIRE 


Question 


Moral 


1. Do you approve or disapprove of Lt. Calley 
having been brought to trial? (Approve) 

. Should American officers be convicted for war 
crimes ordered by their superiors? (Should) 
3. Do you consider Calley’s actions: Right—what 

any good soldier would do under the cir- 
cumstances; Wrong—but hard for him to know 
right from wrong; Wrong—clear violation of 
military code; Wrong—violation of morality 
regardless of military code? (Wrong—violation 
of morality) 

4. Do you consider Calley guilty or innocent of 
murder? (Guilty) 

. E. Howard Hunt sincerely believed he was 
helping the country. Was he right to participate 
in the Watergate break-in? (No) 

6. Should money contributed for general use in 
the Nixon campaign have been given to the 
Watergate defendants to pay for their law: 
and feed their families? (Should not) 

7. Should the Watergate defendants have been 
allowed to conduct a public campaign to raise 
money for their defense? (Yes) _ 

8. Considering his duty as an officer in CRP 
(Committee to Reelect the President) should 
Magruder have admitted the burglars were 
hired by CRP or tried to cover up? 
(Admitted) $ , 

9. Herbert Kalmbach asked his superior, “Is this 
a proper assignment and am I to go through 
with it?” He was told it was. If it later turns 
out that Kalmbach was collecting hush money, 
would you consider him responsible? (Yes) 

10. If the money was used as hush money, would 

you consider Kalmbach guilty of a crime? (Yes) 

11. Daniel Ellsberg stole top secret papers that 

belonged to the Pentagon because they ‘ 
contained information about the Vietnam War 
that he felt the public should know. Was _ 
Ellsberg right to steal the Pentagon Papers? 
(Yes) 


=) 


in 


2 SESAN tau 

0 45 78 89 36.29** 3" 
20 22 53 84 38.72** 46** 
20 39 54 82 36.00** Adee 
60 40 66 97 27.86"" 38" 
64 64 75 89 13.92** D M hans 
66 65 82 87 18.22** 20" 
46 33 53 88 53.97" Jon 
40 53 17 94 46.43** Sor 
19 26 34 62 27.27" 25% 
19 33 40 54 12.08* Akg 
16 21 43 MM 30.10** -294+ 


(lable continued) 
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Question 


. Ellsberg stole the Pentagon Papers; Hunt and 
Liddy stole from Ellsberg’s psychiatrist. Were 
the two crimes basically the same or different? 
(Different) 

. If impeachment alone were held today, based 
on what you know at this time, would you be 
for or against impeachment? (Hor) 

. If the President were already impeached and a 
vote were held today on removing him from 
Hee would you be for or against removal? 

For) 

15. John Caufield was the man who delivered the 

hush money to the Watergate burglars. 

Caufield knew the purpose of the money but 

had been told by John Dean that instructions 

to deliver the funds had come “from the Oval 

Office.” Was Caufield right to have delivered 

the money? (No) 

Was it permissible to have a nurse take files 

from Elisberg’s psychiatrist to obtain more in- 

eee on his theft of the Pentagon Papers? 
No) 

. John Ehrlichman stated that taking such files 
is common when investigating a political 
suspect. Was it permissible? (No) 

. Jobn Ebrlichman was in part responsible for 
preventing leaks of military secrets. Was he 
right to investigate Ellsberg concerning the 
Pentagon Papers? (Yes) 

Hunt and Liddy’s break-in at Ellsberg’s 

psychiatrist justified? (No) 

20. Was Ehrlichman, who had hired Hunt and 

Liddy, responsible for their exploits? (Yes) 


10. 


N» Questions 1-4, 15 
N Questions 5-14, 16-20 


, Percentages are for choice consistent with rights, 

Jes on moral judgment interview, The wording of 
"The df = 3. For questions with more than two choices, 
© The number of respondents varies slightly from question 
ap <05. 

"tp < 001. 
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gate participants and Lt. Calley while Mc- 
Governites tended to condemn them, the 
pattern of choice by moral stage was the same 
within both camps. In the vast majority of 
cases there was greater support for the rights 
alternative among subjects at each higher 
stage within the same political group. 

A similar analysis controlling for school 
was done for two of the larger subsamples. 
Within both the Harvard and Boston State 
groups, all questions followed the predicted 
pattern, No individual analysis was done for 
the other institutions. 

An overall test of the effect of these com- 
peting variables on the pattern of content 
ied by using the 20-question 
ye correlations of this index 


choices was ob 
Rights Index, I 
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41 43.93** 


30 60 96 43.18** Ja 


60 83 100 534e 


69 20.35** 


26.25" 


56 


3.17 
4.58 
76 


(74) 
(93) 


86 


(27) 
(61) 


2.87 


ut which subject gave 25% of his/her 
shortened for presentation here. 

tE nt with rights were combined during analysis, 
question due to subject omissions, do not knows, and so on. 


with moral stage, political ideology, 1972 can- 
didate preference, and a dimension of school 
prestige is presented in Table 3. As can be 
seen, the relation of moral stage to the Rights 
Index is greater than with any of the other 
variables. When candidate preference, politi- 
cal ideology, and school are entered first into 
a regression, the multiple correlation with th 
Rights Index is 57 (R* = .32), F(3,306) = 
50.09, p < .01. However, the inclusion of 
moral stage increases the multiple correlation 
to .70 (R?= 49), which is a highly signifi- 
cant addition, F(4, 305) = 76.62, p < .01. 
Of course, when moral stage is entered first 
into the regression, it absorbs nearly all the 
variance itself. The conclusion to be drawn 
from these results is that the relationship be- 
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TABLE 
PERCENTAGE OF Subjects CHOOSING RIG 
[ORAL STAGE AND CAN 
72 candidate preference 
McGovern ‘Nixon 
Moral stage Moral stage 

Question 3 + Ts 3 + 5 

L. “Calley trial Apiro 51 95 26 59 it 
2. Convict officers (Should) 27 94 5 33 50 
3. Calley’s actions (Wrong—morality) 44 85 15 33 71 
4. Calley verdict (Guilty) 44 100 28 42 86 
5. Hunt (Wrong) 66 88 63 78 91 
6. General funds (Should not give) 68 90 67 81 89 
7. Public defense fund (Yes) 34 89 36 48 80 
8. Magruder (Admit hired) 54 96 52 56 81 
9, Kalmbach (Responsible) 31 66 20 12 45 
10. Kalmbach (Guilty) 43 55 20 14 50 
11. Ellsberg (Right to steal) 37 62 5 22 10 
12. Hunt, Ellsberg crimes (Different) 58 88 33 80 67 
13. Impeachment (For) 86 97 40 64 100 
14. Removal (For) 91 95 24 46 83 
15. Caufield (Wrong) 85 100 73 76 100 
16. Take Ellsberg’s file (Wrong) 97 100 80 93 91 
17. Common excuse (Wrong) 95 100 64 77 91 
N^ anonsi 1-4, 15 (27) (7) 
N Questions 5 -14 1617, AGI a 


Note, EET are for choice co 
responses on moral judgment interview, The ears 
which were significant in Table 1 are presented here. S 


subject gave 25% of his/her 
e. Only those questions 


age 2 aie were eliminated from this table due ‘to their infrequency. 


a The number of respondents varies slightly from question to question due to subject omissions, do not knows, and so on, 


tween moral stage and Watergate-Calley 
choices is real and cannot be attributed to 
other factors. 

The tendency of higher stage persons to 
choose the alternative consistent with rights 
does not mean that, in fact, they share greater 
consensus. Often, as in Questions 2, 8, 9, and 
11, there was more actual agreement among 
Stage 3 persons of both political camps than 
among those at Stage 5, At lower stages, per- 
sons are unaware of situational subtleties that 
higher stage subjects find meaningful. For ex- 
ample, in the Kalmbach situation (Question 


TABLE 3 


INTERCORRELATIONS OF MORAL, POLITICAL, 
AND EDUCATIONAL VARIABLE: 


n 
> 


1. Rights Index == 
2. Moral development 

3. Candidate preference 

4 

5 


. Political ideology 
. School prestige 


Note. The N = 372. 


| bee 
Ooo AN 


Numbers are product-moment 


_ correlations. 


9), supplementary open-ended responses indi- 
cate that while Stage 3 subjects simply ex- 
cused Kalmbach because he acted as a good 
subordinate, Stage 5 subjects saw a complex 
mix of dutiful behavior and one’s ultimate re- 
sponsibility to decide personally the propriety 
of one’s acts. Differences in the interpretation 
of facts that were irrelevant at Stage 3 pro- 
voked considerable disagreement at Stage 5. 
However, the tendency of higher stage sub- 
jects from both political camps to see Kalm- 
bach as increasingly more responsible sug- 
gests that the consensus at Stage 3 is unstable 
and dissolves as individuals develop, 


Discussion 


The major result of this study was that per- 
sons at each higher stage of moral structure 
more often made decisions in moral dilemmas 
that were consistent with human rights and 
less often chose alternatives which were de- 
signed to maintain conventions or institu- 
tions. These findings support the view that 
moral structures are, at least partially, de- 
terminative of moral choice. Furthermore, 
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since the observed pattern was in all cases 
monotonic, there is empirical support for 
Kohlberg’s claim that at the highest stage of 
reasoning all persons, given the same informa- 
tion, should reach the same answer. This is 
not to say that there was perfect agreement 
among Stage 5 individuals, Since disputes 
over information exist, there will never be 
consensus in the real world. Furthermore, 
even Stage 5 persons have not reached full 
development. However, the tendency at each 
higher stage to choose the answer consistent 
with rights more often, suggests the psycho- 
logical validity of the philosophic claim that 
such answers are in a logical or objective 
sense right. 

A process of decision making that accounts 
for these results rests on the assumption that 
nearly everyone sees the core of highest stage 
reasoning. For example, people at all stages 
recognize that life is more basic than either 
law or property as evidenced by the common 
response to Kohlberg’s standard Heinz di- 
lemma, “Property can be replaced but life 
cannot.” In that story nearly everyone would 
like to save the woman’s life. However, at 
lower stages factors which are irrelevant in 
terms of a philosophically valid moral de- 
cision may be given greater weight. A Stage 2 
individual may want to justify helping the 
wife, but weigh the inconvenience of a 
lengthy trial, or even a jail term, equally if 
not greater. A Stage 3 person’s altruism may 
be restrained because stealing is not con- 
sistent with being a good citizen, One aspect 
of development is that at each higher stage, 
an individual adopts, eliminates, and reor- 
ganizes concepts to increase the likelihood of 
coming to decisions which in some sense he or 
she knows to be right. In doing so the indi- 
vidual achieves structures that are more equil- 
ibrated both within themselves and with the 
outside world. 


` Relation of Structure to Concepts 


While the foregoing model accounts for the 
general relationship between structure and 
choice, it does not explain the mechanism by 
which structure and choice are related within 
each of the stages. That mechanism involves 
Piaget’s (1970) definition of a structure as a 
whole or integrated system of elements which 
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is self-regulating in the sense that it provides 
the transformational rules by which any ele- 
ment in the structure can be generated from 
any other without appealing to extrastruc- 
tural elements, As a transformational rule 
the principle of each moral stage defines a 
set of equivalent concepts and imposes some 
limits on their substitutability. For example, 
the principle of Stage 3, interpersonal con- 
cordance, dictates that one can substitute any 
action which “a good citizen should do” as 
long as it is not based on conditions that were 
rejected by development through Stage 1 
(physical power) or Stage 2 (blatant self-in- 
terest). The monotonic nature of our results 
suggests that the logical relation of rights to 
conventions imposes more stringent conditions 
on concept substitutability at each higher 
stage. While “protecting rights” is but one 
of many “things a good citizen should do” at 
Stage 3 (others include being sincere, loyal, 
and dutiful), it is logically paramount at 
Stage 5. While many Stage 2 and 3 subjects 
could defend a choice not consistent with 
rights, few could at Stage 5. 


Moral Stages and Political Reasoning 


We turn now to the interaction effect of 
moral reasoning and candidate preference on 
political choice. In comparing political camps 
we find that the greatest difference involved 
the most explicitly partisan issue, impeach- 
ment (Questions 13 and 14, Table 2), fol- 
lowed by the issues of Kalmbach (Questions 
9 and 10) and Ellsberg (Questions 11 and 
12). These situations seem to be distinguished 
by their complexity, relative obscurity, and 
institutional nature. Few people know the 
procedures involved in either campaign fund- 
raising or in national security research. Where 
the facts are most ambiguous, individuals are 
most influenced by their general political ori- 
entations in interaction with their moral rea- 
soning. By comparison, the issue in Question 
3, the right to a public defense fund, is com- 
mon and independent of any particular con- 
text. In this case the effect of political prefer- 
ence was nil. 
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To test the attribution hypothesis of self-perception theory directly, students 
were induced to attribute their behavior in a university course to extrinsic 
outcomes (ie. a high grade). Attitudes were correlated with noninstrumental 
behaviors when attributions were not induced and were correlated with grades 


when attributions were induced. Evidence suggests students used their behavior 
to infer intrinsic features of their courses when those features were not made 


salient by direct questioning and when attributions were not induced. More- = 


over, subjects reported less favorable attitudes when induced to make extrinsic 
attributions and when extrinsic information about the course was presented 


prior to intrinsic information, The results are discussed in terms of the effect 
of extrinsic attribution on inhibiting the generation and recall of intrinsic 


information when formulating an attitude. 


Despite support for a general attribution ap- 
proach to attitudes (Bem, 1972; Nisbett & 
Valins, 1971), variables believed critical to the 
process have been rarely manipulated. One fre- 
quently cited but untested assumption is that 
people use their behavior to infer their attitude 
when “they have good reason to believe their 
behavior toward a stimulus is produced pri- 
marily by their feelings about the stimulus” 
(Nisbett & Valins, 1971, p. 66). People are 
said to view their behavior as least intrinsically 
determined when salient and plausible external 
“causes” are present. The extrinsic features of a 
situation (reward, social demands) are assumed 
to lead to extrinsic attributions, diminishing an 
attribution of the behavior to internal/intrinsic 
features of the situation (liking, enjoyment). 
The typical experiment supporting this as- 
f attribution does not manipulate 


sumption í 
the at ion of behavior to external causes 
directly stead allows the behavior to be 
made nditions which are more or less 
extern ‘rolled. In executing such experi- 
ment ution is presumed and also 
believe ‘o the attitudes inferred. 

Tt is q that an attribution is not 
necessary t the knowledge of one’s 
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are sufficient to derive an attitude, Consider, 
for instance, a situation in which a person is 
given a salient reward for doing an interesting 
task (Calder & Staw, 1975; Deci, 1971; Lepper, 
Greene, & Nisbett, 1973). The attribution hy- 
pothesis of self-perception theory predicts that 
a person given payment finds the task less in- 
teresting than one not given payment. Presum- 
ably the rewarded person attributes his activity 
to the externally mediated payment, obscuring 
intrinsic reasons for it. A person not given pay- 
ment has no convenient explanation for his be- 
havior and seeks reasons consistent with per- / 
forming it. The search takes him to personal or 
task characteristics and he concludes, “I did it. 
because it is an interesting task, and I enjoy. 
tasks like this one.” 

An alternative interpretation of the above 
situation need not assume the person attrib- 
utes his behavior to extrinsic outcomes. The 
external cues themselves provide information 
about the value of the task. A person curious 
about the experimenter’s offer of money might 
conclude that the experimenter thinks the tas 
is not enjoyable and requires an inducement. 
In line with such reasoning, Staw, Calder, and 
Hess (Note 1) found that when payment 
given by an experimenter was not normally as- 
sociated with doing a particular task, the self- 
perception results obtained. When, however, 
the given payment was normally associated 
with a task, the self-perception results did not 
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obtain. It is quite plausible that payment was 
a -being used as a cue about the inherent value of 
oe % the task; unusual payment may suggest a less 
: than desirable task. 

The ambiguity of an attribution interpreta- 
tion of attitude inference derives primarily 
from the historical practice of not manipulating 

» the attribution process directly and also from a 
tradition of rarely Measuring attributions, as 
Calvert-Boyanowski and Leventhal (1975) 
pointed out. The effect of attribution on atti- 
tude inference can thus neither be demon- 
strated nor verified. 

In addition to being ambiguous, attribution 
studies have made no attempts to examine the 
process by which either attribut ions or behavior 
affect attitudes. Most Writings about attribu- 
tion seem to imply some kind of direct infer- 
ence process, in which behavior is something 
which needs to be explained. Behavior attrib- 
a, -uted to extrinsic causes, of course, needs no 

< explanation. In the absence of an extrinsic at- 
ee! tribution, however, the person might impute 
F. some characteristic to himself (an intrinsic 
motivation, a liking) or to the situation (an 
interesting, pl sasurable, or challenging fea- 
ture) to explain his behavior. Neither of these 
Processes has been directly examined in at- 
tribution studies, however, 

The present experiment was designed to ex- 
amine whether attribution is necessary to atti- 
tude inference from behavior and to explore 
Some processes by which behavior affects atti- 
a tudes. In so doing, this‘experiment expands on 
a methodology introduced by Salancik and 
Conway (1975) for directly examining theoret- 
ical questions about information-processing 
approaches to attitude (including attribution 
theory). The methodology involves manipulat- 
ing a person’s cognitions during an inquiry into 
his attitudes. The general idea is to structure 
questions so that in answering these questions 
4. a person brings into his own awareness the ex- 
WY perimentally relevant information. Questions 
framed in such ways that respondents answer 
them as experimentally intended are used to 
shape a person’s attitudes and attributions di 
rectly. Consider asking a person, “Did you 
actively campaign for candidate A during the 
last election?” ssuming a no response, the 
person’s recollection of his behavior may lead 
him to think he is not very favorable to the 
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candidate. To vary the evaluation, the next 
question might be, “Many people who actively 
campaigned for candidate A were contacted by 
his campaign committee, Were you con- 
tacted?” A no answer provides an externally 
generated explanation for not campaigning 
and thus the person should be less likely to 
infer an attitude from the behavior. 


Study Design and Theoretical Predictions 


The present st udy concerns students’ atti- 
tudes toward a course taken at their university 
in a prior semester, Questionnaires were de- 
signed to make students aware of (a) their 
course-related behaviors, (b) intrinsic course 
features, (c) extrinsic course outcomes, and (d) 
extrinsic attributions, These questionnaires 
were experimentally sequenced to vary what 
subjects were aware of before expressing an at- 
titude. Intrinsic features were either made 
salient or not. Extrinsic features were either 
made salient or not. Within conditions of sali- 
ent eXtrinsic features, students were either 
induced or not induced to attribute their 
course behaviors to the achievement of extrin- 
sic outcomes. Finally, when both intrinsic (1) 
and extrinsic (E) features were made salient, 
two orders were imposed (I-E and E-I). 

To measure the use of behavior to infer at- 
titudes, correlations between subjects’ behavior 
and their attitudes are computed, If subjects 
use their behavior to derive an attitude, high 
positive correlations should obtain; if not, zero 
correlations, Tests of the attribution hypothe- 
sis are made by examining differences in mean 
correlations between conditions. 

The experimental manipulations of extrinsic 
outcomes and attributions were done to test 
directly the attribution argument. If the attri- 
bution assumption is not nec ary and the 
mere salience of extrinsic features are sufficient 
to create attitudes, then attitude behavior cor- 
relations should be lower when extrinsic fea- 
tures are made salient, If, however, attribution 
is critical to attitude inference, then attitude 
behavior correlations should be lower only 
when students are induced to attribute their 
behavior to the extrinsic outcomes of their 
courses, 

The manipulations and Measurement of in- 
trinsic features were made to examine some 
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alternative processes by which attitudes derive 
from behavior. One possible way behavior is 
used to infer attitudes is as a salient and direct 
source of information about feelings. Behavior 
in this sense is a reaction to a situation and as- 
sumed to indicate subjective impressions. If 
there is no salient extrinsic reason for the be- 
havior, a person may simply assume or impute 
an internal reason for the behavior, such as lik- 
ing. Essentially, the person imputes character- 
istics to himself directly from the behavior. 
The empirical implications of this assumption 
are: (a) Behavior is correlated with attitudes 
whenever an extrinsic attribution is not made 
before measuring the attitude, and (b) behavior 
and attitudes are uncorrelated with subjects’ 
reports of the intrinsic features of their courses. 
A variation on the above process is one which 
suggests a person directly infers an attitude 
from his behavior but does so because he has 
no other bases for judging his attitude. Other 
potential information about a situation may be 
less salient, less easily recalled, and hence less 
available for developing an attitude. As Nis- 
bett and Valins (1971) have said, “If we knew 
our feelings about objects independently of our 
behavior toward them, observation of our be- 
havior could not influence our feelings” (p. 75). 
To test this possibility in the present study, 
some students are specifically primed to recall 
concrete aspects of their courses which they 
found interesting or enjoyable (intrinsic fea- 
tures salient), Since they could derive an atti- 
tude from this information, their need for in- 
ferring one from behavior should be less and 
the attitude-behavior correlations should be 
lower. 
An alternative process by which behavior 
might affect attitude is as a cue used to impute 
_ characteristics to the situation. A person who 
behaves in a positive way in a class (by doing 
all the readings) may assume some characteris- 
tics of the course (e.g., an interesting professor) 
led him to so behave and he imputes those 
characteristics to the course. The person then 
derives an attitude from the characteristics he 
imputes, creating a correlation between atti- 
tude and behavior. If the behavior is already 
attributed to extrinsic factors, of course, there 
would be no need to impute intrinsic character- 
istics. This process makes quite different pre- 
dictions than the first one, Behavior should be 
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correlated not only with attitudes but also with 
subjects’ reports about the intrinsic features of}. 
the course, as long as an extrinsic attribution >. 
does not precede those reports. To test this, 
subjects are given different orders for recalling 
intrinsic and extrinsic features of their courses. 
Some are primed to recall intrinsic features be- 
fore extrinsic features and before an induced 
attribution. Others are primed to recall intrin- 
sic features after the extrinsic attribution in- 
ducement. If behavior affects the characteriz- 
ing of courses, it follows that attitude-behavior 
correlations are higher for the I-E groups than 
the E-I groups. 

Thus far we have discussed the effect of attri- 
bution on the use of behavioral information in 
forming an attitude—the correlation of be- 4 
havior and attitude reports. In the context of 
the present study, the attribution hypothesis 
has less to suggest about the mean level of at- 
titudes which students express. Since students 
may behave in either positive or negative ways w, 
with regard to their courses, the attribution 
hypothesis only addresses the issue of whether 
they develop positive or negative attitudes rel- 
ative to some unknown mean. While we had » 
no a priori expectations about mean level of 
attitudes, there were major differences among 
groups; these are presented in the Results sec- 
tion and are discussed after the presentation. 


METHOD top 
Subjects 


The subjects were 120 undergraduates in commerce * 
who participated as part of a course requirement; they 
were distributed randomly into eight experimental con- 
ditions that varied ques in a single questionnaire 
about a course taken the previous semester (5 weeks 
earlier). Subjects were run in group sessions of about 10 
each. Participation took about 30 min. 


Procedure 


When subjects arrived for the experiment, the ex- x 
perimenter asked them to write the names, numbers, & 
and grades of four courses they took in the previous 
semester. The experimenter kept the list and directed 
the subjects to a seat in a classroom three seats removed 
from other subjects in the room. The experimenter re- 
turned to the front desk, looked at a random number 
table, and selected one of the subject's listed courses, 
writing the course number on an experimental question- 
naire. After all subjects were seated, the experimenter 
explained the purpose of the study as one of trying to 
find out what students thought about their courses and 


— 


~are: (positive /grade related) “I took not 


what their experiences were, To this end, they were to 
answer a short questionnaire about one course from the 
previous semester, They were told that the course was 
to be randomly selected from the ones they listed, so a 
fair representation of courses could be as . They 
were told to fill out the questionnaire only for the s 
lected course and think about that course before re- 
sponding, They were to answer every question and to do 
only one page at a time in the order of the pages. The 
essentials of these instructions were printed on the fac- 
ing page of the questionnaire. Each subject was then 
handed his questionnaire and asked to write his name in 
a provided space. 


Selection of Course for valuation 


Although the experimenter said the course for evalu- 
ation was to be selected randomly from the subject’s 
list, in fact the experimenter always selected the highest 
graded course. In the case of tied grades, a random selec- 
tion between the two, three, or four highest graded 
courses was made, The reason for selecting a high- 
graded course was to make the manipulation of extrinsic 
attribution effective. The manipulation involved induc- 
ing subjects to believe they did what they did in the 
course in order to get a good grade. If in fact they did 
not get a good grade, the attribution has no credibility. 
‘Thus, courses evaluated were those receiving high 
grades. The average grade of all courses was 3.496, 
compared to a grade point average for all subjects of 
3.953. Only 5 of 120 subjects reported a course grade 
below their grade point average. 


Experimental Questionnaire 


The experimental questionnaire was presented as a 
single stapled booklet composed of five separable sec- 
tions: (a) behavior questions, (b) intri ic feature ques- 
tions, (c) extrinsic feature questions, (d) extrinsic at- 
tribution inducement questions, and (¢) attitude ques- 
tions. Each of these is described in turn. 

The behavior questionnaire consisted of 26 questions 
about the students’ course-relevant behavior., Four 
types of behavior were used, varying as to whether it 
was a positive behavior or a negative behavior and vary- 
ing as to whether it was a behavior related to getting 
grades in a course. A pretest sample had been asked to 
sort behaviors in terms of whether they thought the be- 
havior was relevant to getting a good grade. The be- 
haviors were defined as grade related and unrelated on 
this basis. Theoretically, it was expected that the unre- 
lated behaviors should be used more in inference of at- 
titude than the grade-related behaviors, since the latter 
were more likely to be attributed to extrinsic causes in 
the absence of experimental inducement.! The four 
types of behavior were randomly sequenced in the ques- 
tionnaire on three pages, with an 11-point scale from 
“Never” to “Always” accompanying each. Examples 
on readings 
in preparation for this course”; (negative/grade re- 
lated) “Of the cases, problems, or papers assigned in this 
course, I did less than was required”; (positive /unre- 
lated) “I met with other students of this course to dis- 
cuss the subject matter of the course”; (negative/unre- 
lated) “I was unattentive in clas 
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The intrinsic features questionnaire provided subjects 
with an opportunity to recall and describe aspects of the 
course which they found intrinsically enjoyable. Four 
questions were asked, each requesting the student to 
describe an incident. The first asked; “Describe as con- 
cretely as you can an aspect of the instructor's class- 
room lectures which you found to be particularly en- 
joyable in and of itself (if any),” Other questions were 
phrased similarly with regard to aspects of “the reading 
material,” “the course’s subject matter,” and “being in 
this course (inside or outside the classroom), A space 
for writing each concrete incident was provided along 
with a 44-dot scale. On the scales, subjects were to 
judge “How often did such incidents or aspects of the 
classroom lectures (ete.) occur during the semester?” 
End phrases to the scales were “Hardly ever” and “Al- 
most always.” The purpose of the intrinsic features 
questionnaire was to make nonextrinsic aspects of the 
course salient to the students, 

‘The extrinsic features questionnaire was intended to 
make extrinsic aspects of their courses salient to stu- 
dents, Six questions assess which the 
course provided an opportunity for specific outcomes, 
such as credit toward graduation, grades, and so on. 
The first question asked, “To what extent did the course 
provide you with an opportunity to meet university 
educational requirements?” A scale followed with end 
labels “Provided such an opportunity” and “Did not 
provide such an opportunity.” The other questions 
were phrased in the same way but with regard to an op- 


portunity “to obtain a good grade,” “to prepare for a 
career,” “to obtain training or knowledge relevant to 


ly relevant distinction between 
taken 


1The theoretic: 
grade-related and grade-unrelated behaviors 
on face validity in analyzing the data from this study, 
However, empirically, the two kinds of behaviors were 
related to attitudes in both similar and dissimilar ways. 
For groups given explicit extrinsic attribution induce- 
ments, attitudes were more correlated with grade-re- 
lated behaviors than with grade-unrelated behaviors 
(436 vs. 160). For other groups, the grade-related be- 
haviors were less correlated with attitudes than the 
grade-unrelated behaviors only when intrinsic features 
of the course were experimentally primed (.270 vs. 563), 
Otherwise, the two kinds of behaviors related to atti- 
tudes in much the same way (.68 vs. .62). The ave 
correlation between grade-related and grade-unrelated 
behavior was .5325, with no substantial variation be- 
tween groups. These facts are not easily summarized ex- 
cept to suggest that behaviors are differentially recol- 
lected in forming attitudes when explicit attributions 
are induced, The distinction between “positive” and 
“negative” behaviors was validated empirically in every 
case, Behaviors defined as positive were always posi- 
tively associated with attitudes and negative behaviors 
were always negatively associated with attitudes. One 
other interesting aspect of the behavior students report 
is that it is unrelated to the grades they received in the 
courses (an average correlation of .06), consistent with 
other studies this author has conducted when there 
were no sampling restrictions on grades. 
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getting a job,” “to obtain credit applicable to gradua- 
tion,” and “to obtain credit applicable to your major.” 

The extrinsic attribution questionnaire was designed 
to prime subjects to attribute their course activities to 
the extrinsic outcomes of the course, The questionnaire 
was introduced with, “In previous studies on course 
activities we have found that students generally work 
fairly hard to do well in their courses, and that they gen- 
erally try to do well to the best of their individual abili- 
ties, We are trying to learn of some of the reasons why 
students take courses and why they try to do well in 
them. Therefore, below, we are going to ask you some 
questions about your own reasons for taking the course 
which you are describing.” ‘There followed five yes-no 
questions which pretesting indicated students had a 
high probability of answering yes The first, and most 
important, question asked, “Students often take courses 
because they would like to get a good grade in them, Did 
you want to get a good grade in this course?” This ques- 
tion form was used for all items intended to induce an 
extrinsic attribution. It is obvious that the form does 
not really ask students to say they took the course to 
get a good grade. Rather, it alludes to an implied attri- 
bution by asking the student to indicate that he wanted 
a good grade from the course, afler pr senting the con- 
clusion that students take courses because they want to 
get a good grade. Pretesting had indicated that this 
particular form of inducement was effective in getting 
students to make extrinsic attributions and that most 
students were willing to respond yes to the question as 
put (98.3%). For a strong manipulation, it was neces- 
sary that students respond yes to the questions. The 
more direct form of asking “did you take this course in 
order to get a good grade” resulted in fewer students 
answering yes, The remaining items requested extrinsic 
attributions with regard to “credit toward graduation,” 
“benefit your career goals,” “improve or maintain your 
grade point average," and “in response to your parents’ 
or friends’ suggestions.” 

The attitude questionnaire consisted of questions re- 
garding students’ attitudes toward the instructor, the 
course material, the lectures, and the subject matter. 
The most important ones for this experiment concern 
the students’ liking, enjoyment, and interest in the 
course and material, Each question was answered on & 
7-point scale with appropriate end labels. Directly fol- 
lowing the attitude questionnaire, subjects were asked 
about their extrinsic or intrinsic attributions and about 
factual details of the course. 


Experimental Variations 


Experimental variations were created by differential 
sequencing of the various sections of the questionnaire. 
The different sequences were theoretically justified by 
our interest in assessing how attribution affected the use 
of behavioral information in attitude inferences, Eight 
sequences were created; in each case, the subject began 
by filling out the behavior questionnaire and ended by 
filling out the attitude questionnaire. What intervened 
between the behavior questionnaire and the attitude 
questionnaire (italicized) represents the experimental 
variations. The following sequences were used ; for each, 
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A represents attitude questionnaire; B, behavior ques- 
tionnaire; Z, intrinsic saliency manipulation; Æ, extrin- 


ducement: 


(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8). 


The above sequences vary along four dimensions. For 
some subjects, intrinsic aspects of the course are made 
salient (4, 5, 6, 7, and 8); for others, they are not (1, 2, 
and 3). For some subjects, extrinsic aspects of the course 
are made salient (2, 3, 4, 5, 6, and 7) ; for others, they are 
not (Land 8). Within groups for which extrinsic features 
are salient, for some an attribution inducement is made 
(2, 4, and 6) and for others it is not (3, 5, and 7). The 
only other variation is the order of the intrinsic and ex- 
trinsic saliency manipulation for groups given both prior 
to the attitude questionnaire (4, 5, 6, and 7). Two 
groups receive the I-E sequence (4 and 5), and two the 
E-I sequence (6 and 7 Rach group had 15 students in 
it. 
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Dependent Variables 


The major dependent variable is the correlation of 
subjects’ attitudes with their behaviors. It is assumed 
that subjects using their behavior to infer their attitude 
have more or less favorable attitudes when they report 
behaving positively or negatively toward the course. 
For this analysis, the grade/unrelated behavior ques- 
tions are combined by summing the positive behaviors 
and subtracting the summed negative behaviors. ‘The 
attitude score is the sum of four questions about how 
much the students like the course, though it was inter- 
esting, liked the material, and thought the course was 
enjoyable, 


Results 

In the analyses presented below, only the 
course-related but grade-unrelated behaviors 
are used (see Footnote 1). Attribution refers 
to the manipulation of extrinsic attribution. 
The distinction between attribution and non- 
attribution groups is based on the presentation 
of the extrinsic attribution questionnaire prior 


to the measurement of attitude. The distinc- ° 


tion between salient and nonsalient groups 1s 
based on the presentation of the relevant ques- 
tionnaires prior to attitude measurements. 


Manipulation Checks 


Since the variations in question sequence 
concern the relative saliency of intrinsic and 


sic saliency manipulation; and af, the attribution in- a 
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extrinsic features of the courses, an attempt 


-was made to assess the effects of those manipu- 


“lations. Unfortunately, no direct method is 
‘available for tapping a person’s consciousness. 
To assess saliency indirectly, we examined 
whether subjects attributed their behavior to 
extrinsic or intrinsic factors. Presumably, at- 
, tributions would be based on salient features. 
The question assessing extrinsic attribution 
read, “To what extent was your behavior in 
the course due to your interest in getting a good 
grade?” The intrinsic attribution question 
‘read, “To what extent was your behavior due 
“fo your interest in the subject matter of the 
course?” Intrinsic attribution scores were sub- 
tracted from extrinsic attribution scores to 
“measure the extent to which subjects made ex- 
trinsic more than intrinsic attributions. Sub- 
jects in general attributed their behavior to 
grades more than to interest, with means of 
30,70 and 28.17 (on 40-point scales) and an 
< extrinsic bias of 2.54. 
The extrinsic saliency manipulation did not 
affect subjects’ attributions, and there was only 
\ slight tendency for intrinsic saliency subjects 
‘to make less extrinsic attributions than intrin- 
sic nonsalient subjects, F(1, 112) = 3.88, 
p < 07. The order of intrinsic and extrinsic 
+ saliency manipulations, however, did affect 
subjects’ attributions, The mean extrinsic at- 
tribution bias of I-E groups was —1.32 com- 
vared with a mean of 4,01 for the E-I groups. 
The significant difference, #(1, 112) = 14.28, 
p <.01, suggests that the features presented 
“first to subjects were more salient to them 
when making attributions. 
The attribution questions also attest to the 
effectiveness of the attribution manipulation. 
, Groups 2, 4, and 6 were all induced to make ex- 
trinsic attributions prior to the attitude scales, 
while Groups 3, 5, and 7 were not. The former 
groups made more extrinsic biased attributions, 
xwith means of 3.45 and .34, respectively, F(1, 
412) = 8.22, p < .01. The difference between 
attribution and no-attribution groups was 
greatest when intrinsic features were not made 
, salient (Group 2 vs. 3, M = 6.07 vs. —.06) 
and least when intrinsic features were not sali- 
ent (Groups 4 and 6 vs, 5 and 7; M = 2.14 vs. 
.54). The interaction between intrinsic saliency 
and extrinsic attribution was marginally sig- 
nificant, F(1, 112) = 3.88, p < .07. 
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TABLE 1 


TTTUDES WITH BEHAVIOR FOR 
UXPERIMENTAL GROUPS 
(n per cell = 15) 


trinsic features of course Not salient 


Salient 
Attributed (4) 33 
Intrinsic—extrinsic (2) .08 
Extrinsic—intrinsic (6) .07 
Not attributed (5) .60 
Intrinsic-extrinsic (3) .67 
Extrinsic-intrinsic (7) AS 
Not salient (8) .64 


(1) .57 


Altitudes as a Function of Behavior 


The correlations between the summed atti- 
tude scales and the reports of grade-unrelated 
behaviors for the various groups are presented 
in Table 1. To test the significance of the differ- 
ence between groups, the correlations were 
transformed into z, transforms and an anal- 
ysis of variance was made of the differences 
between groups with k and infinity degrees 
of freedom. The s transform equalizes the 
variances of the correlations so that they can be 
treated as means with variances of 1/n. In the 
analyses reported, contrasts of means were 
tested by writing orthogonal coefficients ap- 
propriate to the difference reported. Thus, for 
instance, attribution groups are weighted —1 
and nonattribution groups, 1, to test for the 
difference between them. 

As can be seen from Table 1, the attitudes of 
subjects were significantly correlated with their 
course behaviors for all groups except those 
given the extrinsic attribution manipulation. 
Moreover, there is a large and significant differ- 
ence between the attribution and nonattribu- 
tion groups, F(1, œ) = 5.595, p<.02, but 
no difference between the extrinsic salient and 
nonsalient groups, F(1, #) = 1.97, p < .20. 
There was a tendency for I-E groups to have 
higher correlations than E-I groups. There was 
also a tendency for correlations to be lower 
when intrinsic features were made salient prior 
to the attitude measure. Neither of these dif- 
ferences was significant nor substantial. 


Attitudes as a Function of Intrinsic Features 


The introduction argued that people may de- 
velop attitudes either directly from their be- 
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TABLE 2 


CORRELATIONS OF REPORTED INTRINSIC FEATURES 
WITH ATTITUDES AND BEHAVIOR 


Extrinsic features of course 


Attitudes 
Salient 
Attributed 61 
Intrinsic-extri Bs 16 
Extrinsic-intrinsic 12 
Not attributed -63 x 
Intrinsic-extrinsic .67 
Extrinsic-intrinsic +52 
Not salient 81 59 
(2) .16 
Not attributed (5) 42 
Intrinsi: extrin ic (3) .81 
Extrinsi 
Not salient (82 


havior or by processing intrinsic characteristics 
of an activity consistent with their behavior. 
Table 2 presents data relevant to both of these 
possibilities, including the correlation of in- 
trinsic features to attitudes and to behavior. 
Subjects’ reports of intrinsic features were mea- 
sured by summing their scaled responses to the 
four questions in the intrinsic features ques- 
tionnaire, 

In Table 2 it can be seen that attitudes were 
correlated with subjects’ reports of the intrinsic 
features of their courses for all groups except 
the one group induced to make an extrinsic at- 
tribution before reporting attitudes or intrinsic 
features. This group’s correlation of .16 is sig- 
nificantly different from the correlations of all 
other groups, .62, F(1, ©) = 5.24, p < .05. 
No other group differences are significant. 

In Table 2 it can be seen that the intrinsic 
features reported for a course were correlated 
with behavior primarily for those groups not 
primed to report intrinsic features before their 
attitudes and not induced to attribute their be- 
havior to extrinsic outcomes, with an average 
correlation of .815. If either intrinsic features 
were primed before attitude measurement or if 

attributions were induced, the intrinsic features 
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reported for courses were less correlated with A)” 
behavior, with an average correlation of .28 Seq: 
The difference between these two sets of cor- 
relations is significant, /(1, = 16.47, 
p < .001. There were no significant differences 
between I-E and E-I groups. 


Attitudes as a Function of Extrinsic Outcomes» 


In Table 3, the correlations of attitudes with 
grades are presented. Although the attribution 
hypothesis does not make explicit predictions 
about the effect of extrinsic outcomes on atti- Ses 
tudes, it is reasonable to assume that if atti 
tudes are not based on intrinsic features or 
behavior, they are based on the extrinsic out- 
comes to which the behavior was attributed. $ 

The attitudes of subjects in the attribution 
groups were correlated with the grades they re- 
ceived in the course. The correlations for the 
three attribution groups, .56, is significantly 
higher than the correlations for the three non- ~ 
attribution groups, —.06, F(1, œ) = 12.47, 
p < .001. In addition, the attitudes of subjects 
for whom intrinsic features were made salient 
were less highly correlated with their grades 
than were those of the nonsalient groups, F (1, 
æ) = 4.45, p < .05. No other differences were 
significant. 


Mean Levels of Altitude 


As mentioned in the introduction, the beori 
ical expectation regarding mean levels of atti- 
tude is less clear than that regarding the corres. 
lation of behavior with attitude. The fact that 


TABI 
CORRELATIONS OF ATTITUDES WITH COURSE GRADES 
FOR EIGHT EXPERIMENTAL GROUPS 

(n per cell = 15) 


4 


Intrinsic features 
of course 


ee S Not. 
Extrinsic features of course Salient salient 
Salient 
Attributed 36 
Intrinsic—extrinsic 79 
Extrinsic-intrinsic 4 
Not attributed oo (| 
Intrinsic-extrinsic =.01 
Extrinsic-intrinsi¢ —.03 
Not salient 10 59 


TABLE 4 


MEAN ATTITUDES Towarp Course ror Eicur 
EXPERIMENTAL Groups 
(x per cell = 1 


Intrinsic features 


of course 
Not 
Extrinsic features of course Salient salient 
Salient 

Attributed is 5.096 

Intrinsic-extrinsic 5.154 
Extrinsic-intrinsic 4.275 
Not attributed 5.814 

Intrinsic-extrinsic 5.388 
Extrinsic-intrinsic 4.982 


Not salient 


5.118 


Scores are based on average response to four attitude 
scaled 1-7, with 7 indicating more favorable attitudes 
course, 


some groups’ behavior correlated with their at- 
titudes and other groups’ behavior did not, 
should not, by itself, lead to mean differences 
in attitudes, as long as there were no differ- 
z ences between the groups in behavior. Within 
groups for which attitude and behavior were 
orrelated, subjects who behaved very posi- 
ively toward the course have a greater than 
“average attitude and subjects who behaved less 
positively have a less than average attitude. 
The net effect should be an average attitude 
= comparable to groups for which attitude and 
§ behavior were uncorrelated. Table 4 presents 
‘| the data relevant to these assumptions. 
As can be seen in Table 4, there were some 
major unexpected differences between the 
groups. The three nonattribution groups, re- 
“gardless of the salience of intrinsic information, 
expressed more favorable attitudes than the 
three attribution groups, with means of 5.395 
and 4.842, respectively, /(1, 112) = 4.47, 
A ¿p < 05. In addition, there were differences be- 
* tween groups questioned about the intrinsic 
and extrinsic features of the course in different 
orders, Groups receiving the I-E order had 
nore favorable attitudes than groups receiving 
the E-I order with means of 5.455 and 4.629, 
F(1, 112) = 6.66, p < .01. No other differences 
between groups were significant. 


5 DISCUSSION 


The analyses of the correlates of attitudes 
_ show several clear implications. It is clear that 
the saliency of extrinsic features of students’ 


ee 
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courses did not by itself affect the use of be- 
havioral information to derive an attitude, or 
their mean level of attitudes. The explicit in- 
ducement of attributions to extrinsic course 
outcomes, however, affected both the mean 
level of attitudes and the use of behavioral in- 
formation in deriving attitudes. It is also clear 
that when intrinsic features of courses were not 
primed-and made salient before subjects ex- 
pressed attitudes, they generated features con- 
stent with their behaviors (i.e., correlated). 
Finally, it is clear that the intrinsic character- 
istics subjects report about a course were re- 
lated to their attitudes when those features 
were salient (by direct questioning or by infer 
ence from behavior). When, however, extrinsic 
attributions were induced and intrinsic course 
features were not made salient by direct que: 
tioning, attitudes were related to the extrinsic 
outcomes students received from their courses, 

From these results we can conclude that the 
attribution of behavior to extrinsic outcomes 
does affect the attitudes individuals develop 
about a situation, and that behavioral informa- 
tion affects attitudes when extrinsic attribu- 
tions are not made. 

However, more important than the fact that 
attribution affects attitudes is the question of 
how. How does behavior affect a person’s 
attitude? And why does extrinsic attribution 
interfere with its effects? The results of the 
present study offer some tentative answers to 
these questions. In the introduction | suggested 
several processes which might account for a 
correlation of behavior and attitude in the ab- 
sence of extrinsic attribution. I attempt here 
to examine these processes in light of the 
results. 

It seems we can dismiss the first processes 
which argued that a person simply attributes 
to himself an attitude consistent with his be- 
havior when the behavior is not already at- 
tributed to other causes (extrinsic). If subjects 
merely used their behavior to infer internal 
motivations or attitudes and did not attend to 
the characteristic features of the course itself, 
then their attitudes would have been correlated 
only with their behavior and not at all with 
their reports of the intrinsic features of the 
course. Instead, both were correlated with 
attitudes. 

It also seems that we can dismiss the argu- 
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ment that individuals impute some internal 
motivation or feeling to themselves when they 
have no other means for evaluating their 
courses. Subjects who were primed to recall 
interesting features about their courses (intrin- 
sic salient conditions) could have used that in- 
formation to infer an interest in the course and 
have less need to rely on their behavior for in- 
ferring an attitude. However, the manipula- 
tions of saliency for intrinsic features did not 
affect the correlations of attitudes with either 
behavior or with intrinsic features. 

It is less easy to dismiss the second process 
that was considered in the introduction, 
namely, that behavior is used to impute char- 
acteristics to a situation from which attitudes 
are then derived. Like the first process, this 
process suggests that (a) a person’s behavior is 
something which needs explaining and (b) in 
the absence of extrinsic reasons for the behav- 
ior, the person develops intrinsic reasons con- 
sistent with the behavior. However, while the 
first process suggests that the person attrib- 
utes a characteristic to himself consistent with 
the behavior (a motivation or liking) indepen- 
dent of the intrinsic characteristics of the situ- 
ation, this second process suggests that the 
person attributes characteristics to the situ- 
ation consistent with his behavior (an inter- 
esting job, a challenging professor). These 
imputed characteristics are then used in form- 
ing an attitude toward the situation. 

The results of the experiment are partially 
but not entirely consistent with this second 
process. First, one would have expected high 
correlations between behavior and the Sub- 
jects’ reports of intrinsic course features for 
every group for which an extrinsic attribution 
did not precede the intrinsic features question- 
naire. Of the six groups for which this was the 

ins 1, 3, 4, 5, 7, and 8), only three 


case (Gr 


showe icant correlations between be- 
havior eir reports of intrinsic course 
featur: 1, this process would have pre- 
dicted | lations between attitudes and 


rinsic features of the course. 
isistent with this prediction 
‘ons are not extremely high 
Third, this process would 
orrelations between atti- 

all groups for which an 

d not precede reports of 


reports c 
The resuli 
though the 
(.6 on the 
have predict 
tude and be! 
extrinsic attr 
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intrinsic course features sults are again 
somewhat consistent but erect; five of the! 
six groups had significant lations. 


seems to acy, 
e is one good 
ucial test of 
the imputa 


Although this second p 
count for the data fairly we 
reason for rejecting it. The nv 
an argument that behavior affec 
tion of intrinsic characteristics co: -<s from th 
data for the group receiving the ‘-I-E-ai-A 
sequence of questionnaires (Group '). This 
group received the attribution manipu: ‘ion 
prior to expressing their attitudes but after | 
porting on the intrinsic features of their courses. 
Since the attribution manipulation followed | 
the report of intrinsic course features, it shou a 
not have affected the use of behavior to inier 
intrinsic features, and hence an attitude. And _ 
while this group’s attitude-behavior corr AE 
tion was larger than those of other attribution | 
groups (.33 vs. .07 and .08), the difference was 
not substantial or significant. Moreover, | 16) 
correlation of this group’s behavior reports} 
with the reports of intrinsic course features vas | 
rather low (.10), a result not consistent with 
assuming that the unattributed behavior di- 


rectly affects the imputation of course | ‘ar 
acteristics. Because of the partial suppori nd? 
partial lack of support for this process, it «an 
only be concluded that it is an open ques’ on. 


Retrospective Saliency 


Although the data do not provide si rong} 
-support for assuming behavior affects attit udes} 
through a direct inference process, the data are 
quite clear in suggesting that extrinsic attribu -} 
tion affects attitudes in some way whenever it 
intervenes between the recall of behavior and 
the expression of an attitude. This fact sug- 
gests a third process by which behavior affects 
attitudes, namely, a retrospective recall of in- 
trinsic characteristics as*or'*! ity viog 
That is, while behavior : pute 
tions of features to a situ 
how individuals recollect 
forming an attitude. The high « 


5 ow 
lations w4 


tween reports of behavior and re orts © mg 
trinsic features after subjects exp: s on atte 
tude (.81 and .82) suggest this ray be the 


case. And the correlatigns of attitudes with 
both behavior and with intrinsic features fot 
nonattribution groups may result fri mjndividi 
uals recalling some subset of intrinsic charac 
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teristics associated with their behavior, produc- 
ing moderate correlations with both rather 
than high correlations with either. 

Some retrospective processes may also ac- 
count for why attribution subjects seem to 
develop attitudes more related to their extrin- 
sic outcomes. The most salient behavioral ex- 
periences they may have about the course may 
be those associated with the achievement of 
grades. More than just forming attitudes cor- 
related with grades, these same subjects tend 
to form attitudes correlated with their reports 
of grade-related behaviors (see Footnote 1) but 
not their grade-unrelated behaviors, If experi- 
ences associated with the achievement of ex- 
trinsic outcomes are made salient by inducing 
an extrinsic attribution, then perhaps the in- 
trinsic characteristics of the situation (associ- 
ated with grade-unrelated behaviors) are made 
less salient and less easily recalled. This pos- 

sibility would also account for the findings that 
attribution groups had less favorable attitudes 
in addition to having attitudes less dependent 
on their grade-unrelated behaviors. With their 
attention focused on the extrinsic outcomes 
and experiences of the course, they may simply 
be unable to recall intrinsic features from which 
favorable attitudes develop. 

The form of most attitude questions offers 
another clue as to why subjects induced to 
make extrinsic attributions had less positive 
attitudes. The attitude questions used, them- 
selves, may direct a person to respond with 
reference, to the intrinsic features of the situa- 
tion. Thus, questions of the form “Did you like 
that course?” and “Did you find that course in- 
teresting?” invite the respondent to reflect 
upon himself and the course in relation to him- 
self. They direct a person to recall information 
about the intrinsic features of the situation. 
On the other hand, a related but somewhat dif- 
ferent question might be “Was that course 

| worthwhile?” ‘This form of question can be 
Sanswered by reference to either information 
about the course (intrinsic features) or in- 
formation about one’s outcomes from the 
course (extrinsic features). If the relative sali- 
ence of intrinsic and extrinsic experiences is 
what distinguishes attribution from nonattri- 
bution subjects in the present experiment, then 
the response differences observed for the liking- 
interest type questions would not occur for 


1311 


worthwhile type questions. Indeed, they did 
not. Questions asking subjects to judge how 
much they got out of the course, how it met 
with expectations, and how useful it was to 
them personally produced no significant differ- 
ences between attribution and nonattribution 
groups. These three questions, of course, can 
all be answered with regard to either the in- 
trinsic or extrinsic aspects of a course experi- 
ence, As such, they are not biased against sub- 
jects for whom extrinsic features may be mòre 
salient than intrinsic one 
The possibility that salient extrinsic experi- 
ences may inhibit the recollection of intrinsic 
task information is also consistent with some 
findings reported in the literature, For example, 
Staw et al. (Note 1) found that paying people 
to do a task decreased task enjoyment when no 
norm for payments ed but did not do so 
when payment wa normally associated with 
the task, a result also found by Kruglanski, 
Riter, Amitai, Margolin, Shabtai, and Zaksh 
(1975), A nonnormative reward may be more 
salient than a normative reward and, as a con- 
sequence, inhibit the recall of intrinsic informa- 
tion when formulating an attitude. A saliency 
interpretation is also consistent with Ross's 
(1975) finding that a salient reward inhibits 
intrinsic motivation in children but a nongali- 
ent reward does not, Ross explicitly focused 
children’s attention on a reward for an interest- 
ing task. It is possible that making the reward 
salient, in addition to affecting attributions, 
also inhibited the recollection of intrinsic in- 
formation about the task, thereby resulting in 
less favorable attitudes toward the task. 
Although some form of retrospective process 
affecting the relative saliency of intrinsic and 
extrinsic information seems to account for the 
present results, further experimental work 
clearly is needed. Attribution theory has done 
very well in demonstrating some rather inter- 
esting and unique phenomena about individ- 
ual attitudes, but attribution researchers have 
done rather poorly in either testing the critical 
assumptions of their theories or understanding 
how the phenomena arise. With regard to fur- 
ther investigation, one interesting feature of the 
retrospective saliency argument described 
above is that the inhibitory effect of extrinsic 
attribution on the use of behavior to infer atti- 
tude should not be dependent on the extrinsic 
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value of the attribution. If the result is due to 
focusing a person’s attention to some particular 
aspect of his experience, then one should be 
able to focus attention equally well by forcing 
an intrinsic attribution. That is, any salient 
attribution of behavior in a situation could 
diminish the correlation between attitude and 
behavior. 
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